We use flame graphs to identify that hgfs is the bottleneck in my build.
Recent articles
Bash Performance Tricks
My coworkers presented a silly programming interview style question to me the other day: given a list of words, find the largest set of words from that list that all have the same hash value. Everyone was playing around with a different language, and someone made the claim that it …
Achieving maximum memory bandwidth
I embarked upon a quest to understand some unexpected behavior and write a program that achieved the theoretical maximum memory bandwidth.
A cross-platform monotonic timer
I've been working on writing a memory bandwidth benchmark for a while and needed to use a monotonic timer to compute accurate timings. I have since learned that this is more challenging to do that I initially expected and each platform has a different way of doing it.
Why is
omp_get_num_procs
so slow?Some students had some difficulty profiling their code because
omp_get_num_procs
was dominating the profiling traces. I tracked it down and found that the profiling tools emitted misleading results when the library didn't have symbols.Introduction to Using Profiling Tools
In this article, you will see several performance tools used to identify bottlenecks in a simple program.