Code Arcana

(feed for performance engineering posts)

Recent articles

Why everyone fails at monitoring; and what you can do about it

Thu 05 October 2017
In performance engineering.

tags: monitoring

People monitor their systems for two main reasons: to keep their system healthy and to understand its performance. Almost everyone does both wrong, for the same reasons: they monitor so they can react to failures, rather than measuring their workload so that they can predict problems.

What should I use …

read more
Fast query log with tcpdump and tshark

Thu 21 July 2016
In performance engineering.

tags: Linux tcpdump wireshark dbbench

dbbench is a tool I've been working on for a while at MemSQL. It is an open source database workload driver engineers at MemSQL and I use for performance testing. One often-overlooked feature in dbbench is the ability to replay query log files. Previously, this was a somewhat manual process …

read more
An informal survey of Linux dynamic tracers

Sat 09 January 2016
In performance engineering.

tags: Linux tracing perf_events

I survey some dynamic tracers (e.g. perf, sysdig) available on Linux.

read more
Dtrace isn't just a tool; it's a philosophy

Sun 03 January 2016
In performance engineering.

tags: Linux perf_events

I document some pain points from recent performance investigations and then speculate that such issues are endemic to the Linux community.

read more
Using off-cpu flame graphs on Linux

Sun 20 December 2015
In performance engineering.

tags: Linux perf_events flamegraph

I use off-cpu flame graphs to identify that repeated mmap calls are slowing my database.

read more
Why are builds on HGFS so slow?

Fri 04 December 2015
In performance engineering.

tags: profiling vmware make

We use flame graphs to identify that hgfs is the bottleneck in my build.

read more
TCP Keepalive is a lie

Fri 28 August 2015
In performance engineering.

tags: tcp linux networking perf_events

In the past few months, I’ve had to debug some gnarly issues related to TCP_KEEPALIVE. Through these issues, I’ve learned that it is harder than one might think to ensure that your sockets fail after a short time when the network is disconnected. This blog post is intended …

read more
Bash Performance Tricks

Tue 06 August 2013
In performance engineering.

tags: bash profiling

My coworkers presented a silly programming interview style question to me the other day: given a list of words, find the largest set of words from that list that all have the same hash value. Everyone was playing around with a different language, and someone made the claim that it …

read more
Achieving maximum memory bandwidth

Sat 18 May 2013
In performance engineering.

tags: profiling

I embarked upon a quest to understand some unexpected behavior and write a program that achieved the theoretical maximum memory bandwidth.

read more
A cross-platform monotonic timer

Wed 15 May 2013
In performance engineering.

tags: profiling

I've been working on writing a memory bandwidth benchmark for a while and needed to use a monotonic timer to compute accurate timings. I have since learned that this is more challenging to do that I initially expected and each platform has a different way of doing it.

read more
Why is omp_get_num_procs so slow?

Fri 10 May 2013
In performance engineering.

tags: profiling

Some students had some difficulty profiling their code because omp_get_num_procs was dominating the profiling traces. I tracked it down and found that the profiling tools emitted misleading results when the library didn't have symbols.

read more
Introduction to Using Profiling Tools

Tue 26 February 2013
In performance engineering.

tags: profiling

In this article, you will see several performance tools used to identify bottlenecks in a simple program.

read more
Analysis of a Parallel Memory Allocator

Fri 11 May 2012
In performance engineering.

tags: malloc

I implemented and tested different configurations of a modern parallel memory allocator.

read more

Recent articles

What should I use …