Include mb.h after mutex.h, in case it actually has to use the mutex-based memory barrier implementation.
jemalloc is a general-purpose scalable concurrent malloc(3) implementation. The INSTALL file contains information on how to configure, build, and install jemalloc.