Include mb.h after mutex.h, in case it actually has to use the mutex-based memory barrier implementation.