This avoids having to choose bin shard on the fly, also will allow flexible bin binding for each thread.
smallocx
JEMALLOC_VERSION_GID