server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Jason Evans	5065156f3f	Fix threads-related profiling bugs. Initialize bt2cnt_tsd so that cleanup at thread exit actually happens. Associate (prof_ctx_t ) with allocated objects, rather than (prof_thr_cnt_t ). Each thread must always operate on its own (prof_thr_cnt_t *), and an object may outlive the thread that allocated it.	2010-04-13 21:17:11 -07:00
Jason Evans	1bb602125c	Update stale JEMALLOC_FILL code. Fix a compilation error due to stale data structure access code in tcache_dalloc_large() for junk filling.	2010-04-13 21:17:02 -07:00
Jason Evans	799ca0b68d	Revert re-addition of purge_lock. Linux kernels have been capable of concurrent page table access since 2.6.27, so this hack is not necessary for modern kernels.	2010-04-08 20:31:58 -07:00
Jason Evans	0656ec0eb4	Fix build system problems. Split library build rules up so that parallel building works. Fix autoconf-related dependencies. Remove obsolete JEMALLOC_VERSION definition.	2010-04-07 23:37:35 -07:00
Jason Evans	f18c982001	Add sampling activation/deactivation control. Add the E/e options to control whether the application starts with sampling active/inactive (secondary control to F/f). Add the prof.active mallctl so that the application can activate/deactivate sampling on the fly.	2010-03-31 18:43:24 -07:00
Jason Evans	a02fc08ec9	Make interval-triggered profile dumping optional. Make it possible to disable interval-triggered profile dumping, even if profiling is enabled. This is useful if the user only wants a single dump at exit, or if the application manually triggers profile dumps.	2010-03-31 17:35:51 -07:00
Jason Evans	0b270a991d	Reduce statistical heap sampling memory overhead. If the mean heap sampling interval is larger than one page, simulate sampled small objects with large objects. This allows profiling context pointers to be omitted for small objects. As a result, the memory overhead for sampling decreases as the sampling interval is increased. Fix a compilation error in the profiling code.	2010-03-31 16:45:04 -07:00
Jason Evans	169cbc1ef7	Re-add purge_lock to funnel madvise(2) calls.	2010-03-26 18:10:19 -07:00
Jason Evans	19b3d61892	Track dirty and clean runs separately. Split arena->runs_avail into arena->runs_avail_{clean,dirty}, and preferentially allocate dirty runs.	2010-03-18 20:36:40 -07:00
Jason Evans	dafde14e08	Remove medium size classes. Remove medium size classes, because concurrent dirty page purging is no longer capable of purging inactive dirty pages inside active runs (due to recent arena/bin locking changes). Enhance tcache to support caching large objects, so that the same range of size classes is still cached, despite the removal of medium size class support.	2010-03-17 16:27:39 -07:00
Jason Evans	f00bb7f132	Add assertions. Check for interior pointers in arena_[ds]alloc(). Check for corrupt pointers in tcache_alloc().	2010-03-15 16:44:12 -07:00
Jason Evans	05b21be347	Purge dirty pages without arena->lock.	2010-03-14 19:41:18 -07:00
Jason Evans	86815df9dc	Push locks into arena bins. For bin-related allocation, protect data structures with bin locks rather than arena locks. Arena locks remain for run allocation/deallocation and other miscellaneous operations. Restructure statistics counters to maintain per bin allocated/nmalloc/ndalloc, but continue to provide arena-wide statistics via aggregation in the ctl code.	2010-03-14 17:38:09 -07:00
Jason Evans	1e0a636c11	Simplify small object allocation/deallocation. Use chained run free lists instead of bitmaps to track free objects within small runs. Remove reference counting for small object run pages.	2010-03-13 20:38:29 -08:00
Jason Evans	3fa9a2fad8	Simplify tcache object caching. Use chains of cached objects, rather than using arrays of pointers. Since tcache_bin_t is no longer dynamically sized, convert tcache_t's tbin to an array of structures, rather than an array of pointers. This implicitly removes tcache_bin_{create,destroy}(), which further simplifies the fast path for malloc/free. Use cacheline alignment for tcache_t allocations. Remove runtime configuration option for number of tcache bin slots, and replace it with a boolean option for enabling/disabling tcache. Limit the number of tcache objects to the lesser of TCACHE_NSLOTS_MAX and 2X the number of regions per run for the size class. For GC-triggered flush, discard 3/4 of the objects below the low water mark, rather than 1/2.	2010-03-13 20:38:18 -08:00
Jason Evans	2caa4715ed	Modify dirty page purging algorithm. Convert chunks_dirty from a red-black tree to a doubly linked list, and use it to purge dirty pages from chunks in FIFO order. Add a lock around the code that purges dirty pages via madvise(2), in order to avoid kernel contention. If lock acquisition fails, indefinitely postpone purging dirty pages. Add a lower limit of one chunk worth of dirty pages per arena for purging, in addition to the active:dirty ratio. When purging, purge all dirty pages from at least one chunk, but rather than purging enough pages to drop to half the purging threshold, merely drop to the threshold.	2010-03-04 22:49:59 -08:00
Jason Evans	698805c525	Simplify malloc_message(). Rather than passing four strings to malloc_message(), malloc_write4(), and all the functions that use them, only pass one string.	2010-03-03 17:45:38 -08:00
Jason Evans	a40bc7afe8	Add release versioning support. Base version string on 'git describe --long', and provide cpp macros in jemalloc.h. Add the version mallctl.	2010-03-02 13:01:16 -08:00
Jason Evans	22ca855e8f	Allow prof.dump mallctl to specify filename.	2010-03-02 12:11:35 -08:00
Jason Evans	74025c85bf	Edit rb documentation.	2010-03-02 12:10:52 -08:00
Jason Evans	b9477e782b	Implement sampling for heap profiling.	2010-03-01 20:15:26 -08:00
Jason Evans	f3ff75289b	Rewrite red-black trees. Use left-leaning 2-3 red-black trees instead of left-leaning 2-3-4 red-black trees. This reduces maximum tree height from (3 lg n) to (2 lg n). Do lazy balance fixup, rather than transforming the tree during the down pass. This improves insert/remove speed by ~30%. Use callback-based iteration rather than macros.	2010-02-28 15:00:18 -08:00
Jason Evans	3b5ee5e857	Fix #include ordering for mb.h. Include mb.h after mutex.h, in case it actually has to use the mutex-based memory barrier implementation.	2010-02-11 15:56:23 -08:00
Jason Evans	cd90fca928	Wrap mallctl* references with JEMALLOC_P().	2010-02-11 14:55:25 -08:00
Jason Evans	376b1529a3	Restructure source tree.	2010-02-11 14:45:59 -08:00

25 Commits