server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Jason Evans	7e11b389aa	Move size class table to man page. Move the table of size classes from jemalloc.c to the manual page. When manually formatting the manual page, it is now necessary to use: nroff -man -t jemalloc.3	2010-09-11 22:52:16 -07:00
Jason Evans	58a6f5c9be	Add posix_memalign test.	2010-09-11 20:59:16 -07:00
Jason Evans	2dbecf1f62	Port to Mac OS X. Add Mac OS X support, based in large part on the OS X support in Mozilla's version of jemalloc.	2010-09-11 18:20:16 -07:00
Jason Evans	b267d0f86a	Add the thread.arena mallctl. Make it possible for each thread to manage which arena it is associated with. Implement the 'tests' and 'check' build targets.	2010-08-13 17:36:00 -07:00
Jason Evans	dcd15098a8	Move assert() calls up in arena_run_reg_alloc(). Move assert() calls up in arena_run_reg_alloc(), so that a corrupt pointer will likely be caught by an assertion before it is dereferenced.	2010-08-05 12:13:42 -07:00
Jason Evans	2541e1b083	Add a missing mutex unlock in malloc_init_hard(). If multiple threads race to initialize malloc, the loser(s) busy-wait until initialization is complete. Add a missing mutex lock so that the loser(s) properly release the initialization mutex. Under some race conditions, this flaw could have caused one or more threads to become permanently blocked. Reported by Terrell Magee.	2010-07-22 11:35:59 -07:00
Jason Evans	b43b7750a6	Fix the libunwind version of prof_backtrace(). Fix the libunwind version of prof_backtrace() to set the backtrace depth for all possible code paths. This fixes the zero-length backtrace problem when using libunwind.	2010-06-04 15:10:43 -07:00
Jason Evans	7013d10a9e	Avoid unnecessary isalloc() calls. When heap profiling is enabled but deactivated, there is no need to call isalloc(ptr) in prof_{malloc,realloc}(). Avoid these calls, so that profiling overhead under such conditions is negligible.	2010-05-11 18:17:02 -07:00
Jason Evans	ed3d152ea0	Fix next_arena initialization. If there is more than one arena, initialize next_arena so that the first and second threads to allocate memory use arenas 0 and 1, rather than both using arena 0.	2010-05-11 12:00:22 -07:00
Jordan DeLong	2206e1acc1	Add MAP_NORESERVE support. Add MAP_NORESERVE to the chunk_mmap() case being used by chunk_swap_enable(), if the system supports it.	2010-05-11 11:46:53 -07:00
Jason Evans	ecea0f6125	Fix junk filling of cached large objects. Use the size argument to tcache_dalloc_large() to control the number of bytes set to 0x5a when junk filling is enabled, rather than accessing a non-existent arena bin. This bug was capable of corrupting an arbitrarily large memory region, depending on what followed the arena data structure in memory (typically zeroed memory, another arena_t, or a red-black tree node for a huge object).	2010-04-28 12:00:59 -07:00
Jason Evans	5055f4516c	Fix tcache crash during thread cleanup. Properly maintain tcache_bin_t's avail pointer such that it is NULL if no objects are cached. This only caused problems during thread cache destruction, since cache flushing otherwise never occurs on an empty bin.	2010-04-14 11:27:13 -07:00
Jason Evans	38cda690dd	Fix profiling regression caused by bugfix. Properly set the context associated with each allocated object, even when the object is not sampled. Remove debug print code that slipped in.	2010-04-14 11:24:45 -07:00
Jason Evans	6d68ed6492	Remove autom4te.cache in distclean (not relclean).	2010-04-13 22:01:55 -07:00
Jason Evans	8d4203c72d	Fix arena chunk purge/dealloc race conditions. Fix arena_chunk_dealloc() to put the new spare in a consistent state before dropping the arena mutex to deallocate the previous spare. Fix arena_run_dalloc() to insert a newly dirtied chunk into the chunks_dirty list before potentially deallocating the chunk, so that dirty page accounting is self-consistent.	2010-04-13 21:17:18 -07:00
Jason Evans	5065156f3f	Fix threads-related profiling bugs. Initialize bt2cnt_tsd so that cleanup at thread exit actually happens. Associate (prof_ctx_t ) with allocated objects, rather than (prof_thr_cnt_t ). Each thread must always operate on its own (prof_thr_cnt_t *), and an object may outlive the thread that allocated it.	2010-04-13 21:17:11 -07:00
Jason Evans	1bb602125c	Update stale JEMALLOC_FILL code. Fix a compilation error due to stale data structure access code in tcache_dalloc_large() for junk filling.	2010-04-13 21:17:02 -07:00
Jason Evans	5523399169	Update documentation.	2010-04-11 19:02:43 -07:00
Jason Evans	5fe764f83f	Generalize ExtractSymbols optimization (pprof). Generalize ExtractSymbols to handle all cases of library address overlap with the main binary.	2010-04-08 23:23:53 -07:00
Jason Evans	799ca0b68d	Revert re-addition of purge_lock. Linux kernels have been capable of concurrent page table access since 2.6.27, so this hack is not necessary for modern kernels.	2010-04-08 20:31:58 -07:00
Jason Evans	68f91893bd	Fix P/p reporting in stats_print(). Now that JEMALLOC_OPTIONS=P isn't the only way to cause stats_print() to be called, opt_stats_print must actually be checked when reporting the state of the P/p option.	2010-04-08 19:14:51 -07:00
Jason Evans	3395860921	Don't build with -march=native. Don't build with -march=native by default, because the generated code may perform especially poorly on ABI-compatible, but internally different, systems.	2010-04-07 23:41:00 -07:00
Jason Evans	0656ec0eb4	Fix build system problems. Split library build rules up so that parallel building works. Fix autoconf-related dependencies. Remove obsolete JEMALLOC_VERSION definition.	2010-04-07 23:37:35 -07:00
Jason Evans	af366593a4	Improve ExtractSymbols (pprof). Iterated downward through both libraries and PCs. This allows PCs to resolve even when library address ranges overlap.	2010-04-07 19:52:15 -07:00
Jason Evans	7cb5b5ea21	Fix error path in prof_dump(). Remove a duplicate prof_leave() call in an error path through prof_dump().	2010-04-06 12:21:46 -07:00
Jason Evans	fd88bd577e	Report E/e option state in jemalloc_stats_print().	2010-04-06 12:20:23 -07:00
Jason Evans	ec5344eba2	Optimize ExtractSymbols (pprof). Modify ExtractSymbols to operate on sorted PCs and libraries, in order to reduce computational complexity from O(N*M) to O(N+M).	2010-04-02 18:49:34 -07:00
Jason Evans	a53610130d	Use addr2line only for --line option (pprof).	2010-04-02 18:48:27 -07:00
Jason Evans	a91f210929	Import pprof from google-perftools, svn r91. Fix divide-by-zero error in pprof. It is possible for sample contexts to currently have no associated objects, but the cumulative statistics are still useful, depending on how the user invokes pprof. Since jemalloc intentionally does not filter such contexts, take care not to divide by 0 when re-scaling for v2 heap sampling. Install pprof as part of 'make install'. Update pprof documentation.	2010-04-02 14:41:02 -07:00
Jason Evans	18ad8234b6	Don't disable leak reporting due to sampling. Leak reporting is useful even if sampling is enabled; some leaks may not be reported, but those reported are still genuine leaks.	2010-04-02 13:48:39 -07:00
Jason Evans	f18c982001	Add sampling activation/deactivation control. Add the E/e options to control whether the application starts with sampling active/inactive (secondary control to F/f). Add the prof.active mallctl so that the application can activate/deactivate sampling on the fly.	2010-03-31 18:43:24 -07:00
Jason Evans	a02fc08ec9	Make interval-triggered profile dumping optional. Make it possible to disable interval-triggered profile dumping, even if profiling is enabled. This is useful if the user only wants a single dump at exit, or if the application manually triggers profile dumps.	2010-03-31 17:35:51 -07:00
Jason Evans	0b270a991d	Reduce statistical heap sampling memory overhead. If the mean heap sampling interval is larger than one page, simulate sampled small objects with large objects. This allows profiling context pointers to be omitted for small objects. As a result, the memory overhead for sampling decreases as the sampling interval is increased. Fix a compilation error in the profiling code.	2010-03-31 16:45:04 -07:00
Jason Evans	169cbc1ef7	Re-add purge_lock to funnel madvise(2) calls.	2010-03-26 18:10:19 -07:00
Jason Evans	c03a63d68d	Set/clear CHUNK_MAP_ZEROED in arena_chunk_purge(). Properly set/clear CHUNK_MAP_ZEROED for all purged pages, according to whether the pages are (potentially) file-backed or anonymous. This was merely a performance pessimization for the anonymous mapping case, but was a calloc()-related bug for the swap_enabled case.	2010-03-22 11:45:01 -07:00
Jason Evans	19b3d61892	Track dirty and clean runs separately. Split arena->runs_avail into arena->runs_avail_{clean,dirty}, and preferentially allocate dirty runs.	2010-03-18 20:36:40 -07:00
Jason Evans	dafde14e08	Remove medium size classes. Remove medium size classes, because concurrent dirty page purging is no longer capable of purging inactive dirty pages inside active runs (due to recent arena/bin locking changes). Enhance tcache to support caching large objects, so that the same range of size classes is still cached, despite the removal of medium size class support.	2010-03-17 16:27:39 -07:00
Jason Evans	e69bee01de	Fix a run initialization race condition. Initialize small run header before dropping arena->lock, arena_chunk_purge() relies on valid small run headers during run iteration. Add some assertions.	2010-03-15 22:25:23 -07:00
Jason Evans	f00bb7f132	Add assertions. Check for interior pointers in arena_[ds]alloc(). Check for corrupt pointers in tcache_alloc().	2010-03-15 16:44:12 -07:00
Jason Evans	6b5974403b	Widen malloc_stats_print() output columns.	2010-03-15 15:50:48 -07:00
Jason Evans	d9ef75fed4	arena_chunk_purge() arena->nactive fix. Update arena->nactive when pseudo-allocating runs in arena_chunk_purge(), since arena_run_dalloc() subtracts from arena->nactive.	2010-03-15 12:43:07 -07:00
Jason Evans	992242c545	Change xmallctl() --> CTL_GET() where possible.	2010-03-14 19:55:32 -07:00
Jason Evans	19b6a5537d	Fix malloc_stats_print() man page prototype.	2010-03-14 19:52:26 -07:00
Jason Evans	e00572b384	mmap()/munmap() without arena->lock or bin->lock.	2010-03-14 19:43:56 -07:00
Jason Evans	05b21be347	Purge dirty pages without arena->lock.	2010-03-14 19:41:18 -07:00
Jason Evans	86815df9dc	Push locks into arena bins. For bin-related allocation, protect data structures with bin locks rather than arena locks. Arena locks remain for run allocation/deallocation and other miscellaneous operations. Restructure statistics counters to maintain per bin allocated/nmalloc/ndalloc, but continue to provide arena-wide statistics via aggregation in the ctl code.	2010-03-14 17:38:09 -07:00
Jason Evans	1e0a636c11	Simplify small object allocation/deallocation. Use chained run free lists instead of bitmaps to track free objects within small runs. Remove reference counting for small object run pages.	2010-03-13 20:38:29 -08:00
Jason Evans	3fa9a2fad8	Simplify tcache object caching. Use chains of cached objects, rather than using arrays of pointers. Since tcache_bin_t is no longer dynamically sized, convert tcache_t's tbin to an array of structures, rather than an array of pointers. This implicitly removes tcache_bin_{create,destroy}(), which further simplifies the fast path for malloc/free. Use cacheline alignment for tcache_t allocations. Remove runtime configuration option for number of tcache bin slots, and replace it with a boolean option for enabling/disabling tcache. Limit the number of tcache objects to the lesser of TCACHE_NSLOTS_MAX and 2X the number of regions per run for the size class. For GC-triggered flush, discard 3/4 of the objects below the low water mark, rather than 1/2.	2010-03-13 20:38:18 -08:00
Jason Evans	2caa4715ed	Modify dirty page purging algorithm. Convert chunks_dirty from a red-black tree to a doubly linked list, and use it to purge dirty pages from chunks in FIFO order. Add a lock around the code that purges dirty pages via madvise(2), in order to avoid kernel contention. If lock acquisition fails, indefinitely postpone purging dirty pages. Add a lower limit of one chunk worth of dirty pages per arena for purging, in addition to the active:dirty ratio. When purging, purge all dirty pages from at least one chunk, but rather than purging enough pages to drop to half the purging threshold, merely drop to the threshold.	2010-03-04 22:49:59 -08:00
Jason Evans	3c2d9c899c	Print version in malloc_stats_print().	2010-03-03 17:55:03 -08:00

... 4 5 6 7 8

385 Commits