server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Jason Evans	81b4e6eb6f	Fix a heap profiling regression. Call prof_ctx_set() in all paths through prof_{m,re}alloc(). Inline arena_prof_ctx_get().	2010-10-20 20:52:00 -07:00
Jason Evans	4d6a134e13	Inline the fast path for heap sampling. Inline the heap sampling code that is executed for every allocation event (regardless of whether a sample is taken). Combine all prof TLS data into a single data structure, in order to reduce the TLS lookup volume.	2010-10-20 19:05:59 -07:00
Jason Evans	93443689a4	Add per thread allocation counters, and enhance heap sampling. Add the "thread.allocated" and "thread.deallocated" mallctls, which can be used to query the total number of bytes ever allocated/deallocated by the calling thread. Add s2u() and sa2u(), which can be used to compute the usable size that will result from an allocation request of a particular size/alignment. Re-factor ipalloc() to use sa2u(). Enhance the heap profiler to trigger samples based on usable size, rather than request size. This has a subtle, but important, impact on the accuracy of heap sampling. For example, previous to this change, 16- and 17-byte objects were sampled at nearly the same rate, but 17-byte objects actually consume 32 bytes each. Therefore it was possible for the sample to be somewhat skewed compared to actual memory usage of the allocated objects.	2010-10-20 17:39:18 -07:00
Jason Evans	21fb95bba6	Fix a bug in arena_dalloc_bin_run(). Fix the newsize argument to arena_run_trim_tail() that arena_dalloc_bin_run() passes. Previously, oldsize-newsize (i.e. the complement) was passed, which could erroneously cause dirty pages to be returned to the clean available runs tree. Prior to the CHUNK_MAP_ZEROED --> CHUNK_MAP_UNZEROED conversion, this bug merely caused dirty pages to be unaccounted for (and therefore never get purged), but with CHUNK_MAP_UNZEROED, this could cause dirty pages to be treated as zeroed (i.e. memory corruption).	2010-10-18 17:45:40 -07:00
Jason Evans	088e6a0a37	Fix arena bugs. Split arena_dissociate_bin_run() out of arena_dalloc_bin_run(), so that arena_bin_malloc_hard() can avoid dissociation when recovering from losing a race. This fixes a bug introduced by a recent attempted fix. Fix a regression in arena_ralloc_large_grow() that was introduced by recent fixes.	2010-10-18 00:04:44 -07:00
Jason Evans	8de6a02823	Fix arena bugs. Move part of arena_bin_lower_run() into the callers, since the conditions under which it should be called differ slightly between callers. Fix arena_chunk_purge() to omit run size in the last map entry for each run it temporarily allocates.	2010-10-17 20:57:30 -07:00
Jason Evans	12ca91402b	Add assertions to run coalescing. Assert that the chunk map bits at the ends of the runs that participate in coalescing are self-consistent.	2010-10-17 19:56:09 -07:00
Jason Evans	940a2e02b2	Fix numerous arena bugs. In arena_ralloc_large_grow(), update the map element for the end of the newly grown run, rather than the interior map element that was the beginning of the appended run. This is a long-standing bug, and it had the potential to cause massive corruption, but triggering it required roughly the following sequence of events: 1) Large in-place growing realloc(), with left-over space in the run that followed the large object. 2) Allocation of the remainder run left over from (1). 3) Deallocation of the remainder run before deallocation of the large run, with unfortunate interior map state left over from previous run allocation/deallocation activity, such that one or more pages of allocated memory would be treated as part of the remainder run during run coalescing. In summary, this was a bad bug, but it was difficult to trigger. In arena_bin_malloc_hard(), if another thread wins the race to allocate a bin run, dispose of the spare run via arena_bin_lower_run() rather than arena_run_dalloc(), since the run has already been prepared for use as a bin run. This bug has existed since March 14, 2010: e00572b384c81bd2aba57fac32f7077a34388915 mmap()/munmap() without arena->lock or bin->lock. Fix bugs in arena_dalloc_bin_run(), arena_trim_head(), arena_trim_tail(), and arena_ralloc_large_grow() that could cause the CHUNK_MAP_UNZEROED map bit to become corrupted. These are all long-standing bugs, but the chances of them actually causing problems was much lower before the CHUNK_MAP_ZEROED --> CHUNK_MAP_UNZEROED conversion. Fix a large run statistics regression in arena_ralloc_large_grow() that was introduced on September 17, 2010: 8e3c3c61b5bb676a705450708e7e79698cdc9e0c Add {,r,s,d}allocm(). Add debug code to validate that supposedly pre-zeroed memory really is.	2010-10-17 17:52:14 -07:00
Jason Evans	397e5111b5	Preserve CHUNK_MAP_UNZEROED for small runs. Preserve CHUNK_MAP_UNZEROED when allocating small runs, because it is possible that untouched pages will be returned to the tree of clean runs, where the CHUNK_MAP_UNZEROED flag matters. Prior to the conversion from CHUNK_MAP_ZEROED, this was already a bug, but in the worst case extra zeroing occurred. After the conversion, this bug made it possible to incorrectly treat pages as pre-zeroed.	2010-10-16 16:19:10 -07:00
Jason Evans	004ed142a6	Fix a regression in CHUNK_MAP_UNZEROED change. Fix a regression added by revision: 3377ffa1f4f8e67bce1e36624285e5baf5f9ecef Change CHUNK_MAP_ZEROED to CHUNK_MAP_UNZEROED. A modified chunk->map dereference was missing the subtraction of map_bias, which caused incorrect chunk map initialization, as well as potential corruption of the first non-header page of memory within each chunk.	2010-10-14 00:28:31 -07:00
Jason Evans	ac6f3c2bb5	Re-organize prof-libgcc configuration. Re-organize code for --enable-prof-libgcc so that configure doesn't report both libgcc and libunwind support as being configured in. This change has no impact on how jemalloc is actually configured/built.	2010-10-07 11:59:12 -07:00
Jason Evans	9f3b0a74fd	Fix tests build when --with-install-suffix is set. Add test/jemalloc_test.h.in, which is processed to include jemalloc/jemalloc@install_suffix@.h, so that test programs can include it without worrying about the install suffix.	2010-10-07 09:53:26 -07:00
Jason Evans	1506a1b903	Move variable declaration out of for loop header. Move a loop variable declaration out of for(usigned i = 0; ...) in order to avoid the need for C99 compilation.	2010-10-07 08:52:32 -07:00
Jason Evans	c6e950665c	Increase PRN 'a' and 'c' constants. Increase PRN 'a' and 'c' constants, so that high bits tend to cascade more.	2010-10-03 00:22:46 -07:00
Jason Evans	9ce3bfd92d	Fix leak context count reporting. Fix a bug in leak context count reporting that tended to cause the number of contexts to be underreported. The reported number of leaked objects and bytes were not affected by this bug.	2010-10-02 22:39:59 -07:00
Jason Evans	588a32cd84	Increase default backtrace depth from 4 to 128. Increase the default backtrace depth, because shallow backtraces tend to result in confusing pprof output graphs.	2010-10-02 22:38:14 -07:00
Jason Evans	a881cd2c61	Make cumulative heap profile data optional. Add the R option to control whether cumulative heap profile data are maintained. Add the T option to control the size of per thread backtrace caches, primarily because when the R option is specified, backtraces that no longer have allocations associated with them are discarded as soon as no thread caches refer to them.	2010-10-02 21:40:26 -07:00
Jason Evans	4d5c09905e	Print prof-libgcc configure setting.	2010-10-02 21:35:27 -07:00
Jason Evans	3c26a7d68e	Remove malloc_swap_enable(). Remove malloc_swap_enable(), which was obsoleted by the "swap.fds" mallctl. The prototype for malloc_swap_enable() was removed from jemalloc/jemalloc.h, but the function itself was accidentally left in place.	2010-10-02 12:04:41 -07:00
Jason Evans	d65cdfe233	Update pprof from google-perftools 1.6. Import updated pprof from google-perftools 1.6, with a patch applied to fix a division by zero error (see http://code.google.com/p/google-perftools/issues/detail?id=235).	2010-10-02 11:31:36 -07:00
Jason Evans	c2fc8c8b3a	Use offsetof() when sizing dynamic structures. Base dynamic structure size on offsetof(), rather than subtracting the size of the dynamic structure member. Results could differ on systems with strict data structure alignment requirements.	2010-10-01 18:02:43 -07:00
Jason Evans	3377ffa1f4	Change CHUNK_MAP_ZEROED to CHUNK_MAP_UNZEROED. Invert the chunk map bit that tracks whether a page is zeroed, so that for zeroed arena chunks, the interior of the page map does not need to be initialized (as it consists entirely of zero bytes).	2010-10-01 17:53:37 -07:00
Jason Evans	7393f44ff0	Omit chunk header in arena chunk map. Omit the first map_bias elements of the map in arena_chunk_t. This avoids barely spilling over into an extra chunk header page for common chunk sizes.	2010-10-01 17:35:43 -07:00
Jason Evans	37dab02e52	Disable interval-based profile dumps by default. It is common to have to specify something like JEMALLOC_OPTIONS=F31i, because interval-based dumps are often unuseful or too expensive. Therefore, disable interval-based dumps by default. To get the previous default behavior it is now necessary to specify 31I as part of the options.	2010-09-30 17:10:17 -07:00
Jason Evans	6005f0710c	Add the "arenas.purge" mallctl.	2010-09-30 16:55:08 -07:00
Jason Evans	075e77cad4	Fix compiler warnings and errors. Use INT_MAX instead of MAX_INT in ALLOCM_ALIGN(), and #include <limits.h> in order to get its definition. Modify prof code related to hash tables to avoid aliasing warnings from gcc 4.1.2 (gcc 4.4.0 and 4.4.3 do not warn).	2010-09-20 19:53:25 -07:00
Jason Evans	355b438c85	Fix compiler warnings. Add --enable-cc-silence, which can be used to silence harmless warnings. Fix an aliasing bug in ckh_pointer_hash().	2010-09-20 19:20:48 -07:00
Jason Evans	6a0d2918ce	Add memalign() and valloc() overrides. If memalign() and/or valloc() are present on the system, override them in order to avoid mixed allocator usage.	2010-09-20 16:52:41 -07:00
Jason Evans	a09f55c87d	Wrap strerror_r(). Create the buferror() function, which wraps strerror_r(). This is necessary because glibc provides a non-standard strerror_r().	2010-09-20 16:05:41 -07:00
Jason Evans	28177d466f	Remove bad assertions in malloc_{pre,post}fork(). Remove assertions that malloc_{pre,post}fork() are only called if threading is enabled. This was true of these functions in the context of FreeBSD's libc, but now the functions are called unconditionally as a result of registering them with pthread_atfork().	2010-09-20 11:24:24 -07:00
Jason Evans	79d660d35d	Store full git GID in VERSION.	2010-09-17 17:38:24 -07:00
Jason Evans	a094babe33	Add gcc attributes for *allocm() prototypes.	2010-09-17 17:35:42 -07:00
Jason Evans	8e3c3c61b5	Add {,r,s,d}allocm(). Add allocm(), rallocm(), sallocm(), and dallocm(), which are a functional superset of malloc(), calloc(), posix_memalign(), malloc_usable_size(), and free().	2010-09-17 15:46:18 -07:00
Jason Evans	4cc6a60a4f	Update modification date in man page.	2010-09-11 23:40:24 -07:00
Jason Evans	8d7a94b275	Fix porting regressions. Fix new build failures and test failures on Linux that were introduced by the port to OS X.	2010-09-11 23:38:12 -07:00
Jason Evans	7e11b389aa	Move size class table to man page. Move the table of size classes from jemalloc.c to the manual page. When manually formatting the manual page, it is now necessary to use: nroff -man -t jemalloc.3	2010-09-11 22:52:16 -07:00
Jason Evans	58a6f5c9be	Add posix_memalign test.	2010-09-11 20:59:16 -07:00
Jason Evans	2dbecf1f62	Port to Mac OS X. Add Mac OS X support, based in large part on the OS X support in Mozilla's version of jemalloc.	2010-09-11 18:20:16 -07:00
Jason Evans	b267d0f86a	Add the thread.arena mallctl. Make it possible for each thread to manage which arena it is associated with. Implement the 'tests' and 'check' build targets.	2010-08-13 17:36:00 -07:00
Jason Evans	dcd15098a8	Move assert() calls up in arena_run_reg_alloc(). Move assert() calls up in arena_run_reg_alloc(), so that a corrupt pointer will likely be caught by an assertion before it is dereferenced.	2010-08-05 12:13:42 -07:00
Jason Evans	2541e1b083	Add a missing mutex unlock in malloc_init_hard(). If multiple threads race to initialize malloc, the loser(s) busy-wait until initialization is complete. Add a missing mutex lock so that the loser(s) properly release the initialization mutex. Under some race conditions, this flaw could have caused one or more threads to become permanently blocked. Reported by Terrell Magee.	2010-07-22 11:35:59 -07:00
Jason Evans	b43b7750a6	Fix the libunwind version of prof_backtrace(). Fix the libunwind version of prof_backtrace() to set the backtrace depth for all possible code paths. This fixes the zero-length backtrace problem when using libunwind.	2010-06-04 15:10:43 -07:00
Jason Evans	7013d10a9e	Avoid unnecessary isalloc() calls. When heap profiling is enabled but deactivated, there is no need to call isalloc(ptr) in prof_{malloc,realloc}(). Avoid these calls, so that profiling overhead under such conditions is negligible.	2010-05-11 18:17:02 -07:00
Jason Evans	ed3d152ea0	Fix next_arena initialization. If there is more than one arena, initialize next_arena so that the first and second threads to allocate memory use arenas 0 and 1, rather than both using arena 0.	2010-05-11 12:00:22 -07:00
Jordan DeLong	2206e1acc1	Add MAP_NORESERVE support. Add MAP_NORESERVE to the chunk_mmap() case being used by chunk_swap_enable(), if the system supports it.	2010-05-11 11:46:53 -07:00
Jason Evans	ecea0f6125	Fix junk filling of cached large objects. Use the size argument to tcache_dalloc_large() to control the number of bytes set to 0x5a when junk filling is enabled, rather than accessing a non-existent arena bin. This bug was capable of corrupting an arbitrarily large memory region, depending on what followed the arena data structure in memory (typically zeroed memory, another arena_t, or a red-black tree node for a huge object).	2010-04-28 12:00:59 -07:00
Jason Evans	5055f4516c	Fix tcache crash during thread cleanup. Properly maintain tcache_bin_t's avail pointer such that it is NULL if no objects are cached. This only caused problems during thread cache destruction, since cache flushing otherwise never occurs on an empty bin.	2010-04-14 11:27:13 -07:00
Jason Evans	38cda690dd	Fix profiling regression caused by bugfix. Properly set the context associated with each allocated object, even when the object is not sampled. Remove debug print code that slipped in.	2010-04-14 11:24:45 -07:00
Jason Evans	6d68ed6492	Remove autom4te.cache in distclean (not relclean).	2010-04-13 22:01:55 -07:00
Jason Evans	8d4203c72d	Fix arena chunk purge/dealloc race conditions. Fix arena_chunk_dealloc() to put the new spare in a consistent state before dropping the arena mutex to deallocate the previous spare. Fix arena_run_dalloc() to insert a newly dirtied chunk into the chunks_dirty list before potentially deallocating the chunk, so that dirty page accounting is self-consistent.	2010-04-13 21:17:18 -07:00

1 2 3 4 5

219 Commits