server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Qi Wang	f4a0f32d34	Fast-path improvement: reduce # of branches and unnecessary operations. - Combine multiple runtime branches into a single malloc_slow check. - Avoid calling arena_choose / size2index / index2size on fast path. - A few micro optimizations.	2015-11-10 14:28:34 -08:00
Joshua Kahn	e8ab0ab9c0	Add function to destroy tree ex_destroy iterates over the tree using post-order traversal so nodes can be removed and processed by the callback function without paying the cost to rebalance the tree. The destruction process cannot be stopped once started.	2015-11-09 15:56:18 -08:00
Joshua Kahn	13b4015531	Allow const keys for lookup Signed-off-by: Steve Dougherty <sdougherty@barracuda.com> This resolves #281.	2015-11-09 15:48:05 -08:00
Steve Dougherty	bd418ce11e	Assert compact color bit is unused Signed-off-by: Joshua Kahn <jkahn@barracuda.com> This resolves #280.	2015-11-09 15:44:30 -08:00
Jason Evans	a784e411f2	Fix a xallocx(..., MALLOCX_ZERO) bug. Fix xallocx(..., MALLOCX_ZERO to zero the last full trailing page of large allocations that have been randomly assigned an offset of 0 when --enable-cache-oblivious configure option is enabled. This addresses a special case missed in `d260f442ce` (Fix xallocx(..., MALLOCX_ZERO) bugs.).	2015-09-24 22:21:55 -07:00
Craig Rodrigues	66814c1a52	Fix tsd_boot1() to use explicit 'void' parameter list.	2015-09-20 21:57:32 -07:00
Jason Evans	6d91929e52	Address portability issues on Solaris. Don't assume Bourne shell is in /bin/sh when running size_classes.sh . Consider __sparcv9 a synonym for __sparc64__ when defining LG_QUANTUM. This resolves #275.	2015-09-15 10:42:36 -07:00
Jason Evans	708ed79834	Resolve an unsupported special case in arena_prof_tctx_set(). Add arena_prof_tctx_reset() and use it instead of arena_prof_tctx_set() when resetting the tctx pointer during reallocation, which happens whenever an originally sampled reallocated object is not sampled during reallocation. This regression was introduced by `594c759f37` (Optimize arena_prof_tctx_set().)	2015-09-14 23:57:58 -07:00
Jason Evans	ea8d97b897	Fix prof_{malloc,free}_sample_object() call order in prof_realloc(). Fix prof_realloc() to call prof_free_sampled_object() after calling prof_malloc_sample_object(). Prior to this fix, if tctx and old_tctx were the same, the tctx could have been prematurely destroyed.	2015-09-14 23:57:52 -07:00
Jason Evans	cec0d63d8b	Make one call to prof_active_get_unlocked() per allocation event. Make one call to prof_active_get_unlocked() per allocation event, and use the result throughout the relevant functions that handle an allocation event. Also add a missing check in prof_realloc(). These fixes protect allocation events against concurrent prof_active changes.	2015-09-14 23:55:48 -07:00
Jason Evans	676df88e48	Rename arena_maxclass to large_maxclass. arena_maxclass is no longer an appropriate name, because arenas also manage huge allocations.	2015-09-11 20:50:20 -07:00
Jason Evans	560a4e1e01	Fix xallocx() bugs. Fix xallocx() bugs related to the 'extra' parameter when specified as non-zero.	2015-09-11 20:40:34 -07:00
Jason Evans	a00b10735a	Fix "prof.reset" mallctl-related corruption. Fix heap profiling to distinguish among otherwise identical sample sites with interposed resets (triggered via the "prof.reset" mallctl). This bug could cause data structure corruption that would most likely result in a segfault.	2015-09-09 23:16:10 -07:00
Jason Evans	b4330b02a8	Fix pointer comparision with undefined behavior. This didn't cause bad code generation in the one case spot-checked (gcc 4.8.1), but had the potential to to so. This bug was introduced by `594c759f37` (Optimize arena_prof_tctx_set().).	2015-09-04 10:31:41 -07:00
Jason Evans	594c759f37	Optimize arena_prof_tctx_set(). Optimize arena_prof_tctx_set() to avoid reading run metadata when deciding whether it's actually necessary to write.	2015-09-02 14:52:24 -07:00
Jason Evans	b5c2a347d7	Silence compiler warnings for unreachable code. Reported by Ingvar Hagelund.	2015-08-19 23:28:34 -07:00
Jason Evans	d01fd19755	Rename index_t to szind_t to avoid an existing type on Solaris. This resolves #256.	2015-08-19 15:21:32 -07:00
Jason Evans	5ef33a9f2b	Don't bitshift by negative amounts. Don't bitshift by negative amounts when encoding/decoding run sizes in chunk header maps. This affected systems with page sizes greater than 8 KiB. Reported by Ingvar Hagelund <ingvar@redpill-linpro.com>.	2015-08-19 14:16:30 -07:00
Jason Evans	85ae064e96	Fix a comment.	2015-08-13 14:54:06 -07:00
Jason Evans	fead75fd52	Fix gcc build failure (define __has_builtin).	2015-08-12 16:46:09 -07:00
Jason Evans	7928f62273	Check whether gcc version supports __builtin_unreachable().	2015-08-12 16:38:39 -07:00
Jason Evans	694d0829c0	Update list of private symbols.	2015-08-12 13:03:43 -07:00
Jason Evans	1f27abc1b1	Refactor arena_mapbits_{small,large}_set() to not preserve unzeroed. Fix arena_run_split_large_helper() to treat newly committed memory as zeroed.	2015-08-11 16:45:47 -07:00
Jason Evans	6bdeddb697	Fix build failure. This regression was introduced by `de249c8679` (Arena chunk decommit cleanups and fixes.). This resolves #254.	2015-08-10 23:42:33 -07:00
Jason Evans	45186f0c07	Refactor arena_mapbits unzeroed flag management. Only set the unzeroed flag when initializing the entire mapbits entry, rather than mutating just the unzeroed bit. This simplifies the possible mapbits state transitions.	2015-08-10 23:03:34 -07:00
Jason Evans	de249c8679	Arena chunk decommit cleanups and fixes. Decommit arena chunk header during chunk deallocation if the rest of the chunk is decommitted.	2015-08-10 17:13:59 -07:00
Jason Evans	8fadb1a8c2	Implement chunk hook support for page run commit/decommit. Cascade from decommit to purge when purging unused dirty pages, so that it is possible to decommit cleaned memory rather than just purging. For non-Windows debug builds, decommit runs rather than purging them, since this causes access of deallocated runs to segfault. This resolves #251.	2015-08-07 00:50:58 -07:00
Daniel Micay	67c46a9e53	work around _FORTIFY_SOURCE false positive In builds with profiling disabled (default), the opt_prof_prefix array has a one byte length as a micro-optimization. This will cause the usage of write in the unused profiling code to be statically detected as a buffer overflow by Bionic's _FORTIFY_SOURCE implementation as it tries to detect read overflows in addition to write overflows. This works around the problem by informing the compiler that not_reached() means code in unreachable in release builds.	2015-08-04 17:09:43 -04:00
Matthijs	c1a6a51e40	MSVC compatibility changes - Decorate public function with __declspec(allocator) and __declspec(restrict), just like MSVC 1900 - Support JEMALLOC_HAS_RESTRICT by defining the restrict keyword - Move __declspec(nothrow) between 'void' and '*' so it compiles once more	2015-08-04 09:01:48 -07:00
Jason Evans	b49a334a64	Generalize chunk management hooks. Add the "arena.<i>.chunk_hooks" mallctl, which replaces and expands on the "arena.<i>.chunk.{alloc,dalloc,purge}" mallctls. The chunk hooks allow control over chunk allocation/deallocation, decommit/commit, purging, and splitting/merging, such that the application can rely on jemalloc's internal chunk caching and retaining functionality, yet implement a variety of chunk management mechanisms and policies. Merge the chunks_[sz]ad_{mmap,dss} red-black trees into chunks_[sz]ad_retained. This slightly reduces how hard jemalloc tries to honor the dss precedence setting; prior to this change the precedence setting was also consulted when recycling chunks. Fix chunk purging. Don't purge chunks in arena_purge_stashed(); instead deallocate them in arena_unstash_purged(), so that the dirty memory linkage remains valid until after the last time it is used. This resolves #176 and #201.	2015-08-03 21:49:02 -07:00
Jason Evans	d059b9d6a1	Implement support for non-coalescing maps on MinGW. - Do not reallocate huge objects in place if the number of backing chunks would change. - Do not cache multi-chunk mappings. This resolves #213.	2015-07-24 18:39:14 -07:00
Jason Evans	87ccb55547	Fix huge_palloc() to handle size rather than usize input. huge_ralloc() passes a size that may not be precisely a size class, so make huge_palloc() handle the more general case of a size input rather than usize. This regression appears to have been introduced by the addition of in-place huge reallocation; as such it was never incorporated into a release.	2015-07-23 17:18:49 -07:00
Jason Evans	4becdf21dc	Fix sa2u() regression. Take large_pad into account when determining whether an aligned allocation can be satisfied by a large size class. This regression was introduced by `8a03cf039c` (Implement cache index randomization for large allocations.).	2015-07-23 17:14:11 -07:00
Jason Evans	71cd2f08ff	Leave PRI* macros defined after using them to define FMT. Macro expansion happens too late for the #undef directives to work as a mechanism for preventing accidental direct use of the PRI macros.	2015-07-23 15:50:09 -07:00
Jason Evans	5fae7dc1b3	Fix MinGW-related portability issues. Create and use FMT* macros that are equivalent to the PRI* macros that inttypes.h defines. This allows uniform use of the Unix-specific format specifiers, e.g. "%zu", as well as avoiding Windows-specific definitions of e.g. PRIu64. Add ffs()/ffsl() support for compiling with gcc. Extract compatibility definitions of ENOENT, EINVAL, EAGAIN, EPERM, ENOMEM, and ENORANGE into include/msvc_compat/windows_extra.h and use the file for tests as well as for core jemalloc code.	2015-07-23 13:56:25 -07:00
Jason Evans	e42c309eba	Add JEMALLOC_FORMAT_PRINTF(). Replace JEMALLOC_ATTR(format(printf, ...). with JEMALLOC_FORMAT_PRINTF(), so that configuration feature tests can omit the attribute if it would cause extraneous compilation warnings.	2015-07-22 15:44:47 -07:00
Jason Evans	5bd879646c	Change default chunk size from 256 KiB to 2 MiB. This change improves interaction with transparent huge pages, e.g. reduced page faults (at least in the absence of unused dirty page purging).	2015-07-15 17:15:26 -07:00
Jason Evans	aa2826621e	Revert to first-best-fit run/chunk allocation. This effectively reverts `97c04a9383` (Use first-fit rather than first-best-fit run/chunk allocation.). In some pathological cases, first-fit search dominates allocation time, and it also tends not to converge as readily on a steady state of memory layout, since precise allocation order has a bigger effect than for first-best-fit.	2015-07-15 17:15:19 -07:00
Jason Evans	dde067264d	Fix an integer overflow bug in {size2index,s2u}_compute(). This {bug,regression} was introduced by `155bfa7da1` (Normalize size classes.). This resolves #241.	2015-07-09 21:36:33 -07:00
Jason Evans	0313607e66	Fix MinGW build warnings. Conditionally define ENOENT, EINVAL, etc. (was unconditional). Add/use PRIzu, PRIzd, and PRIzx for use in malloc_printf() calls. gcc issued (harmless) warnings since e.g. "%zu" should be "%Iu" on Windows, and the alternative to this workaround would have been to disable the function attributes which cause gcc to look for type mismatches in formatted printing function calls.	2015-07-07 20:10:28 -07:00
Matthijs	a1aaf949a5	Optimizations for Windows - Set opt_lg_chunk based on run-time OS setting - Verify LG_PAGE is compatible with run-time OS setting - When targeting Windows Vista or newer, use SRWLOCK instead of CRITICAL_SECTION - When targeting Windows Vista or newer, statically initialize init_lock	2015-06-25 22:53:58 +02:00
Jason Evans	241abc601b	Fix size class overflow handling when profiling is enabled. Fix size class overflow handling for malloc(), posix_memalign(), memalign(), calloc(), and realloc() when profiling is enabled. Remove an assertion that erroneously caused arena_sdalloc() to fail when profiling was enabled. This resolves #232.	2015-06-23 18:56:14 -07:00
Jason Evans	0a9f9a4d51	Convert arena_maybe_purge() recursion to iteration. This resolves #235.	2015-06-22 18:50:58 -07:00
Jason Evans	713b844bff	Update a comment.	2015-06-15 12:01:05 -07:00
Chi-hung Hsieh	c073f8167a	Fix type errors in C11 versions of atomic_*() functions.	2015-05-27 20:33:18 -07:00
Jason Evans	836bbe9951	Impose a minimum tcache count for small size classes. Now that small allocation runs have fewer regions due to run metadata residing in chunk headers, an explicit minimum tcache count is needed to make sure that tcache adequately amortizes synchronization overhead.	2015-05-19 17:47:16 -07:00
Jason Evans	6591ff09d8	Fix arena_dalloc() performance regression. Take into account large_pad when computing whether to pass the deallocation request to tcache_dalloc_large(), so that the largest cacheable size makes it back to tcache. This regression was introduced by `8a03cf039c` (Implement cache index randomization for large allocations.).	2015-05-19 17:44:45 -07:00
Jason Evans	fd5f9e43c3	Avoid atomic operations for dependent rtree reads.	2015-05-15 17:02:30 -07:00
Jason Evans	c451831264	Fix type punning in calls to atomic operation functions.	2015-05-07 22:35:40 -07:00
Jason Evans	8a03cf039c	Implement cache index randomization for large allocations. Extract szad size quantization into {extent,run}_quantize(), and . quantize szad run sizes to the union of valid small region run sizes and large run sizes. Refactor iteration in arena_run_first_fit() to use run_quantize{,_first,_next(), and add support for padded large runs. For large allocations that have no specified alignment constraints, compute a pseudo-random offset from the beginning of the first backing page that is a multiple of the cache line size. Under typical configurations with 4-KiB pages and 64-byte cache lines this results in a uniform distribution among 64 page boundary offsets. Add the --disable-cache-oblivious option, primarily intended for performance testing. This resolves #13.	2015-05-06 13:27:39 -07:00

1 2 3 4 5 ...

332 Commits