server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Qi Wang	57ed894f8a	Fix arena_bind(). When tsd is not in nominal state (e.g. during thread termination), we should not increment nthreads.	2016-09-23 14:39:29 -07:00
Jason Evans	fa09fe798a	Fix rallocx() sampling code to not eagerly commit sampler update. rallocx() for an alignment-constrained request may end up with a smaller-than-worst-case size if in-place reallocation succeeds due to serendipitous alignment. In such cases, sampling may not happen.	2016-06-08 10:14:25 -07:00
Elliot Ronaghan	9de0094e6e	Fix a Valgrind regression in calloc(). This regression was caused by `3ef51d7f73` (Optimize the fast paths of calloc() and [m,d,sd]allocx().).	2016-06-07 14:27:24 -07:00
Jason Evans	c1e00ef2a6	Resolve bootstrapping issues when embedded in FreeBSD libc. `b2c0d6322d` (Add witness, a simple online locking validator.) caused a broad propagation of tsd throughout the internal API, but tsd_fetch() was designed to fail prior to tsd bootstrapping. Fix this by splitting tsd_t into non-nullable tsd_t and nullable tsdn_t, and modifying all internal APIs that do not critically rely on tsd to take nullable pointers. Furthermore, add the tsd_booted_get() function so that tsdn_fetch() can probe whether tsd bootstrapping is complete and return NULL if not. All dangerous conversions of nullable pointers are tsdn_tsd() calls that assert-fail on invalid conversion.	2016-05-10 22:51:33 -07:00
Jason Evans	0c12dcabc5	Fix tsd bootstrapping for a0malloc().	2016-05-07 16:55:36 -07:00
Jason Evans	3ef51d7f73	Optimize the fast paths of calloc() and [m,d,sd]allocx(). This is a broader application of optimizations to malloc() and free() in `f4a0f32d34` (Fast-path improvement: reduce # of branches and unnecessary operations.). This resolves #321.	2016-05-06 14:37:39 -07:00
Jason Evans	c2f970c32b	Modify pages_map() to support mapping uncommitted virtual memory. If the OS overcommits: - Commit all mappings in pages_map() regardless of whether the caller requested committed memory. - Linux-specific: Specify MAP_NORESERVE to avoid unfortunate interactions with heuristic overcommit mode during fork(2). This resolves #193.	2016-05-05 18:56:17 -07:00
Jason Evans	108c4a11e9	Fix witness/fork() interactions. Fix witness to clear its list of owned mutexes in the child if platform-specific malloc_mutex code re-initializes mutexes rather than unlocking them.	2016-04-26 10:47:22 -07:00
Jason Evans	174c0c3a9c	Fix fork()-related lock rank ordering reversals.	2016-04-25 23:16:20 -07:00
Jason Evans	259f8ebbfc	Fix arena_choose_hard() regression. This regression was caused by `66cd953514` (Do not allocate metadata via non-auto arenas, nor tcaches.).	2016-04-22 22:21:31 -07:00
Jason Evans	66cd953514	Do not allocate metadata via non-auto arenas, nor tcaches. This assures that all internally allocated metadata come from the first opt_narenas arenas, i.e. the automatically multiplexed arenas.	2016-04-22 15:19:59 -07:00
Jason Evans	b2c0d6322d	Add witness, a simple online locking validator. This resolves #358.	2016-04-14 02:09:28 -07:00
Jason Evans	39f58755a7	Fix a potential tsd cleanup leak. Prior to `767d85061a` (Refactor arenas array (fixes deadlock).), it was possible under some circumstances for arena_get() to trigger recreation of the arenas cache during tsd cleanup, and the arenas cache would then be leaked. In principle a similar issue could still occur as a side effect of decay-based purging, which calls arena_tdata_get(). Fix arenas_tdata_cleanup() by setting tsd->arenas_tdata_bypass to true, so that arena_tdata_get() will gracefully fail (an expected behavior) rather than recreating tsd->arena_tdata. Reported by Christopher Ferris <cferris@google.com>.	2016-02-27 21:18:15 -08:00
Jason Evans	9d2c10f2e8	Add more HUGE_MAXCLASS overflow checks. Add HUGE_MAXCLASS overflow checks that are specific to heap profiling code paths. This fixes test failures that were introduced by `0c516a00c4` (Make *allocx() size class overflow behavior defined.).	2016-02-25 16:42:15 -08:00
Jason Evans	0c516a00c4	Make *allocx() size class overflow behavior defined. Limit supported size and alignment to HUGE_MAXCLASS, which in turn is now limited to be less than PTRDIFF_MAX. This resolves #278 and #295.	2016-02-25 15:29:49 -08:00
Jason Evans	767d85061a	Refactor arenas array (fixes deadlock). Refactor the arenas array, which contains pointers to all extant arenas, such that it starts out as a sparse array of maximum size, and use double-checked atomics-based reads as the basis for fast and simple arena_get(). Additionally, reduce arenas_lock's role such that it only protects against arena initalization races. These changes remove the possibility for arena lookups to trigger locking, which resolves at least one known (fork-related) deadlock. This resolves #315.	2016-02-24 23:58:10 -08:00
Jason Evans	9e1810ca9d	Silence miscellaneous 64-to-32-bit data loss warnings.	2016-02-24 13:03:48 -08:00
Jason Evans	0931cecbfa	Use ssize_t for readlink() rather than int.	2016-02-24 13:03:48 -08:00
Jason Evans	8f683b94a7	Make opt_narenas unsigned rather than size_t.	2016-02-24 13:03:48 -08:00
Jason Evans	9bad079039	Refactor time_* into nstime_*. Use a single uint64_t in nstime_t to store nanoseconds rather than using struct timespec. This reduces fragility around conversions between long and uint64_t, especially missing casts that only cause problems on 32-bit platforms.	2016-02-21 21:39:05 -08:00
Jason Evans	243f7a0508	Implement decay-based unused dirty page purging. This is an alternative to the existing ratio-based unused dirty page purging, and is intended to eventually become the sole purging mechanism. Add mallctls: - opt.purge - opt.decay_time - arena.<i>.decay - arena.<i>.decay_time - arenas.decay_time - stats.arenas.<i>.decay_time This resolves #325.	2016-02-19 20:56:21 -08:00
Jason Evans	db927b6727	Refactor arenas_cache tsd. Refactor arenas_cache tsd into arenas_tdata, which is a structure of type arena_tdata_t.	2016-02-19 20:32:37 -08:00
Jason Evans	f829009929	Add --with-malloc-conf. Add --with-malloc-conf, which makes it possible to embed a default options string during configuration.	2016-02-19 20:29:06 -08:00
Cosmin Paraschiv	9cb481a73f	Call malloc_test_boot0() from malloc_init_hard_recursible(). When using LinuxThreads, malloc bootstrapping deadlocks, since malloc_tsd_boot0() ends up calling pthread_setspecific(), which causes recursive allocation. Fix it by moving the malloc_tsd_boot0() call to malloc_init_hard_recursible(). The deadlock was introduced by `8bb3198f72` (Refactor/fix arenas manipulation.), when tsd_boot() was split and the top half, tsd_boot0(), got an extra tsd_wrapper_set() call.	2016-01-11 11:10:39 -08:00
Qi Wang	f4a0f32d34	Fast-path improvement: reduce # of branches and unnecessary operations. - Combine multiple runtime branches into a single malloc_slow check. - Avoid calling arena_choose / size2index / index2size on fast path. - A few micro optimizations.	2015-11-10 14:28:34 -08:00
Jason Evans	21523297fc	Add mallocx() OOM tests.	2015-09-17 15:27:28 -07:00
Jason Evans	3263be6efb	Simplify imallocx_prof_sample(). Simplify imallocx_prof_sample() to always operate on usize rather than sometimes using size. This avoids redundant usize computations and more closely fits the style adopted by i[rx]allocx_prof_sample() to fix sampling bugs.	2015-09-17 10:19:28 -07:00
Jason Evans	4be9c79f88	Fix irallocx_prof_sample(). Fix irallocx_prof_sample() to always allocate large regions, even when alignment is non-zero.	2015-09-17 10:17:55 -07:00
Jason Evans	38e2c8fa9c	Fix ixallocx_prof_sample(). Fix ixallocx_prof_sample() to never modify nor create sampled small allocations. xallocx() is in general incapable of moving small allocations, so this fix removes buggy code without loss of generality.	2015-09-17 10:05:56 -07:00
Jason Evans	9a505b768c	Centralize xallocx() size[+extra] overflow checks.	2015-09-15 14:39:58 -07:00
Jason Evans	8c485b02a6	Fix ixallocx_prof() to check for size greater than HUGE_MAXCLASS.	2015-09-15 00:51:09 -07:00
Jason Evans	708ed79834	Resolve an unsupported special case in arena_prof_tctx_set(). Add arena_prof_tctx_reset() and use it instead of arena_prof_tctx_set() when resetting the tctx pointer during reallocation, which happens whenever an originally sampled reallocated object is not sampled during reallocation. This regression was introduced by `594c759f37` (Optimize arena_prof_tctx_set().)	2015-09-14 23:57:58 -07:00
Jason Evans	23f6e103c8	Fix ixallocx_prof_sample() argument order reversal. Fix ixallocx_prof() to pass usize_max and zero to ixallocx_prof_sample() in the correct order.	2015-09-14 23:57:09 -07:00
Jason Evans	ce9a4e3479	s/max_usize/usize_max/g	2015-09-14 23:55:54 -07:00
Jason Evans	d9704042ee	s/oldptr/old_ptr/g	2015-09-14 23:55:54 -07:00
Jason Evans	cec0d63d8b	Make one call to prof_active_get_unlocked() per allocation event. Make one call to prof_active_get_unlocked() per allocation event, and use the result throughout the relevant functions that handle an allocation event. Also add a missing check in prof_realloc(). These fixes protect allocation events against concurrent prof_active changes.	2015-09-14 23:55:48 -07:00
Jason Evans	ef363de701	Fix irealloc_prof() to prof_alloc_rollback() on OOM.	2015-09-14 23:54:42 -07:00
Jason Evans	46ff049128	Optimize irallocx_prof() to optimistically update the sampler state.	2015-09-14 22:47:18 -07:00
Jason Evans	4acb6c7ff3	Fix ixallocx_prof() size+extra overflow. Fix ixallocx_prof() to clamp the extra parameter if size+extra would overflow HUGE_MAXCLASS.	2015-09-14 22:47:12 -07:00
Mike Hommey	0a116faf95	Force initialization of the init_lock in malloc_init_hard on Windows XP This resolves #269.	2015-09-04 10:35:20 -07:00
Jason Evans	30949da601	Fix arenas_cache_cleanup() and arena_get_hard(). Fix arenas_cache_cleanup() and arena_get_hard() to handle allocation/deallocation within the application's thread-specific data cleanup functions even after arenas_cache is torn down. This is a more general fix that complements `45e9f66c28` (Fix arenas_cache_cleanup().).	2015-08-27 20:32:35 -07:00
Christopher Ferris	45e9f66c28	Fix arenas_cache_cleanup(). Fix arenas_cache_cleanup() to handle allocation/deallocation within the application's thread-specific data cleanup functions even after arenas_cache is torn down.	2015-08-21 12:33:17 -07:00
Matthijs	c1a6a51e40	MSVC compatibility changes - Decorate public function with __declspec(allocator) and __declspec(restrict), just like MSVC 1900 - Support JEMALLOC_HAS_RESTRICT by defining the restrict keyword - Move __declspec(nothrow) between 'void' and '*' so it compiles once more	2015-08-04 09:01:48 -07:00
Jason Evans	00632609df	Move JEMALLOC_NOTHROW just after return type. Only use __declspec(nothrow) in C++ mode. This resolves #244.	2015-07-21 08:21:13 -07:00
Mike Hommey	50cd636eed	Remove JEMALLOC_ALLOC_SIZE annotations on functions not returning pointers As per gcc documentation: The alloc_size attribute is used to tell the compiler that the function return value points to memory (...) This resolves #245.	2015-07-21 09:16:07 +09:00
Jason Evans	ae93d6bf36	Avoid function prototype incompatibilities. Add various function attributes to the exported functions to give the compiler more information to work with during optimization, and also specify throw() when compiling with C++ on Linux, in order to adequately match what __THROW does in glibc. This resolves #237.	2015-07-10 16:09:40 -07:00
Matthijs	a1aaf949a5	Optimizations for Windows - Set opt_lg_chunk based on run-time OS setting - Verify LG_PAGE is compatible with run-time OS setting - When targeting Windows Vista or newer, use SRWLOCK instead of CRITICAL_SECTION - When targeting Windows Vista or newer, statically initialize init_lock	2015-06-25 22:53:58 +02:00
Jason Evans	241abc601b	Fix size class overflow handling when profiling is enabled. Fix size class overflow handling for malloc(), posix_memalign(), memalign(), calloc(), and realloc() when profiling is enabled. Remove an assertion that erroneously caused arena_sdalloc() to fail when profiling was enabled. This resolves #232.	2015-06-23 18:56:14 -07:00
Jason Evans	dc0610a714	Add alignment assertions to public aligned allocation functions.	2015-06-22 18:48:58 -07:00
Jason Evans	8a03cf039c	Implement cache index randomization for large allocations. Extract szad size quantization into {extent,run}_quantize(), and . quantize szad run sizes to the union of valid small region run sizes and large run sizes. Refactor iteration in arena_run_first_fit() to use run_quantize{,_first,_next(), and add support for padded large runs. For large allocations that have no specified alignment constraints, compute a pseudo-random offset from the beginning of the first backing page that is a multiple of the cache line size. Under typical configurations with 4-KiB pages and 64-byte cache lines this results in a uniform distribution among 64 page boundary offsets. Add the --disable-cache-oblivious option, primarily intended for performance testing. This resolves #13.	2015-05-06 13:27:39 -07:00

1 2 3 4

194 Commits