server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Jason Evans	61d26425e5	Fix JSON-mode output for !config_stats and/or !config_prof cases. These bugs were introduced by b599b32280e1142856b0b96293a71e1684b1ccfb (Add "J" (JSON) support to malloc_stats_print().), which was first released in 4.3.0. This resolves #615.	2017-02-26 11:04:58 -08:00
Jason Evans	adae7cfc4a	Fix chunk_alloc_dss() regression. Fix chunk_alloc_dss() to account for bytes that are not a multiple of the chunk size. This regression was introduced by e2bcf037d445a84a71c7997670819ebd0a893b4a (Make dss operations lockless.), which was first released in 4.3.0.	2017-02-26 10:53:26 -08:00
Qi Wang	c2323e13a5	Get rid of witness in malloc_mutex_t when !(configured w/ debug). We don't touch witness at all when config_debug == false. Let's only pay the memory cost in malloc_mutex_s when needed. Note that when !config_debug, we keep the field in a union so that we don't have to do #ifdefs in multiple places.	2017-02-24 09:41:29 -08:00
Jason Evans	08c24e7c1a	Relax witness assertions related to prof_gdump(). In some cases the prof machinery allocates (in order to modify the bt2gctx hash table), and such operations are synchronized via bt2gctx_mtx. Rather than asserting that no locks are held on entry into functions that may call prof_gdump(), make the weaker assertion that no "core" locks are held. The prof machinery enqueues dumps triggered by prof_gdump() calls when bt2gctx_mtx is held, so this weakened assertion avoids false failures in such cases.	2017-02-23 10:08:42 -08:00
Jason Evans	f56cb9a68e	Add witness_assert_depth[_to_rank](). This makes it possible to make lock state assertions about precisely which locks are held.	2017-02-23 10:08:42 -08:00
Jason Evans	7034e6baa1	Enable mutex witnesses even when !isthreaded. This fixes interactions with witness_assert_depth[_to_rank](), which was added in dad74bd3c811ca2b1af1fd57b28f2456da5ba08b (Convert witness_assert_lockless() to witness_assert_lock_depth().).	2017-02-23 10:08:42 -08:00
David Goldblatt	44e50041dc	CI: Run --enable-debug builds on windows This will hopefully catch some windows-specific bugs.	2017-02-23 10:06:15 -08:00
Jason Evans	e85e588e45	Use MALLOC_CONF rather than malloc_conf for tests. malloc_conf does not reliably work with MSVC, which complains of "inconsistent dll linkage", i.e. its inability to support the application overriding malloc_conf when dynamically linking/loading. Work around this limitation by adding test harness support for per test shell script sourcing, and converting all tests to use MALLOC_CONF instead of malloc_conf.	2017-02-23 10:06:15 -08:00
Jason Evans	3ecc3c8486	Fix/refactor tcaches synchronization. Synchronize tcaches with tcaches_mtx rather than ctl_mtx. Add missing synchronization for tcache flushing. This bug was introduced by 1cb181ed632e7573fb4eab194e4d216867222d27 (Implement explicit tcache support.), which was first released in 4.0.0.	2017-02-23 10:06:15 -08:00
Jason Evans	de49674fbd	Use MALLOC_CONF rather than malloc_conf for tests. malloc_conf does not reliably work with MSVC, which complains of "inconsistent dll linkage", i.e. its inability to support the application overriding malloc_conf when dynamically linking/loading. Work around this limitation by adding test harness support for per test shell script sourcing, and converting all tests to use MALLOC_CONF instead of malloc_conf.	2017-02-23 08:57:02 -08:00
Jason Evans	fdba5ad5cc	Repair file permissions. This regression was caused by 8f61fdedb908c29905103b22dda32ceb29cd8ede (Uniformly cast mallctl[bymib]() oldp/newp arguments to (void *).). This resolves #538.	2017-02-22 00:24:32 -08:00
Jason Evans	8ac7937eb5	Remove remainder of mb (memory barrier). This complements 94c5d22a4da7844d0bdc5b370e47b1ba14268af2 (Remove mb.h, which is unused).	2017-02-22 00:24:14 -08:00
Jason Evans	664ef652d9	Avoid -lgcc for heap profiling if unwind.h is missing. This removes an unneeded library dependency when falling back to intrinsics-based backtracing (or failing to enable heap profiling at all).	2017-02-21 12:46:58 -08:00
Jason Evans	54269dc0ed	Remove obsolete arena_maybe_purge() call. Remove a call to arena_maybe_purge() that was necessary for ratio-based purging, but is obsolete in the context of decay-based purging.	2017-02-21 12:46:41 -08:00
Jason Evans	003ca8717f	Move arena_basic_stats_merge() prototype (hygienic cleanup).	2017-02-21 12:46:20 -08:00
David Goldblatt	d4f3f9a03f	Beef up travis CI integration testing Introduces gen_travis.py, which generates .travis.yml, and updates .travis.yml to be the generated version. The travis build matrix approach doesn't play well with mixing and matching various different environment settings, so we generate every build explicitly, rather than letting them do it for us. To avoid abusing travis resources (and save us time waiting for CI results), we don't test every possible combination of options; we only check up to 2 unusual settings at a time.	2017-02-21 12:45:59 -08:00
Jason Evans	2dfc5b5aac	Disable coalescing of cached extents. Extent splitting and coalescing is a major component of large allocation overhead, and disabling coalescing of cached extents provides a simple and effective hysteresis mechanism. Once two-phase purging is implemented, it will probably make sense to leave coalescing disabled for the first phase, but coalesce during the second phase.	2017-02-16 20:11:50 -08:00
Jason Evans	c1ebfaa673	Optimize extent coalescing. Refactor extent_can_coalesce(), extent_coalesce(), and extent_record() to avoid needlessly repeating extent [de]activation operations.	2017-02-16 20:11:50 -08:00
Jason Evans	b0654b95ed	Fix arena->stats.mapped accounting. Mapped memory increases when extent_alloc_wrapper() succeeds, and decreases when extent_dalloc_wrapper() is called (during purging).	2017-02-16 15:52:11 -08:00
Jason Evans	f8fee6908d	Synchronize arena->decay with arena->decay.mtx. This removes the last use of arena->lock.	2017-02-16 09:39:46 -08:00
Jason Evans	d433471f58	Derive {allocated,nmalloc,ndalloc,nrequests}_large stats. This mildly reduces stats update overhead during normal operation.	2017-02-16 09:39:46 -08:00
Jason Evans	ab25d3c987	Synchronize arena->tcache_ql with arena->tcache_ql_mtx. This replaces arena->lock synchronization.	2017-02-16 09:39:46 -08:00
Jason Evans	6b5cba4191	Convert arena->stats synchronization to atomics.	2017-02-16 09:39:46 -08:00
Jason Evans	fa2d64c94b	Convert arena->prof_accumbytes synchronization to atomics.	2017-02-16 09:39:46 -08:00
Jason Evans	b779522b9b	Convert arena->dss_prec synchronization to atomics.	2017-02-16 09:39:46 -08:00
Jason Evans	0721b895ff	Do not generate unused tsd_*_[gs]et() functions. This avoids a gcc diagnostic note: note: The ABI for passing parameters with 64-byte alignment has changed in GCC 4.6 This note related to the cacheline alignment of rtree_ctx_t, which was introduced by 4a346f55939af4f200121cc4454089592d952f18 (Replace rtree path cache with LRU cache.).	2017-02-13 10:47:16 -08:00
Jason Evans	cd2501efd6	Fix extent_alloc_dss() regression. Fix extent_alloc_dss() to account for bytes that are not a multiple of the page size. This regression was introduced by 577d4572b0821a15e5370f9bf566d884b7cf707c (Make dss operations lockless.), which was first released in 4.3.0.	2017-02-10 14:06:31 -08:00
Jason Evans	6b8ef771a9	Fix rtree_subkey() regression. Fix rtree_subkey() to use uintptr_t rather than unsigned for key bitmasking. This regression was introduced by 4a346f55939af4f200121cc4454089592d952f18 (Replace rtree path cache with LRU cache.).	2017-02-10 09:05:02 -08:00
Jason Evans	7f55dbef9b	Enable mutex witnesses even when !isthreaded. This fixes interactions with witness_assert_depth[_to_rank](), which was added in d0e93ada51e20f4ae394ff4dbdcf96182767c89c (Add witness_assert_depth[_to_rank]().).	2017-02-09 17:05:47 -08:00
Jason Evans	db7da56359	Spin adaptively in rtree_elm_acquire().	2017-02-08 18:50:03 -08:00
Jason Evans	de8a68e853	Enhance spin_adaptive() to yield after several iterations. This avoids worst case behavior if e.g. another thread is preempted while owning the resource the spinning thread is waiting for.	2017-02-08 18:50:03 -08:00
Jason Evans	5f11830754	Replace spin_init() with SPIN_INITIALIZER.	2017-02-08 18:50:03 -08:00
Jason Evans	650c070e10	Remove rtree support for 0 (NULL) keys. NULL can never actually be inserted in practice, and removing support allows a branch to be removed from the fast path.	2017-02-08 18:50:03 -08:00
Jason Evans	f5cf9b19c8	Determine rtree levels at compile time. Rather than dynamically building a table to aid per level computations, define a constant table at compile time. Omit both high and low insignificant bits. Use one to three tree levels, depending on the number of significant bits.	2017-02-08 18:50:03 -08:00
Jason Evans	ff4db5014e	Remove rtree leading 0 bit optimization. A subsequent change instead ignores insignificant high bits.	2017-02-08 18:50:03 -08:00
Jason Evans	cdc240d501	Make non-essential inline rtree functions static functions.	2017-02-08 18:50:03 -08:00
Jason Evans	c511a44e99	Split rtree_elm_lookup_hard() out of rtree_elm_lookup(). Anything but a hit in the first element of the lookup cache is expensive enough to negate the benefits of inlining.	2017-02-08 18:50:03 -08:00
Jason Evans	4a346f5593	Replace rtree path cache with LRU cache. Rework rtree_ctx_t to encapsulate an rtree leaf LRU lookup cache rather than a single-path element lookup cache. The replacement is logically much simpler, as well as slightly faster in the fast path case and less prone to degraded performance during non-trivial sequences of lookups.	2017-02-08 18:50:03 -08:00
Jason Evans	0ecf692726	Optimize a branch out of rtree_read() if !dependent.	2017-02-08 18:50:03 -08:00
Jason Evans	3bd6d8e41d	Conditianalize lg_tcache_max use on JEMALLOC_TCACHE.	2017-02-07 12:15:36 -08:00
Jason Evans	5177995530	Fix extent_record(). Read adjacent rtree elements while holding element locks, since the extents mutex only protects against relevant like-state extent mutation. Fix management of the 'coalesced' loop state variable to merge forward/backward results, rather than overwriting the result of forward coalescing if attempting to coalesce backward. In practice this caused no correctness issues, but could cause extra iterations in rare cases. These regressions were introduced by d27f29b468ae3e9d2b1da4a9880351d76e5a1662 (Disentangle arena and extent locking.).	2017-02-06 20:05:49 -08:00
Jason Evans	6737d5f61e	Fix a race in extent_grow_retained(). Set extent as active prior to registration so that other threads can't modify it in the absence of locking. This regression was introduced by d27f29b468ae3e9d2b1da4a9880351d76e5a1662 (Disentangle arena and extent locking.), via non-obvious means. Removal of extents_mtx protection during extent_grow_retained() execution opened up the race, but in the presence of that locking, the code was safe. This resolves #599.	2017-02-04 12:15:13 -08:00
Jason Evans	1bac516aaa	Optimize compute_size_with_overflow(). Do not check for overflow unless it is actually a possibility.	2017-02-03 19:13:05 -08:00
Jason Evans	767ffa2b5f	Fix compute_size_with_overflow(). Fix compute_size_with_overflow() to use a high_bits mask that has the high bits set, rather than the low bits. This regression was introduced by 5154ff32ee8c37bacb6afd8a07b923eb33228357 (Unify the allocation paths).	2017-02-03 19:13:05 -08:00
Jason Evans	d27f29b468	Disentangle arena and extent locking. Refactor arena and extent locking protocols such that arena and extent locks are never held when calling into the extent_*_wrapper() API. This requires extra care during purging since the arena lock no longer protects the inner purging logic. It also requires extra care to protect extents from being merged with adjacent extents. Convert extent_t's 'active' flag to an enumerated 'state', so that retained extents are explicitly marked as such, rather than depending on ring linkage state. Refactor the extent collections (and their synchronization) for cached and retained extents into extents_t. Incorporate LRU functionality to support purging. Incorporate page count accounting, which replaces arena->ndirty and arena->stats.retained. Assert that no core locks are held when entering any internal [de]allocation functions. This is in addition to existing assertions that no locks are held when entering external [de]allocation functions. Audit and document synchronization protocols for all arena_t fields. This fixes a potential deadlock due to recursive allocation during gdump, in a similar fashion to b49c649bc18fff4bd10a1c8adbaf1f25f6453cb6 (Fix lock order reversal during gdump.), but with a necessarily much broader code impact.	2017-02-01 16:43:46 -08:00
Jason Evans	1b6e43507e	Fix/refactor tcaches synchronization. Synchronize tcaches with tcaches_mtx rather than ctl_mtx. Add missing synchronization for tcache flushing. This bug was introduced by 1cb181ed632e7573fb4eab194e4d216867222d27 (Implement explicit tcache support.), which was first released in 4.0.0.	2017-02-01 16:43:46 -08:00
Jason Evans	d0e93ada51	Add witness_assert_depth[_to_rank](). This makes it possible to make lock state assertions about precisely which locks are held.	2017-02-01 16:43:46 -08:00
Jason Evans	ace679ce74	Synchronize extent_grow_next accesses. This should have been part of 411697adcda2fd75e135cdcdafb95f2bd295dc7f (Use exponential series to size extents.), which introduced extent_grow_next.	2017-02-01 16:43:46 -08:00
Jason Evans	5033a9176a	Call prof_gctx_create() without owing bt2gctx_mtx. This reduces the probability of allocating (and thereby indirectly making a system call) while owning bt2gctx_mtx. Unfortunately it is an incomplete solution, because ckh insertion/deletion can also allocate/deallocate, which requires more extensive changes to address.	2017-02-01 16:43:46 -08:00
Jason Evans	397f54aa46	Conditionalize prof fork handling on config_prof. This allows the compiler to completely remove dead code.	2017-02-01 16:43:46 -08:00

... 5 6 7 8 9 ...

2026 Commits