server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Qi Wang	94a88c26f4	Implement huge arena: opt.huge_threshold. The feature allows using a dedicated arena for huge allocations. We want the addtional arena to separate huge allocation because: 1) mixing small extents with huge ones causes fragmentation over the long run (this feature reduces VM size significantly); 2) with many arenas, huge extents rarely get reused across threads; and 3) huge allocations happen way less frequently, therefore no concerns for lock contention.	2018-06-29 10:35:02 -07:00
Qi Wang	77a71ef2b7	Fall back to the default pthread_create if RTLD_NEXT fails.	2018-06-28 13:18:21 -07:00
David Goldblatt	d1e11d48d4	Move tsd link and in_hook after tcache. This can lead to better cache utilization down the common paths where we don't touch the link.	2018-06-27 13:39:02 -07:00
Qi Wang	0ff7ff3ec7	Optimize ixalloc by avoiding a size lookup.	2018-06-05 21:03:51 -07:00
David Goldblatt	a7f749c9af	Hooks: Protect against reentrancy. Previously, we made the user deal with this themselves, but that's not good enough; if hooks may allocate, we should test the allocation pathways down hooks. If we're doing that, we might as well actually implement the protection for the user.	2018-05-18 11:43:03 -07:00
David Goldblatt	0379235f47	Tests: Shouldn't be able to change global slowness. This can help ensure that we don't leave slowness changes behind in case of resource exhaustion.	2018-05-18 11:43:03 -07:00
David Goldblatt	59e371f463	Hooks: Add a hook exhaustion test. When we run out of space in which to store hooks, we should return EAGAIN from the mallctl, but not otherwise misbehave.	2018-05-18 11:43:03 -07:00
David Goldblatt	126e9a84a5	Hooks: move the "extra" pointer into the hook_t itself. This simplifies the mallctl call to install a hook, which should only take a single argument.	2018-05-18 11:43:03 -07:00
David Goldblatt	cb0707c0fc	Hooks: hook the realloc pathways that move/expand.	2018-05-18 11:43:03 -07:00
David Goldblatt	67270040a5	Hooks: hook the realloc paths that act as pure malloc/free.	2018-05-18 11:43:03 -07:00
David Goldblatt	226327cf66	Hooks: hook the pure-allocation functions.	2018-05-18 11:43:03 -07:00
David Goldblatt	5ae6e7cbfa	Add "hook" module. The hook module allows a low-reader-overhead way of finding hooks to invoke and calling them. For now, none of the allocation pathways are tied into the hooks; this will come later.	2018-05-18 11:43:03 -07:00
David Goldblatt	06a8c40b36	Add the Seq module, a simple seqlock implementation. This allows fast reader-writer concurrency in cases where writers are rare. The immediate use case is for the hooking implementaiton.	2018-05-18 11:43:03 -07:00
David Goldblatt	c7a87e0e0b	Rename hooks module to test_hooks. "Hooks" is really the best name for the module that will contain the publicly exposed hooks. So lets rename the current "hooks" module (that hook external dependencies, for reentrancy testing) to "test_hooks".	2018-05-18 11:43:03 -07:00
David Goldblatt	e870829e64	TSD: Add the ability to enter a global slow path. This gives any thread the ability to send other threads down slow paths the next time they fetch tsd.	2018-05-18 11:43:03 -07:00
David Goldblatt	feff510b9f	TSD: Pull name mangling into a macro.	2018-05-18 11:43:03 -07:00
David Goldblatt	39d6420c0c	TSD: Make state atomic. This will let us change the state of another thread remotely, eventually.	2018-05-18 11:43:03 -07:00
David Goldblatt	982c10de35	TSD: Make all state access happen through a function. Shortly, tsd state will be atomic and have some complicated enough logic down the state-setting path that we should be aware of it.	2018-05-18 11:43:03 -07:00
David Goldblatt	e74a1a37c8	Atomics: Add atomic_u8_t, force-inline operations. We're about to need an atomic uint8_t for state operations. Unfortunately, we're at the point where things won't get inlined into the key methods unless they're force-inlined. This is embarassing and we should do something about it, but in the meantime we'll force-inline a little more when we need to.	2018-05-18 11:43:03 -07:00
Qi Wang	312352faa8	Fix background thread index issues with max_background_threads.	2018-05-15 12:25:23 -07:00
Qi Wang	0fadf4a2e3	Add UNUSED to avoid compiler warnings.	2018-04-16 13:50:21 -07:00
Jason Evans	2a80d6f15b	Avoid a printf format specifier warning. This dodges a warning emitted by the FreeBSD system gcc when compiling libc for architectures which don't use clang as the system compiler.	2018-04-16 11:07:51 -07:00
Dave Watson	8b14f3abc0	background_thread: add max thread count config Looking at the thread counts in our services, jemalloc's background thread is useful, but mostly idle. Add a config option to tune down the number of threads.	2018-04-10 14:01:45 -07:00
Qi Wang	4be74d5112	Consolidate the two memory loads in rtree_szind_slab_read(). szind and slab bits are read on fast path, where compiler generated two memory loads separately for them before this diff. Manually operate on the bits to avoid the extra memory load.	2018-04-10 10:18:46 -07:00
Qi Wang	d3e0976a2c	Fix type warning on Windows. Add cast since read / write has unsigned return type on windows.	2018-04-09 16:50:30 -07:00
Qi Wang	2dccf45640	Control idump and gdump with prof_active.	2018-04-09 16:35:14 -07:00
David Goldblatt	86c61d4a57	Stats printing: Move global mutex stats to use emitter.	2018-03-09 11:47:17 -08:00
David Goldblatt	ebe0b5f828	Emitter: Add support for row-based output in table mode. This is needed for things like mutex stats in table mode.	2018-03-09 11:47:17 -08:00
David Goldblatt	27a8fe6780	Introduce the emitter module. The emitter can be used to produce structured json or tabular output. For now it has no uses; in subsequent commits, I'll begin transitioning stats printing code over.	2018-03-09 11:47:17 -08:00
Qi Wang	e4f090e8df	Add opt.thp which allows explicit hugepage usage. "always" marks all user mappings as MADV_HUGEPAGE; while "never" marks all mappings as MADV_NOHUGEPAGE. The default setting "default" does not change any settings. Note that all the madvise calls are part of the default extent hooks by design, so that customized extent hooks have complete control over the mappings including hugepage settings.	2018-03-08 13:08:06 -08:00
Qi Wang	efa40532dc	Remove config.thp which wasn't in use.	2018-03-08 13:08:06 -08:00
David T. Goldblatt	dd7e283b6f	Tweak the ticker paths to help GCC generate better code. GCC on its own isn't quite able to turn the ticker subtract into a memory operation followed by a js.	2018-02-21 16:04:23 -08:00
rustyx	83aa9880b7	Make generated headers usable in both x86 and x64 mode in Visual Studio	2018-01-30 13:11:41 -08:00
Christopher Ferris	f78d4ca3fb	Modify configure to determine return value of strerror_r. On glibc and Android's bionic, strerror_r returns char* when _GNU_SOURCE is defined. Add a configure check for this rather than assume glibc is the only libc that behaves this way.	2018-01-10 21:01:18 -08:00
Qi Wang	41790f4fa4	Check tsdn_null before reading reentrancy level.	2018-01-05 13:05:17 -08:00
Qi Wang	91b247d311	In iallocztm, check lock rank only when not in reentrancy.	2018-01-05 13:05:17 -08:00
Rajeev Misra	72bdbc35e3	extent_t bitpacking logic refactoring	2018-01-04 11:11:04 -08:00
Rajeev Misra	f47e39d11a	handle 32 bit mutex counters	2018-01-04 11:08:17 -08:00
David Goldblatt	21f7c13d0b	Add the div module, which allows fast division by dynamic values.	2017-12-21 14:25:43 -08:00
David T. Goldblatt	7f1b02e3fa	Split up and standardize naming of stats code. The arena-associated stats are now all prefixed with arena_stats_, and live in their own file. Likewise, malloc_bin_stats_t -> bin_stats_t, also in its own file.	2017-12-18 16:29:10 -08:00
David T. Goldblatt	901d94a2b0	Rename cache_alloc_easy to cache_bin_alloc_easy. This lives in the cache_bin module; just a typo.	2017-12-18 16:29:10 -08:00
David T. Goldblatt	8aafa270fd	Move bin stats code from arena to bin module.	2017-12-18 16:29:10 -08:00
David T. Goldblatt	48bb4a056b	Move bin forking code from arena to bin module.	2017-12-18 16:29:10 -08:00
David T. Goldblatt	a8dd8876fb	Move bin initialization from arena module to bin module.	2017-12-18 16:29:10 -08:00
David T. Goldblatt	4bf4a1c4ea	Pull out arena_bin_info_t and arena_bin_t into their own file. In the process, kill arena_bin_index, which is unused. To follow are several diffs continuing this separation.	2017-12-18 16:29:10 -08:00
Qi Wang	740bdd68b1	Over purge by 1 extent always. When purging, large allocations are usually the ones that cross the npages_limit threshold, simply because they are "large". This means we often leave the large extent around for a while, which has the downsides of: 1) high RSS and 2) more chance of them getting fragmented. Given that they are not likely to be reused very soon (LRU), let's over purge by 1 extent (which is often large and not reused frequently).	2017-12-18 12:57:07 -08:00
Ed Schouten	749caf14ae	Also use __riscv to detect builds for RISC-V CPUs. According to the RISC-V toolchain conventions, __riscv__ is the old spelling of this definition. __riscv should be used going forward. https://github.com/riscv/riscv-toolchain-conventions#cc-preprocessor-definitions	2017-12-09 10:10:42 -08:00
Qi Wang	eb1b08daae	Fix an extent coalesce bug. When coalescing, we should take both extents off the LRU list; otherwise decay can grab the existing outer extent through extents_evict.	2017-11-16 15:32:02 -08:00
Qi Wang	fac706836f	Add opt.lg_extent_max_active_fit When allocating from dirty extents (which we always prefer if available), large active extents can get split even if the new allocation is much smaller, in which case the introduced fragmentation causes high long term damage. This new option controls the threshold to reuse and split an existing active extent. We avoid using a large extent for much smaller sizes, in order to reduce fragmentation. In some workload, adding the threshold improves virtual memory usage by >10x.	2017-11-16 15:32:02 -08:00
Dave Watson	d6feed6e66	Use tsd offset_state instead of atomic While working on #852, I noticed the prng state is atomic. This is the only atomic use of prng in all of jemalloc. Instead, use a threadlocal prng state if possible to avoid unnecessary cache line contention.	2017-11-14 08:58:18 -08:00

1 2 3 4 5 ...

831 Commits