server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
David Goldblatt	bac8e2e5a6	Extent -> Ehooks: Move dalloc hook.	2019-12-20 10:18:40 -08:00
David Goldblatt	dc8b4e6e13	Extent -> Ehooks: Move alloc hook.	2019-12-20 10:18:40 -08:00
David Goldblatt	703fbc0ff5	Introduce unsafe reentrancy guards. We have to work to circumvent the safety checks in pre_reentrancy when going down extent hook pathways. Instead, let's explicitly have checked and unchecked guards.	2019-12-20 10:18:40 -08:00
David Goldblatt	ae0d8e8591	Move extent ehook calls into ehooks	2019-12-20 10:18:40 -08:00
David Goldblatt	ba8b9ecbcb	Add ehooks module	2019-12-20 10:18:40 -08:00
David Goldblatt	837119a948	base_structs.h: Remove some mid-line tabs.	2019-12-20 10:18:40 -08:00
David Goldblatt	9f6eb09585	Extents: Eagerly initialize extent hooks. When deferred initialization was added, initializing required copying sizeof(extent_hooks_t) bytes after a pointer chase. Today, it's just a single pointer loaded from the base_t. In subsequent diffs, we'll get rid of even that.	2019-12-20 10:18:40 -08:00
David Goldblatt	4278f84603	Move extent hook getters/setters to arena.c This is where they're logically scoped; they access arena data.	2019-12-20 10:18:40 -08:00
Qi Wang	d5031ea824	Allow dallocx and sdallocx after tsd destruction. After a thread turns into purgatory / reincarnated state, still allow dallocx and sdallocx to function normally.	2019-12-19 11:17:03 -08:00
Yinan Zhang	4afd709d1f	Restructure setters for profiling info Explicitly define three setters: - `prof_tctx_reset()`: set `prof_tctx` to `1U`, if we don't know in advance whether the allocation is large or not; - `prof_tctx_reset_sampled()`: set `prof_tctx` to `1U`, if we already know in advance that the allocation is large; - `prof_info_set()`: set a real `prof_tctx`, and also set other profiling info e.g. the allocation time. Code structure wise, the prof level is kept as a thin wrapper, the large level only provides low level setter APIs, and the arena level carries out the main logic.	2019-12-17 10:01:28 -08:00
Yinan Zhang	1d01e4c770	Initialization utilities for nstime	2019-12-16 16:08:56 -08:00
Qi Wang	dd649c9485	Optimize away the tsd_fast() check on fastpath. Fold the tsd_state check onto the event threshold check. The fast threshold is set to 0 when tsd switch to non-nominal. The fast_threshold can be reset by remote threads, to refect the non nominal tsd state change.	2019-12-11 23:44:20 -08:00
Yinan Zhang	45836d7fd3	Pass nstime_t pointer for profiling	2019-12-11 11:38:16 -08:00
Yinan Zhang	7d2bac5a38	Refactor destroy code path for prof_tctx	2019-12-10 16:31:05 -08:00
Yinan Zhang	055478cca8	Threshold is no longer updated before prof_realloc()	2019-12-10 16:31:05 -08:00
Yinan Zhang	7e3671911f	Get rid of old indentation style for prof	2019-12-06 09:47:51 -08:00
Yinan Zhang	dfdd46f6c1	Refactor prof_tctx_t creation	2019-12-06 09:47:51 -08:00
Yinan Zhang	aa1d71fb7a	Rename prof_tctx to alloc_tctx in prof_info_t	2019-12-06 09:47:51 -08:00
Yinan Zhang	5e0b090992	No need to pass usize to prof_tctx_set()	2019-12-06 09:47:51 -08:00
David Goldblatt	1b1e76acfe	Disable some spuriously-triggering warnings	2019-12-04 13:45:17 -08:00
Yinan Zhang	6945371778	Change tsdn to tsd for profiling code path	2019-11-22 16:31:56 -08:00
Yinan Zhang	b55419f9b9	Restructure profiling Develop new data structure and code logic for holding profiling related information stored in the extent that may be needed after the extent is released, which in particular is the case for the reallocation code path (e.g. in `rallocx()` and `xallocx()`). The data structure is a generalization of `prof_tctx_t`: we previously only copy out the `prof_tctx` before the extent is released, but we may be in need of additional fields. Currently the only additional field is the allocation time field, but there may be more fields in the future. The restructuring also resolved a bug: `prof_realloc()` mistakenly passed the new `ptr` to `prof_free_sampled_object()`, but passing in the `old_ptr` would crash because it's already been released. Now the essential profiling information is collectively copied out early and safely passed to `prof_free_sampled_object()` after the extent is released.	2019-11-22 16:31:56 -08:00
Mark Santaniello	8b2c2a596d	Support C++17 over-aligned allocation Summary: Add support for C++17 over-aligned allocation: http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2016/p0035r4.html Supporting all 10 operators means we avoid thunking thru libstdc++-v3/libsupc++ and just call jemalloc directly. It's also worth noting that there is now an aligned and sized operator delete: ``` void operator delete(void* ptr, std::size_t size, std::align_val_t al) noexcept; ``` If JeMalloc did not provide this, the default implementation would ignore the size parameter entirely: https://github.com/gcc-mirror/gcc/blob/master/libstdc%2B%2B-v3/libsupc%2B%2B/del_opsa.cc#L30-L33 (I must also update ax_cxx_compile_stdcxx.m4 to a newer version with C++17 support.) Test Plan: Wrote a simple test that allocates and then deletes an over-aligned type: ``` struct alignas(32) Foo {}; Foo f; int main() { f = new Foo; delete f; } ``` Before this change, both new and delete go thru PLT, and we end up calling regular old free: ``` (gdb) disassemble Dump of assembler code for function main(): ... 0x00000000004029b7 <+55>: call 0x4022d0 <_ZnwmSt11align_val_t@plt> ... 0x00000000004029d5 <+85>: call 0x4022e0 <_ZdlPvmSt11align_val_t@plt> ... (gdb) s free (ptr=0x7ffff6408020) at /home/engshare/third-party2/jemalloc/master/src/jemalloc.git-trunk/src/jemalloc.c:2842 2842 if (!free_fastpath(ptr, 0, false)) { ``` After this change, we directly call new/delete and ultimately call sdallocx: ``` (gdb) disassemble Dump of assembler code for function main(): ... 0x0000000000402b77 <+55>: call 0x496ca0 <operator new(unsigned long, std::align_val_t)> ... 0x0000000000402b95 <+85>: call 0x496e60 <operator delete(void, unsigned long, std::align_val_t)> ... (gdb) s 116 je_sdallocx_noflags(ptr, size); ```	2019-11-22 10:14:16 -08:00
Qi Wang	9a7ae3c97f	Reduce footprint of bin_t. Avoid storing mutex_prof_data_t in bin_t. Added bin_stats_data_t which is used for reporting bin stats.	2019-11-21 11:08:36 -08:00
Yinan Zhang	73510dfd15	Revert "Fix bug in prof_realloc" This reverts commit 3b5eecf102dcc3eb9a4a50346cdfa96917683e0a.	2019-11-15 15:13:39 -08:00
Yinan Zhang	3b5eecf102	Fix bug in prof_realloc We should pass in `old_ptr` rather than the new `ptr` to `prof_free_sampled_object()` when `old_ptr` points to a sampled allocation.	2019-11-15 13:28:33 -08:00
Leonardo Santagada	c462753cc8	Use __forceinline for JEMALLOC_ALWAYS_INLINE on msvc	2019-11-12 13:50:25 -08:00
Qi Wang	da50d8ce87	Refactor and optimize prof sampling initialization. Makes the prof sample prng use the tsd prng_state. This allows us to properly initialize the sample interval event, without having to create tdata. As a result, tdata will be created on demand (when a thread reaches the sample interval bytes allocated), instead of on the first allocation.	2019-11-11 10:35:37 -08:00
Qi Wang	bc774a3519	Rename tsd->offset_state to tsd->prng_state.	2019-11-11 10:35:37 -08:00
Qi Wang	19a51abf33	Avoid arena->offset_state when tsd not available for prng. Use stack locals and remove the offset_state in arena.	2019-11-11 10:35:37 -08:00
Nick Desaulniers	d01b425e5d	Add -Wimplicit-fallthrough checks if supported Clang since r369414 (clang-10) can now check -Wimplicit-fallthrough for C code, and use the GNU C style attribute to denote fallthrough. Move the test from header only to autoconf. The previous test used brittle version detection which did not work for newer clang that supported this feature. The attribute has to be its own statement, hence the added `;`. It also can only precede case statements, so the final cases should be explicitly terminated with break statements. Fixes commit 3d29d11ac2c1 ("Clean compilation -Wextra") Link: `1e0affb6e5` Signed-off-by: Nick Desaulniers <ndesaulniers@google.com>	2019-11-08 13:03:03 -08:00
Yinan Zhang	43f0ce92d8	Define general purpose tsd_thread_event_init()	2019-11-04 16:07:56 -08:00
Yinan Zhang	97f93fa0f2	Pull tcache GC events into thread event handler	2019-11-04 16:07:56 -08:00
Yinan Zhang	198f02e797	Pull prof_accumbytes into thread event handler	2019-11-04 15:21:16 -08:00
Yinan Zhang	152c0ef954	Build a general purpose thread event handler	2019-11-04 11:15:50 -08:00
David T. Goldblatt	de81a4eada	Add stats counters for number of zero reallocs	2019-10-29 17:48:44 -07:00
David T. Goldblatt	9cfa805947	Realloc: Make behavior of realloc(ptr, 0) configurable.	2019-10-29 17:48:44 -07:00
Yinan Zhang	05681e387a	Optimize cache_bin_alloc_easy for malloc fast path `tcache_bin_info` is not accessed on malloc fast path but the compiler reserves a register for it, as well as an additional register for `tcache_bin_info[ind].stack_size`. The optimization gets rid of the need for the two registers.	2019-10-21 16:43:45 -07:00
Yinan Zhang	4fe50bc7d0	Fix amd64 MSVC warning	2019-10-18 10:16:29 -07:00
Yinan Zhang	4fbbc817c1	Simplify time setting and getting for prof log	2019-10-16 09:24:52 -07:00
Yinan Zhang	66e07f986d	Suppress tdata creation in reentrancy This change suppresses tdata initialization and prof sample threshold update in interrupting malloc calls. Interrupting calls have no need for tdata. Delaying tdata creation aligns better with our lazy tdata creation principle, and it also helps us gain control back from interrupting calls more quickly and reduces any risk of delegating tdata creation to an interrupting call.	2019-10-04 08:52:50 -07:00
Yinan Zhang	beb7c16e94	Guard prof_active reset by opt_prof Set `prof_active` to read-only when `opt_prof` is turned off.	2019-10-02 11:42:53 -07:00
David T. Goldblatt	3d84bd57f4	Arena: Add helper function arena_get_from_extent.	2019-09-23 23:06:27 -07:00
David T. Goldblatt	c97d255752	Eset: Remove temporary declaration.	2019-09-23 23:06:27 -07:00
David T. Goldblatt	ce5b128f10	Remove the undefined extent_size_quantize declarations.	2019-09-23 23:06:27 -07:00
David T. Goldblatt	821dd53a1d	Extent -> Eset: Rename arena members.	2019-09-23 23:06:27 -07:00
David T. Goldblatt	e144b21e4b	Extent -> Eset: Move fork handling.	2019-09-23 23:06:27 -07:00
David T. Goldblatt	77bbb35a92	Extent -> Eset: Move extent fit functions.	2019-09-23 23:06:27 -07:00
David T. Goldblatt	1210af9a4e	Extent -> Eset: Move insertion and removal.	2019-09-23 23:06:27 -07:00
David T. Goldblatt	a42861540e	Extents -> Eset: Convert some stats getters.	2019-09-23 23:06:27 -07:00

1 2 3 4 5 ...

1072 Commits