server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
David Goldblatt	865debda22	Rename extent.h -> edata.h. This name is slightly pithier; a full-on rename will come shortly.	2019-12-20 10:18:40 -08:00
David Goldblatt	a738a66b5c	Ehooks: Add some debug zero and addr checks. These help make sure that the ehooks return properly zeroed memory when required to.	2019-12-20 10:18:40 -08:00
David Goldblatt	4b2e5ee8b9	Ehooks: Add a "zero" ehook. This is the first API expansion. It lets the hooks pick where and how to purge within themselves.	2019-12-20 10:18:40 -08:00
David Goldblatt	403f2d1664	Extents: Split out introspection functionality. This isn't really part of the core extent allocation facilities. Especially as this module grows, having it in its own place may come in handy.	2019-12-20 10:18:40 -08:00
David Goldblatt	92a511d385	Make extent module hermetic. In the form of extent2.h. The naming leaves something to be desired, but I'll leave that for a later diff.	2019-12-20 10:18:40 -08:00
David Goldblatt	39fdc690a0	Ehooks comments and cleanup.	2019-12-20 10:18:40 -08:00
David Goldblatt	c8dae890c8	Extent -> Ehooks: Move over default hooks.	2019-12-20 10:18:40 -08:00
David Goldblatt	2fe5108263	Extent -> Ehooks: Move merge hook.	2019-12-20 10:18:40 -08:00
David Goldblatt	1fff4d2ee3	Extent -> Ehooks: Move split hook.	2019-12-20 10:18:40 -08:00
David Goldblatt	a5b42a1a10	Extent -> Ehooks: Move purge_forced hook.	2019-12-20 10:18:40 -08:00
David Goldblatt	368baa42ef	Extent -> Ehooks: Move purge_lazy hook.	2019-12-20 10:18:40 -08:00
David Goldblatt	d78fe241ac	Extent -> Ehooks: Move commit and decommit hooks.	2019-12-20 10:18:40 -08:00
David Goldblatt	5459ec9dae	Extent -> Ehooks: Move destroy hook.	2019-12-20 10:18:40 -08:00
David Goldblatt	bac8e2e5a6	Extent -> Ehooks: Move dalloc hook.	2019-12-20 10:18:40 -08:00
David Goldblatt	dc8b4e6e13	Extent -> Ehooks: Move alloc hook.	2019-12-20 10:18:40 -08:00
David Goldblatt	703fbc0ff5	Introduce unsafe reentrancy guards. We have to work to circumvent the safety checks in pre_reentrancy when going down extent hook pathways. Instead, let's explicitly have checked and unchecked guards.	2019-12-20 10:18:40 -08:00
David Goldblatt	ae0d8e8591	Move extent ehook calls into ehooks	2019-12-20 10:18:40 -08:00
David Goldblatt	ba8b9ecbcb	Add ehooks module	2019-12-20 10:18:40 -08:00
David Goldblatt	837119a948	base_structs.h: Remove some mid-line tabs.	2019-12-20 10:18:40 -08:00
David Goldblatt	9f6eb09585	Extents: Eagerly initialize extent hooks. When deferred initialization was added, initializing required copying sizeof(extent_hooks_t) bytes after a pointer chase. Today, it's just a single pointer loaded from the base_t. In subsequent diffs, we'll get rid of even that.	2019-12-20 10:18:40 -08:00
David Goldblatt	4278f84603	Move extent hook getters/setters to arena.c This is where they're logically scoped; they access arena data.	2019-12-20 10:18:40 -08:00
Qi Wang	d5031ea824	Allow dallocx and sdallocx after tsd destruction. After a thread turns into purgatory / reincarnated state, still allow dallocx and sdallocx to function normally.	2019-12-19 11:17:03 -08:00
Yinan Zhang	4afd709d1f	Restructure setters for profiling info Explicitly define three setters: - `prof_tctx_reset()`: set `prof_tctx` to `1U`, if we don't know in advance whether the allocation is large or not; - `prof_tctx_reset_sampled()`: set `prof_tctx` to `1U`, if we already know in advance that the allocation is large; - `prof_info_set()`: set a real `prof_tctx`, and also set other profiling info e.g. the allocation time. Code structure wise, the prof level is kept as a thin wrapper, the large level only provides low level setter APIs, and the arena level carries out the main logic.	2019-12-17 10:01:28 -08:00
Yinan Zhang	1d01e4c770	Initialization utilities for nstime	2019-12-16 16:08:56 -08:00
Qi Wang	dd649c9485	Optimize away the tsd_fast() check on fastpath. Fold the tsd_state check onto the event threshold check. The fast threshold is set to 0 when tsd switch to non-nominal. The fast_threshold can be reset by remote threads, to refect the non nominal tsd state change.	2019-12-11 23:44:20 -08:00
Yinan Zhang	45836d7fd3	Pass nstime_t pointer for profiling	2019-12-11 11:38:16 -08:00
Yinan Zhang	7d2bac5a38	Refactor destroy code path for prof_tctx	2019-12-10 16:31:05 -08:00
Yinan Zhang	055478cca8	Threshold is no longer updated before prof_realloc()	2019-12-10 16:31:05 -08:00
Yinan Zhang	7e3671911f	Get rid of old indentation style for prof	2019-12-06 09:47:51 -08:00
Yinan Zhang	dfdd46f6c1	Refactor prof_tctx_t creation	2019-12-06 09:47:51 -08:00
Yinan Zhang	aa1d71fb7a	Rename prof_tctx to alloc_tctx in prof_info_t	2019-12-06 09:47:51 -08:00
Yinan Zhang	5e0b090992	No need to pass usize to prof_tctx_set()	2019-12-06 09:47:51 -08:00
David Goldblatt	1b1e76acfe	Disable some spuriously-triggering warnings	2019-12-04 13:45:17 -08:00
Yinan Zhang	6945371778	Change tsdn to tsd for profiling code path	2019-11-22 16:31:56 -08:00
Yinan Zhang	b55419f9b9	Restructure profiling Develop new data structure and code logic for holding profiling related information stored in the extent that may be needed after the extent is released, which in particular is the case for the reallocation code path (e.g. in `rallocx()` and `xallocx()`). The data structure is a generalization of `prof_tctx_t`: we previously only copy out the `prof_tctx` before the extent is released, but we may be in need of additional fields. Currently the only additional field is the allocation time field, but there may be more fields in the future. The restructuring also resolved a bug: `prof_realloc()` mistakenly passed the new `ptr` to `prof_free_sampled_object()`, but passing in the `old_ptr` would crash because it's already been released. Now the essential profiling information is collectively copied out early and safely passed to `prof_free_sampled_object()` after the extent is released.	2019-11-22 16:31:56 -08:00
Mark Santaniello	8b2c2a596d	Support C++17 over-aligned allocation Summary: Add support for C++17 over-aligned allocation: http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2016/p0035r4.html Supporting all 10 operators means we avoid thunking thru libstdc++-v3/libsupc++ and just call jemalloc directly. It's also worth noting that there is now an aligned and sized operator delete: ``` void operator delete(void* ptr, std::size_t size, std::align_val_t al) noexcept; ``` If JeMalloc did not provide this, the default implementation would ignore the size parameter entirely: https://github.com/gcc-mirror/gcc/blob/master/libstdc%2B%2B-v3/libsupc%2B%2B/del_opsa.cc#L30-L33 (I must also update ax_cxx_compile_stdcxx.m4 to a newer version with C++17 support.) Test Plan: Wrote a simple test that allocates and then deletes an over-aligned type: ``` struct alignas(32) Foo {}; Foo f; int main() { f = new Foo; delete f; } ``` Before this change, both new and delete go thru PLT, and we end up calling regular old free: ``` (gdb) disassemble Dump of assembler code for function main(): ... 0x00000000004029b7 <+55>: call 0x4022d0 <_ZnwmSt11align_val_t@plt> ... 0x00000000004029d5 <+85>: call 0x4022e0 <_ZdlPvmSt11align_val_t@plt> ... (gdb) s free (ptr=0x7ffff6408020) at /home/engshare/third-party2/jemalloc/master/src/jemalloc.git-trunk/src/jemalloc.c:2842 2842 if (!free_fastpath(ptr, 0, false)) { ``` After this change, we directly call new/delete and ultimately call sdallocx: ``` (gdb) disassemble Dump of assembler code for function main(): ... 0x0000000000402b77 <+55>: call 0x496ca0 <operator new(unsigned long, std::align_val_t)> ... 0x0000000000402b95 <+85>: call 0x496e60 <operator delete(void, unsigned long, std::align_val_t)> ... (gdb) s 116 je_sdallocx_noflags(ptr, size); ```	2019-11-22 10:14:16 -08:00
Qi Wang	9a7ae3c97f	Reduce footprint of bin_t. Avoid storing mutex_prof_data_t in bin_t. Added bin_stats_data_t which is used for reporting bin stats.	2019-11-21 11:08:36 -08:00
Yinan Zhang	73510dfd15	Revert "Fix bug in prof_realloc" This reverts commit 3b5eecf102dcc3eb9a4a50346cdfa96917683e0a.	2019-11-15 15:13:39 -08:00
Yinan Zhang	3b5eecf102	Fix bug in prof_realloc We should pass in `old_ptr` rather than the new `ptr` to `prof_free_sampled_object()` when `old_ptr` points to a sampled allocation.	2019-11-15 13:28:33 -08:00
Leonardo Santagada	c462753cc8	Use __forceinline for JEMALLOC_ALWAYS_INLINE on msvc	2019-11-12 13:50:25 -08:00
Qi Wang	da50d8ce87	Refactor and optimize prof sampling initialization. Makes the prof sample prng use the tsd prng_state. This allows us to properly initialize the sample interval event, without having to create tdata. As a result, tdata will be created on demand (when a thread reaches the sample interval bytes allocated), instead of on the first allocation.	2019-11-11 10:35:37 -08:00
Qi Wang	bc774a3519	Rename tsd->offset_state to tsd->prng_state.	2019-11-11 10:35:37 -08:00
Qi Wang	19a51abf33	Avoid arena->offset_state when tsd not available for prng. Use stack locals and remove the offset_state in arena.	2019-11-11 10:35:37 -08:00
Nick Desaulniers	d01b425e5d	Add -Wimplicit-fallthrough checks if supported Clang since r369414 (clang-10) can now check -Wimplicit-fallthrough for C code, and use the GNU C style attribute to denote fallthrough. Move the test from header only to autoconf. The previous test used brittle version detection which did not work for newer clang that supported this feature. The attribute has to be its own statement, hence the added `;`. It also can only precede case statements, so the final cases should be explicitly terminated with break statements. Fixes commit 3d29d11ac2c1 ("Clean compilation -Wextra") Link: `1e0affb6e5` Signed-off-by: Nick Desaulniers <ndesaulniers@google.com>	2019-11-08 13:03:03 -08:00
Yinan Zhang	43f0ce92d8	Define general purpose tsd_thread_event_init()	2019-11-04 16:07:56 -08:00
Yinan Zhang	97f93fa0f2	Pull tcache GC events into thread event handler	2019-11-04 16:07:56 -08:00
Yinan Zhang	198f02e797	Pull prof_accumbytes into thread event handler	2019-11-04 15:21:16 -08:00
Yinan Zhang	152c0ef954	Build a general purpose thread event handler	2019-11-04 11:15:50 -08:00
David T. Goldblatt	de81a4eada	Add stats counters for number of zero reallocs	2019-10-29 17:48:44 -07:00
David T. Goldblatt	9cfa805947	Realloc: Make behavior of realloc(ptr, 0) configurable.	2019-10-29 17:48:44 -07:00

... 8 9 10 11 12 ...

1492 Commits