server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Qi Wang	94ace05832	Fix the prof thread_name reference in prof_recent dump. As pointed out in #2434, the thread_name in prof_tdata_t was changed in #2407. This also requires an update for the prof_recent dump, specifically the emitter expects a "char **" which is fixed in this commit.	2023-05-11 09:10:57 -07:00
Qi Wang	6ea8a7e928	Add config detection for JEMALLOC_HAVE_PTHREAD_SET_NAME_NP. and use it on the background thread name setting.	2023-05-11 09:10:57 -07:00
auxten	5bac384970	If ptr present check if alloc_ctx.edata == NULL	2023-05-10 17:18:22 -07:00
auxten	019cccc293	Make arenas_lookup_ctl triable	2023-05-10 17:18:22 -07:00
Kevin Svetlitski	dc0a184f8d	Fix possible `NULL` pointer dereference in `VERIFY_READ` Static analysis flagged this. Fixed by simply checking `oldlenp` before dereferencing it.	2023-05-09 10:57:09 -07:00
Kevin Svetlitski	12311fe6c3	Fix segfault in `extent_try_coalesce_impl` Static analysis flagged this. `extent_record` was passing `NULL` as the value for `coalesced` to `extent_try_coalesce`, which in turn passes that argument to `extent_try_coalesce_impl`, where it is written to without checking if it is `NULL`. I can confirm from reviewing the fleetwide coredump data that this was in fact being hit in production.	2023-05-09 10:55:44 -07:00
Kevin Svetlitski	70344a2d38	Make eligible functions `static` The codebase is already very disciplined in making any function which can be `static`, but there are a few that appear to have slipped through the cracks.	2023-05-08 15:00:02 -07:00
Kevin Svetlitski	6841110bd6	Make `edata_cmp_summary_comp` 30% faster `edata_cmp_summary_comp` is one of the very hottest functions, taking up 3% of all time spent inside Jemalloc. I noticed that all existing callsites rely only on the sign of the value returned by this function, so I came up with this equivalent branchless implementation which preserves this property. After empirical measurement, I have found that this implementation is 30% faster, therefore representing a 1% speed-up to the allocator as a whole. At @interwq's suggestion, I've applied the same optimization to `edata_esnead_comp` in case this function becomes hotter in the future.	2023-05-04 09:59:17 -07:00
Amaury Séchet	f2b28906e6	Some nits in cache_bin.h	2023-05-01 10:21:17 -07:00
Kevin Svetlitski	fc680128e0	Remove errant `assert` in `arena_extent_alloc_large` This codepath may generate deferred work when the HPA is enabled. See also [@davidtgoldblatt's relevant comment on the PR which introduced this](https://github.com/jemalloc/jemalloc/pull/2107#discussion_r699770967) which prevented a similarly incorrect `assert` from being added elsewhere.	2023-05-01 10:00:30 -07:00
Eric Mueller	521970fb2e	Check for equality instead of assigning in asserts in hpa_from_pai. It appears like a simple typo means we're unconditionally overwriting some fields in hpa_from_pai when asserts are enabled. From hpa_shard_init, it looks like these fields have these values anyway, so this shouldn't cause bugs, but if something is wrong it seems better to have these asserts in place. See issue #2412.	2023-04-17 20:57:48 -07:00
guangli-dai	5f64ad60cd	Remove locked flag set in malloc_mutex_trylock As a hint flag of the lock, parameter locked should be set only when the lock is gained or freed.	2023-04-06 10:57:04 -07:00
Qi Wang	434a68e221	Disallow decay during reentrancy. Decay should not be triggered during reentrant calls (may cause lock order reversal / deadlocks). Added a delay_trigger flag to the tickers to bypass decay when rentrancy_level is not zero.	2023-04-05 10:16:37 -07:00
Qi Wang	e62aa478c7	Rearrange the bools in prof_tdata_t to save some bytes. This lowered the sizeof(prof_tdata_t) from 200 to 192 which is a round size class. Afterwards the tdata_t size remain unchanged with the last commit, which effectively inlined the storage of thread names for free.	2023-04-05 10:03:12 -07:00
Qi Wang	ce0b7ab6c8	Inline the storage for thread name in prof_tdata_t. The previous approach managed the thread name in a separate buffer, which causes races because the thread name update (triggered by new samples) can happen at the same time as prof dumping (which reads the thread names) -- these two operations are under separate locks to avoid blocking each other. Implemented the thread name storage as part of the tdata struct, which resolves the lifetime issue and also avoids internal alloc / dalloc during prof_sample.	2023-04-05 10:03:12 -07:00
Qi Wang	6cab460a45	Add a multithreaded test for prof_sys_thread_name. Verified that this catches the issue being fixed in 5fd5583.	2023-04-05 10:03:12 -07:00
Amaury Séchet	5266152d79	Simplify the logic in ph_remove	2023-03-31 14:35:31 -07:00
Amaury Séchet	be6da4f663	Do not maintain root->prev in ph_remove.	2023-03-31 14:34:57 -07:00
Amaury Séchet	543e2d61e6	Simplify the logic in ph_insert Also fixes what looks like an off by one error in the lazy aux list merge part of the code that previously never touched the last node in the aux list.	2023-03-31 14:34:24 -07:00
guangli-dai	31e01a98f1	Fix the rdtscp detection bug and add prefix for the macro.	2023-03-23 11:16:19 -07:00
Qi Wang	8b64be3441	Explicit arena assignment in test_tcache_max. Otherwise the associated arena could change with percpu arena enabled.	2023-03-22 15:16:43 -07:00
Qi Wang	8e7353a19b	Explicit arena assignment in test_thread_idle. Otherwise the associated arena could change with percpu arena enabled.	2023-03-22 15:16:43 -07:00
Marvin Schmidt	45249cf5a9	Fix exception specification error for hosts using musl libc It turns out that the previous commit did not suffice since the JEMALLOC_SYS_NOTHROW definition also causes the same exception specification errors as JEMALLOC_USE_CXX_THROW did: ``` x86_64-pc-linux-musl-cc -std=gnu11 -Werror=unknown-warning-option -Wall -Wextra -Wshorten-64-to-32 -Wsign-compare -Wundef -Wno-format-zero-length -Wpointer- arith -Wno-missing-braces -Wno-missing-field-initializers -pipe -g3 -fvisibility=hidden -Wimplicit-fallthrough -O3 -funroll-loops -march=native -O2 -pipe -c -march=native -O2 -pipe -D_GNU_SOURCE -D_REENTRANT -Iinclude -Iinclude -o src/background_thread.o src/background_thread.c In file included from src/jemalloc_cpp.cpp:9: In file included from include/jemalloc/internal/jemalloc_preamble.h:27: include/jemalloc/internal/../jemalloc.h:254:32: error: exception specification in declaration does not match previous declaration void JEMALLOC_SYS_NOTHROW je_malloc(size_t size) ^ include/jemalloc/internal/../jemalloc.h:75:21: note: expanded from macro 'je_malloc' ^ /usr/x86_64-pc-linux-musl/include/stdlib.h:40:7: note: previous declaration is here void malloc (size_t); ^ ``` On systems using the musl C library we have to omit the exception specification on malloc function family like it's done for MacOS, FreeBSD and OpenBSD.	2023-03-16 12:11:40 -07:00
Marvin Schmidt	aba1645f2d	configure: Handle -linux-musl hosts properly This is the same as the `--linux*` case with the two exceptions that we don't set glibc=1 and don't define JEMALLOC_USE_CXX_THROW	2023-03-16 12:11:40 -07:00
Qi Wang	d503d72129	Add the missing descriptions in AC_DEFINE	2023-03-14 16:47:00 -07:00
Qi Wang	71bc1a3d91	Avoid assuming the arena id in test when percpu_arena is used.	2023-03-13 10:50:10 -07:00
Amaury Séchet	f743690739	Remove unused mutex from hpa_central	2023-03-10 11:25:47 -08:00
Chris Seymour	4edea8eb8e	switch to https	2023-03-09 11:44:02 -08:00
guangli-dai	09e4b38fb1	Use asm volatile during benchmarks.	2023-02-24 11:17:48 -08:00
Fernando Pelliccioni	e8b28908de	[MSVC] support for Visual Studio 2019 and 2022	2023-02-21 13:39:25 -08:00
barracuda156	4422f88d17	Makefile.in: link with g++ when cxx enabled	2023-02-21 13:26:58 -08:00
Qi Wang	c7805f1eb5	Add a header in HPA stats for the nonfull slabs.	2023-02-17 13:31:27 -08:00
Qi Wang	b6125120ac	Add an explicit name to the dedicated oversize arena.	2023-02-17 13:31:09 -08:00
Qi Wang	97b313c7d4	More conservative setting for /test/unit/background_thread_enable. Lower the thread and arena count to avoid resource exhaustion on 32-bit.	2023-02-16 14:42:21 -08:00
Qi Wang	5fd55837bb	Fix thread_name updating for heap profiling. The current thread name reading path updates the name every time, which requires both alloc and dalloc -- and the temporary NULL value in the middle causes races where the prof dump read path gets NULLed in the middle. Minimize the changes in this commit to isolate the bugfix testing; will also refactor the whole thread name paths later.	2023-02-15 17:49:40 -08:00
Qi Wang	8580c65f81	Implement prof sample hooks "experimental.hooks.prof_sample(_free)". The added hooks hooks.prof_sample and hooks.prof_sample_free are intended to allow advanced users to track additional information, to enable new ways of profiling on top of the jemalloc heap profile and sample features. The sample hook is invoked after the allocation and backtracing, and forwards the both the allocation and backtrace to the user hook; the sample_free hook happens before the actual deallocation, and forwards only the ptr and usz to the hook.	2022-12-07 16:06:49 -08:00
guangli-dai	a74acb57e8	Fix dividing 0 error in stress/cpp/microbench Summary: Per issue #2356, some CXX compilers may optimize away the new/delete operation in stress/cpp/microbench.cpp. Thus, this commit (1) bumps the time interval to 1 if it is 0, and (2) modifies the pointers in the microbench to volatile.	2022-12-06 10:46:14 -08:00
Guangli Dai	e8f9f13811	Inline free and sdallocx into operator delete	2022-11-21 11:14:05 -08:00
guangli-dai	06374d2a6a	Benchmark operator delete Added the microbenchmark for operator delete. Also modified bench.h so that it can be used in C++.	2022-11-21 11:14:05 -08:00
guangli-dai	14ad8205bf	Update the ratio display in benchmark In bench.h, specify the ratio as the time consumption ratio and modify the display of the ratio.	2022-11-21 11:14:05 -08:00
Qi Wang	481bbfc990	Add a configure option --enable-force-getenv. Allows the use of getenv() rather than secure_getenv() to read MALLOC_CONF. This helps in situations where hosts are under full control, and setting MALLOC_CONF is needed while also setuid. Disabled by default.	2022-11-04 13:37:14 -07:00
Qi Wang	143e9c4a2f	Enable fast thread locals for dealloc-only threads. Previously if a thread does only allocations, it stays on the slow path / minimal initialized state forever. However, dealloc-only is a valid pattern for dedicated reclamation threads -- this means thread cache is disabled (no batched flush) for them, which causes high overhead and contention. Added the condition to fully initialize TSD when a fair amount of dealloc activities are observed.	2022-10-25 09:54:38 -07:00
Paul Smith	be65438f20	jemalloc_internal_types.h: Use alloca if __STDC_NO_VLA__ is defined No currently-available version of Visual Studio C compiler supports variable length arrays, even if it defines __STDC_VERSION__ >= C99. As far as I know Microsoft has no plans to ever support VLAs in MSVC. The C11 standard requires that the __STDC_NO_VLA__ macro be defined if the compiler doesn't support VLAs, so fall back to alloca() if so.	2022-10-14 15:48:32 -07:00
divanorama	1897f185d2	Fix safety_check segfault in double free test	2022-10-03 10:55:10 -07:00
Jordan Rome	b04e7666f2	update PROFILING_INTERNALS.md Expand the bad example of summing before unbiasing.	2022-10-03 10:48:29 -07:00
David Carlier	4c95c953e2	fix build for non linux/BSD platforms.	2022-10-03 10:42:09 -07:00
divanorama	3de0c24859	Disable builtin malloc in tests With `--with-jemalloc-prefix=` and without `-fno-builtin` or `-O1` both clang and gcc may optimize out `malloc` calls whose result is unused. Comparing result to NULL also doesn't necessarily count as being used. This won't be a problem in most client programs as this only concerns really unused pointers, but in tests it's important to actually execute allocations. `-fno-builtin` should disable this optimization for both gcc and clang, and applying it only to tests code shouldn't hopefully be an issue. Another alternative is to force "use" of result but that'd require more changes and may miss some other optimization-related issues. This should resolve https://github.com/jemalloc/jemalloc/issues/2091	2022-10-03 10:39:13 -07:00
Lily Wang	c0c9783ec9	Add vcpkg installation instructions	2022-09-19 15:15:28 -07:00
Guangli Dai	c9ac1f4701	Fix a bug in C++ integration test.	2022-09-16 15:04:59 -07:00
Guangli Dai	ba19d2cb78	Add arena-level name. An arena-level name can help identify manual arenas.	2022-09-16 15:04:59 -07:00

1 2 3 4 5 ...

3353 Commits