server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Qi Wang	06aac61c4b	Split the core logic of tcache flush into a separate function. The core function takes a ptr array as input (containing items to be flushed), which will be reused to flush sanitizer-stashed items.	2021-12-29 14:44:43 -08:00
Qi Wang	d038160f3b	Fix shadowed variable usage. Verified with EXTRA_CFLAGS=-Wshadow.	2021-12-23 10:55:08 -08:00
Qi Wang	bd70d8fc0f	Add the profiling settings for tests explicit. Many profiling related tests make assumptions on the profiling settings, e.g. opt_prof is off by default, and prof_active is default on when opt_prof is on. However the default settings can be changed via --with-malloc-conf at build time. Fixing the tests by adding the assumed settings explicitly.	2021-12-22 20:10:28 -08:00
Joshua Watt	e491df1d2f	Fix warnings when using autoheader.	2021-12-22 13:57:41 -08:00
Qi Wang	60b9637cc0	Only invoke malloc_cpu_count_is_deterministic() when necessary. Also refactor the handling of the non-deterministic case. Notably allow the case with narenas set to proceed w/o warnings, to not affect existing valid use cases.	2021-12-22 13:52:12 -08:00
Qi Wang	837b37c4ce	Fix the time-since computation in HPA. nstime module guarantees monotonic clock update within a single nstime_t. This means, if two separate nstime_t variables are read and updated separately, nstime_subtract between them may result in underflow. Fixed by switching to the time since utility provided by nstime.	2021-12-21 23:37:22 -08:00
Qi Wang	310af725b0	Add nstime_ns_since which obtains the duration since the input time.	2021-12-21 23:37:22 -08:00
Azat Khuzhin	cafe9a3158	Disable percpu arena in case of non deterministic CPU count Determinitic number of CPUs is important for percpu arena to work correctly, since it uses cpu index - sched_getcpu(), and if it will greater then number of CPUs bad thing will happen, or assertion will be failed in debug build: <jemalloc>: ../contrib/jemalloc/src/jemalloc.c:321: Failed assertion: "ind <= narenas_total_get()" Aborted (core dumped) Number of CPUs can be obtained from the following places: - sched_getaffinity() - sysconf(_SC_NPROCESSORS_ONLN) - sysconf(_SC_NPROCESSORS_CONF) For the sched_getaffinity() you may simply use taskset(1) to run program on a different cpu, and in case it will be not first, percpu will work incorrectly, i.e.: $ taskset --cpu-list $(( $(getconf _NPROCESSORS_ONLN)-1 )) <your_program> _SC_NPROCESSORS_ONLN uses /sys/devices/system/cpu/online, LXD/LXC virtualize /sys/devices/system/cpu/online file [1], and so when you run container with limited limits.cpus it will bind randomly selected CPU to it [1]: https://github.com/lxc/lxcfs/issues/301 _SC_NPROCESSORS_CONF uses /sys/devices/system/cpu/cpu*, and AFAIK nobody playing with dentries there. So if all three of these are equal, percpu arenas should work correctly. And a small note regardless _SC_NPROCESSORS_ONLN/_SC_NPROCESSORS_CONF, musl uses sched_getaffinity() for both. So this will also increase the entropy. Also note, that you can check is percpu arena really applied using abort_conf:true. Refs: https://github.com/jemalloc/jemalloc/pull/1939 Refs: https://github.com/ClickHouse/ClickHouse/issues/32806 v2: move malloc_cpu_count_is_deterministic() into malloc_init_hard_recursible() since _SC_NPROCESSORS_CONF does allocations for readdir() v3: - mark cpu_count_is_deterministic static - check only if percpu arena is enabled - check narenas	2021-12-21 11:53:09 -08:00
mweisgut	bb5052ce90	Fix base_ehooks_get_for_metadata	2021-12-20 15:37:53 -08:00
Alex Lapenkov	9015e129bd	Update visual studio projects Add relevant source files to the projects.	2021-12-15 10:39:17 -08:00
Alex Lapenkou	d90655390f	San: Create a function for committing and zeroing Committing and zeroing an extent is usually done together, hence a new function.	2021-12-15 10:39:17 -08:00
Alex Lapenkou	800ce49c19	San: Bump alloc frequently reused guarded allocations To utilize a separate retained area for guarded extents, use bump alloc to allocate those extents.	2021-12-15 10:39:17 -08:00
Alex Lapenkou	f56f5b9930	Pass 'frequent_reuse' hint to PAI Currently used only for guarding purposes, the hint is used to determine if the allocation is supposed to be frequently reused. For example, it might urge the allocator to ensure the allocation is cached.	2021-12-15 10:39:17 -08:00
Alex Lapenkou	2c70e8d351	Rename 'arena_decay' to 'arena_util' While initially this file contained helper functions for one particular test, now its usage spread across different test files. Purpose has shifted towards a collection of handy arena ctl wrappers.	2021-12-15 10:39:17 -08:00
Alex Lapenkou	0f6da1257d	San: Implement bump alloc The new allocator will be used to allocate guarded extents used as slabs for guarded small allocations.	2021-12-15 10:39:17 -08:00
Alex Lapenkou	34b00f8969	San: Avoid running san tests with prof enabled With prof enabled, number of page aligned allocations doesn't match the number of slab "ends" because prof allocations skew the addresses. It leads to 'pages' array overflow and hard to debug failures.	2021-12-15 10:39:17 -08:00
Alex Lapenkou	62f9c54d2a	San: Rename 'guard' to 'san' This prepares the foundation for more sanitizer-related work in the future.	2021-12-15 10:39:17 -08:00
Alex Lapenkou	d9bbf539ff	CI: Refactor gen_travis.py The CI consolidation project adds more operating systems to Travis. This refactoring is aimed to decouple the configuration of each individual OS from the actual job matrix generation and formatting. Otherwise, format_job function would turn into a huge collection of ad-hoc conditions.	2021-12-06 15:11:14 -08:00
Qi Wang	7dcf77809c	Mark slab as true on sized dealloc fast path. For sized dealloc, fastpath only handles lookup-able sizes, which must be slabs.	2021-12-06 14:28:34 -08:00
Qi Wang	af6ee27c0d	Enforce abort_conf:true when malloc_conf is not fully recognized. Ensures the malloc_conf "ends with key", "ends with comma" and "malform conf string" cases abort under abort_conf:true.	2021-12-06 14:27:25 -08:00
David CARLIER	113e8e68e1	freebsd 14 build fix proposal. seems to have introduced finally more linux api cpu affinity (sched_* family) compatibility detected at configure time thus adjusting accordingly.	2021-12-06 13:15:21 -08:00
Alex Lapenkou	3b3257a709	Correct opt.prof_leak documentation The option has been misleading, because it stays disabled unless prof_final is also specified. In practice it's impossible to detect that the option is silently disabled, because it just doesn't provide any output as if there are no memory leaks detected.	2021-11-23 15:10:21 -08:00
Qi Wang	cdabe908d0	Track the initialized state of nstime_t on debug build. Some nstime_t operations require and assume the input nstime is initialized (e.g. nstime_update) -- uninitialized input may cause silent failures which is difficult to reproduce / debug. Add an explicit flag to track the state (limited to debug build only). Also fixed an use case in hpa (time of last_purge).	2021-11-17 15:49:27 -08:00
Qi Wang	400c59895a	Fix uninitialized nstime reading / updating on the stack in hpa. In order for nstime_update to handle non-monotonic clocks, it requires the input nstime to be initialized -- when reading for the first time, zero init has to be done. Otherwise random stack value may be seen as clocks and returned.	2021-11-16 16:54:12 -08:00
Qi Wang	8b81d3f214	Fix the initialization of last_event in thread event init. The event counters maintain a relationship with the current bytes: last_event <= current < next_event. When a reinit happens (e.g. reincarnated tsd), the last event needs progressing because all events start fresh from the current bytes.	2021-11-16 10:28:00 -08:00
Qi Wang	6bdb4f5ab0	Check prof_active in addtion to opt_prof during batch_alloc().	2021-11-12 09:20:18 -08:00
Qi Wang	37342a4d32	Add ctl interface for experimental_infallible_new.	2021-11-05 13:20:09 -07:00
Alex Lapenkou	6cb585b13a	San: Unguard guarded slabs during arena destruction When opt_retain is on, slab extents remain guarded in all states, even retained. This works well if arena is never destroyed, because we anticipate those slabs will be eventually reused. But if the arena is destroyed, the slabs must be unguarded to prevent leaking guard pages.	2021-11-03 17:55:50 -07:00
Qi Wang	b6a7a535b3	Optimize away a branch on the free fastpath. On the rtree metadata lookup fast path, there will never be a NULL returned when the cache key matches (which is unknown to the compiler). The previous logic was checking for NULL return value, resulting in the extra branch (in addition to the cache key match checking). Make the lookup_fast return a bool to indicate cache miss / match, so that the extra branch is avoided.	2021-10-28 16:55:54 -07:00
Qi Wang	4d56aaeca5	Optimize away the tsd_fast() check on free fastpath. To ensure that the free fastpath can tolerate uninitialized tsd, improved the static initializer for rtree_ctx in tsd.	2021-10-28 10:05:59 -07:00
Ashutosh Grewal	26f5257b88	Remove declaration of an undefined function	2021-10-18 11:10:22 -07:00
Wang JinLong	2159615419	Add new architecture loongarch. Signed-off-by: Wang JinLong <wangjinlong@uniontech.com>	2021-10-18 10:57:34 -07:00
Alex Lapenkou	8daac7958f	Redefine functions with test hooks only for tests Android build has issues with these defines, this will allow the build to succeed if it doesn't need to build the tests.	2021-10-15 15:25:36 -07:00
Alex Lapenkou	c9ebff0fd6	Initialize deferred_work_generated As the code evolves, some code paths that have previously assigned deferred_work_generated may cease being reached. This would leave the value uninitialized. This change initializes the value for safety.	2021-10-07 11:50:38 -07:00
Stan Angelov	912324a1ac	Add debug check outside of the loop in hpa_alloc_batch. This optimizes the whole loop away for non-debug builds.	2021-10-01 14:40:43 -07:00
David CARLIER	cf9724531a	Darwin malloc_size override support proposal. Darwin has similar api than Linux/FreeBSD's malloc_usable_size.	2021-10-01 14:32:40 -07:00
Qi Wang	ab0f1604b4	Delay the atexit call to prof_log_start(). So that atexit() is only done when prof_log is used.	2021-09-29 13:35:50 -07:00
David Carlier	11b6db7448	CPU affinity on BSD platforms support.	2021-09-28 11:40:21 -07:00
Qi Wang	83f3294027	Small refactors around 7bb05e0.	2021-09-27 16:05:13 -07:00
Qi Wang	3c4b717ffc	Remove unused header base_structs.h.	2021-09-27 16:05:13 -07:00
Qi Wang	deb8e62a83	Implement guard pages. Adding guarded extents, which are regular extents surrounded by guard pages (mprotected). To reduce syscalls, small guarded extents are cached as a separate eset in ecache, and decay through the dirty / muzzy / retained pipeline as usual.	2021-09-26 16:30:15 -07:00
Piotr Balcer	7bb05e04be	add experimental.arenas_create_ext mallctl This mallctl accepts an arena_config_t structure which can be used to customize the behavior of the arena. Right now it contains extent_hooks and a new option, metadata_use_hooks, which controls whether the extent hooks are also used for metadata allocation. The medata_use_hooks option has two main use cases: 1. In heterogeneous memory systems, to avoid metadata being placed on potentially slower memory. 2. Avoiding virtual memory from being leaked as a result of metadata allocation failure originating in an extent hook.	2021-09-24 13:43:18 -07:00
Alex Lapenkou	a9031a0970	Allow setting a dump hook If users want to be notified when a heap dump occurs, they can set this hook.	2021-09-22 15:04:01 -07:00
Alex Lapenkou	f7d46b8119	Allow setting custom backtrace hook Existing backtrace implementations skip native stack frames from runtimes like Python. The hook allows to augment the backtraces to attribute allocations to native functions in heap profiles.	2021-09-22 15:04:01 -07:00
Qi Wang	523cfa55c5	Guard prof related mallctl with opt_prof. The prof initialization is done only when opt_prof is true. This change makes sure the prof_* mallctls only have limited read access (i.e. no access to prof internals) when opt_prof is false. In addition, initialize the global prof mutexes even if opt_prof is false. This makes sure the mutex stats are set properly.	2021-09-20 10:42:16 -07:00
Alex Lapenkou	6e848a005e	Remove opt_background_thread_hpa_interval_max_ms Now that HPA can communicate the time until its deferred work should be done, this option is not used anymore.	2021-09-17 16:56:41 -07:00
Alex Lapenkou	8229cc77c5	Wake up background threads on demand This change allows every allocator conforming to PAI communicate that it deferred some work for the future. Without it if a background thread goes into indefinite sleep, there is no way to notify it about upcoming deferred work.	2021-09-17 16:56:41 -07:00
Alex Lapenkou	97da57c13a	HPA: Add min_purge_interval_ms option This rate limiting option is required to avoid purging too often.	2021-09-17 16:56:41 -07:00
Alex Lapenkou	b8b8027f19	Allow PAI to calculate time until deferred work Previously the calculation of sleep time between wakeups was implemented within background_thread. This resulted in some parts of decay and hpa specific logic mixing with background thread implementation. In this change, background thread delegates this calculation to arena and it, in turn, delegates it to PAI. The next step is to implement the actual calculation of time until deferred work in HPA.	2021-09-17 16:56:41 -07:00
Alex Lapenkou	26140dd246	Reject --enable-prof-libunwind without --enable-prof Prior to the change you could specify --enable-prof-libunwind without --enable-prof which would do effectively nothing. This was confusing as I expected --enable-prof-libunwind to act like --enable-prof, but use libunwind.	2021-09-13 14:02:40 -07:00

... 2 3 4 5 6 ...

3375 Commits