server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
David CARLIER	cf9724531a	Darwin malloc_size override support proposal. Darwin has similar api than Linux/FreeBSD's malloc_usable_size.	2021-10-01 14:32:40 -07:00
Qi Wang	ab0f1604b4	Delay the atexit call to prof_log_start(). So that atexit() is only done when prof_log is used.	2021-09-29 13:35:50 -07:00
David Carlier	11b6db7448	CPU affinity on BSD platforms support.	2021-09-28 11:40:21 -07:00
Qi Wang	83f3294027	Small refactors around 7bb05e0.	2021-09-27 16:05:13 -07:00
Qi Wang	3c4b717ffc	Remove unused header base_structs.h.	2021-09-27 16:05:13 -07:00
Qi Wang	deb8e62a83	Implement guard pages. Adding guarded extents, which are regular extents surrounded by guard pages (mprotected). To reduce syscalls, small guarded extents are cached as a separate eset in ecache, and decay through the dirty / muzzy / retained pipeline as usual.	2021-09-26 16:30:15 -07:00
Piotr Balcer	7bb05e04be	add experimental.arenas_create_ext mallctl This mallctl accepts an arena_config_t structure which can be used to customize the behavior of the arena. Right now it contains extent_hooks and a new option, metadata_use_hooks, which controls whether the extent hooks are also used for metadata allocation. The medata_use_hooks option has two main use cases: 1. In heterogeneous memory systems, to avoid metadata being placed on potentially slower memory. 2. Avoiding virtual memory from being leaked as a result of metadata allocation failure originating in an extent hook.	2021-09-24 13:43:18 -07:00
Alex Lapenkou	a9031a0970	Allow setting a dump hook If users want to be notified when a heap dump occurs, they can set this hook.	2021-09-22 15:04:01 -07:00
Alex Lapenkou	f7d46b8119	Allow setting custom backtrace hook Existing backtrace implementations skip native stack frames from runtimes like Python. The hook allows to augment the backtraces to attribute allocations to native functions in heap profiles.	2021-09-22 15:04:01 -07:00
Qi Wang	523cfa55c5	Guard prof related mallctl with opt_prof. The prof initialization is done only when opt_prof is true. This change makes sure the prof_* mallctls only have limited read access (i.e. no access to prof internals) when opt_prof is false. In addition, initialize the global prof mutexes even if opt_prof is false. This makes sure the mutex stats are set properly.	2021-09-20 10:42:16 -07:00
Alex Lapenkou	6e848a005e	Remove opt_background_thread_hpa_interval_max_ms Now that HPA can communicate the time until its deferred work should be done, this option is not used anymore.	2021-09-17 16:56:41 -07:00
Alex Lapenkou	8229cc77c5	Wake up background threads on demand This change allows every allocator conforming to PAI communicate that it deferred some work for the future. Without it if a background thread goes into indefinite sleep, there is no way to notify it about upcoming deferred work.	2021-09-17 16:56:41 -07:00
Alex Lapenkou	97da57c13a	HPA: Add min_purge_interval_ms option This rate limiting option is required to avoid purging too often.	2021-09-17 16:56:41 -07:00
Alex Lapenkou	b8b8027f19	Allow PAI to calculate time until deferred work Previously the calculation of sleep time between wakeups was implemented within background_thread. This resulted in some parts of decay and hpa specific logic mixing with background thread implementation. In this change, background thread delegates this calculation to arena and it, in turn, delegates it to PAI. The next step is to implement the actual calculation of time until deferred work in HPA.	2021-09-17 16:56:41 -07:00
Alex Lapenkou	26140dd246	Reject --enable-prof-libunwind without --enable-prof Prior to the change you could specify --enable-prof-libunwind without --enable-prof which would do effectively nothing. This was confusing as I expected --enable-prof-libunwind to act like --enable-prof, but use libunwind.	2021-09-13 14:02:40 -07:00
Mingli Yu	e5062e9fb9	Makefile.in: make sure doc generated before install There is a race between the doc generation and the doc installation, so make the install depend on the build for doc. Signed-off-by: Mingli Yu <mingli.yu@windriver.com>	2021-09-13 13:40:39 -07:00
Qi Wang	8b24cb8fdf	Don't assume initialized arena in the default alloc hook. Specifically, this change allows the default alloc hook to used during arenas.create. One use case is to invoke the default alloc hook in a customized hook arena, i.e. the default hooks can be read out of a default arena, then create customized ones based on these hooks. Note that mixing the default with customized hooks is not recommended, and should only be considered when the customization is simple and straightforward.	2021-08-25 14:19:25 -07:00
Alex Lapenkou	c01a885e94	HPA: Correctly calculate retained pages Retained pages are those which haven't been touched and are unbacked from OS perspective. For a pageslab their number should equal "total pages in slab" minus "touched pages".	2021-08-20 18:06:17 -07:00
Alex Lapenkou	2c625d5cd9	Fix warnings when compiled with clang When clang sees an unknown warning option, unlike gcc it doesn't fail the build with error. It issues a warning. Hence JE_CFLAGS_ADD with warning options that didnt't exist in clang would still mark those options as available. This led to several warnings when built with clang or "gcc" on OSX. This change fixes those warnings by simply making clang fail builds with non-existent warning options.	2021-08-13 14:14:46 -07:00
Alex Lapenkou	9d02bdc883	Port gen_run_tests.py to python3 Insignificant changes to make the script runnable on python3.	2021-08-13 10:59:32 -07:00
Qi Wang	5884a076fb	Rename prof.dump_prefix to prof.prefix This better aligns with our naming convention. The option has not been included in any upstream release yet.	2021-08-12 23:04:29 -07:00
Qi Wang	6a01600712	Add Cirrus CI testing matrix Contains 16 testing configs -- a mix of debug, prof, -m32 and a few uncommon options.	2021-08-10 09:59:10 -07:00
Alex Lapenkou	f58064b932	Verify that HPA is used before calling its functions This change eliminates the possibility of PA calling functions of uninitialized HPA.	2021-08-05 16:43:28 -07:00
David Goldblatt	27f71242b7	Mutex: Tweak internal spin count. The recent pairing heap optimizations flattened the lock hold time profile. This was a win for raw cycle counts, but ended up causing us to "just miss" acquiring the mutex before sleeping more often. Bump those counts.	2021-08-05 14:33:16 -07:00
David Goldblatt	6f41ba55ee	Mutex: Make spin count configurable. Don't document it since we don't want to support this as a "real" setting, but it's handy for testing.	2021-08-05 10:13:53 -07:00
David Goldblatt	dae24589bc	PH: Insert-below-min fast-path.	2021-08-02 15:02:49 -07:00
David Goldblatt	40d53e007c	ph: Add aux-list counting and pre-merging.	2021-08-02 15:02:49 -07:00
David Goldblatt	dcb7b83fac	Eset: Cache summary information for heap edatas. This lets us do a single array scan to find first fits, instead of taking a cache miss per examined size class.	2021-08-02 15:02:49 -07:00
David Goldblatt	252e0942d0	Eset: Pull per-pszind data into structs. We currently have one for stats and one for the data. The data struct is just a wrapper around the edata_heap_t, but this will change shortly.	2021-08-02 15:02:49 -07:00
David Goldblatt	dc0a4b8b2f	Edata: Pull out comparison fields into a summary. For now, this is a no-op; eventually, it will allow some caching in the eset.	2021-08-02 15:02:49 -07:00
David Goldblatt	0170dd198a	Edata: Fix a couple typos. Some readability-enhancing whitespace, and a spelling error.	2021-08-02 15:02:49 -07:00
David Goldblatt	08a4cc0969	Pairing heap: inline functions instead of macros. By force-inlining everything that would otherwise be a macro, we get the same effect (it's not clear in the first place that this is actually a good idea, but it avoids making any changes to the existing performance profile). This makes the code more maintainable (in anticipation of subsequent changes), as well as making performance profiles and debug info more readable (we get "real" line numbers, instead of making everything point to the macro definition of all associated functions).	2021-08-02 15:02:49 -07:00
David Goldblatt	92a1e38f52	edata_cache: Allow unbounded fast caching. The edata_cache_small had a fill/flush heuristic. In retrospect, this was a premature optimization; more testing indicates that an unbounded cache is effectively fine here, and moreover we spend a nontrivial amount of time doing unnecessary filling/flushing. As the HPA takes on a larger and larger fraction of all allocations, any theoretical differences in allocation patterns should shrink. The HPA is more efficient with its metadata in general, so it still comes out ahead on metadata usage anyways.	2021-07-26 15:14:37 -07:00
David Goldblatt	d93eef2f40	HPA: Introduce a redesigned hpa_central_t. For now, this only handles allocating virtual address space to shards, with no reuse. This is framework, though; it will change over time.	2021-07-23 21:59:59 -07:00
David Goldblatt	e09eac1d4e	Remove hpa_central. This is now dead code.	2021-07-23 21:59:59 -07:00
Alex Lapenkou	c88fe355e6	Add unit tests for decay After slight changes in the interface, it's an opportunity to enhance unit tests.	2021-07-22 23:19:09 -07:00
Alex Lapenkou	aaea4fd1e6	Add more documentation to decay.c It took me a while to understand why some things are implemented the way they are, so hopefully it will help future readers.	2021-07-22 23:19:09 -07:00
Alex Lapenkou	4b633b9a81	Clean up background thread sleep computation Isolate the computation of purge interval from background thread logic and move into more suitable file.	2021-07-22 23:19:09 -07:00
David Goldblatt	6630c59896	HPA: Hugification hysteresis. We wait a while after deciding a huge extent should get hugified to see if it gets purged before long. This avoids hugifying extents that might shortly get dehugified for purging. Rename and use the hpa_dehugification_threshold option support code for this, since it's now ignored.	2021-07-12 17:59:18 -07:00
David Goldblatt	113938b6f4	HPA: Pull out a hooks type. For now, this is a no-op change. In a subsequent commit, it will be useful for testing.	2021-07-12 17:59:18 -07:00
David Goldblatt	1d4a7666d5	HPA: Do deferred operations on background threads.	2021-07-12 17:59:18 -07:00
David Goldblatt	583284f2d9	Add HPA deferral functionality.	2021-07-12 17:59:18 -07:00
David Goldblatt	ace329d11b	HPA batch dalloc: Just do one deferred work check. We only need to do one check per batch dalloc, not one check per dalloc in the batch.	2021-07-12 17:59:18 -07:00
David Goldblatt	47d8a7e6b0	psset: Purge empty slabs first. These are particularly good candidates for purging (listed in the diff).	2021-07-12 17:59:18 -07:00
David Goldblatt	41fd56605e	HPA: Purge across retained extents. This lets us cut down on the number of expensive system calls we perform.	2021-07-12 17:59:18 -07:00
David Goldblatt	347523517b	PAI: Fix a typo.	2021-07-12 17:59:11 -07:00
David Goldblatt	9c42ed2d14	Travis: Don't test "clang" on OS X. On OS X, "gcc" is really just clang anyways, so this combination gets tested by the gcc test. This is purely redundant, and (since it runs early in the output) increases time to signal for real breakages further down in the list.	2021-07-08 09:53:28 -07:00
David Goldblatt	d202218e86	HPA: Fix typos with big performance implications. This fixes two simple but significant typos in the HPA: - The conf string parsing accidentally set a min value of PAGE for hpa_sec_batch_fill_extra; i.e. allocating 4096 extra pages every time we attempted to allocate a single page. This puts us over the SEC flush limit, so we then immediately flush all but one of them (probably triggering purging). - The HPA was using the default PAI batch alloc implementation, which meant it did not actually get any locking advantages. This snuck by because I did all the performance testing without using the PAI interface or config settings. When I cleaned it up and put everything behind nice interfaces, I only did correctness checks, and didn't try any performance ones.	2021-06-24 16:26:55 -07:00
David Goldblatt	de033f56c0	mpsc_queue: Add module. This is a simple multi-producer, single-consumer queue. The intended use case is in the HPA, as we begin supporting hpdatas that move between hpa_shards. We take just a single CAS as the cost to send a message (or a batch of messages) in the low-contention case, and lock-freedom lets us avoid some lock-ordering issues.	2021-06-24 14:55:49 -07:00
David Goldblatt	4452a4812f	Add opt.experimental_infallible_new. This allows a guarantee that operator new never throws. Fix the .gitignore rules to include test/integration/cpp while we're here.	2021-06-24 12:22:51 -07:00

... 2 3 4 5 6 ...

3340 Commits