server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Qi Wang	83f3294027	Small refactors around `7bb05e0`.	2021-09-27 16:05:13 -07:00
Qi Wang	deb8e62a83	Implement guard pages. Adding guarded extents, which are regular extents surrounded by guard pages (mprotected). To reduce syscalls, small guarded extents are cached as a separate eset in ecache, and decay through the dirty / muzzy / retained pipeline as usual.	2021-09-26 16:30:15 -07:00
Piotr Balcer	7bb05e04be	add experimental.arenas_create_ext mallctl This mallctl accepts an arena_config_t structure which can be used to customize the behavior of the arena. Right now it contains extent_hooks and a new option, metadata_use_hooks, which controls whether the extent hooks are also used for metadata allocation. The medata_use_hooks option has two main use cases: 1. In heterogeneous memory systems, to avoid metadata being placed on potentially slower memory. 2. Avoiding virtual memory from being leaked as a result of metadata allocation failure originating in an extent hook.	2021-09-24 13:43:18 -07:00
Alex Lapenkou	a9031a0970	Allow setting a dump hook If users want to be notified when a heap dump occurs, they can set this hook.	2021-09-22 15:04:01 -07:00
Alex Lapenkou	f7d46b8119	Allow setting custom backtrace hook Existing backtrace implementations skip native stack frames from runtimes like Python. The hook allows to augment the backtraces to attribute allocations to native functions in heap profiles.	2021-09-22 15:04:01 -07:00
Qi Wang	523cfa55c5	Guard prof related mallctl with opt_prof. The prof initialization is done only when opt_prof is true. This change makes sure the prof_* mallctls only have limited read access (i.e. no access to prof internals) when opt_prof is false. In addition, initialize the global prof mutexes even if opt_prof is false. This makes sure the mutex stats are set properly.	2021-09-20 10:42:16 -07:00
Alex Lapenkou	6e848a005e	Remove opt_background_thread_hpa_interval_max_ms Now that HPA can communicate the time until its deferred work should be done, this option is not used anymore.	2021-09-17 16:56:41 -07:00
Alex Lapenkou	8229cc77c5	Wake up background threads on demand This change allows every allocator conforming to PAI communicate that it deferred some work for the future. Without it if a background thread goes into indefinite sleep, there is no way to notify it about upcoming deferred work.	2021-09-17 16:56:41 -07:00
Alex Lapenkou	b8b8027f19	Allow PAI to calculate time until deferred work Previously the calculation of sleep time between wakeups was implemented within background_thread. This resulted in some parts of decay and hpa specific logic mixing with background thread implementation. In this change, background thread delegates this calculation to arena and it, in turn, delegates it to PAI. The next step is to implement the actual calculation of time until deferred work in HPA.	2021-09-17 16:56:41 -07:00
Qi Wang	8b24cb8fdf	Don't assume initialized arena in the default alloc hook. Specifically, this change allows the default alloc hook to used during arenas.create. One use case is to invoke the default alloc hook in a customized hook arena, i.e. the default hooks can be read out of a default arena, then create customized ones based on these hooks. Note that mixing the default with customized hooks is not recommended, and should only be considered when the customization is simple and straightforward.	2021-08-25 14:19:25 -07:00
Qi Wang	5884a076fb	Rename prof.dump_prefix to prof.prefix This better aligns with our naming convention. The option has not been included in any upstream release yet.	2021-08-12 23:04:29 -07:00
David Goldblatt	08a4cc0969	Pairing heap: inline functions instead of macros. By force-inlining everything that would otherwise be a macro, we get the same effect (it's not clear in the first place that this is actually a good idea, but it avoids making any changes to the existing performance profile). This makes the code more maintainable (in anticipation of subsequent changes), as well as making performance profiles and debug info more readable (we get "real" line numbers, instead of making everything point to the macro definition of all associated functions).	2021-08-02 15:02:49 -07:00
David Goldblatt	92a1e38f52	edata_cache: Allow unbounded fast caching. The edata_cache_small had a fill/flush heuristic. In retrospect, this was a premature optimization; more testing indicates that an unbounded cache is effectively fine here, and moreover we spend a nontrivial amount of time doing unnecessary filling/flushing. As the HPA takes on a larger and larger fraction of all allocations, any theoretical differences in allocation patterns should shrink. The HPA is more efficient with its metadata in general, so it still comes out ahead on metadata usage anyways.	2021-07-26 15:14:37 -07:00
David Goldblatt	d93eef2f40	HPA: Introduce a redesigned hpa_central_t. For now, this only handles allocating virtual address space to shards, with no reuse. This is framework, though; it will change over time.	2021-07-23 21:59:59 -07:00
David Goldblatt	e09eac1d4e	Remove hpa_central. This is now dead code.	2021-07-23 21:59:59 -07:00
Alex Lapenkou	c88fe355e6	Add unit tests for decay After slight changes in the interface, it's an opportunity to enhance unit tests.	2021-07-22 23:19:09 -07:00
David Goldblatt	6630c59896	HPA: Hugification hysteresis. We wait a while after deciding a huge extent should get hugified to see if it gets purged before long. This avoids hugifying extents that might shortly get dehugified for purging. Rename and use the hpa_dehugification_threshold option support code for this, since it's now ignored.	2021-07-12 17:59:18 -07:00
David Goldblatt	113938b6f4	HPA: Pull out a hooks type. For now, this is a no-op change. In a subsequent commit, it will be useful for testing.	2021-07-12 17:59:18 -07:00
David Goldblatt	1d4a7666d5	HPA: Do deferred operations on background threads.	2021-07-12 17:59:18 -07:00
David Goldblatt	47d8a7e6b0	psset: Purge empty slabs first. These are particularly good candidates for purging (listed in the diff).	2021-07-12 17:59:18 -07:00
David Goldblatt	41fd56605e	HPA: Purge across retained extents. This lets us cut down on the number of expensive system calls we perform.	2021-07-12 17:59:18 -07:00
David Goldblatt	de033f56c0	mpsc_queue: Add module. This is a simple multi-producer, single-consumer queue. The intended use case is in the HPA, as we begin supporting hpdatas that move between hpa_shards. We take just a single CAS as the cost to send a message (or a batch of messages) in the low-contention case, and lock-freedom lets us avoid some lock-ordering issues.	2021-06-24 14:55:49 -07:00
David Goldblatt	4452a4812f	Add opt.experimental_infallible_new. This allows a guarantee that operator new never throws. Fix the .gitignore rules to include test/integration/cpp while we're here.	2021-06-24 12:22:51 -07:00
David Goldblatt	0689448b1e	Travis: Unbreak the builds. In the hopes of future-proofing as much as possible, jump to the latest distribution Travis supports.	2021-06-24 07:40:28 -07:00
David Goldblatt	36c6bfb963	SEC: Allow arbitrarily many shards, cached sizes.	2021-05-22 08:17:41 -07:00
Qi Wang	08089589f7	Fix an interaction between the oversize_threshold test and bgthds. Also added the shared utility to check if background_thread is enabled.	2021-05-13 16:19:14 -07:00
David Goldblatt	5417938215	Red-black tree: add summarize/filter. This allows tracking extra information in the nodes of an red-black tree to filter searches in the tree to just those that match some property.	2021-05-12 11:14:23 -07:00
David Goldblatt	b2c08ef2e6	RB unit tests: don't test reentrantly. The RB code doesn't do any allocation, and takes a little bit of time to run. There's no sense in doing everything three times.	2021-05-12 11:14:23 -07:00
David Goldblatt	aea91b8c33	Clean up some minor data structure inconsistencies Namely, unify the include guard styling with the majority of the project, and do flat_bitmap -> fb, to match its naming convention.	2021-05-12 11:14:23 -07:00
David Goldblatt	12cd13cd41	Fix thread.name/prof_sys_thread_name interaction When prof_sys_thread_name is true, we don't allow setting the thread name. Teach the unit test this.	2021-03-31 14:45:12 -07:00
David Goldblatt	304cdbb132	Fix a prof_recent/prof_sys_thread_name interaction When both of these are enabled, the output format changes slightly. Teach the unit test about the interaction.	2021-03-31 14:45:12 -07:00
Qi Wang	7c964b0352	Add rtree_write_range(): writing the same content to multiple leaf elements. Apply to emap_(de)register_interior which became noticeable in perf profiles.	2021-03-29 17:19:53 -07:00
Qi Wang	4d8c22f9a5	Store edata->state in rtree leaf and make edata_t 128B aligned. Verified that this doesn't result in any real increase of edata_t bytes allocated.	2021-03-29 17:19:53 -07:00
Qi Wang	70d1541c5b	Track extent is_head state in rtree leaf.	2021-03-29 17:19:53 -07:00
David Goldblatt	73ca4b8ef8	HPA: Use dirtiest-first purging. This seems to be practically beneficial, despite some pathological corner cases.	2021-02-19 15:10:54 -08:00
David Goldblatt	d21d5b46b6	Edata: Move sn into its own field. This lets the bins use a fragmentation avoidance policy that matches the HPA's (without affecting the PAC).	2021-02-19 15:10:54 -08:00
David Goldblatt	fb327368db	SEC: Expand option configurability. This change pulls the SEC options into a struct, which simplifies their handling across various modules (e.g. PA needs to forward on SEC options from the malloc_conf string, but it doesn't really need to know their names). While we're here, make some of the fixed constants configurable, and unify naming from the configuration options to the internals.	2021-02-19 15:10:54 -08:00
David Goldblatt	ce9386370a	HPA: Implement batch allocation.	2021-02-19 15:10:54 -08:00
David Goldblatt	cdae6706a6	SEC: Use batch fills. Currently, this doesn't help much, since no PAI implementation supports flushing. This will change in subsequent commits.	2021-02-19 15:10:54 -08:00
David Goldblatt	480f3b11cd	Add a batch allocation interface to the PAI. For now, no real allocator actually implements this interface; this will change in subsequent diffs.	2021-02-19 15:10:54 -08:00
David Goldblatt	bf448d7a5a	SEC: Reduce lock hold times. Only flush a subset of extents during flushing, and drop the lock while doing so.	2021-02-19 15:10:54 -08:00
David Goldblatt	f47b4c2cd8	PAI/SEC: Add a dalloc_batch function. This lets the SEC flush all of its items in a single call, rather than flushing everything at once.	2021-02-19 15:10:54 -08:00
Qi Wang	a11be50332	Implement opt.cache_oblivious. Keep config.cache_oblivious for now to remain backward-compatible.	2021-02-11 11:32:01 -08:00
David Goldblatt	b3df80bc79	Pull HPA options into a containing struct. Currently that just means max_alloc, but we're about to add more. While we're touching these lines anyways, tweak things to be more in line with testing.	2021-02-04 20:58:31 -08:00
David Goldblatt	bdb7307ff2	fxp: Add FXP_INIT_PERCENT This lets us specify fxp values easily in source.	2021-02-04 20:58:31 -08:00
David Goldblatt	caef4c2868	FXP: add fxp_mul_frac. This can multiply size_ts by a fraction without the risk of overflow.	2021-02-04 20:58:31 -08:00
David Goldblatt	dc886e5608	hpdata: Return the number of pages to be purged. We'll use this in the next commit.	2021-02-04 20:58:31 -08:00
David Goldblatt	9fd9c876bb	psset: keep aggregate stats. This will let us quickly query these stats to make purging decisions quickly.	2021-02-04 20:58:31 -08:00
David Goldblatt	da63f23e68	HPA: Track pending purges/hugifies in the psset. This finishes the refactoring of the HPA/psset interactions the past few commits have been building towards. Rather than the HPA removing and then reinserting hpdatas, it simply begins updates and ends them. These updates can set flags on the hpdata that prevent it from being returned for certain types of requests. For example, it can call hpdata_alloc_allowed_set(hpdata, false) during an update, at which point the given hpdata will no longer be returned for psset_pick_alloc requests. This has various of benefits: - It maintains stats correctness during purges and hugifies. - It allows simpler and more explicit concurrency control for the various special cases (e.g. allocations are disallowed during purge, but not during hugify). - It lets allocations and deallocations avoid disturbing the purging and hugification orderings. If an hpdata "loses its place" in one of the queues just do to an alloc / dalloc, it can result in pathological edge cases where very hot, very full hugepages never get hugified (and cold extents on the same hugepage as hot ones never get purged). The key benefit though is that tracking hpdatas to be purged / hugified in a principled way will let us do delayed purging and hugification. Eventually this will let us move these operations to background threads, but in the short term the benefit is that it will let us have global purging policies (e.g. purge when the entire arena has too many dirty pages, rather than any particular hugepage).	2021-02-04 20:58:31 -08:00
David Goldblatt	bf64557ed6	Move empty slab tracking to the psset. We're moving towards a world in which purging decisions are less rigidly enforced at a single-hugepage level. In that world, it makes sense to keep around some hpdatas which are not completely purged, in which case we'll need to track them.	2021-02-04 20:58:31 -08:00

1 2 3 4 5 ...

714 Commits