server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Alex Lapenkou	f58064b932	Verify that HPA is used before calling its functions This change eliminates the possibility of PA calling functions of uninitialized HPA.	2021-08-05 16:43:28 -07:00
David Goldblatt	27f71242b7	Mutex: Tweak internal spin count. The recent pairing heap optimizations flattened the lock hold time profile. This was a win for raw cycle counts, but ended up causing us to "just miss" acquiring the mutex before sleeping more often. Bump those counts.	2021-08-05 14:33:16 -07:00
David Goldblatt	6f41ba55ee	Mutex: Make spin count configurable. Don't document it since we don't want to support this as a "real" setting, but it's handy for testing.	2021-08-05 10:13:53 -07:00
David Goldblatt	dae24589bc	PH: Insert-below-min fast-path.	2021-08-02 15:02:49 -07:00
David Goldblatt	40d53e007c	ph: Add aux-list counting and pre-merging.	2021-08-02 15:02:49 -07:00
David Goldblatt	dcb7b83fac	Eset: Cache summary information for heap edatas. This lets us do a single array scan to find first fits, instead of taking a cache miss per examined size class.	2021-08-02 15:02:49 -07:00
David Goldblatt	252e0942d0	Eset: Pull per-pszind data into structs. We currently have one for stats and one for the data. The data struct is just a wrapper around the edata_heap_t, but this will change shortly.	2021-08-02 15:02:49 -07:00
David Goldblatt	dc0a4b8b2f	Edata: Pull out comparison fields into a summary. For now, this is a no-op; eventually, it will allow some caching in the eset.	2021-08-02 15:02:49 -07:00
David Goldblatt	0170dd198a	Edata: Fix a couple typos. Some readability-enhancing whitespace, and a spelling error.	2021-08-02 15:02:49 -07:00
David Goldblatt	08a4cc0969	Pairing heap: inline functions instead of macros. By force-inlining everything that would otherwise be a macro, we get the same effect (it's not clear in the first place that this is actually a good idea, but it avoids making any changes to the existing performance profile). This makes the code more maintainable (in anticipation of subsequent changes), as well as making performance profiles and debug info more readable (we get "real" line numbers, instead of making everything point to the macro definition of all associated functions).	2021-08-02 15:02:49 -07:00
David Goldblatt	92a1e38f52	edata_cache: Allow unbounded fast caching. The edata_cache_small had a fill/flush heuristic. In retrospect, this was a premature optimization; more testing indicates that an unbounded cache is effectively fine here, and moreover we spend a nontrivial amount of time doing unnecessary filling/flushing. As the HPA takes on a larger and larger fraction of all allocations, any theoretical differences in allocation patterns should shrink. The HPA is more efficient with its metadata in general, so it still comes out ahead on metadata usage anyways.	2021-07-26 15:14:37 -07:00
David Goldblatt	d93eef2f40	HPA: Introduce a redesigned hpa_central_t. For now, this only handles allocating virtual address space to shards, with no reuse. This is framework, though; it will change over time.	2021-07-23 21:59:59 -07:00
David Goldblatt	e09eac1d4e	Remove hpa_central. This is now dead code.	2021-07-23 21:59:59 -07:00
Alex Lapenkou	c88fe355e6	Add unit tests for decay After slight changes in the interface, it's an opportunity to enhance unit tests.	2021-07-22 23:19:09 -07:00
Alex Lapenkou	aaea4fd1e6	Add more documentation to decay.c It took me a while to understand why some things are implemented the way they are, so hopefully it will help future readers.	2021-07-22 23:19:09 -07:00
Alex Lapenkou	4b633b9a81	Clean up background thread sleep computation Isolate the computation of purge interval from background thread logic and move into more suitable file.	2021-07-22 23:19:09 -07:00
David Goldblatt	6630c59896	HPA: Hugification hysteresis. We wait a while after deciding a huge extent should get hugified to see if it gets purged before long. This avoids hugifying extents that might shortly get dehugified for purging. Rename and use the hpa_dehugification_threshold option support code for this, since it's now ignored.	2021-07-12 17:59:18 -07:00
David Goldblatt	113938b6f4	HPA: Pull out a hooks type. For now, this is a no-op change. In a subsequent commit, it will be useful for testing.	2021-07-12 17:59:18 -07:00
David Goldblatt	1d4a7666d5	HPA: Do deferred operations on background threads.	2021-07-12 17:59:18 -07:00
David Goldblatt	583284f2d9	Add HPA deferral functionality.	2021-07-12 17:59:18 -07:00
David Goldblatt	ace329d11b	HPA batch dalloc: Just do one deferred work check. We only need to do one check per batch dalloc, not one check per dalloc in the batch.	2021-07-12 17:59:18 -07:00
David Goldblatt	47d8a7e6b0	psset: Purge empty slabs first. These are particularly good candidates for purging (listed in the diff).	2021-07-12 17:59:18 -07:00
David Goldblatt	41fd56605e	HPA: Purge across retained extents. This lets us cut down on the number of expensive system calls we perform.	2021-07-12 17:59:18 -07:00
David Goldblatt	347523517b	PAI: Fix a typo.	2021-07-12 17:59:11 -07:00
David Goldblatt	9c42ed2d14	Travis: Don't test "clang" on OS X. On OS X, "gcc" is really just clang anyways, so this combination gets tested by the gcc test. This is purely redundant, and (since it runs early in the output) increases time to signal for real breakages further down in the list.	2021-07-08 09:53:28 -07:00
David Goldblatt	d202218e86	HPA: Fix typos with big performance implications. This fixes two simple but significant typos in the HPA: - The conf string parsing accidentally set a min value of PAGE for hpa_sec_batch_fill_extra; i.e. allocating 4096 extra pages every time we attempted to allocate a single page. This puts us over the SEC flush limit, so we then immediately flush all but one of them (probably triggering purging). - The HPA was using the default PAI batch alloc implementation, which meant it did not actually get any locking advantages. This snuck by because I did all the performance testing without using the PAI interface or config settings. When I cleaned it up and put everything behind nice interfaces, I only did correctness checks, and didn't try any performance ones.	2021-06-24 16:26:55 -07:00
David Goldblatt	de033f56c0	mpsc_queue: Add module. This is a simple multi-producer, single-consumer queue. The intended use case is in the HPA, as we begin supporting hpdatas that move between hpa_shards. We take just a single CAS as the cost to send a message (or a batch of messages) in the low-contention case, and lock-freedom lets us avoid some lock-ordering issues.	2021-06-24 14:55:49 -07:00
David Goldblatt	4452a4812f	Add opt.experimental_infallible_new. This allows a guarantee that operator new never throws. Fix the .gitignore rules to include test/integration/cpp while we're here.	2021-06-24 12:22:51 -07:00
David Goldblatt	0689448b1e	Travis: Unbreak the builds. In the hopes of future-proofing as much as possible, jump to the latest distribution Travis supports.	2021-06-24 07:40:28 -07:00
David Carlier	4fb93a18ee	extent_can_acquire_neighbor typo fix	2021-06-19 08:13:11 -07:00
Vineet Gupta	2381efab57	ARC: add Minimum allocation alignment Signed-off-by: Vineet Gupta <vgupta@synopsys.com>	2021-06-03 13:43:38 -07:00
Ondřej Surý	2c0f4c2ac3	Fix typo in configure.ac: experimetal -> experimental	2021-05-25 08:20:37 -07:00
David Goldblatt	36c6bfb963	SEC: Allow arbitrarily many shards, cached sizes.	2021-05-22 08:17:41 -07:00
Deanna Gelbart	11beab38bc	Added --debug-syms-by-id option	2021-05-17 10:00:40 -07:00
Qi Wang	08089589f7	Fix an interaction between the oversize_threshold test and bgthds. Also added the shared utility to check if background_thread is enabled.	2021-05-13 16:19:14 -07:00
David Goldblatt	5417938215	Red-black tree: add summarize/filter. This allows tracking extra information in the nodes of an red-black tree to filter searches in the tree to just those that match some property.	2021-05-12 11:14:23 -07:00
David Goldblatt	b2c08ef2e6	RB unit tests: don't test reentrantly. The RB code doesn't do any allocation, and takes a little bit of time to run. There's no sense in doing everything three times.	2021-05-12 11:14:23 -07:00
David Goldblatt	aea91b8c33	Clean up some minor data structure inconsistencies Namely, unify the include guard styling with the majority of the project, and do flat_bitmap -> fb, to match its naming convention.	2021-05-12 11:14:23 -07:00
David Goldblatt	1f688490e1	Stats: Fix a printing bug when hpa_dirty_mult = -1 Missed a layer of indirection.	2021-05-05 19:45:25 -07:00
David Goldblatt	4f7cb3a413	Sized deallocation: fix a typo. dealloction -> deallocation.	2021-05-04 16:46:15 -07:00
David Goldblatt	12cd13cd41	Fix thread.name/prof_sys_thread_name interaction When prof_sys_thread_name is true, we don't allow setting the thread name. Teach the unit test this.	2021-03-31 14:45:12 -07:00
David Goldblatt	304cdbb132	Fix a prof_recent/prof_sys_thread_name interaction When both of these are enabled, the output format changes slightly. Teach the unit test about the interaction.	2021-03-31 14:45:12 -07:00
Qi Wang	9b523c6c15	Refactor the locking in extent_recycle(). Hold the ecache lock across extent_recycle_extract() and extent_recycle_split(), so that the extent_deactivate after split can avoid re-take the ecache mutex.	2021-03-31 14:42:33 -07:00
Qi Wang	ce68f326b0	Avoid the release & re-acquire of the ecache locks around the merge hook.	2021-03-31 14:42:33 -07:00
Qi Wang	7dc77527ba	Delete the mutex_pool module.	2021-03-29 17:19:53 -07:00
Qi Wang	03d95cba88	Remove the unnecessary arena_ind_set in base_alloc_edata(). All edata alloc sites are already followed with proper edata_init().	2021-03-29 17:19:53 -07:00
Qi Wang	3093d9455e	Move the edata mergeability related functions to extent.h.	2021-03-29 17:19:53 -07:00
Qi Wang	7c964b0352	Add rtree_write_range(): writing the same content to multiple leaf elements. Apply to emap_(de)register_interior which became noticeable in perf profiles.	2021-03-29 17:19:53 -07:00
Qi Wang	add636596a	Stop checking head state in the merge hook. Now that all merging go through try_acquire_edata_neighbor, the mergeablility checks (including head state checking) are done before reaching the merge hook. In other words, merge hook will never be called if the head state doesn't agree.	2021-03-29 17:19:53 -07:00
Qi Wang	49b7d7f0a4	Passing down the original edata on the expand path. Instead of passing down the new_addr, pass down the active edata which allows us to always use a neighbor-acquiring semantic. In other words, this tells us both the original edata and neighbor address. With this change, only neighbors of a "known" edata can be acquired, i.e. acquiring an edata based on an arbitrary address isn't possible anymore.	2021-03-29 17:19:53 -07:00

1 2 3 4 5 ...

3168 Commits