server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Qi Wang	72cfdce718	Allocate tcache stack from base allocator When using metadata_thp, allocate tcache bin stacks from base0, which means they will be placed on huge pages along with other metadata, instead of mixed with other regular allocations. In order to do so, modified the base allocator to support limited reuse: freed tcached stacks (from thread termination) will be returned to base0 and made available for reuse, but no merging will be attempted since they were bump allocated out of base blocks. These reused base extents are managed using separately allocated base edata_t -- they are cached in base->edata_avail when the extent is all allocated. One tricky part is, stats updating must be skipped for such reused extents (since they were accounted for already, and there is no purging for base). This requires tracking the "if is reused" state explicitly and bypass the stats updates when allocating from them.	2023-09-18 12:18:32 -07:00
Kevin Svetlitski	3e82f357bb	Fix all optimization-inhibiting integer-to-pointer casts Following from PR #2481, we replace all integer-to-pointer casts [which hide pointer provenance information (and thus inhibit optimizations)](https://clang.llvm.org/extra/clang-tidy/checks/performance/no-int-to-ptr.html) with equivalent operations that preserve this information. I have enabled the corresponding clang-tidy check in our static analysis CI so that we do not get bitten by this again in the future.	2023-07-24 14:40:42 -07:00
Kevin Svetlitski	41e0b857be	Make headers self-contained by fixing `#include`s Header files are now self-contained, which makes the relationships between the files clearer, and crucially allows LSP tools like `clangd` to function correctly in all of our header files. I have verified that the headers are self-contained (aside from the various Windows shims) by compiling them as if they were C files – in a follow-up commit I plan to add this to CI to ensure we don't regress on this front.	2023-07-14 09:06:32 -07:00
Kevin Svetlitski	6841110bd6	Make `edata_cmp_summary_comp` 30% faster `edata_cmp_summary_comp` is one of the very hottest functions, taking up 3% of all time spent inside Jemalloc. I noticed that all existing callsites rely only on the sign of the value returned by this function, so I came up with this equivalent branchless implementation which preserves this property. After empirical measurement, I have found that this implementation is 30% faster, therefore representing a 1% speed-up to the allocator as a whole. At @interwq's suggestion, I've applied the same optimization to `edata_esnead_comp` in case this function becomes hotter in the future.	2023-05-04 09:59:17 -07:00
Yuriy Chernyshov	70d4102f48	Fix compiling edata.h with MSVC At the time an attempt to compile jemalloc 5.3.0 with MSVC 2019 results in the followin error message: > jemalloc/include/jemalloc/internal/edata.h:660: error C4576: a parenthesized type followed by an initializer list is a non-standard explicit type conversion syntax	2022-05-09 14:51:07 -07:00
Qi Wang	deb8e62a83	Implement guard pages. Adding guarded extents, which are regular extents surrounded by guard pages (mprotected). To reduce syscalls, small guarded extents are cached as a separate eset in ecache, and decay through the dirty / muzzy / retained pipeline as usual.	2021-09-26 16:30:15 -07:00
David Goldblatt	dc0a4b8b2f	Edata: Pull out comparison fields into a summary. For now, this is a no-op; eventually, it will allow some caching in the eset.	2021-08-02 15:02:49 -07:00
David Goldblatt	0170dd198a	Edata: Fix a couple typos. Some readability-enhancing whitespace, and a spelling error.	2021-08-02 15:02:49 -07:00
David Goldblatt	08a4cc0969	Pairing heap: inline functions instead of macros. By force-inlining everything that would otherwise be a macro, we get the same effect (it's not clear in the first place that this is actually a good idea, but it avoids making any changes to the existing performance profile). This makes the code more maintainable (in anticipation of subsequent changes), as well as making performance profiles and debug info more readable (we get "real" line numbers, instead of making everything point to the macro definition of all associated functions).	2021-08-02 15:02:49 -07:00
Qi Wang	49b7d7f0a4	Passing down the original edata on the expand path. Instead of passing down the new_addr, pass down the active edata which allows us to always use a neighbor-acquiring semantic. In other words, this tells us both the original edata and neighbor address. With this change, only neighbors of a "known" edata can be acquired, i.e. acquiring an edata based on an arbitrary address isn't possible anymore.	2021-03-29 17:19:53 -07:00
Qi Wang	1784939688	Use rtree tracked states to protect edata outside of ecache locks. This avoids the addr-based mutexes (i.e. the mutex_pool), and instead relies on the metadata tracked in rtree leaf: the head state and extent_state. Before trying to access the neighbor edata (e.g. for coalescing), the states will be verified first -- only neighbor edatas from the same arena and with the same state will be accessed.	2021-03-29 17:19:53 -07:00
Qi Wang	4d8c22f9a5	Store edata->state in rtree leaf and make edata_t 128B aligned. Verified that this doesn't result in any real increase of edata_t bytes allocated.	2021-03-29 17:19:53 -07:00
Qi Wang	70d1541c5b	Track extent is_head state in rtree leaf.	2021-03-29 17:19:53 -07:00
David Goldblatt	d21d5b46b6	Edata: Move sn into its own field. This lets the bins use a fragmentation avoidance policy that matches the HPA's (without affecting the PAC).	2021-02-19 15:10:54 -08:00
Yinan Zhang	afa489c3c5	Record request size in prof info	2021-01-07 20:39:49 -08:00
David Goldblatt	ca30b5db2b	Introduce hpdata_t. Using an edata_t both for hugepages and the allocations within those hugepages was convenient at first, but has outlived its usefulness. Representing hugepages explicitly, with their own data structure, will make future development easier.	2020-12-07 06:21:08 -08:00
David Goldblatt	43af63fff4	HPA: Manage whole hugepages at a time. This redesigns the HPA implementation to allow us to manage hugepages all at once, locally, without relying on a global fallback.	2020-12-07 06:21:08 -08:00
David Goldblatt	b4c37a6e81	Rename edata_tree_t -> edata_avail_t. This isn't a tree any more, and it mildly irritates me any time I see it.	2020-11-13 13:42:11 -08:00
David Goldblatt	634ec6f50a	Edata: add an "age" field.	2020-10-23 11:14:34 -07:00
David Goldblatt	259c5e3e8f	psset: Add stats	2020-09-18 12:39:25 -07:00
David Goldblatt	018b162d67	Add psset: a set of pageslabs. This introduces a new sort of edata_t; a pageslab, and a set to manage them. This is part of a series of a commits to implement a hugepage allocator; the pageset will be per-arena, and track small page allocations requests within a larger extent allocated from a centralized hugepage allocator.	2020-09-18 12:39:25 -07:00
David Goldblatt	e034500698	Edata: rename "ranged" bit to "pai". This better represents its intended purpose; the hugepage allocator design evolved away from needing contiguity of hugepage virtual address space.	2020-09-18 12:39:25 -07:00
David Goldblatt	ae541d3fab	Edata: Reserve some space for hugepages.	2020-07-08 13:20:59 -07:00
David Goldblatt	392f645f4d	Edata: split up different list linkage uses.	2020-07-08 13:20:59 -07:00
David Goldblatt	129b727058	Add typed-list module. This gives some named convenience wrappers.	2020-07-08 13:20:59 -07:00
David Goldblatt	eda9c2858f	Edata: zero stack edatas before initializing. This avoids some UB. No compilers take advantage of it for now, but no sense in tempting fate.	2020-05-14 10:30:20 -07:00
David Goldblatt	12eb888e54	Edata: Add a ranged bit. We steal the dumpable bit, which we ended up not needing.	2020-04-10 13:12:47 -07:00
David Goldblatt	bd4fdf295e	Rtree: Pull leaf contents into their own struct.	2020-04-10 13:12:47 -07:00
Yinan Zhang	9a60cf54ec	Last-N profiling mode	2019-12-30 15:58:57 -08:00
David Goldblatt	0704516245	Ehooks: Add head tracking.	2019-12-20 10:18:40 -08:00
David Goldblatt	a7862df616	Rename extent_t to edata_t. This frees us up from the unfortunate extent/extent2 naming collision.	2019-12-20 10:18:40 -08:00
David Goldblatt	865debda22	Rename extent.h -> edata.h. This name is slightly pithier; a full-on rename will come shortly.	2019-12-20 10:18:40 -08:00

32 Commits