server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
David Goldblatt	0f6c420f83	HPA: Make purging/hugifying more principled. Before this change, purge/hugify decisions had several sharp edges that could lead to pathological behavior if tuning parameters weren't carefully chosen. It's the first of a series; this introduces basic "make every hugepage with dirty pages purgeable" functionality, and the next commit expands that functionality to have a smarter policy for picking hugepages to purge. Previously, the dehugify logic would never dehugify a hugepage unless it was dirtier than the dehugification threshold. This can lead to situations in which these pages (which themselves could never be purged) would push us above the maximum allowed dirty pages in the shard. This forces immediate purging of any pages deallocated in non-hugified hugepages, which in turn places nonobvious practical limitations on the relationships between various config settings. Instead, we make our preference not to dehugify to purge a soft one rather than a hard one. We'll avoid purging them, but only so long as we can do so by purging non-hugified pages. If we need to purge them to satisfy our dirty page limits, or to hugify other, more worthy candidates, we'll still do so.	2021-02-19 15:10:54 -08:00
David Goldblatt	6bddb92ad6	psset: Rename "bitmap" to "pageslab_bitmap". It tracks pageslabs. Soon, we'll have another bitmap (to track dirty pages) that we want to disambiguate. While we're here, fix an out-of-date comment.	2021-02-19 15:10:54 -08:00
David Goldblatt	154aa5fcc1	Use the flat bitmap for eset and psset bitmaps. This is simpler (note that the eset field comment was actually incorrect!), and slightly faster.	2021-02-19 15:10:54 -08:00
David Goldblatt	d21d5b46b6	Edata: Move sn into its own field. This lets the bins use a fragmentation avoidance policy that matches the HPA's (without affecting the PAC).	2021-02-19 15:10:54 -08:00
David Goldblatt	fb327368db	SEC: Expand option configurability. This change pulls the SEC options into a struct, which simplifies their handling across various modules (e.g. PA needs to forward on SEC options from the malloc_conf string, but it doesn't really need to know their names). While we're here, make some of the fixed constants configurable, and unify naming from the configuration options to the internals.	2021-02-19 15:10:54 -08:00
David Goldblatt	cdae6706a6	SEC: Use batch fills. Currently, this doesn't help much, since no PAI implementation supports flushing. This will change in subsequent commits.	2021-02-19 15:10:54 -08:00
David Goldblatt	480f3b11cd	Add a batch allocation interface to the PAI. For now, no real allocator actually implements this interface; this will change in subsequent diffs.	2021-02-19 15:10:54 -08:00
David Goldblatt	bf448d7a5a	SEC: Reduce lock hold times. Only flush a subset of extents during flushing, and drop the lock while doing so.	2021-02-19 15:10:54 -08:00
David Goldblatt	1944ebbe7f	HPA: Implement batch deallocation. This saves O(n) mutex locks/unlocks during SEC flush.	2021-02-19 15:10:54 -08:00
David Goldblatt	f47b4c2cd8	PAI/SEC: Add a dalloc_batch function. This lets the SEC flush all of its items in a single call, rather than flushing everything at once.	2021-02-19 15:10:54 -08:00
David Goldblatt	4b8870c7db	SEC: Fix a comment typo.	2021-02-19 15:10:54 -08:00
Qi Wang	a11be50332	Implement opt.cache_oblivious. Keep config.cache_oblivious for now to remain backward-compatible.	2021-02-11 11:32:01 -08:00
Qi Wang	041145c272	Report the correct and wrong sizes on sized dealloc bug detection.	2021-02-08 14:42:27 -08:00
Qi Wang	f3b2668b32	Report the offending pointer on sized dealloc bug detection.	2021-02-08 14:42:27 -08:00
David Goldblatt	edbfe6912c	Inline malloc fastpath into operator new. This saves a small but non-negligible amount of CPU in C++ programs.	2021-02-08 14:17:47 -08:00
David Goldblatt	79f81a3732	HPA: Make dirty_mult configurable.	2021-02-04 20:58:31 -08:00
David Goldblatt	32dd153796	HPA: Make dehugification threshold configurable.	2021-02-04 20:58:31 -08:00
David Goldblatt	4790db15ed	HPA: make the hugification threshold configurable.	2021-02-04 20:58:31 -08:00
David Goldblatt	b3df80bc79	Pull HPA options into a containing struct. Currently that just means max_alloc, but we're about to add more. While we're touching these lines anyways, tweak things to be more in line with testing.	2021-02-04 20:58:31 -08:00
David Goldblatt	bdb7307ff2	fxp: Add FXP_INIT_PERCENT This lets us specify fxp values easily in source.	2021-02-04 20:58:31 -08:00
David Goldblatt	caef4c2868	FXP: add fxp_mul_frac. This can multiply size_ts by a fraction without the risk of overflow.	2021-02-04 20:58:31 -08:00
David Goldblatt	56e85c0e47	HPA: Use a whole-shard purging heuristic. Previously, we used only hpdata-local information to decide whether to purge.	2021-02-04 20:58:31 -08:00
David Goldblatt	dc886e5608	hpdata: Return the number of pages to be purged. We'll use this in the next commit.	2021-02-04 20:58:31 -08:00
David Goldblatt	9fd9c876bb	psset: keep aggregate stats. This will let us quickly query these stats to make purging decisions quickly.	2021-02-04 20:58:31 -08:00
David Goldblatt	da63f23e68	HPA: Track pending purges/hugifies in the psset. This finishes the refactoring of the HPA/psset interactions the past few commits have been building towards. Rather than the HPA removing and then reinserting hpdatas, it simply begins updates and ends them. These updates can set flags on the hpdata that prevent it from being returned for certain types of requests. For example, it can call hpdata_alloc_allowed_set(hpdata, false) during an update, at which point the given hpdata will no longer be returned for psset_pick_alloc requests. This has various of benefits: - It maintains stats correctness during purges and hugifies. - It allows simpler and more explicit concurrency control for the various special cases (e.g. allocations are disallowed during purge, but not during hugify). - It lets allocations and deallocations avoid disturbing the purging and hugification orderings. If an hpdata "loses its place" in one of the queues just do to an alloc / dalloc, it can result in pathological edge cases where very hot, very full hugepages never get hugified (and cold extents on the same hugepage as hot ones never get purged). The key benefit though is that tracking hpdatas to be purged / hugified in a principled way will let us do delayed purging and hugification. Eventually this will let us move these operations to background threads, but in the short term the benefit is that it will let us have global purging policies (e.g. purge when the entire arena has too many dirty pages, rather than any particular hugepage).	2021-02-04 20:58:31 -08:00
David Goldblatt	bf64557ed6	Move empty slab tracking to the psset. We're moving towards a world in which purging decisions are less rigidly enforced at a single-hugepage level. In that world, it makes sense to keep around some hpdatas which are not completely purged, in which case we'll need to track them.	2021-02-04 20:58:31 -08:00
David Goldblatt	99fc0717e6	psset: Reconceptualize insertion/removal. Really, this isn't a functional change, just a naming change. We start thinking of pageslabs as being always in the psset. What we used to think of as removal is now thought of as being in the psset, but in the process of being updated (and therefore, unavalable for serving new allocations). This is in preparation of subsequent changes to support deferred purging; allocations will still be in the psset for the purposes of choosing when to purge, but not for purposes of allocation/deallocation.	2021-02-04 20:58:31 -08:00
David Goldblatt	d3e5ea03c5	HPA: Track dirty stats.	2021-02-04 20:58:31 -08:00
David Goldblatt	68a1666e91	hpdata: Rename "dirty" to "touched". This matches the usage in the rest of the codebase.	2021-02-04 20:58:31 -08:00
David Goldblatt	be0d7a53f3	HPA: Don't track inactive pages. This is really only useful for human consumption. Correspondingly, emit it only in the human-readable stats, and let everybody else compute from the hugepage size and nactive.	2021-02-04 20:58:31 -08:00
David Goldblatt	55e0f60ca1	psset stats: Simplify handling. We can treat the huge and nonhuge cases uniformly using huge state as an array index.	2021-02-04 20:58:31 -08:00
David Goldblatt	94cd9444c5	HPA: Some minor reformattings.	2021-02-04 20:58:31 -08:00
David Goldblatt	b25ee5d88e	HPA: Add purge stats.	2021-02-04 20:58:31 -08:00
David Goldblatt	746ea3de6f	HPA stats: Allow some derived stats. However, we put them in their own struct, to avoid the messiness that the arena has (mixing derived and non-derived stats in the arena_stats_t).	2021-02-04 20:58:31 -08:00
David Goldblatt	30b9e8162b	HPA: Generalize purging. Previously, we would purge a hugepage only when it's completely empty. With this change, we can purge even when only partially empty. Although the heuristic here is still fairly primitive, this infrastructure can scale to become more advanced.	2021-02-04 20:58:31 -08:00
David Goldblatt	70692cfb13	hpdata: Add state changing helpers. We're about to allow hugepage subextent purging; get as much of our metadata handling ready as possible.	2021-02-04 20:58:31 -08:00
David Goldblatt	9b75808be1	flat bitmap: Add a bitwise and/or/not. We're about to need them.	2021-02-04 20:58:31 -08:00
David Goldblatt	2ae966222f	hpdata: track per-page dirty state.	2021-02-04 20:58:31 -08:00
David Goldblatt	ff4086aa6b	hpdata: count active pages instead of free ones. This will be more consistent with later naming choices.	2021-02-04 20:58:31 -08:00
David Goldblatt	3624dd42ff	hpdata: Add a comment for hpdata_consistent.	2021-02-04 20:58:31 -08:00
David Goldblatt	20140629b4	Bin: Move stats closer to the mutex. This is a slight cache locality optimization.	2021-02-04 14:10:43 -08:00
David Goldblatt	c259323ab3	Use ticker_geom_t for arena tcache decay.	2021-02-04 14:10:43 -08:00
David Goldblatt	8edfc5b170	Add ticker_geom_t. This lets a single ticker object drive events across a large number of different tick streams while sharing state.	2021-02-04 14:10:43 -08:00
David Goldblatt	3967329813	Arena: share bin offsets in a global. This saves us a cache miss when lookup up the arena bin offset in a remote arena during tcache flush. All arenas share the base offset, and so we don't need to look it up repeatedly for each arena. Secondarily, it shaves 288 bytes off the arena on, e.g., x86-64.	2021-02-04 14:10:43 -08:00
David Goldblatt	2fcbd18115	Cache bin: Don't reverse flush order. The items we pick to flush matter a lot, but the order in which they get flushed doesn't; just use forward scans. This simplifies the accessing code, both in terms of the C and the generated assembly (i.e. this speeds up the flush pathways).	2021-02-04 14:10:43 -08:00
David Goldblatt	4c46e11365	Cache an arena's index in the arena. This saves us a pointer hop down some perf-sensitive paths.	2021-02-04 14:10:43 -08:00
David Goldblatt	229994a204	Tcache flush: keep common path state in registers. By carefully force-inlining the division constants and the operation sum count, we can eliminate redundant operations in the arena-level dalloc function. Do so.	2021-02-04 14:10:43 -08:00
David Goldblatt	31a629c3de	Tcache flush: prefetch edata contents. This frontloads more of the miss latency. It also moves it to a pathway where we have not yet acquired any locks, so that it should (hopefully) reduce hold times.	2021-02-04 14:10:43 -08:00
David Goldblatt	9f9247a62e	Tcache fluhing: increase cache miss parallelism. In practice, many rtree_leaf_elm accesses are cache misses. By restructuring, we can make it more likely that these misses occur without blocking us from starting later lookups, taking more of those misses in parallel.	2021-02-04 14:10:43 -08:00
David Goldblatt	181ba7fd4d	Tcache flush: Add an emap "batch lookup" path. For now this is a no-op; but the interface is a little more flexible for our purposes.	2021-02-04 14:10:43 -08:00

1 2 3 4 5 ...

1448 Commits