server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
David Goldblatt	2fcbd18115	Cache bin: Don't reverse flush order. The items we pick to flush matter a lot, but the order in which they get flushed doesn't; just use forward scans. This simplifies the accessing code, both in terms of the C and the generated assembly (i.e. this speeds up the flush pathways).	2021-02-04 14:10:43 -08:00
David Goldblatt	229994a204	Tcache flush: keep common path state in registers. By carefully force-inlining the division constants and the operation sum count, we can eliminate redundant operations in the arena-level dalloc function. Do so.	2021-02-04 14:10:43 -08:00
David Goldblatt	31a629c3de	Tcache flush: prefetch edata contents. This frontloads more of the miss latency. It also moves it to a pathway where we have not yet acquired any locks, so that it should (hopefully) reduce hold times.	2021-02-04 14:10:43 -08:00
David Goldblatt	9f9247a62e	Tcache fluhing: increase cache miss parallelism. In practice, many rtree_leaf_elm accesses are cache misses. By restructuring, we can make it more likely that these misses occur without blocking us from starting later lookups, taking more of those misses in parallel.	2021-02-04 14:10:43 -08:00
David Goldblatt	181ba7fd4d	Tcache flush: Add an emap "batch lookup" path. For now this is a no-op; but the interface is a little more flexible for our purposes.	2021-02-04 14:10:43 -08:00
David Goldblatt	c007c537ff	Tcache flush: Unify edata lookup path.	2021-02-04 14:10:43 -08:00
David Goldblatt	a011c4c22d	cache_bin: Separate out local and remote accesses. This fixes an incorrect debug-mode assert: - T1 starts an arena stats update and reads stack_head from another thread's cache bin, when that cache bin has 1 item in it. - T2 allocates from that cache bin. The cache_bin's stack_head now points to a NULL pointer, since the cache bin is empty. - T1 Re-reads the cache_bin's stack_head to perform an assertion check (since it previously saw that the bin was empty, whatever stack_head points to should be non-NULL).	2021-01-08 14:18:08 -08:00
Qi Wang	bf72188f80	Allow opt.tcache_max to accept small size classes. Previously all the small size classes were cached. However this has downsides -- particularly when page size is greater than 4K (e.g. iOS), which will result in much higher SMALL_MAXCLASS. This change allows tcache_max to be set to lower values, to better control resources taken by tcache.	2020-10-24 20:43:44 -07:00
David Goldblatt	6599651aee	PA: Use an SEC in fron of the HPA shard.	2020-10-23 11:14:34 -07:00
Qi Wang	c8209150f9	Switch from opt.lg_tcache_max to opt.tcache_max Though for convenience, keep parsing lg_tcache_max.	2020-10-22 20:40:41 -07:00
Qi Wang	5e41ff9b74	Add a hard limit on tcache max size class. For locality reasons, tcache bins are integrated in TSD. Allowing all size classes to be cached has little benefit, but takes up much thread local storage. In addition, it complicates the layout which we try hard to optimize.	2020-10-16 13:49:51 -07:00
Qi Wang	3de19ba401	Eagerly detect double free and sized dealloc bugs for large sizes.	2020-10-15 10:03:16 -07:00
David Goldblatt	be9548f2be	Tcaches: Fix a subtle race condition. Without a lock held continuously between checking tcaches_past and incrementing it, it's possible for two threads to go down manual creation path simultaneously. If the number of tcaches is one less than the maximum, it's possible for both to create a tcache and increment tcaches_past, with the second thread returning a value larger than TCACHES_MAX.	2020-10-13 15:06:16 -07:00
Yinan Zhang	f28cc2bc87	Extract bin shard selection out of bin locking	2020-07-31 09:16:50 -07:00
David Goldblatt	f1f4ec315a	Tcache: Tweak nslots_max tuning parameter. In making these settings configurable, `634afc4124` unintentially changed a tuning parameter (reducing the "goal" max by a factor of 4). This commit undoes that change.	2020-07-09 08:58:05 -07:00
Yinan Zhang	a795b19327	Remove beginning define in source files ``` sed -i "/^#define JEMALLOC_[A-Z_]_C_$/d" src/.c; ```	2020-06-19 12:15:44 -07:00
David Goldblatt	8da0896b79	Tcache: Make an integer conversion explicit.	2020-05-28 15:52:40 -07:00
David Goldblatt	6cdac3c573	Tcache: Make flush fractions configurable.	2020-05-16 13:34:23 -07:00
David Goldblatt	7503b5b33a	Stats, CTL: Expose new tcache settings.	2020-05-16 13:34:23 -07:00
David Goldblatt	ee72bf1cfd	Tcache: Add tcache gc delay option. This can reduce flushing frequency for small size classes.	2020-05-16 13:34:23 -07:00
David Goldblatt	d338dd45d7	Tcache: Make incremental gc bytes configurable.	2020-05-16 13:34:23 -07:00
David Goldblatt	ec0b579563	Tcache: Privatize opt_lg_tcache_max default.	2020-05-16 13:34:23 -07:00
David Goldblatt	181093173d	Tcache: make slot sizing configurable.	2020-05-16 13:34:23 -07:00
David Goldblatt	634afc4124	Tcache: Make size computation configurable.	2020-05-16 13:34:23 -07:00
Yinan Zhang	b06dfb9ccc	Push event handlers to constituent modules	2020-05-12 09:16:16 -07:00
Yinan Zhang	abd4674931	Extract out per event postponed wait time fetching	2020-05-12 09:16:16 -07:00
Yinan Zhang	733ae918f0	Extract out per event new wait time fetching	2020-05-12 09:16:16 -07:00
David Goldblatt	cd29ebefd0	Tcache: treat small and large cache bins uniformly	2020-04-14 15:20:19 -07:00
David Goldblatt	a13fbad374	Tcache: split up fast and slow path data.	2020-04-14 15:20:19 -07:00
David Goldblatt	7099c66205	Arena: fill in terms of cache_bins.	2020-04-14 15:20:19 -07:00
David Goldblatt	294b276fc7	PA: Parameterize emap. Move emap_global to arena. This lets us test the PA module without interfering with the global emap used by the real allocator (the one not under test).	2020-04-10 13:12:47 -07:00
David Goldblatt	d701a085c2	Fast path: allow low-water mark changes. This lets us put more allocations on an "almost as fast" path after a flush. This results in around a 4% reduction in malloc cycles in prod workloads (corresponding to about a 0.1% reduction in overall cycles).	2020-03-12 11:54:19 -07:00
David Goldblatt	fef0b1ffe4	Cache bin: Remove last internals accesses.	2020-03-12 11:54:19 -07:00
David Goldblatt	0a2fcfac01	Tcache: Hold cache bin allocation explicitly.	2020-03-12 11:54:19 -07:00
David Goldblatt	d498a4bb08	Cache bin: Add an emptiness assertion.	2020-03-12 11:54:19 -07:00
David Goldblatt	7f5ebd211c	Cache bin: set low-water internally.	2020-03-12 11:54:19 -07:00
David Goldblatt	60113dfe3b	Cache bin: Move in initialization code.	2020-03-12 11:54:19 -07:00
David Goldblatt	44529da852	Cache-bin: Make flush modifications internal I.e. the tcache code just calls a cache-bin function to finish flush (and move pointers around, etc.). It doesn't directly access the cache-bin's owned memory any more.	2020-03-12 11:54:19 -07:00
David Goldblatt	ff6acc6ed5	Cache bin: simplify names and argument ordering. We always start with the cache bin, then its info (if necessary).	2020-03-12 11:54:19 -07:00
David Goldblatt	e1dcc557d6	Cache bin: Only take the relevant cache_bin_info_t Previously, we took an array of cache_bin_info_ts and an index, and dereferenced ourselves. But infos for other cache_bins aren't relevant to any particular cache bin, so that should be the caller's job.	2020-03-12 11:54:19 -07:00
David Goldblatt	d303f30796	cache_bin nflush -> n. We're going to use it on the fill pathway as well.	2020-03-12 11:54:19 -07:00
David Goldblatt	74d36d78ef	Cache bin: Make ncached_max a query on the info_t.	2020-03-12 11:54:19 -07:00
David Goldblatt	b66c0973cc	cache_bin: Don't allow direct internals access.	2020-03-12 11:54:19 -07:00
David Goldblatt	909c501b07	Cache_bin: Shouldn't know about tcache. Instead, have it take the cache_bin_info_ts to use by pointer. While we're here, add a src file for the cache bin.	2020-03-12 11:54:19 -07:00
David Goldblatt	79f1ee2fc0	Move junking out of arena/tcache code. This is debug only and we keep it off the fast path. Moving it here simplifies the internal logic. This never tries to junk on regions that were shrunk via xallocx. I think this is fine for two reasons: - The shrunk-with-xallocx case is rare. - We don't always do that anyway before this diff (it depends on the opt settings and extent hooks in effect).	2020-03-12 11:54:19 -07:00
David T. Goldblatt	6c3491ad31	Tcache: Unify bin flush logic. The small and large pathways share most of their logic, even if some of the individual operations are different. We pull out the common logic into a force-inlined function, and then specialize twice, once for each value of "small".	2020-02-25 10:21:03 -08:00
David T. Goldblatt	29436fa056	Break prof and tcache knowledge of b0.	2020-02-18 11:22:09 -08:00
David Goldblatt	7e6c8a7286	Emap: Standardize naming. Namespace everything under emap_, always specify what it is we're looking up (emap_lookup -> emap_edata_lookup), and use "ctx" over "info".	2020-02-17 10:50:51 -08:00
David Goldblatt	ac50c1e44b	Emap: Remove direct access to emap internals. In the process, we do a few local cleanups and optimizations. In particular, the size safety check on tcache flush no longer does a redundant load.	2020-02-17 10:50:51 -08:00
David Goldblatt	9b5d105fc3	Emap: Move in iealloc. This is logically scoped to the emap.	2020-02-17 10:50:51 -08:00

1 2 3 4

183 Commits