server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
David Goldblatt	2ae966222f	hpdata: track per-page dirty state.	2021-02-04 20:58:31 -08:00
David Goldblatt	ff4086aa6b	hpdata: count active pages instead of free ones. This will be more consistent with later naming choices.	2021-02-04 20:58:31 -08:00
David Goldblatt	20140629b4	Bin: Move stats closer to the mutex. This is a slight cache locality optimization.	2021-02-04 14:10:43 -08:00
David Goldblatt	c259323ab3	Use ticker_geom_t for arena tcache decay.	2021-02-04 14:10:43 -08:00
David Goldblatt	8edfc5b170	Add ticker_geom_t. This lets a single ticker object drive events across a large number of different tick streams while sharing state.	2021-02-04 14:10:43 -08:00
David Goldblatt	3967329813	Arena: share bin offsets in a global. This saves us a cache miss when lookup up the arena bin offset in a remote arena during tcache flush. All arenas share the base offset, and so we don't need to look it up repeatedly for each arena. Secondarily, it shaves 288 bytes off the arena on, e.g., x86-64.	2021-02-04 14:10:43 -08:00
David Goldblatt	2fcbd18115	Cache bin: Don't reverse flush order. The items we pick to flush matter a lot, but the order in which they get flushed doesn't; just use forward scans. This simplifies the accessing code, both in terms of the C and the generated assembly (i.e. this speeds up the flush pathways).	2021-02-04 14:10:43 -08:00
David Goldblatt	4c46e11365	Cache an arena's index in the arena. This saves us a pointer hop down some perf-sensitive paths.	2021-02-04 14:10:43 -08:00
David Goldblatt	229994a204	Tcache flush: keep common path state in registers. By carefully force-inlining the division constants and the operation sum count, we can eliminate redundant operations in the arena-level dalloc function. Do so.	2021-02-04 14:10:43 -08:00
David Goldblatt	31a629c3de	Tcache flush: prefetch edata contents. This frontloads more of the miss latency. It also moves it to a pathway where we have not yet acquired any locks, so that it should (hopefully) reduce hold times.	2021-02-04 14:10:43 -08:00
David Goldblatt	9f9247a62e	Tcache fluhing: increase cache miss parallelism. In practice, many rtree_leaf_elm accesses are cache misses. By restructuring, we can make it more likely that these misses occur without blocking us from starting later lookups, taking more of those misses in parallel.	2021-02-04 14:10:43 -08:00
David Goldblatt	181ba7fd4d	Tcache flush: Add an emap "batch lookup" path. For now this is a no-op; but the interface is a little more flexible for our purposes.	2021-02-04 14:10:43 -08:00
David Goldblatt	c007c537ff	Tcache flush: Unify edata lookup path.	2021-02-04 14:10:43 -08:00
David CARLIER	35a8552605	Mac OS: Tag mapped pages. This can be used to help profiling tools (e.g. vmmap) identify the sources of mappings more specifically.	2021-02-03 15:05:53 -08:00
Yinan Zhang	f6699803e2	Fix duration in prof log	2021-01-25 16:38:38 -08:00
Azat Khuzhin	a943172b73	Add runtime detection for MADV_DONTNEED zeroes pages (mostly for qemu) qemu does not support this, yet [1], and you can get very tricky assert if you will run program with jemalloc in use under qemu: <jemalloc>: ../contrib/jemalloc/src/extent.c:1195: Failed assertion: "p[i] == 0" [1]: https://patchwork.kernel.org/patch/10576637/ Here is a simple example that shows the problem [2]: // Gist to check possible issues with MADV_DONTNEED // For example it does not supported by qemu user // There is a patch for this [1], but it hasn't been applied. // [1]: https://lists.gnu.org/archive/html/qemu-devel/2018-08/msg05422.html #include <sys/mman.h> #include <stdio.h> #include <stddef.h> #include <assert.h> #include <string.h> int main(int argc, char *argv) { void addr = mmap(NULL, 1<<16, PROT_READ\|PROT_WRITE, MAP_PRIVATE\|MAP_ANONYMOUS, -1, 0); if (addr == MAP_FAILED) { perror("mmap"); return 1; } memset(addr, 'A', 1<<16); if (!madvise(addr, 1<<16, MADV_DONTNEED)) { puts("MADV_DONTNEED does not return error. Check memory."); for (int i = 0; i < 1<<16; ++i) { assert(((unsigned char )addr)[i] == 0); } } else { perror("madvise"); } if (munmap(addr, 1<<16)) { perror("munmap"); return 1; } return 0; } ### unpatched qemu $ qemu-x86_64-static /tmp/test-MADV_DONTNEED MADV_DONTNEED does not return error. Check memory. test-MADV_DONTNEED: /tmp/test-MADV_DONTNEED.c:19: main: Assertion `((unsigned char )addr)[i] == 0' failed. qemu: uncaught target signal 6 (Aborted) - core dumped Aborted (core dumped) ### patched qemu (by returning ENOSYS error) $ qemu-x86_64 /tmp/test-MADV_DONTNEED madvise: Success ### patch for qemu to return ENOSYS diff --git a/linux-user/syscall.c b/linux-user/syscall.c index 897d20c076..5540792e0e 100644 --- a/linux-user/syscall.c +++ b/linux-user/syscall.c @@ -11775,7 +11775,7 @@ static abi_long do_syscall1(void cpu_env, int num, abi_long arg1, turns private file-backed mappings into anonymous mappings. This will break MADV_DONTNEED. This is a hint, so ignoring and returning success is ok. / - return 0; + return ENOSYS; #endif #ifdef TARGET_NR_fcntl64 case TARGET_NR_fcntl64: [2]: https://gist.github.com/azat/12ba2c825b710653ece34dba7f926ece v2: - review fixes - add opt_dont_trust_madvise v3: - review fixes - rename opt_dont_trust_madvise to opt_trust_madvise	2021-01-20 20:08:30 -08:00
David Goldblatt	a011c4c22d	cache_bin: Separate out local and remote accesses. This fixes an incorrect debug-mode assert: - T1 starts an arena stats update and reads stack_head from another thread's cache bin, when that cache bin has 1 item in it. - T2 allocates from that cache bin. The cache_bin's stack_head now points to a NULL pointer, since the cache bin is empty. - T1 Re-reads the cache_bin's stack_head to perform an assertion check (since it previously saw that the bin was empty, whatever stack_head points to should be non-NULL).	2021-01-08 14:18:08 -08:00
Yinan Zhang	14d689c0f9	Add prof stats mutex stats	2021-01-07 20:39:49 -08:00
Yinan Zhang	9f71b5779b	Output prof stats in stats print	2021-01-07 20:39:49 -08:00
Yinan Zhang	1f1a0231ed	Split macros for initializing stats headers	2021-01-07 20:39:49 -08:00
Yinan Zhang	54f3351f1f	Add mallctl for prof stats fetching	2021-01-07 20:39:49 -08:00
Yinan Zhang	40fa4d29d3	Track per size class internal fragmentation	2021-01-07 20:39:49 -08:00
Yinan Zhang	afa489c3c5	Record request size in prof info	2021-01-07 20:39:49 -08:00
David Goldblatt	f9bb8dedef	Un-force-inline do_rallocx. The additional overhead of the function-call setup and flags checking is relatively small, but costs us the replication of the entire realloc pathway in terms of size.	2021-01-04 14:55:49 -08:00
David Goldblatt	a9fa2defdb	Add JEMALLOC_COLD, and mark some functions cold. This hints to the compiler that it should care more about space than CPU (among other things). In cases where the compiler lacks profile-guided information, this can be a substantial space savings. For now, we mark the mallctl or atexit driven profiling and stats functions that take up the most space.	2021-01-04 14:55:49 -08:00
David Goldblatt	5d8e70ab26	prof_recent: cassert(config_prof) more often. This tells the compiler that these functions are never called, which lets them be optimized away in builds where profiling is disabled.	2021-01-04 14:55:49 -08:00
David Goldblatt	83cad746ae	prof_log: cassert(config_prof) in public functions This lets the compiler infer that the code is dead in builds where profiling is enabled, saving on space there.	2021-01-04 14:55:49 -08:00
David Goldblatt	526180b76d	Extent.c: Avoid an rtree NULL-check. The edge case in which pages_map returns (void *)PAGE can trigger an incorrect assertion failure. Avoid it.	2021-01-04 14:50:49 -08:00
Yinan Zhang	b35ac00d58	Do not bump to large size for page aligned request	2020-12-29 17:09:58 -08:00
Yinan Zhang	8a56d6b636	Add last-N mutex stats	2020-12-29 09:44:19 -08:00
Yinan Zhang	22d62d8cbd	Handle ending gap properly for HPA stats	2020-12-18 16:40:57 -08:00
Yinan Zhang	6c5a3a24dd	Omit bin stats rows with no data	2020-12-18 16:40:57 -08:00
Yinan Zhang	ea013d8fa4	Enforce realloc sizing stability	2020-12-18 11:41:52 -08:00
Yinan Zhang	74bd63b203	Optimize stats print using partial name-to-mib	2020-12-18 10:39:58 -08:00
Yinan Zhang	4557c0a67d	Enable ctl on partial mib and partial name	2020-12-18 10:39:58 -08:00
Yinan Zhang	006dd0414e	Add partial name-to-mib functionality	2020-12-18 10:39:58 -08:00
Yinan Zhang	f2e1a5be77	Do not fail on partial ctl path for ctl_nametomib() We do not fail on partial ctl path when the given `mib` array is shorter than the given name, and we should keep the behavior the same in the reverse case, which I feel is also the more natural way.	2020-12-18 10:39:58 -08:00
Yinan Zhang	6ab181d2b7	Extract node lookup given mib input	2020-12-18 10:39:58 -08:00
Yinan Zhang	3a627b9674	No need to record all nodes in ctl_lookup()	2020-12-18 10:39:58 -08:00
Yinan Zhang	91e006c4c2	Enable ctl_lookup() to start from arbitrary node	2020-12-18 10:39:58 -08:00
Jin Qian	4e3fe218e9	Use posix_madvise to purge pages when available	2020-12-18 10:05:59 -08:00
David Goldblatt	1e3b8636ff	HPA: Remove unused malloc_conf options.	2020-12-08 12:10:48 -08:00
Aditya Kumar	9522ae41d6	Move n_search outside of assert as reported by static analyzer	2020-12-07 06:49:27 -08:00
David Goldblatt	a559caf74a	hpdata: Strengthen assertions. Now that we have flat bitmap bit counting functions, we can easily assert that nfree is always correct. While we're tightening up this code, enforce consistency on API boundaries as well.	2020-12-07 06:21:08 -08:00
David Goldblatt	3ed0b4e8a3	HPA: Add an nevictions counter. I.e. the number of times we've purged a hugepage-sized region.	2020-12-07 06:21:08 -08:00
David Goldblatt	fffcefed33	malloc_conf: Clarify HPA options.	2020-12-07 06:21:08 -08:00
David Goldblatt	f7cf23aa4d	psset: Relegate alloc/dalloc to test code. This is no longer part of the "core" functionality; we only need the stub implementations as an end-to-end test of hpdata + psset interactions when metadata is being modified. Treat them accordingly.	2020-12-07 06:21:08 -08:00
David Goldblatt	f9299ca572	HPA: Use psset fit/insert/remove. This will let us remove alloc_new and alloc_reuse functions from the psset.	2020-12-07 06:21:08 -08:00
David Goldblatt	0971e1e4e3	hpdata: Use addr/size instead of begin/npages. This is easier for the users of the hpdata.	2020-12-07 06:21:08 -08:00
David Goldblatt	5228d869ee	psset: Use fit/insert/remove as basis functions. All other functionality can be implemented in terms of these; doing so (while retaining the same API) will be convenient for subsequent refactors.	2020-12-07 06:21:08 -08:00

1 2 3 4 5 ...

1665 Commits