server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
David Goldblatt	9f9247a62e	Tcache fluhing: increase cache miss parallelism. In practice, many rtree_leaf_elm accesses are cache misses. By restructuring, we can make it more likely that these misses occur without blocking us from starting later lookups, taking more of those misses in parallel.	2021-02-04 14:10:43 -08:00
David Goldblatt	181ba7fd4d	Tcache flush: Add an emap "batch lookup" path. For now this is a no-op; but the interface is a little more flexible for our purposes.	2021-02-04 14:10:43 -08:00
David CARLIER	35a8552605	Mac OS: Tag mapped pages. This can be used to help profiling tools (e.g. vmmap) identify the sources of mappings more specifically.	2021-02-03 15:05:53 -08:00
Azat Khuzhin	a943172b73	Add runtime detection for MADV_DONTNEED zeroes pages (mostly for qemu) qemu does not support this, yet [1], and you can get very tricky assert if you will run program with jemalloc in use under qemu: <jemalloc>: ../contrib/jemalloc/src/extent.c:1195: Failed assertion: "p[i] == 0" [1]: https://patchwork.kernel.org/patch/10576637/ Here is a simple example that shows the problem [2]: // Gist to check possible issues with MADV_DONTNEED // For example it does not supported by qemu user // There is a patch for this [1], but it hasn't been applied. // [1]: https://lists.gnu.org/archive/html/qemu-devel/2018-08/msg05422.html #include <sys/mman.h> #include <stdio.h> #include <stddef.h> #include <assert.h> #include <string.h> int main(int argc, char *argv) { void addr = mmap(NULL, 1<<16, PROT_READ\|PROT_WRITE, MAP_PRIVATE\|MAP_ANONYMOUS, -1, 0); if (addr == MAP_FAILED) { perror("mmap"); return 1; } memset(addr, 'A', 1<<16); if (!madvise(addr, 1<<16, MADV_DONTNEED)) { puts("MADV_DONTNEED does not return error. Check memory."); for (int i = 0; i < 1<<16; ++i) { assert(((unsigned char )addr)[i] == 0); } } else { perror("madvise"); } if (munmap(addr, 1<<16)) { perror("munmap"); return 1; } return 0; } ### unpatched qemu $ qemu-x86_64-static /tmp/test-MADV_DONTNEED MADV_DONTNEED does not return error. Check memory. test-MADV_DONTNEED: /tmp/test-MADV_DONTNEED.c:19: main: Assertion `((unsigned char )addr)[i] == 0' failed. qemu: uncaught target signal 6 (Aborted) - core dumped Aborted (core dumped) ### patched qemu (by returning ENOSYS error) $ qemu-x86_64 /tmp/test-MADV_DONTNEED madvise: Success ### patch for qemu to return ENOSYS diff --git a/linux-user/syscall.c b/linux-user/syscall.c index 897d20c076..5540792e0e 100644 --- a/linux-user/syscall.c +++ b/linux-user/syscall.c @@ -11775,7 +11775,7 @@ static abi_long do_syscall1(void cpu_env, int num, abi_long arg1, turns private file-backed mappings into anonymous mappings. This will break MADV_DONTNEED. This is a hint, so ignoring and returning success is ok. / - return 0; + return ENOSYS; #endif #ifdef TARGET_NR_fcntl64 case TARGET_NR_fcntl64: [2]: https://gist.github.com/azat/12ba2c825b710653ece34dba7f926ece v2: - review fixes - add opt_dont_trust_madvise v3: - review fixes - rename opt_dont_trust_madvise to opt_trust_madvise	2021-01-20 20:08:30 -08:00
David Goldblatt	a011c4c22d	cache_bin: Separate out local and remote accesses. This fixes an incorrect debug-mode assert: - T1 starts an arena stats update and reads stack_head from another thread's cache bin, when that cache bin has 1 item in it. - T2 allocates from that cache bin. The cache_bin's stack_head now points to a NULL pointer, since the cache bin is empty. - T1 Re-reads the cache_bin's stack_head to perform an assertion check (since it previously saw that the bin was empty, whatever stack_head points to should be non-NULL).	2021-01-08 14:18:08 -08:00
Yinan Zhang	14d689c0f9	Add prof stats mutex stats	2021-01-07 20:39:49 -08:00
Yinan Zhang	40fa4d29d3	Track per size class internal fragmentation	2021-01-07 20:39:49 -08:00
Yinan Zhang	afa489c3c5	Record request size in prof info	2021-01-07 20:39:49 -08:00
Yinan Zhang	b35ac00d58	Do not bump to large size for page aligned request	2020-12-29 17:09:58 -08:00
Yinan Zhang	8a56d6b636	Add last-N mutex stats	2020-12-29 09:44:19 -08:00
Yinan Zhang	74bd63b203	Optimize stats print using partial name-to-mib	2020-12-18 10:39:58 -08:00
Yinan Zhang	4557c0a67d	Enable ctl on partial mib and partial name	2020-12-18 10:39:58 -08:00
Yinan Zhang	006dd0414e	Add partial name-to-mib functionality	2020-12-18 10:39:58 -08:00
Jin Qian	26c1dc5a3a	Support AutoConf for posix_madvise and POSIX_MADV_DONTNEED	2020-12-18 10:05:59 -08:00
Jin Qian	96a59c3bb5	Fix recursive malloc during bootstrap on QNX pthread_key_create on QNX triggers recursive allocation during tsd bootstrapping. Using tsd_init_check_recursion to detect that. Before pthread_key_create, the address of tsd_boot_wrapper is returned from tsd_get_wrapper instead of using TLS to store the pointer. tsd_set_wrapper becomes a no-op. After that, the address of tsd_boot_wrapper is written to TLS and bootstrap continues as before. Signed-off-by: Jin Qian <jqian@aurora.tech>	2020-12-18 10:05:59 -08:00
David Goldblatt	1e3b8636ff	HPA: Remove unused malloc_conf options.	2020-12-08 12:10:48 -08:00
David Goldblatt	a559caf74a	hpdata: Strengthen assertions. Now that we have flat bitmap bit counting functions, we can easily assert that nfree is always correct. While we're tightening up this code, enforce consistency on API boundaries as well.	2020-12-07 06:21:08 -08:00
David Goldblatt	54c94c1679	flat bitmap: add scount / ucount functions. These can compute the number or set or unset bits in a subrange of the bitmap.	2020-12-07 06:21:08 -08:00
David Goldblatt	e6c057ad35	fb: implement assign in terms of a visitor. We'll reuse this visitor in the next commit.	2020-12-07 06:21:08 -08:00
David Goldblatt	734e72ce8f	bit_util: Guarantee popcount's presence. Implement popcount generically, so that we can rely on it being present.	2020-12-07 06:21:08 -08:00
David Goldblatt	d9f7e6c668	hpdata: Add a test. We're about to make the functionality here more complicated; testing hpdata directly (rather than relying on user's tests) will make debugging easier.	2020-12-07 06:21:08 -08:00
David Goldblatt	3ed0b4e8a3	HPA: Add an nevictions counter. I.e. the number of times we've purged a hugepage-sized region.	2020-12-07 06:21:08 -08:00
David Goldblatt	f7cf23aa4d	psset: Relegate alloc/dalloc to test code. This is no longer part of the "core" functionality; we only need the stub implementations as an end-to-end test of hpdata + psset interactions when metadata is being modified. Treat them accordingly.	2020-12-07 06:21:08 -08:00
David Goldblatt	0971e1e4e3	hpdata: Use addr/size instead of begin/npages. This is easier for the users of the hpdata.	2020-12-07 06:21:08 -08:00
David Goldblatt	5228d869ee	psset: Use fit/insert/remove as basis functions. All other functionality can be implemented in terms of these; doing so (while retaining the same API) will be convenient for subsequent refactors.	2020-12-07 06:21:08 -08:00
David Goldblatt	089f8fa442	Move hpdata bitmap logic out of the psset.	2020-12-07 06:21:08 -08:00
David Goldblatt	ca30b5db2b	Introduce hpdata_t. Using an edata_t both for hugepages and the allocations within those hugepages was convenient at first, but has outlived its usefulness. Representing hugepages explicitly, with their own data structure, will make future development easier.	2020-12-07 06:21:08 -08:00
David Goldblatt	43af63fff4	HPA: Manage whole hugepages at a time. This redesigns the HPA implementation to allow us to manage hugepages all at once, locally, without relying on a global fallback.	2020-12-07 06:21:08 -08:00
David Goldblatt	63677dde63	Pages: Statically detect if pages_huge may succeed	2020-12-07 06:21:08 -08:00
David Goldblatt	c1b2a77933	psset: Move in stats. A later change will benefit from having these functions pulled into a psset-module set of functions.	2020-12-07 06:21:08 -08:00
David Goldblatt	d0a991d47b	psset: Add insert/remove functions. These will allow us to (for instance) move pageslabs from a psset dedicated to not-yet-hugeified pages to one dedicated to hugeified ones.	2020-12-07 06:21:08 -08:00
David Goldblatt	ecd39418ac	Add fxp: A fixed-point math library. This will be used in the next commit to allow non-integer values for narenas_ratio.	2020-12-04 23:48:19 -08:00
David Carlier	520b75fa2d	utrace support with label based signature.	2020-11-30 11:43:00 -08:00
Yinan Zhang	be5e49f4fa	Add a batch mode for cache_bin_alloc()	2020-11-16 20:58:01 -08:00
Yinan Zhang	566c4a8594	Slight changes to cache bin internal functions	2020-11-16 20:58:01 -08:00
David Goldblatt	cf2549a149	Add a per-arena oversize_threshold. This can let manual arenas trade off memory and CPU the way auto arenas do.	2020-11-13 13:45:35 -08:00
David Goldblatt	4ca3d91e96	Rename geom_grow -> exp_grow. This was promised in the review of the introduction of geom_grow, but would have been painful to do there because of the series that introduced it. Now that those are comitted, renaming is easier.	2020-11-13 13:42:33 -08:00
David Goldblatt	b4c37a6e81	Rename edata_tree_t -> edata_avail_t. This isn't a tree any more, and it mildly irritates me any time I see it.	2020-11-13 13:42:11 -08:00
David Carlier	95f0a77fde	Detect pthread_getname_np explicitly. At least one libc (musl) defines pthread_setname_np without defining pthread_getname_np. Detect the presence of each individually, rather than inferring both must be defined if set is.	2020-11-11 17:31:22 -08:00
David Goldblatt	589638182a	Use the edata_cache_small_t in the HPA.	2020-11-05 12:34:43 -08:00
David Goldblatt	03a6047111	Edata cache small: rewrite. In previous designs, this was intended to be a sort of cache that couldn't fail. In the current design, we want to use it just as a contention reduction mechanism. Rewrite it with those goals in mind.	2020-11-05 12:34:43 -08:00
David Goldblatt	1b3ee75667	Add experimental.thread.activity_callback. This (experimental, undocumented) functionality can be used by users to track various statistics of interest at a finer level of granularity than the thread.	2020-11-05 12:33:25 -08:00
David Carlier	d2d941017b	MADV_DO[NOT]DUMP support equivalence on FreeBSD.	2020-11-02 09:15:15 -08:00
DC	ef6d51ed44	DragonFlyBSD build support.	2020-10-27 12:35:19 -07:00
Qi Wang	bf72188f80	Allow opt.tcache_max to accept small size classes. Previously all the small size classes were cached. However this has downsides -- particularly when page size is greater than 4K (e.g. iOS), which will result in much higher SMALL_MAXCLASS. This change allows tcache_max to be set to lower values, to better control resources taken by tcache.	2020-10-24 20:43:44 -07:00
David Goldblatt	ea32060f9c	SEC: Implement thread affinity. For now, just have every thread pick a shard once and stick with it.	2020-10-23 11:14:34 -07:00
David Goldblatt	d16849c91d	psset: Do first-fit based on slab age. This functions more like the serial number strategy of the ecache and hpa_central_t. Longer-lived slabs are more likely to continue to live for longer in the future.	2020-10-23 11:14:34 -07:00
David Goldblatt	634ec6f50a	Edata: add an "age" field.	2020-10-23 11:14:34 -07:00
David Goldblatt	6599651aee	PA: Use an SEC in fron of the HPA shard.	2020-10-23 11:14:34 -07:00
David Goldblatt	ea51e97bb8	Add SEC module: a small extent cache. This can be used to take pressure off a more centralized, worse-sharded allocator without requiring a full break of the arena abstraction.	2020-10-23 11:14:34 -07:00

... 2 3 4 5 6 ...

1500 Commits