server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Elliot Ronaghan	8a1a794b0c	Don't use compact red-black trees with the pgi compiler Some bug (either in the red-black tree code, or in the pgi compiler) seems to cause red-black trees to become unbalanced. This issue seems to go away if we don't use compact red-black trees. Since red-black trees don't seem to be used much anymore, I opted for what seems to be an easy fix here instead of digging in and trying to find the root cause of the bug. Some context in case it's helpful: I experienced a ton of segfaults while using pgi as Chapel's target compiler with jemalloc 4.0.4. The little bit of debugging I did pointed me somewhere deep in red-black tree manipulation, but I didn't get a chance to investigate further. It looks like 4.2.0 replaced most uses of red-black trees with pairing-heaps, which seems to avoid whatever bug I was hitting. However, `make check_unit` was still failing on the rb test, so I figured the core issue was just being masked. Here's the `make check_unit` failure: ```sh === test/unit/rb === test_rb_empty: pass tree_recurse:test/unit/rb.c:90: Failed assertion: (((_Bool) (((uintptr_t) (left_node)->link.rbn_right_red) & ((size_t)1)))) == (false) --> true != false: Node should be black test_rb_random:test/unit/rb.c:274: Failed assertion: (imbalances) == (0) --> 1 != 0: Tree is unbalanced tree_recurse:test/unit/rb.c:90: Failed assertion: (((_Bool) (((uintptr_t) (left_node)->link.rbn_right_red) & ((size_t)1)))) == (false) --> true != false: Node should be black test_rb_random:test/unit/rb.c:274: Failed assertion: (imbalances) == (0) --> 1 != 0: Tree is unbalanced node_remove:test/unit/rb.c:190: Failed assertion: (imbalances) == (0) --> 2 != 0: Tree is unbalanced <jemalloc>: test/unit/rb.c:43: Failed assertion: "pathp[-1].cmp < 0" test/test.sh: line 22: 12926 Aborted Test harness error ``` While starting to debug I saw the RB_COMPACT option and decided to check if turning that off resolved the bug. It seems to have fixed it (`make check_unit` passes and the segfaults under Chapel are gone) so it seems like on okay work-around. I'd imagine this has performance implications for red-black trees under pgi, but if they're not going to be used much anymore it's probably not a big deal.	2016-06-08 14:48:55 -07:00
Jason Evans	dd752c1ffd	Fix potential VM map fragmentation regression. Revert 245ae6036c09cc11a72fab4335495d95cddd5beb (Support --with-lg-page values larger than actual page size.), because it could cause VM map fragmentation if the kernel grows mmap()ed memory downward. This resolves #391.	2016-06-07 14:15:49 -07:00
Jason Evans	4e910fc958	Fix extent_alloc_dss() regressions. Page-align the gap, if any, and add/use extent_dalloc_gap(), which registers the gap extent before deallocation.	2016-06-05 21:00:02 -07:00
Jason Evans	04942c3d90	Remove a stray memset(), and fix a junk filling test regression.	2016-06-05 21:00:02 -07:00
Jason Evans	f8f0542194	Modify extent hook functions to take an (extent_t *) argument. This facilitates the application accessing its own extent allocator metadata during hook invocations. This resolves #259.	2016-06-05 21:00:02 -07:00
Jason Evans	6f29a83924	Add rtree lookup path caching. rtree-based extent lookups remain more expensive than chunk-based run lookups, but with this optimization the fast path slowdown is ~3 CPU cycles per metadata lookup (on Intel Core i7-4980HQ), versus ~11 cycles prior. The path caching speedup tends to degrade gracefully unless allocated memory is spread far apart (as is the case when using a mixture of sbrk() and mmap()).	2016-06-05 20:59:57 -07:00
Jason Evans	7be2ebc23f	Make tsd cleanup functions optional, remove noop cleanup functions.	2016-06-05 20:42:24 -07:00
Jason Evans	b14fdaaca0	Add a missing prof_alloc_rollback() call. In the case where prof_alloc_prep() is called with an over-estimate of allocation size, and sampling doesn't end up being triggered, the tctx must be discarded.	2016-06-05 20:42:24 -07:00
Jason Evans	c8c3cbdf47	Miscellaneous s/chunk/extent/ updates.	2016-06-05 20:42:24 -07:00
Jason Evans	a43db1c608	Relax NBINS constraint (max 255 --> max 256).	2016-06-05 20:42:24 -07:00
Jason Evans	751f2c332d	Remove obsolete stats.arenas.<i>.metadata.mapped mallctl. Rename stats.arenas.<i>.metadata.allocated mallctl to stats.arenas.<i>.metadata .	2016-06-05 20:42:24 -07:00
Jason Evans	03eea4fb8b	Better document --enable-ivsalloc.	2016-06-05 20:42:24 -07:00
Jason Evans	22588dda6e	Rename most remaining chunk APIs to extent.	2016-06-05 20:42:23 -07:00
Jason Evans	0c4932eb1e	s/chunk_lookup/extent_lookup/g, s/chunks_rtree/extents_rtree/g	2016-06-05 20:42:23 -07:00
Jason Evans	4a55daa363	s/CHUNK_HOOKS_INITIALIZER/EXTENT_HOOKS_INITIALIZER/g	2016-06-05 20:42:23 -07:00
Jason Evans	c9a76481d8	Rename chunks_{cached,retained,mtx} to extents_{cached,retained,mtx}.	2016-06-05 20:42:23 -07:00
Jason Evans	9c305c9e5c	s/chunk_hook/extent_hook/g	2016-06-05 20:42:23 -07:00
Jason Evans	7d63fed0fd	Rename huge to large.	2016-06-05 20:42:23 -07:00
Jason Evans	714d1640f3	Update private symbols.	2016-06-05 20:42:23 -07:00
Jason Evans	498856f44a	Move slabs out of chunks.	2016-06-05 20:42:23 -07:00
Jason Evans	d28e5a6696	Improve interval-based profile dump triggering. When an allocation is large enough to trigger multiple dumps, use modular math rather than subtraction to reset the interval counter. Prior to this change, it was possible for a single allocation to cause many subsequent allocations to all trigger profile dumps. When updating usable size for a sampled object, try to cancel out the difference between LARGE_MINCLASS and usable size from the interval counter.	2016-06-05 20:42:23 -07:00
Jason Evans	ed2c2427a7	Use huge size class infrastructure for large size classes.	2016-06-05 20:42:18 -07:00
Jason Evans	b46261d58b	Implement cache-oblivious support for huge size classes.	2016-06-03 12:27:41 -07:00
Jason Evans	4731cd47f7	Allow chunks to not be naturally aligned. Precisely size extents for huge size classes that aren't multiples of chunksize.	2016-06-03 12:27:41 -07:00
Jason Evans	741967e79d	Remove CHUNK_ADDR2BASE() and CHUNK_ADDR2OFFSET().	2016-06-03 12:27:41 -07:00
Jason Evans	23c52c895f	Make extent_prof_tctx_[gs]et() atomic.	2016-06-03 12:27:41 -07:00
Jason Evans	760bf11b23	Add extent_dirty_[gs]et().	2016-06-03 12:27:41 -07:00
Jason Evans	47613afc34	Convert rtree from per chunk to per page. Refactor [de]registration to maintain interior rtree entries for slabs.	2016-06-03 12:27:41 -07:00
Jason Evans	5c6be2bdd3	Refactor chunk_purge_wrapper() to take extent argument.	2016-06-03 12:27:41 -07:00
Jason Evans	0eb6f08959	Refactor chunk_[de]commit_wrapper() to take extent arguments.	2016-06-03 12:27:41 -07:00
Jason Evans	6c94470822	Refactor chunk_dalloc_{cache,wrapper}() to take extent arguments. Rename arena_extent_[d]alloc() to extent_[d]alloc(). Move all chunk [de]registration responsibility into chunk.c.	2016-06-03 12:27:41 -07:00
Jason Evans	de0305a7f3	Add/use chunk_split_wrapper(). Remove redundant ptr/oldsize args from huge_*(). Refactor huge/chunk/arena code boundaries.	2016-06-03 12:27:41 -07:00
Jason Evans	1ad060584f	Add/use chunk_merge_wrapper().	2016-06-03 12:27:41 -07:00
Jason Evans	384e88f451	Add/use chunk_commit_wrapper().	2016-06-03 12:27:41 -07:00
Jason Evans	56e0031d7d	Add/use chunk_decommit_wrapper().	2016-06-03 12:27:41 -07:00
Jason Evans	4d2d9cec5a	Merge chunk_alloc_base() into its only caller.	2016-06-03 12:27:41 -07:00
Jason Evans	fc0372a15e	Replace extent_tree_szad_* with extent_heap_*.	2016-06-03 12:27:41 -07:00
Jason Evans	ffa45a5331	Use rtree rather than [sz]ad trees for chunk split/coalesce operations.	2016-06-03 12:27:41 -07:00
Jason Evans	93e79c5c3f	Remove redundant chunk argument from chunk_{,de,re}register().	2016-06-03 12:27:41 -07:00
Jason Evans	9aea58d9a2	Add extent_past_get().	2016-06-03 12:27:41 -07:00
Jason Evans	d78846c989	Replace extent_achunk_[gs]et() with extent_slab_[gs]et().	2016-06-03 12:27:41 -07:00
Jason Evans	fae8344098	Add extent_active_[gs]et(). Always initialize extents' runs_dirty and chunks_cache linkage.	2016-06-03 12:27:41 -07:00
Jason Evans	6f71844659	Move PAGE definitions to pages.h.	2016-06-03 12:27:41 -07:00
Jason Evans	e75e9be130	Add rtree element witnesses.	2016-06-03 12:27:41 -07:00
Jason Evans	8c9be3e837	Refactor rtree to always use base_alloc() for node allocation.	2016-06-03 12:27:41 -07:00
Jason Evans	db72272bef	Use rtree-based chunk lookups rather than pointer bit twiddling. Look up chunk metadata via the radix tree, rather than using CHUNK_ADDR2BASE(). Propagate pointer's containing extent. Minimize extent lookups by doing a single lookup (e.g. in free()) and propagating the pointer's extent into nearly all the functions that may need it.	2016-06-03 12:27:41 -07:00
Jason Evans	2d2b4e98c9	Add element acquire/release capabilities to rtree. This makes it possible to acquire short-term "ownership" of rtree elements so that it is possible to read an extent pointer and read the extent's contents with a guarantee that the element will not be modified until the ownership is released. This is intended as a mechanism for resolving rtree read/write races rather than as a way to lock extents.	2016-06-03 12:27:33 -07:00
Jason Evans	a7a6f5bc96	Rename extent_node_t to extent_t.	2016-05-16 12:21:28 -07:00
Jason Evans	3aea827f5e	Simplify run quantization.	2016-05-16 12:21:27 -07:00
Jason Evans	7bb00ae9d6	Refactor runs_avail. Use pszind_t size classes rather than szind_t size classes, and always reserve space for NPSIZES elements. This removes unused heaps that are not multiples of the page size, and adds (currently) unused heaps for all huge size classes, with the immediate benefit that the size of arena_t allocations is constant (no longer dependent on chunk size).	2016-05-16 12:21:21 -07:00

1 2 3 4 5 ...

474 Commits