server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Jason Evans	3763d3b5f9	Refactor arena_cactive_update() into arena_cactive_{add,sub}(). This removes an implicit conversion from size_t to ssize_t. For cactive decreases, the size_t value was intentionally underflowed to generate "negative" values (actually positive values above the positive range of ssize_t), and the conversion to ssize_t was undefined according to C language semantics. This regression was perpetuated by 1522937e9cbcfa24c881dc439cc454f9a34a7e88 (Fix the cactive statistic.) and first release in 4.0.0, which in retrospect only fixed one of two problems introduced by aa5113b1fdafd1129c22512837c6c3d66c295fc8 (Refactor overly large/complex functions) and first released in 3.5.0.	2016-02-26 17:29:35 -08:00
Jason Evans	a62e94cabb	Remove invalid tests. Remove invalid tests that were intended to be tests of (hugemax+1) OOM, for which tests already exist.	2016-02-26 16:27:52 -08:00
buchgr	d412624b25	Move retaining out of default chunk hooks This fixes chunk allocation to reuse retained memory even if an application-provided chunk allocation function is in use. This resolves #307.	2016-02-26 15:24:13 -08:00
Jason Evans	20fad3430c	Refactor some bitmap cpp logic.	2016-02-26 14:43:39 -08:00
Dave Watson	b8823ab026	Use linear scan for small bitmaps For small bitmaps, a linear scan of the bitmap is slightly faster than a tree search - bitmap_t is more compact, and there are fewer writes since we don't have to propogate state transitions up the tree. On x86_64 with the current settings, I'm seeing ~.5%-1% CPU improvement in production canaries with this change. The old tree code is left since 32bit sizes are much larger (and ffsl smaller), and maybe the run sizes will change in the future. This resolves #339.	2016-02-26 14:21:10 -08:00
Jason Evans	01ecdf32d6	Miscellaneous bitmap refactoring.	2016-02-26 14:21:10 -08:00
rustyx	4c4ee292e4	Improve test_threads performance	2016-02-26 17:18:58 +01:00
rustyx	ebd00e95b8	Fix MSVC project	2016-02-26 17:18:48 +01:00
Jason Evans	42ce80e15a	Silence miscellaneous 64-to-32-bit data loss warnings. This resolves #341.	2016-02-25 20:51:00 -08:00
Jason Evans	8282a2ad97	Remove a superfluous comment.	2016-02-25 16:44:48 -08:00
Jason Evans	9d2c10f2e8	Add more HUGE_MAXCLASS overflow checks. Add HUGE_MAXCLASS overflow checks that are specific to heap profiling code paths. This fixes test failures that were introduced by 0c516a00c4cb28cff55ce0995f756b5aae074c9e (Make *allocx() size class overflow behavior defined.).	2016-02-25 16:42:15 -08:00
Jason Evans	e3195fa4a5	Cast PTRDIFF_MAX to size_t before adding 1. This fixes compilation warnings regarding integer overflow that were introduced by 0c516a00c4cb28cff55ce0995f756b5aae074c9e (Make *allocx() size class overflow behavior defined.).	2016-02-25 16:40:24 -08:00
Jason Evans	0c516a00c4	Make *allocx() size class overflow behavior defined. Limit supported size and alignment to HUGE_MAXCLASS, which in turn is now limited to be less than PTRDIFF_MAX. This resolves #278 and #295.	2016-02-25 15:29:49 -08:00
Jason Evans	767d85061a	Refactor arenas array (fixes deadlock). Refactor the arenas array, which contains pointers to all extant arenas, such that it starts out as a sparse array of maximum size, and use double-checked atomics-based reads as the basis for fast and simple arena_get(). Additionally, reduce arenas_lock's role such that it only protects against arena initalization races. These changes remove the possibility for arena lookups to trigger locking, which resolves at least one known (fork-related) deadlock. This resolves #315.	2016-02-24 23:58:10 -08:00
Dave Watson	3812729167	Fix arena_size computation. Fix arena_size arena_new() computation to incorporate runs_avail_nclasses elements for runs_avail, rather than (runs_avail_nclasses - 1) elements. Since offsetof(arena_t, runs_avail) is used rather than sizeof(arena_t) for the first term of the computation, all of the runs_avail elements must be added into the second term. This bug was introduced (by Jason Evans) while merging pull request #330 as 3417a304ccde61ac1f68b436ec22c03f1d6824ec (Separate arena_avail trees).	2016-02-24 20:10:02 -08:00
Dave Watson	cd86c1481a	Fix arena_run_first_best_fit Merge of 3417a304ccde61ac1f68b436ec22c03f1d6824ec looks like a small bug: first_best_fit doesn't scan through all the classes, since ind is offset from runs_avail_nclasses by run_avail_bias.	2016-02-24 17:50:02 -08:00
Jason Evans	c7a9a6c86b	Attempt mmap-based in-place huge reallocation. Attempt mmap-based in-place huge reallocation by plumbing new_addr into chunk_alloc_mmap(). This can dramatically speed up incremental huge reallocation. This resolves #335.	2016-02-24 17:23:18 -08:00
Jason Evans	5ec703dd33	Document the heap profile format. This resolves #258.	2016-02-24 15:35:24 -08:00
Jason Evans	f591d2611a	Update manual to reflect removal of global huge object tree. This resolves #323.	2016-02-24 14:38:54 -08:00
Jason Evans	aa63d5d377	Fix ffs_zu() compilation error on MinGW. This regression was caused by 9f4ee6034c3ac6a8c8b5f9a0d76822fb2fd90c41 (Refactor jemalloc_ffs() into ffs_().).	2016-02-24 14:01:47 -08:00
Jason Evans	ca8fffb5c1	Silence miscellaneous 64-to-32-bit data loss warnings.	2016-02-24 13:16:51 -08:00
Jason Evans	b3d0070b14	Compile with -Wshorten-64-to-32. This will prevent accidental creation of potential integer truncation bugs when developing on LP64 systems.	2016-02-24 13:03:48 -08:00
Jason Evans	9e1810ca9d	Silence miscellaneous 64-to-32-bit data loss warnings.	2016-02-24 13:03:48 -08:00
Jason Evans	1c42a04cc6	Change lg_floor() return type from size_t to unsigned.	2016-02-24 13:03:48 -08:00
Jason Evans	0931cecbfa	Use ssize_t for readlink() rather than int.	2016-02-24 13:03:48 -08:00
Jason Evans	8f683b94a7	Make opt_narenas unsigned rather than size_t.	2016-02-24 13:03:48 -08:00
Jason Evans	603b3bd413	Make nhbins unsigned rather than size_t.	2016-02-24 13:03:48 -08:00
Jason Evans	8dd5115ede	Explicitly cast mib[] elements to unsigned where appropriate.	2016-02-24 13:03:48 -08:00
Jason Evans	9f4ee6034c	Refactor jemalloc_ffs() into ffs_(). Use appropriate versions to resolve 64-to-32-bit data loss warnings.	2016-02-24 13:03:48 -08:00
Dmitri Smirnov	b41a07c31a	Fix Windows build issues This resolves #333.	2016-02-23 18:55:45 -08:00
Jason Evans	ae45142adc	Collapse arena_avail_tree_* into arena_run_tree_*. These tree types converged to become identical, yet they still had independently generated red-black tree implementations.	2016-02-23 18:27:24 -08:00
Dave Watson	3417a304cc	Separate arena_avail trees Separate run trees by index, replacing the previous quantize logic. Quantization by index is now performed only on insertion / removal from the tree, and not on node comparison, saving some cpu. This also means we don't have to dereference the miscelm* pointers, saving half of the memory loads from miscelms/mapbits that have fallen out of cache. A linear scan of the indicies appears to be fast enough. The only cost of this is an extra tree array in each arena.	2016-02-23 18:09:36 -08:00
Dave Watson	2b1fc90b7b	Remove rbt_nil Since this is an intrusive tree, rbt_nil is the whole size of the node and can be quite large. For example, miscelm is ~100 bytes.	2016-02-23 18:09:25 -08:00
Jason Evans	0da8ce1e96	Use table lookup for run_quantize_{floor,ceil}(). Reduce run quantization overhead by generating lookup tables during bootstrapping, and using the tables for all subsequent run quantization.	2016-02-22 16:47:34 -08:00
Jason Evans	08551eee58	Fix run_quantize_ceil(). In practice this bug had limited impact (and then only by increasing chunk fragmentation) because run_quantize_ceil() returned correct results except for inputs that could only arise from aligned allocation requests that required more than page alignment. This bug existed in the original run quantization implementation, which was introduced by 8a03cf039cd06f9fa6972711195055d865673966 (Implement cache index randomization for large allocations.).	2016-02-22 16:28:00 -08:00
Jason Evans	a9a4684792	Test run quantization. Also rename run_quantize_*() to improve clarity. These tests demonstrate that run_quantize_ceil() is flawed.	2016-02-22 14:58:05 -08:00
Jason Evans	817d9030a5	Indentation style cleanup.	2016-02-22 10:44:58 -08:00
Jason Evans	9bad079039	Refactor time_* into nstime_*. Use a single uint64_t in nstime_t to store nanoseconds rather than using struct timespec. This reduces fragility around conversions between long and uint64_t, especially missing casts that only cause problems on 32-bit platforms.	2016-02-21 21:39:05 -08:00
Jason Evans	788d29d397	Fix Windows-specific prof-related compilation portability issues.	2016-02-20 23:46:14 -08:00
Jason Evans	fd9cd7a6cc	Fix time_update() to compile and work on MinGW.	2016-02-20 23:45:22 -08:00
Jason Evans	56139dc403	Remove _WIN32-specific struct timespec declaration. struct timespec is already defined by the system (at least on MinGW).	2016-02-20 23:43:17 -08:00
Jason Evans	ecae12323d	Fix overflow in prng_range(). Add jemalloc_ffs64() and use it instead of jemalloc_ffsl() in prng_range(), since long is not guaranteed to be a 64-bit type.	2016-02-20 23:41:33 -08:00
Jason Evans	aac93f414e	Add symbol mangling for prng_[lg_]range().	2016-02-20 11:26:00 -08:00
rustyx	984c64f724	Add MS Visual Studio 2015 support	2016-02-20 10:55:23 -08:00
rustyx	3c2c5a5071	Fix warning in ipalloc	2016-02-20 10:55:18 -08:00
rustyx	efbee86278	Prevent MSVC from optimizing away tls_callback (resolves #318 )	2016-02-20 10:52:53 -08:00
rustyx	7f283980f0	getpid() fix for Win32	2016-02-20 10:52:53 -08:00
rustyx	90c7269c05	Add CPU "pause" intrinsic for MSVC	2016-02-20 10:52:48 -08:00
rustyx	bc49863fb5	Fix error "+ 2")syntax error: invalid arithmetic operator (error token is " in Cygwin x64	2016-02-20 10:50:24 -08:00
rustyx	46e0b2301c	Detect LG_SIZEOF_PTR depending on MSVC platform target	2016-02-20 10:50:24 -08:00

1 2 3 4 5 ...

1244 Commits