server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
zhxchen17	4b76c684bb	Add "prof.dump_prefix" to override filename prefixes for dumps.	2019-09-12 22:26:03 -07:00
Qi Wang	22bc75ee3e	Workaround the stringop-overflow check false positives.	2019-09-09 11:35:04 -07:00
Qi Wang	0043e68d4c	Track low_water == -1 case explicitly. The -1 value of low_water indicates if the cache has been depleted and refilled. Track the status explicitly in the tcache struct. This allows the fast path to check if (cur_ptr > low_water), instead of >=, which avoids reaching slow path when the last item is allocated.	2019-08-21 16:00:38 -07:00
Qi Wang	937ca1db9f	Store ncached_max * ptr_size in tcache_bin_info. With the cache bin metadata switched to pointers, ncached_max is usually accessed and timed by sizeof(ptr). Store the results in tcache_bin_info for direct access, and add a helper function for the ncached_max value.	2019-08-19 12:23:24 -07:00
Qi Wang	7599c82d48	Redesign the cache bin metadata for fast path. Implement the pointer-based metadata for tcache bins -- - 3 pointers are maintained to represent each bin; - 2 of the pointers are compressed on 64-bit; - is_full / is_empty done through pointer comparison; Comparing to the previous counter based design -- - fast-path speed up ~15% in benchmarks - direct pointer comparison and de-reference - no need to access tcache_bin_info in common case	2019-08-19 12:21:44 -07:00
Yinan Zhang	ad3f7dbfa0	Buffer prof_log_stop Make use of the new buffered writer for the output of `prof_log_stop`.	2019-08-12 09:06:01 -07:00
Yinan Zhang	8c8466fa6e	Add compact json option for emitter JSON format is largely meant for machine-machine communication, so adding the option to the emitter. According to local testing, the savings in terms of bytes outputted is around 50% for stats printing and around 25% for prof log printing.	2019-08-09 09:53:41 -07:00
Yinan Zhang	7fc6b1b259	Add buffered writer The buffered writer adopts a signature identical to `write_cb`, so that it can be plugged into anywhere `write_cb` appears.	2019-08-09 09:44:29 -07:00
Qi Wang	10fcff6c38	Lower nthreads in test/unit/retained on 32-bit to avoid OOM.	2019-07-25 13:10:03 -07:00
Qi Wang	9a86c65abc	Implement retain on Windows. The VirtualAlloc and VirtualFree APIs are different because MEM_DECOMMIT cannot be used across multiple VirtualAlloc regions. To properly support decommit, only allow merge / split within the same region -- this is done by tracking the "is_head" state of extents and not merging cross-region. Add a new state is_head (only relevant for retain && !maps_coalesce), which is true for the first extent in each VirtualAlloc region. Determine if two extents can be merged based on the head state, and use serial numbers for sanity checks.	2019-07-23 22:18:55 -07:00
Yinan Zhang	c92ac30601	Add confirm_conf option If the confirm_conf option is set, when the program starts, each of the four malloc_conf strings will be printed, and each option will be printed when being set.	2019-05-22 09:38:39 -07:00
Yinan Zhang	4c63b0e76a	Improve memory utilization tests Added tests for large size classes and expanded the tests to cover wider range of allocation sizes.	2019-05-21 12:57:06 -07:00
Doron Roberts-Kedes	7fc4f2a32c	Add nonfull_slabs to bin_stats_t. When config_stats is enabled track the size of bin->slabs_nonfull in the new nonfull_slabs counter in bin_stats_t. This metric should be useful for establishing an upper ceiling on the savings possible by meshing.	2019-04-29 13:35:02 -07:00
David Goldblatt	33e1dad680	Safety checks: Add a redzoning feature.	2019-04-15 16:48:12 -07:00
Yinan Zhang	7ee3897740	Separate tests for extent utilization API As title.	2019-04-10 13:03:20 -07:00
Qi Wang	c2a3a7cd3f	Fix test/unit/prof_log Compiler optimizations may produce traces more than expected. Instead verify the lower bound only.	2019-04-05 13:47:10 -07:00
Yinan Zhang	9aab3f2be0	Add memory utilization analytics to mallctl The analytics tool is put under experimental.utilization namespace in mallctl. Input is one pointer or an array of pointers and the output is a list of memory utilization statistics.	2019-04-04 13:48:39 -07:00
Qi Wang	6fe11633b0	Fix the binshard unit test. The test attempts to trigger usage of multiple sharded bins, which percpu_arena makes it less reliable.	2019-04-02 16:53:00 -07:00
Qi Wang	fb56766ca9	Eagerly purge oversized merged extents. This change improves memory usage slightly, at virtually no CPU cost.	2019-03-14 17:34:55 -07:00
Qi Wang	e3db480f6f	Rename huge_threshold to oversize_threshold. The keyword huge tend to remind people of huge pages which is not relevent to the feature.	2019-01-25 13:15:45 -08:00
Qi Wang	7a815c1b7c	Un-experimental the huge_threshold feature.	2019-01-16 12:28:57 -08:00
Qi Wang	441335d924	Add unit test for producer-consumer pattern.	2018-12-18 15:09:53 -08:00
Qi Wang	711a61f3b4	Add unit test for sharded bins.	2018-12-03 17:17:03 -08:00
Qi Wang	45bb4483ba	Add stats for arenas.bin.i.nshards.	2018-12-03 17:17:03 -08:00
Tyler Etzel	5e23f96dd4	Add unit tests for logging	2018-08-01 13:27:11 -07:00
Tyler Etzel	eb261e53a6	Small refactoring of emitter - Make API more clear for using as standalone json emitter - Support cases that weren't possible before, e.g. - emitting primitive values in an array - emitting nested arrays	2018-08-01 13:27:11 -07:00
David Goldblatt	3aba072cef	SC: Remove global data. The global data is mostly only used at initialization, or for easy access to values we could compute statically. Instead of consuming that space (and risking TLB misses), we can just pass around a pointer to stack data during bootstrapping.	2018-07-23 13:37:08 -07:00
David Goldblatt	55e5cc1341	SC: Make some key size classes static. The largest small class, smallest large class, and largest large class may all be needed down fast paths; to avoid the risk of touching another cache line, we can make them available as constants.	2018-07-12 20:53:06 -07:00
David T. Goldblatt	a7f68aed3e	SC: Add page customization functionality.	2018-07-12 20:53:06 -07:00
David Goldblatt	2f07e92adb	Add lg_ceil to bit_util. Also, add the bit_util test back to the Makefile.	2018-07-12 20:53:06 -07:00
David Goldblatt	e904f813b4	Hide size class computation behind a layer of indirection. This class removes almost all the dependencies on size_classes.h, accessing the data there only via the new module sc.h, which does not depend on any configuration options. In a subsequent commit, we'll remove the configure-time size class computations, doing them at boot time, instead.	2018-07-12 20:53:06 -07:00
gnzlbg	3d29d11ac2	Clean compilation -Wextra Before this commit jemalloc produced many warnings when compiled with -Wextra with both Clang and GCC. This commit fixes the issues raised by these warnings or suppresses them if they were spurious at least for the Clang and GCC versions covered by CI. This commit: * adds `JEMALLOC_DIAGNOSTIC` macros: `JEMALLOC_DIAGNOSTIC_{PUSH,POP}` are used to modify the stack of enabled diagnostics. The `JEMALLOC_DIAGNOSTIC_IGNORE_...` macros are used to ignore a concrete diagnostic. * adds `JEMALLOC_FALLTHROUGH` macro to explicitly state that falling through `case` labels in a `switch` statement is intended * Removes all UNUSED annotations on function parameters. The warning -Wunused-parameter is now disabled globally in `jemalloc_internal_macros.h` for all translation units that include that header. It is never re-enabled since that header cannot be included by users. * locally suppresses some -Wextra diagnostics: * `-Wmissing-field-initializer` is buggy in older Clang and GCC versions, where it does not understanding that, in C, `= {0}` is a common C idiom to initialize a struct to zero * `-Wtype-bounds` is suppressed in a particular situation where a generic macro, used in multiple different places, compares an unsigned integer for smaller than zero, which is always true. * `-Walloc-larger-than-size=` diagnostics warn when an allocation function is called with a size that is too large (out-of-range). These are suppressed in the parts of the tests where `jemalloc` explicitly does this to test that the allocation functions fail properly. * adds a new CI build bot that runs the log unit test on CI. Closes #1196 .	2018-07-09 21:40:42 -07:00
Qi Wang	cdf15b458a	Rename huge_threshold to experimental, and tweak documentation.	2018-06-29 10:35:02 -07:00
Qi Wang	ff622eeab5	Add unit test for opt.huge_threshold.	2018-06-29 10:35:02 -07:00
Qi Wang	1302af4c43	Add ctl and stats for opt.huge_threshold.	2018-06-29 10:35:02 -07:00
Qi Wang	94a88c26f4	Implement huge arena: opt.huge_threshold. The feature allows using a dedicated arena for huge allocations. We want the addtional arena to separate huge allocation because: 1) mixing small extents with huge ones causes fragmentation over the long run (this feature reduces VM size significantly); 2) with many arenas, huge extents rarely get reused across threads; and 3) huge allocations happen way less frequently, therefore no concerns for lock contention.	2018-06-29 10:35:02 -07:00
David Goldblatt	a7f749c9af	Hooks: Protect against reentrancy. Previously, we made the user deal with this themselves, but that's not good enough; if hooks may allocate, we should test the allocation pathways down hooks. If we're doing that, we might as well actually implement the protection for the user.	2018-05-18 11:43:03 -07:00
David Goldblatt	59e371f463	Hooks: Add a hook exhaustion test. When we run out of space in which to store hooks, we should return EAGAIN from the mallctl, but not otherwise misbehave.	2018-05-18 11:43:03 -07:00
David Goldblatt	bb071db92e	Mallctl: Add experimental.hooks.[install\|remove].	2018-05-18 11:43:03 -07:00
David Goldblatt	126e9a84a5	Hooks: move the "extra" pointer into the hook_t itself. This simplifies the mallctl call to install a hook, which should only take a single argument.	2018-05-18 11:43:03 -07:00
David Goldblatt	cb0707c0fc	Hooks: hook the realloc pathways that move/expand.	2018-05-18 11:43:03 -07:00
David Goldblatt	67270040a5	Hooks: hook the realloc paths that act as pure malloc/free.	2018-05-18 11:43:03 -07:00
David Goldblatt	83e516154c	Hooks: hook the pure-expand function.	2018-05-18 11:43:03 -07:00
David Goldblatt	c154f5881b	Hooks: hook the pure-deallocation functions.	2018-05-18 11:43:03 -07:00
David Goldblatt	226327cf66	Hooks: hook the pure-allocation functions.	2018-05-18 11:43:03 -07:00
David Goldblatt	5ae6e7cbfa	Add "hook" module. The hook module allows a low-reader-overhead way of finding hooks to invoke and calling them. For now, none of the allocation pathways are tied into the hooks; this will come later.	2018-05-18 11:43:03 -07:00
David Goldblatt	06a8c40b36	Add the Seq module, a simple seqlock implementation. This allows fast reader-writer concurrency in cases where writers are rare. The immediate use case is for the hooking implementaiton.	2018-05-18 11:43:03 -07:00
David Goldblatt	c7a87e0e0b	Rename hooks module to test_hooks. "Hooks" is really the best name for the module that will contain the publicly exposed hooks. So lets rename the current "hooks" module (that hook external dependencies, for reentrancy testing) to "test_hooks".	2018-05-18 11:43:03 -07:00
David Goldblatt	e870829e64	TSD: Add the ability to enter a global slow path. This gives any thread the ability to send other threads down slow paths the next time they fetch tsd.	2018-05-18 11:43:03 -07:00
David Goldblatt	982c10de35	TSD: Make all state access happen through a function. Shortly, tsd state will be atomic and have some complicated enough logic down the state-setting path that we should be aware of it.	2018-05-18 11:43:03 -07:00

... 4 5 6 7 8 ...

590 Commits