server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Mike Hommey	c9db461ffb	Use InterlockedCompareExchange instead of non-existing InterlockedCompareExchange32	2015-03-17 12:09:30 +09:00
Jason Evans	04211e2266	Fix heap profiling regressions. Remove the prof_tctx_state_destroying transitory state and instead add the tctx_uid field, so that the tuple <thr_uid, tctx_uid> uniquely identifies a tctx. This assures that tctx's are well ordered even when more than two with the same thr_uid coexist. A previous attempted fix based on prof_tctx_state_destroying was only sufficient for protecting against two coexisting tctx's, but it also introduced a new dumping race. These regressions were introduced by `602c8e0971` (Implement per thread heap profiling.) and `764b00023f` (Fix a heap profiling regression.).	2015-03-16 15:11:06 -07:00
Jason Evans	262146dfc4	Eliminate innocuous compiler warnings.	2015-03-14 14:34:16 -07:00
Jason Evans	764b00023f	Fix a heap profiling regression. Add the prof_tctx_state_destroying transitionary state to fix a race between a thread destroying a tctx and another thread creating a new equivalent tctx. This regression was introduced by `602c8e0971` (Implement per thread heap profiling.).	2015-03-14 14:01:35 -07:00
Daniel Micay	d6384b09e1	use CLOCK_MONOTONIC in the timer if it's available Linux sets _POSIX_MONOTONIC_CLOCK to 0 meaning it might be available, so a sysconf check is necessary at runtime with a fallback to the mandatory CLOCK_REALTIME clock.	2015-03-13 14:07:35 -07:00
Mike Hommey	f69e2f6fda	Use the error code given to buferror on Windows `a14bce85` made buferror not take an error code, and make the Windows code path for buferror use GetLastError, while the alternative code paths used errno. Then `2a83ed02` made buferror take an error code again, and while it changed the non-Windows code paths to use that error code, the Windows code path was not changed accordingly.	2015-03-13 13:54:02 -07:00
Jason Evans	d69964bd2d	Fix a heap profiling regression. Fix prof_tctx_comp() to incorporate tctx state into the comparison. During a dump it is possible for both a purgatory tctx and an otherwise equivalent nominal tctx to reside in the tree at the same time. This regression was introduced by `602c8e0971` (Implement per thread heap profiling.).	2015-03-12 16:25:18 -07:00
Jason Evans	fbd8d773ad	Fix unsigned comparison underflow. These bugs only affected tests and debug builds.	2015-03-11 23:14:50 -07:00
Jason Evans	bc45d41d23	Fix a declaration-after-statement regression.	2015-03-11 16:50:40 -07:00
Jason Evans	f5c8f37259	Normalize rdelm/rd structure field naming.	2015-03-10 18:29:49 -07:00
Jason Evans	38e42d311c	Refactor dirty run linkage to reduce sizeof(extent_node_t).	2015-03-10 18:15:40 -07:00
Jason Evans	54673fd8d7	Update ChangeLog.	2015-03-09 16:02:40 -07:00
Jason Evans	04ca7580db	Fix a chunk_recycle() regression. This regression was introduced by `97c04a9383` (Use first-fit rather than first-best-fit run/chunk allocation.).	2015-03-06 23:25:13 -08:00
Jason Evans	97c04a9383	Use first-fit rather than first-best-fit run/chunk allocation. This tends to more effectively pack active memory toward low addresses. However, additional tree searches are required in many cases, so whether this change stands the test of time will depend on real-world benchmarks.	2015-03-06 20:21:41 -08:00
Jason Evans	5707d6f952	Quantize szad trees by size class. Treat sizes that round down to the same size class as size-equivalent in trees that are used to search for first best fit, so that there are only as many "firsts" as there are size classes. This comes closer to the ideal of first fit.	2015-03-06 20:21:41 -08:00
Jason Evans	f044bb219e	Change default chunk size from 4 MiB to 256 KiB. Recent changes have improved huge allocation scalability, which removes upward pressure to set the chunk size so large that huge allocations are rare. Smaller chunks are more likely to completely drain, so set the default to the smallest size that doesn't leave excessive unusable trailing space in chunk headers.	2015-03-06 20:18:34 -08:00
Mike Hommey	4d871f73af	Preserve LastError when calling TlsGetValue TlsGetValue has a semantic difference with pthread_getspecific, in that it can return a non-error NULL value, so it always sets the LastError. But allocator callers may not be expecting calling e.g. free() to change the value of the last error, so preserve it.	2015-03-04 09:50:33 -08:00
Mike Hommey	7c46fd59cc	Make --without-export actually work `9906660` added a --without-export configure option to avoid exporting jemalloc symbols, but the option didn't actually work.	2015-03-04 21:49:15 +09:00
Dave Huseby	970fcfbca5	adding support for bitrig	2015-02-25 20:36:01 -05:00
Jason Evans	35e3fd9a63	Fix a compilation error and an incorrect assertion.	2015-02-18 16:51:51 -08:00
Jason Evans	99bd94fb65	Fix chunk cache races. These regressions were introduced by `ee41ad409a` (Integrate whole chunks into unused dirty page purging machinery.).	2015-02-18 16:40:53 -08:00
Jason Evans	738e089a2e	Rename "dirty chunks" to "cached chunks". Rename "dirty chunks" to "cached chunks", in order to avoid overloading the term "dirty". Fix the regression caused by `339c2b23b2` (Fix chunk_unmap() to propagate dirty state.), and actually address what that change attempted, which is to only purge chunks once, and propagate whether zeroed pages resulted into chunk_record().	2015-02-18 01:15:50 -08:00
Jason Evans	339c2b23b2	Fix chunk_unmap() to propagate dirty state. Fix chunk_unmap() to propagate whether a chunk is dirty, and modify dirty chunk purging to record this information so it can be passed to chunk_unmap(). Since the broken version of chunk_unmap() claimed that all chunks were clean, this resulted in potential memory corruption for purging implementations that do not zero (e.g. MADV_FREE). This regression was introduced by `ee41ad409a` (Integrate whole chunks into unused dirty page purging machinery.).	2015-02-17 22:25:56 -08:00
Jason Evans	47701b22ee	arena_chunk_dirty_node_init() --> extent_node_dirty_linkage_init()	2015-02-17 22:23:10 -08:00
Jason Evans	eafebfdfbe	Remove obsolete type arena_chunk_miscelms_t.	2015-02-17 16:12:31 -08:00
Jason Evans	a4e1888d1a	Simplify extent_node_t and add extent_node_init().	2015-02-17 15:13:52 -08:00
Jason Evans	ee41ad409a	Integrate whole chunks into unused dirty page purging machinery. Extend per arena unused dirty page purging to manage unused dirty chunks in aaddtion to unused dirty runs. Rather than immediately unmapping deallocated chunks (or purging them in the --disable-munmap case), store them in a separate set of trees, chunks_[sz]ad_dirty. Preferrentially allocate dirty chunks. When excessive unused dirty pages accumulate, purge runs and chunks in ingegrated LRU order (and unmap chunks in the --enable-munmap case). Refactor extent_node_t to provide accessor functions.	2015-02-16 21:02:17 -08:00
Jason Evans	40ab8f98e4	Remove more obsolete (incorrect) assertions. This regression was introduced by `88fef7ceda` (Refactor huge_*() calls into arena internals.), and went undetected because of the --enable-debug regression.	2015-02-15 20:26:45 -08:00
Jason Evans	cb9b44914e	Remove obsolete (incorrect) assertions. This regression was introduced by `88fef7ceda` (Refactor huge_*() calls into arena internals.), and went undetected because of the --enable-debug regression.	2015-02-15 20:13:28 -08:00
Jason Evans	02e5dcf39d	Fix --enable-debug regression. Fix --enable-debug to actually enable debug mode. This regression was introduced by `cbf3a6d703` (Move centralized chunk management into arenas.).	2015-02-15 20:12:06 -08:00
Jason Evans	2195ba4e1f	Normalize _link and link_ fields to all be *_link.	2015-02-15 16:43:52 -08:00
Jason Evans	b01186cebd	Remove redundant tcache_boot() call.	2015-02-15 14:04:55 -08:00
Jason Evans	41cfe03f39	If MALLOCX_ARENA(a) is specified, use it during tcache fill.	2015-02-13 15:28:56 -08:00
Abhishek Kulkarni	feaaa3df0d	Take into account the install suffix that jemalloc was built with in the pkg-config file. Signed-off-by: Abhishek Kulkarni <adkulkar@umail.iu.edu>	2015-02-13 12:46:19 -08:00
Dan McGregor	f8880310eb	Put VERSION file in object directory Also allow for the possibility that there exists a VERSION file in the srcroot, in case of building from a release tarball out of tree.	2015-02-13 12:36:14 -08:00
Dan McGregor	ab5e3790f6	Build docs in object directory	2015-02-13 12:14:34 -08:00
Jason Evans	5f7140b045	Make prof_tctx accesses atomic. Although exceedingly unlikely, it appears that writes to the prof_tctx field of arena_chunk_map_misc_t could be reordered such that a stale value could be read during deallocation, with profiler metadata corruption and invalid pointer dereferences being the most likely effects.	2015-02-12 15:54:53 -08:00
Jason Evans	88fef7ceda	Refactor huge_() calls into arena internals. Make redirects to the huge_() API the arena code's responsibility, since arenas now take responsibility for all allocation sizes.	2015-02-12 14:06:37 -08:00
Daniel Micay	1eaf3b6f34	add missing check for new_addr chunk size `8ddc93293c` switched this to over using the address tree in order to avoid false negatives, so it now needs to check that the size of the free extent is large enough to satisfy the request.	2015-02-12 15:46:30 -05:00
Jason Evans	cbf3a6d703	Move centralized chunk management into arenas. Migrate all centralized data structures related to huge allocations and recyclable chunks into arena_t, so that each arena can manage huge allocations and recyclable virtual memory completely independently of other arenas. Add chunk node caching to arenas, in order to avoid contention on the base allocator. Use chunks_rtree to look up huge allocations rather than a red-black tree. Maintain a per arena unsorted list of huge allocations (which will be needed to enumerate huge allocations during arena reset). Remove the --enable-ivsalloc option, make ivsalloc() always available, and use it for size queries if --enable-debug is enabled. The only practical implications to this removal are that 1) ivsalloc() is now always available during live debugging (and the underlying radix tree is available during core-based debugging), and 2) size query validation can no longer be enabled independent of --enable-debug. Remove the stats.chunks.{current,total,high} mallctls, and replace their underlying statistics with simpler atomically updated counters used exclusively for gdump triggering. These statistics are no longer very useful because each arena manages chunks independently, and per arena statistics provide similar information. Simplify chunk synchronization code, now that base chunk allocation cannot cause recursive lock acquisition.	2015-02-12 00:15:56 -08:00
Jason Evans	f30e261c5b	Update ckh to support metadata allocation tracking.	2015-02-12 00:15:24 -08:00
Jason Evans	064dbfbaf7	Fix a regression in tcache_bin_flush_small(). Fix a serious regression in tcache_bin_flush_small() that was introduced by `1cb181ed63` (Implement explicit tcache support.).	2015-02-12 00:15:16 -08:00
Jason Evans	051eae8cc5	Remove unnecessary xchg* lock prefixes.	2015-02-10 16:05:52 -08:00
Jason Evans	9e561e8d3f	Test and fix tcache ID recycling.	2015-02-10 09:03:48 -08:00
Jason Evans	1cb181ed63	Implement explicit tcache support. Add the MALLOCX_TCACHE() and MALLOCX_TCACHE_NONE macros, which can be used in conjunction with the *allocx() API. Add the tcache.create, tcache.flush, and tcache.destroy mallctls. This resolves #145.	2015-02-09 17:44:48 -08:00
Jason Evans	23694b0745	Fix arena_get() for (!init_if_missing && refresh_if_missing) case. Fix arena_get() to refresh the cache as needed in the (!init_if_missing && refresh_if_missing) case. This flaw was introduced by the initial arena_get() implementation, which was part of `8bb3198f72` (Refactor/fix arenas manipulation.).	2015-02-09 17:43:10 -08:00
Jason Evans	8d0e04d42f	Refactor rtree to be lock-free. Recent huge allocation refactoring associates huge allocations with arenas, but it remains necessary to quickly look up huge allocation metadata during reallocation/deallocation. A global radix tree remains a good solution to this problem, but locking would have become the primary bottleneck after (upcoming) migration of chunk management from global to per arena data structures. This lock-free implementation uses double-checked reads to traverse the tree, so that in the steady state, each read or write requires only a single atomic operation. This implementation also assures that no more than two tree levels actually exist, through a combination of careful virtual memory allocation which makes large sparse nodes cheap, and skipping the root node on x64 (possible because the top 16 bits are all 0 in practice).	2015-02-04 16:51:53 -08:00
Jason Evans	c810fcea1f	Add (x != 0) assertion to lg_floor(x). lg_floor(0) is undefined, but depending on compiler options may not cause a crash. This assertion makes it harder to accidentally abuse lg_floor().	2015-02-04 16:51:53 -08:00
Jason Evans	f500a10b2e	Refactor base_alloc() to guarantee demand-zeroed memory. Refactor base_alloc() to guarantee that allocations are carved from demand-zeroed virtual memory. This supports sparse data structures such as multi-page radix tree nodes. Enhance base_alloc() to keep track of fragments which were too small to support previous allocation requests, and try to consume them during subsequent requests. This becomes important when request sizes commonly approach or exceed the chunk size (as could radix tree node allocations).	2015-02-04 16:51:53 -08:00
Jason Evans	918a1a5b3f	Reduce extent_node_t size to fit in one cache line.	2015-02-04 16:51:53 -08:00

1 2 3 4 5 ...

1007 Commits