server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Jason Evans	c1baa0a9b7	Add huge page configuration and pages_[no}huge(). Add the --with-lg-hugepage configure option, but automatically configure LG_HUGEPAGE even if it isn't specified. Add the pages_[no]huge() functions, which toggle huge page state via madvise(..., MADV_[NO]HUGEPAGE) calls.	2016-12-26 17:59:34 -08:00
Jason Evans	eab3b180e5	Fix JSON-mode output for !config_stats and/or !config_prof cases. These bugs were introduced by 0ba5b9b6189e16a983d8922d8c5cb6ab421906e8 (Add "J" (JSON) support to malloc_stats_print().), which was backported as b599b32280e1142856b0b96293a71e1684b1ccfb (with the same bugs except the inapplicable "metatata" misspelling) and first released in 4.3.0.	2016-12-23 11:15:44 -08:00
Jason Evans	bacb6afc6c	Simplify arena_slab_regind(). Rewrite arena_slab_regind() to provide sufficient constant data for the compiler to perform division strength reduction. This replaces more general manual strength reduction that was implemented before arena_bin_info was compile-time-constant. It would be possible to slightly improve on the compiler-generated division code by taking advantage of range limits that the compiler doesn't know about.	2016-12-23 10:34:34 -08:00
Jason Evans	194d6f9de8	Restructure CFLAGS/CXXFLAGS configuration. Convert CFLAGS/CXXFLAGS to be concatenations: CFLAGS := CONFIGURE_CFLAGS SPECIFIED_CFLAGS EXTRA_CFLAGS CXXFLAGS := CONFIGURE_CXXFLAGS SPECIFIED_CXXFLAGS EXTRA_CXXFLAGS This ordering makes it possible to override the flags set by the configure script both during and after configuration, with CFLAGS/CXXFLAGS and EXTRA_CFLAGS/EXTRA_CXXFLAGS, respectively. This resolves #504.	2016-12-16 07:24:36 -08:00
Jason Evans	a965a9cb12	Re-expand the Travis-CI build matrix.	2016-12-13 16:19:20 -08:00
Jason Evans	590ee2a6e0	Update Travis-CI config for C++ integration.	2016-12-13 14:53:10 -08:00
Jason Evans	69c26cdb01	Add some missing explicit casts.	2016-12-13 13:38:11 -08:00
Dave Watson	2319152d9f	jemalloc cpp new/delete bindings Adds cpp bindings for jemalloc, along with necessary autoconf settings. This is mostly to add sized deallocation support, which can't be added from C directly. Sized deallocation is ~10% microbench improvement. * Import ax_cxx_compile_stdcxx.m4 from the autoconf repo, seems like the easiest way to get c++14 detection. * Adds various other changes, like CXXFLAGS, to configure.ac. * Adds new rules to Makefile.in for src/jemalloc-cpp.cpp, and a basic unittest. * Both new and delete are overridden, to ensure jemalloc is used for both. * TODO future enhancement of avoiding extra PLT thunks for new and delete - sdallocx and malloc are publicly exported jemalloc symbols, using an alias would link them directly. Unfortunately, was having trouble getting it to play nice with jemalloc's namespace support. Testing: Tested gcc 4.8, gcc 5, gcc 5.2, clang 4.0. Only gcc >= 5 has sized deallocation support, verified that the rest build correctly. Tested mac osx and Centos. Tested --with-jemalloc-prefix and --without-export. This resolves #202.	2016-12-12 18:36:06 -08:00
Jason Evans	d4c5aceb7c	Add a_type parameter to qr_{meld,split}().	2016-12-12 18:16:51 -08:00
Jason Evans	fbe3015818	Update ChangeLog for 4.4.0.	2016-12-03 18:35:23 -08:00
Jason Evans	acb7b1f53e	Add --disable-syscall. This resolves #517.	2016-12-03 16:50:58 -08:00
Jason Evans	7179351a45	Update configure cache file example.	2016-11-30 09:57:12 -08:00
John Szakmeister	eb29d7ec0e	Implement a more reliable detection scheme for os_unfair_lock. The core issue here is the weak linking of the symbol, and in certain environments--for instance, using the latest Xcode (8.1) with the latest SDK (10.12)--os_unfair_lock may resolve even though you're compiling on a host that doesn't support it (10.11). We can use the availability macros to circumvent this problem, and detect that we're not compiling for a target that is going to support them and error out at compile time. The other alternative is to do a runtime check, but that presents issues for cross-compiling.	2016-11-23 15:32:35 -05:00
Jason Evans	32127949a3	Enable overriding JEMALLOC_{ALLOC,FREE}_JUNK. This resolves #509.	2016-11-22 10:58:58 -08:00
Jason Evans	c3b85f2585	Style fixes.	2016-11-22 10:58:23 -08:00
Jason Evans	5234be2133	Add pthread_atfork(3) feature test. Some versions of Android provide a pthreads library without providing pthread_atfork(), so in practice a separate feature test is necessary for the latter.	2016-11-17 15:14:57 -08:00
Jason Evans	fda60be799	Update a comment.	2016-11-17 11:50:52 -08:00
Jason Evans	a64123ce13	Refactor madvise(2) configuration. Add feature tests for the MADV_FREE and MADV_DONTNEED flags to madvise(2), so that MADV_FREE is detected and used for Linux kernel versions 4.5 and newer. Refactor pages_purge() so that on systems which support both flags, MADV_FREE is preferred over MADV_DONTNEED. This resolves #387.	2016-11-17 10:31:57 -08:00
Jason Evans	f7ca1c9bc3	Remove a residual comment.	2016-11-16 19:41:09 -08:00
Jason Evans	aec5a051e8	Avoid gcc type-limits warnings.	2016-11-16 18:28:38 -08:00
Maks Naumov	95974c0440	Remove size_t -> unsigned -> size_t conversion.	2016-11-16 11:23:31 -08:00
Jason Evans	9b94c015af	Document how to use --cache configure option. This resolves #494.	2016-11-16 10:56:40 -08:00
Jason Evans	4066b4ef57	Revert "Add JE_RUNNABLE() and use it for os_unfair_lock_*() test." This reverts commit a2e601a2236315fb6f994ff364ea442ed0aed07b. JE_RUNNABLE() causes general cross-compilation issues.	2016-11-16 10:40:00 -08:00
Jason Evans	8a4528bdd1	Uniformly cast mallctl[bymib]() oldp/newp arguments to (void *). This avoids warnings in some cases, and is otherwise generally good hygiene.	2016-11-15 15:01:03 -08:00
Jason Evans	2c95154501	Add packing test, which verifies stable layout policy.	2016-11-15 13:08:33 -08:00
Jason Evans	a38acf716e	Add extent serial numbers. Add extent serial numbers and use them where appropriate as a sort key that is higher priority than address, so that the allocation policy prefers older extents. This resolves #147.	2016-11-15 13:08:33 -08:00
Jason Evans	c0a667112c	Fix arena_reset() crashing bug. This regression was caused by 498856f44a30b31fe713a18eb2fc7c6ecf3a9f63 (Move slabs out of chunks.).	2016-11-15 10:34:02 -08:00
Jason Evans	a2e601a223	Add JE_RUNNABLE() and use it for os_unfair_lock_*() test. This resolves #494.	2016-11-12 09:48:06 -08:00
Jason Evans	c25e711cf9	Reduce memory usage for sdallocx() test_alignment_and_size.	2016-11-11 23:50:35 -08:00
Jason Evans	32d69e967e	Add configure support for --linux-android. This is tailored to Android, i.e. more specific than the --linux* configuration. This resolves #471.	2016-11-10 15:36:17 -08:00
Jason Evans	c233dd5e40	Update config.{guess,sub} from upstream.	2016-11-10 15:02:05 -08:00
Jason Evans	85dae2ff49	Update ChangeLog for 4.3.1.	2016-11-07 16:22:02 -08:00
Jason Evans	5e0373c815	Fix test_prng_lg_range_zu() to work on 32-bit systems.	2016-11-07 11:50:11 -08:00
Jason Evans	cda59f9970	Rename atomic__{uint32,uint64,u}() to atomic__{u32,u64,zu}(). This change conforms to naming conventions throughout the codebase.	2016-11-07 11:27:48 -08:00
Jason Evans	2e46b13ad5	Revert "Define 64-bits atomics unconditionally" This reverts commit c2942e2c0e097e7c75a3addd0b9c87758f91692e. This resolves #495.	2016-11-07 10:53:35 -08:00
Jason Evans	04b463546e	Refactor prng to not use 64-bit atomics on 32-bit platforms. This resolves #495.	2016-11-07 10:52:44 -08:00
Jason Evans	e0a9e78374	Update ChangeLog for 4.3.0.	2016-11-04 15:15:24 -07:00
Matthew Parkinson	d30b3ea51a	Fixes to Visual Studio Project files	2016-11-04 09:59:05 -07:00
Jason Evans	6d2a57cfbb	Use -std=gnu11 if available. This supersedes -std=gnu99, and enables C11 atomics.	2016-11-03 21:57:17 -07:00
Jason Evans	0760876927	Update ChangeLog for 4.3.0.	2016-11-04 00:02:43 -07:00
Jason Evans	a967fae362	Fix/simplify extent_recycle() allocation size computations. Do not call s2u() during alloc_size computation, since any necessary ceiling increase is taken care of later by extent_first_best_fit() --> extent_size_quantize_ceil(), and the s2u() call may erroneously cause a higher quantization result. Remove an overly strict overflow check that was added in 4a7852137d8b6598fdb90ea8e1fd3bc8a8b94a3a (Fix extent_recycle()'s cache-oblivious padding support.).	2016-11-03 23:49:21 -07:00
Jason Evans	4a7852137d	Fix extent_recycle()'s cache-oblivious padding support. Add padding after computing the size class, so that the optimal size class isn't skipped during search for a usable extent. This regression was caused by b46261d58b449cc4c099ed2384451a2499688f0e (Implement cache-oblivious support for huge size classes.).	2016-11-03 22:33:35 -07:00
Jason Evans	ea9961acdb	Fix psz/pind edge cases. Add an "over-size" extent heap in which to store extents which exceed the maximum size class (plus cache-oblivious padding, if enabled). Remove psz2ind_clamp() and use psz2ind() instead so that trying to allocate the maximum size class can in principle succeed. In practice, this allows assertions to hold so that OOM errors can be successfully generated.	2016-11-03 22:33:34 -07:00
Jason Evans	8dd5ea87ca	Fix extent_alloc_cache[_locked]() to support decommitted allocation. Fix extent_alloc_cache[_locked]() to support decommitted allocation, and use this ability in arena_stash_dirty(), so that decommitted extents are not needlessly committed during purging. In practice this does not happen on any currently supported systems, because both extent merging and decommit must be implemented; all supported systems implement one xor the other.	2016-11-03 22:33:23 -07:00
Jason Evans	4f7d8c2dee	Update symbol mangling.	2016-11-03 15:00:02 -07:00
Jason Evans	04e1328ef1	Update ChangeLog for 4.3.0.	2016-11-02 21:39:24 -07:00
Samuel Moritz	69f027b855	Support Debian GNU/kFreeBSD. Treat it exactly like Linux since they both use GNU libc.	2016-11-02 20:36:37 -07:00
Dave Watson	25f7bbcf28	Fix long spinning in rtree_node_init rtree_node_init spinlocks the node, allocates, and then sets the node. This is under heavy contention at the top of the tree if many threads start to allocate at the same time. Instead, take a per-rtree sleeping mutex to reduce spinning. Tested both pthreads and osx OSSpinLock, and both reduce spinning adequately Previous benchmark time: ./ttest1 500 100 ~15s New benchmark time: ./ttest1 500 100 .57s	2016-11-02 20:30:53 -07:00
Dave Watson	712fde79fd	Check for existance of CPU_COUNT macro before using it. This resolves #485.	2016-11-02 20:05:40 -07:00
Jason Evans	83ebf2fda5	Fix sycall(2) configure test for Linux.	2016-11-02 19:50:44 -07:00

... 3 4 5 6 7 ...

1708 Commits