server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Jason Evans	7ca0fdfb85	Disable munmap() if it causes VM map holes. Add a configure test to determine whether common mmap()/munmap() patterns cause VM map holes, and only use munmap() to discard unused chunks if the problem does not exist. Unify the chunk caching for mmap and dss. Fix options processing to limit lg_chunk to be large enough that redzones will always fit.	2012-04-12 20:20:58 -07:00
Jason Evans	d6abcbb14b	Always disable redzone by default. Always disable redzone by default, even when --enable-debug is specified. The memory overhead for redzones can be substantial, which makes this feature something that should only be opted into.	2012-04-12 17:09:54 -07:00
Mike Hommey	b8325f9cb0	Call base_boot before chunk_boot0 Chunk_boot0 calls rtree_new, which calls base_alloc, which locks the base_mtx mutex. That mutex is initialized in base_boot.	2012-04-12 11:42:20 -07:00
Mike Hommey	83c324acd8	Use a stub replacement and disable dss when sbrk is not supported	2012-04-12 08:43:08 -07:00
Jason Evans	5ff709c264	Normalize aligned allocation algorithms. Normalize arena_palloc(), chunk_alloc_mmap_slow(), and chunk_recycle_dss() to use the same algorithm for trimming over-allocation. Add the ALIGNMENT_ADDR2BASE(), ALIGNMENT_ADDR2OFFSET(), and ALIGNMENT_CEILING() macros, and use them where appropriate. Remove the run_size_p parameter from sa2u(). Fix a potential deadlock in chunk_recycle_dss() that was introduced by `eae269036c` (Add alignment support to chunk_alloc()).	2012-04-11 18:13:45 -07:00
Jason Evans	122449b073	Implement Valgrind support, redzones, and quarantine. Implement Valgrind support, as well as the redzone and quarantine features, which help Valgrind detect memory errors. Redzones are only implemented for small objects because the changes necessary to support redzones around large and huge objects are complicated by in-place reallocation, to the point that it isn't clear that the maintenance burden is worth the incremental improvement to Valgrind support. Merge arena_salloc() and arena_salloc_demote(). Refactor i[v]salloc() to expose the 'demote' option.	2012-04-11 11:46:18 -07:00
Jason Evans	a1ee7838e1	Rename labels. Rename labels from FOO to label_foo in order to avoid system macro definitions, in particular OUT and ERROR on mingw. Reported by Mike Hommey.	2012-04-10 15:07:44 -07:00
Mike Hommey	eae269036c	Add alignment support to chunk_alloc().	2012-04-10 14:51:39 -07:00
Mike Hommey	c5851eaf6e	Remove MAP_NORESERVE support It was only used by the swap feature, and that is gone.	2012-04-10 12:05:27 -07:00
Jason Evans	3701367e4c	Always initialize tcache data structures. Always initialize tcache data structures if the tcache configuration option is enabled, regardless of opt_tcache. This fixes "thread.tcache.enabled" mallctl manipulation in the case when opt_tcache is false.	2012-04-06 12:41:55 -07:00
Jason Evans	fad100bc35	Remove arena_malloc_prechosen(). Remove arena_malloc_prechosen(), now that arena_malloc() can be invoked in a way that is semantically equivalent.	2012-04-06 12:24:46 -07:00
Jason Evans	b147611b52	Add utrace(2)-based tracing (--enable-utrace).	2012-04-05 13:36:17 -07:00
Jason Evans	02b231205e	Fix threaded initialization and enable it on Linux. Reported by Mike Hommey.	2012-04-05 11:06:23 -07:00
Jason Evans	f3ca7c8386	Add missing "opt.lg_tcache_max" mallctl implementation.	2012-04-04 16:16:09 -07:00
Jason Evans	01b3fe55ff	Add a0malloc(), a0calloc(), and a0free(). Add a0malloc(), a0calloc(), and a0free(), which are used by FreeBSD's libc to allocate/deallocate TLS in static binaries.	2012-04-03 19:25:48 -07:00
Jason Evans	633aaff967	Postpone mutex initialization on FreeBSD. Postpone mutex initialization on FreeBSD until after base allocation is safe.	2012-04-03 19:25:30 -07:00
Jason Evans	9d4d76874d	Finish renaming "arenas.pagesize" to "arenas.page".	2012-04-02 07:15:42 -07:00
Jason Evans	ae4c7b4b40	Clean up PAGE macros. s/PAGE_SHIFT/LG_PAGE/g and s/PAGE_SIZE/PAGE/g. Remove remnants of the dynamic-page-shift code. Rename the "arenas.pagesize" mallctl to "arenas.page". Remove the "arenas.chunksize" mallctl, which is redundant with "opt.lg_chunk".	2012-04-02 07:04:34 -07:00
Jason Evans	f004737267	Revert "Avoid NULL check in free() and malloc_usable_size()." This reverts commit `96d4120ac0`. ivsalloc() depends on chunks_rtree being initialized. This can be worked around via a NULL pointer check. However, thread_allocated_tsd_get() also depends on initialization having occurred, and there is no way to guard its call in free() that is cheaper than checking whether ptr is NULL.	2012-04-02 15:18:24 -07:00
Jason Evans	96d4120ac0	Avoid NULL check in free() and malloc_usable_size(). Generalize isalloc() to handle NULL pointers in such a way that the NULL checking overhead is only paid when introspecting huge allocations (or NULL). This allows free() and malloc_usable_size() to no longer check for NULL. Submitted by Igor Bukanov and Mike Hommey.	2012-04-02 14:50:03 -07:00
Mike Hommey	80b25932ca	Move last bit of zone initialization in zone.c, and lazy-initialize	2012-04-02 14:15:20 -07:00
Jason Evans	4eeb52f080	Remove vsnprintf() and strtoumax() validation. Remove code that validates malloc_vsnprintf() and malloc_strtoumax() against their namesakes. The validation code has adequately served its usefulness at this point, and it isn't worth dealing with the different formatting for %p with glibc versus other implementations for NULL pointers ("(nil)" vs. "0x0"). Reported by Mike Hommey.	2012-04-02 02:30:24 -07:00
Mike Hommey	3c2ba0dcbc	Avoid crashes when system libraries use the purgeable zone allocator	2012-03-30 11:03:20 -07:00
Mike Hommey	71a93b8725	Move zone registration to zone.c	2012-03-30 10:53:00 -07:00
Mike Hommey	1a0e777024	Add a SYS_write definition on systems where it is not defined in headers Namely, in the Android NDK headers, SYS_write is not defined; but __NR_write is.	2012-03-30 10:21:41 -07:00
Mike Hommey	e77fa59ece	Don't use pthread_atfork to register prefork/postfork handlers on OSX OSX libc calls zone allocators' force_lock/force_unlock already.	2012-03-28 16:17:21 -07:00
Jason Evans	d4be8b7b6e	Add the "thread.tcache.enabled" mallctl.	2012-03-26 19:02:49 -07:00
Jason Evans	2465bdf493	Check for NULL ptr in malloc_usable_size(). Check for NULL ptr in malloc_usable_size(), rather than just asserting that ptr is non-NULL. This matches behavior of other implementations (e.g., glibc and tcmalloc).	2012-03-26 13:13:55 -07:00
Mike Hommey	5b3db098f7	Make zone_{free, realloc, free_definite_size} fallback to the system allocator if they are called with a pointer that jemalloc didn't allocate It turns out some OSX system libraries (like CoreGraphics on 10.6) like to call malloc_zone_* functions, but giving them pointers that weren't allocated with the zone they are using. Possibly, they do malloc_zone_malloc(malloc_default_zone()) before we register the jemalloc zone, and malloc_zone_realloc(malloc_default_zone()) after. malloc_default_zone() returning a different value in both cases.	2012-03-26 12:54:25 -07:00
Mike Hommey	5c89c50d18	Fix glibc hooks when using both --with-jemalloc-prefix and --with-mangling	2012-03-26 12:43:05 -07:00
Jason Evans	41b6afb834	Port to FreeBSD. Use FreeBSD-specific functions (_pthread_mutex_init_calloc_cb(), _malloc_{pre,post}fork()) to avoid bootstrapping issues due to allocation in libc and libthr. Add malloc_strtoumax() and use it instead of strtoul(). Disable validation code in malloc_vsnprintf() and malloc_strtoumax() until jemalloc is initialized. This is necessary because locale initialization causes allocation for both vsnprintf() and strtoumax(). Force the lazy-lock feature on in order to avoid pthread_self(), because it causes allocation. Use syscall(SYS_write, ...) rather than write(...), because libthr wraps write() and causes allocation. Without this workaround, it would not be possible to print error messages in malloc_conf_init() without substantially reworking bootstrapping. Fix choose_arena_hard() to look at how many threads are assigned to the candidate choice, rather than checking whether the arena is uninitialized. This bug potentially caused more arenas to be initialized than necessary.	2012-02-02 23:09:53 -08:00
Jason Evans	6da5418ded	Remove ephemeral mutexes. Remove ephemeral mutexes from the prof machinery, and remove malloc_mutex_destroy(). This simplifies mutex management on systems that call malloc()/free() inside pthread_mutex_{create,destroy}(). Add atomic_*_u() for operation on unsigned values. Fix prof_printf() to call malloc_vsnprintf() rather than malloc_snprintf().	2012-03-23 18:05:51 -07:00
Jason Evans	9225a1991a	Add JEMALLOC_CC_SILENCE_INIT(). Add JEMALLOC_CC_SILENCE_INIT(), which provides succinct syntax for initializing a variable to avoid a spurious compiler warning.	2012-03-23 15:39:07 -07:00
Jason Evans	cd9a1346e9	Implement tsd. Implement tsd, which is a TLS/TSD abstraction that uses one or both internally. Modify bootstrapping such that no tsd's are utilized until allocation is safe. Remove malloc_[v]tprintf(), and use malloc_snprintf() instead. Fix %p argument size handling in malloc_vsnprintf(). Fix a long-standing statistics-related bug in the "thread.arena" mallctl that could cause crashes due to linked list corruption.	2012-03-23 15:14:55 -07:00
Mike Hommey	154829d256	Improve zone support for OSX I tested a build from 10.7 run on 10.7 and 10.6, and a build from 10.6 run on 10.6. The AC_COMPILE_IFELSE limbo is to avoid running a program during configure, which presumably makes it work when cross compiling for iOS.	2012-03-20 11:52:50 -07:00
Mike Hommey	f4d0fc310e	Unbreak mac after commit `4e2e3dd`	2012-03-20 11:48:42 -07:00
Jason Evans	e24c7af35d	Invert NO_TLS to JEMALLOC_TLS.	2012-03-19 10:21:17 -07:00
Jason Evans	e7b8fa18d2	Rename the "tcache.flush" mallctl to "thread.tcache.flush".	2012-03-16 17:09:32 -07:00
Jason Evans	4e2e3dd9cf	Fix fork-related bugs. Acquire/release arena bin locks as part of the prefork/postfork. This bug made deadlock in the child between fork and exec a possibility. Split jemalloc_postfork() into jemalloc_postfork_{parent,child}() so that the child can reinitialize mutexes rather than unlocking them. In practice, this bug tended not to cause problems.	2012-03-13 16:31:41 -07:00
Jason Evans	824d34e5b7	Modify malloc_vsnprintf() validation code. Modify malloc_vsnprintf() validation code to verify that output is identical to vsnprintf() output, even if both outputs are truncated due to buffer exhaustion.	2012-03-13 13:19:04 -07:00
Jason Evans	0a0bbf63e5	Implement aligned_alloc(). Implement aligned_alloc(), which was added in the C11 standard. The function is weakly specified to the point that a minimally compliant implementation would be painful to use (size must be an integral multiple of alignment!), which in practice makes posix_memalign() a safer choice.	2012-03-13 12:55:21 -07:00
Jason Evans	4c2faa8a7c	Fix a regression in JE_COMPILABLE(). Revert JE_COMPILABLE() so that it detects link errors. Cross-compiling should still work as long as a valid configure cache is provided. Clean up some comments/whitespace.	2012-03-13 11:09:23 -07:00
Jason Evans	eb2398106f	Fix malloc_stats_print() option support. Fix malloc_stats_print() to honor 'b' and 'l' in the opts parameter.	2012-03-13 08:46:12 -07:00
Jason Evans	2bb6c7a632	s/PRIx64/PRIxPTR/ for uintptr_t printf() argument.	2012-03-12 13:38:55 -07:00
Jason Evans	7cca608575	Remove extra '}'.	2012-03-12 11:31:54 -07:00
Jason Evans	d81e4bdd5c	Implement malloc_vsnprintf(). Implement malloc_vsnprintf() (a subset of vsnprintf(3)) as well as several other printing functions based on it, so that formatted printing can be relied upon without concern for inducing a dependency on floating point runtime support. Replace malloc_write() calls with malloc_printf() where doing so simplifies the code. Add name mangling for library-private symbols in the data and BSS sections. Adjust CONF_HANDLE_() macros in malloc_conf_init() to expose all opt_* variable use to cpp so that proper mangling occurs.	2012-03-07 16:19:19 -08:00
Jason Evans	4507f34628	Remove the lg_tcache_gc_sweep option. Remove the lg_tcache_gc_sweep option, because it is no longer very useful. Prior to the addition of dynamic adjustment of tcache fill count, it was possible for fill/flush overhead to be a problem, but this problem no longer occurs.	2012-03-05 14:34:37 -08:00
Jason Evans	b8c8be7f8a	Use UINT64_C() rather than LLU for 64-bit constants.	2012-03-05 12:26:26 -08:00
Jason Evans	7e77eaffff	Add the --disable-experimental option.	2012-03-02 17:47:37 -08:00
Jason Evans	84f7cdb0c5	Rename prn to prng. Rename prn to prng so that Windows doesn't choke when trying to create a file named prn.h.	2012-03-02 15:59:45 -08:00
Jason Evans	0a5489e37d	Add --with-mangling. Add the --with-mangling configure option, which can be used to specify name mangling on a per public symbol basis that takes precedence over --with-jemalloc-prefix. Expose the memalign() and valloc() overrides even if --with-jemalloc-prefix is specified. This change does no real harm, and simplifies the code.	2012-03-01 17:19:20 -08:00
Jason Evans	166a745b39	Simplify zone_good_size(). Simplify zone_good_size() to avoid memory allocation. Submitted by Mike Hommey.	2012-02-29 13:03:29 -08:00
Jason Evans	7e15dab94d	Add nallocm(). Add nallocm(), which computes the real allocation size that would result from the corresponding allocm() call. nallocm() is a functional superset of OS X's malloc_good_size(), in that it takes alignment constraints into account.	2012-02-29 12:56:37 -08:00
Jason Evans	4bb0983013	Use glibc allocator hooks. When jemalloc is used as a libc malloc replacement (i.e. not prefixed), some particular setups may end up inconsistently calling malloc from libc and free from jemalloc, or the other way around. glibc provides hooks to make its functions use alternative implementations. Use them. Submitted by Karl Tomlinson and Mike Hommey.	2012-02-29 10:37:27 -08:00
Jason Evans	5965631636	Do not enforce minimum alignment in memalign(). Do not enforce minimum alignment in memalign(). This is a non-standard function, and there is disagreement over whether to enforce minimum alignment. Solaris documentation (whence memalign() originated) says that minimum alignment is required: The value of alignment must be a power of two and must be greater than or equal to the size of a word. However, Linux's manual page says in its NOTES section: memalign() may not check that the boundary parameter is correct. This is descriptive rather than prescriptive, but applications with bad assumptions about memalign() exist, so be as forgiving as possible. Reported by Mike Hommey.	2012-02-28 21:37:38 -08:00
Jason Evans	93c023d181	Remove unused variables in stats_print(). Submitted by Mike Hommey.	2012-02-28 21:13:12 -08:00
Jason Evans	bdcadf41e9	Remove unused variable in arena_run_split(). Submitted by Mike Hommey.	2012-02-28 21:11:03 -08:00
Jason Evans	d073a32109	Enable the stats configuration option by default.	2012-02-28 20:41:16 -08:00
Jason Evans	c90ad71237	Remove the sysv option.	2012-02-28 20:31:37 -08:00
Jason Evans	f081b88dfb	Fix realloc(p, 0) to act like free(p). Reported by Yoni Londer.	2012-02-28 20:24:05 -08:00
Jason Evans	b172610317	Simplify small size class infrastructure. Program-generate small size class tables for all valid combinations of LG_TINY_MIN, LG_QUANTUM, and PAGE_SHIFT. Use the appropriate table to generate all relevant data structures, and remove the distinction between tiny/quantum/cacheline/subpage bins. Remove --enable-dynamic-page-shift. This option didn't prove useful in practice, and it prevented optimizations. Add Tilera architecture support.	2012-02-28 16:50:47 -08:00
Jason Evans	5389146191	Remove the opt.lg_prof_bt_max option. Remove opt.lg_prof_bt_max, and hard code it to 7. The original intention of this option was to enable faster backtracing by limiting backtrace depth. However, this makes graphical pprof output very difficult to interpret. In practice, decreasing sampling frequency is a better mechanism for limiting profiling overhead.	2012-02-13 18:41:36 -08:00
Jason Evans	0b526ff94d	Remove the opt.lg_prof_tcmax option. Remove the opt.lg_prof_tcmax option and hard-code a cache size of 1024. This setting is something that users just shouldn't have to worry about. If lock contention actually ends up being a problem, the simple solution available to the user is to reduce sampling frequency.	2012-02-13 18:04:26 -08:00
Jason Evans	e7a1058aaa	Fix bin->runcur management. Fix an interaction between arena_dissociate_bin_run() and arena_bin_lower_run() that made it possible for bin->runcur to point to a run other than the lowest non-full run. This bug violated jemalloc's layout policy, but did not affect correctness.	2012-02-13 17:36:52 -08:00
Jason Evans	746868929a	Remove highruns statistics.	2012-02-13 15:18:19 -08:00
Jason Evans	ef8897b4b9	Make 8-byte tiny size class non-optional. When tiny size class support was first added, it was intended to support truly tiny size classes (even 2 bytes). However, this wasn't very useful in practice, so the minimum tiny size class has been limited to sizeof(void *) for a long time now. This is too small to be standards compliant, but other commonly used malloc implementations do not even bother using a 16-byte quantum on systems with vector units (SSE2+, AltiVEC, etc.). As such, it is safe in practice to support an 8-byte tiny size class on 64-bit systems that support 16-byte types.	2012-02-13 15:03:59 -08:00
Jason Evans	6ffbbeb5d6	Silence compiler warnings.	2012-02-13 12:31:30 -08:00
Jason Evans	962463d9b5	Streamline tcache-related malloc/free fast paths. tcache_get() is inlined, so do the config_tcache check inside tcache_get() and simplify its callers. Make arena_malloc() an inline function, since it is part of the malloc() fast path. Remove conditional logic that cause build issues if --disable-tcache was specified.	2012-02-13 12:29:49 -08:00
Jason Evans	4162627757	Remove the swap feature. Remove the swap feature, which enabled per application swap files. In practice this feature has not proven itself useful to users.	2012-02-13 10:56:17 -08:00
Jason Evans	fd56043c53	Remove magic. Remove structure magic, because 1) it is no longer conditional, and 2) it stopped being very effective at detecting memory corruption several years ago.	2012-02-13 10:24:43 -08:00
Jason Evans	7372b15a31	Reduce cpp conditional logic complexity. Convert configuration-related cpp conditional logic to use static constant variables, e.g.: #ifdef JEMALLOC_DEBUG [...] #endif becomes: if (config_debug) { [...] } The advantage is clearer, more concise code. The main disadvantage is that data structures no longer have conditionally defined fields, so they pay the cost of all fields regardless of whether they are used. In practice, this is only a minor concern; config_stats will go away in an upcoming change, and config_prof is the only other major feature that depends on more than a few special-purpose fields.	2012-02-10 20:22:09 -08:00
Jason Evans	334cc02142	Fix malloc_stats_print(..., "a") output. Fix the logic in stats_print() such that if the "a" flag is passed in without the "m" flag, merged statistics will be printed even if only one arena is initialized.	2011-11-11 14:46:04 -08:00
Jason Evans	12a4887826	Fix huge_ralloc to maintain chunk statistics. Fix huge_ralloc() to properly maintain chunk statistics when using mremap(2).	2011-11-11 14:41:59 -08:00
Jason Evans	fa351d9fdc	Fix huge_ralloc() race when using mremap(2). Fix huge_ralloc() to remove the old memory region from tree of huge allocations before calling mremap(2), in order to make sure that no other thread acquires the old memory region via mmap() and encounters stale metadata in the tree. Reported by: Rich Prohaska	2011-11-09 11:55:19 -08:00
Jason Evans	30fbef8aea	Fix rallocm() test to support >4KiB pages.	2011-11-05 21:06:55 -07:00
Jason Evans	8e6f8b490d	Initialize arenas_tsd before setting it. Reported by: Ethan Burns, Rich Prohaska, Tudor Bosman	2011-11-03 18:40:03 -07:00
Jason Evans	a9076c9483	Fix a prof-related race condition. Fix prof_lookup() to artificially raise curobjs for all paths through the code that creates a new entry in the per thread bt2cnt hash table. This fixes a race condition that could corrupt memory if prof_accum were false, and a non-default lg_prof_tcmax were used and/or threads were destroyed.	2011-08-30 23:40:11 -07:00
Jason Evans	46405e670f	Fix a prof-related bug in realloc(). Fix realloc() such that it only records the object passed in as freed if no OOM error occurs.	2011-08-30 23:37:29 -07:00
Jason Evans	749c2a0ab6	Add missing prof_malloc() call in allocm(). Add a missing prof_malloc() call in allocm(). Before this fix, negative object/byte counts could be observed in heap profiles for applications that use allocm().	2011-08-12 18:37:54 -07:00
Jason Evans	a507004d29	Fix off-by-one backtracing issues. Rewrite prof_alloc_prep() as a cpp macro, PROF_ALLOC_PREP(), in order to remove any doubt as to whether an additional stack frame is created. Prior to this change, it was assumed that inlining would reduce the total number of frames in the backtrace, but in practice behavior wasn't completely predictable. Create imemalign() and call it from posix_memalign(), memalign(), and valloc(), so that all entry points require the same number of stack frames to be ignored during backtracing.	2011-08-12 13:48:27 -07:00
Jason Evans	b493ce22a4	Conditionalize an isalloc() call in rallocm(). Conditionalize an isalloc() call in rallocm() that be unnecessary.	2011-08-12 11:28:47 -07:00
Jason Evans	183ba50c19	Fix two prof-related bugs in rallocm(). Properly handle boundary conditions for sampled region promotion in rallocm(). Prior to this fix, some combinations of 'size' and 'extra' values could cause erroneous behavior. Additionally, size class recording for promoted regions was incorrect.	2011-08-11 23:00:25 -07:00
Jason Evans	0cdd42eb32	Clean up prof-related comments. Clean up some prof-related comments to more accurately reflect how the code works. Simplify OOM handling code in a couple of prof-related error paths.	2011-08-09 19:06:06 -07:00
Jason Evans	41b954ed36	Use prof_tdata_cleanup() argument. Use the argument to prof_tdata_cleanup(), rather than calling PROF_TCACHE_GET(). This fixes a bug in the NO_TLS case.	2011-08-08 17:10:07 -07:00
Jason Evans	f9a8edbb50	Fix assertions in arena_purge(). Fix assertions in arena_purge() to accurately reflect the constraints in arena_maybe_purge(). There were two bugs here, one of which merely weakened the assertion, and the other of which referred to an uninitialized variable (typo; used npurgatory instead of arena->npurgatory).	2011-06-12 17:13:39 -07:00
Jason Evans	f0b22cf932	Use LLU suffix for all 64-bit constants. Add the LLU suffix for all 0x... 64-bit constants. Reported by Jakob Blomer.	2011-05-22 10:49:44 -07:00
Jason Evans	7427525c28	Move repo contents in jemalloc/ to top level.	2011-03-31 20:36:17 -07:00

... 2 3 4 5 6

287 Commits