server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Jason Evans	d6abcbb14b	Always disable redzone by default. Always disable redzone by default, even when --enable-debug is specified. The memory overhead for redzones can be substantial, which makes this feature something that should only be opted into.	2012-04-12 17:09:54 -07:00
Mike Hommey	b8325f9cb0	Call base_boot before chunk_boot0 Chunk_boot0 calls rtree_new, which calls base_alloc, which locks the base_mtx mutex. That mutex is initialized in base_boot.	2012-04-12 11:42:20 -07:00
Jason Evans	5ff709c264	Normalize aligned allocation algorithms. Normalize arena_palloc(), chunk_alloc_mmap_slow(), and chunk_recycle_dss() to use the same algorithm for trimming over-allocation. Add the ALIGNMENT_ADDR2BASE(), ALIGNMENT_ADDR2OFFSET(), and ALIGNMENT_CEILING() macros, and use them where appropriate. Remove the run_size_p parameter from sa2u(). Fix a potential deadlock in chunk_recycle_dss() that was introduced by `eae269036c` (Add alignment support to chunk_alloc()).	2012-04-11 18:13:45 -07:00
Jason Evans	122449b073	Implement Valgrind support, redzones, and quarantine. Implement Valgrind support, as well as the redzone and quarantine features, which help Valgrind detect memory errors. Redzones are only implemented for small objects because the changes necessary to support redzones around large and huge objects are complicated by in-place reallocation, to the point that it isn't clear that the maintenance burden is worth the incremental improvement to Valgrind support. Merge arena_salloc() and arena_salloc_demote(). Refactor i[v]salloc() to expose the 'demote' option.	2012-04-11 11:46:18 -07:00
Jason Evans	a1ee7838e1	Rename labels. Rename labels from FOO to label_foo in order to avoid system macro definitions, in particular OUT and ERROR on mingw. Reported by Mike Hommey.	2012-04-10 15:07:44 -07:00
Jason Evans	b147611b52	Add utrace(2)-based tracing (--enable-utrace).	2012-04-05 13:36:17 -07:00
Jason Evans	02b231205e	Fix threaded initialization and enable it on Linux. Reported by Mike Hommey.	2012-04-05 11:06:23 -07:00
Jason Evans	01b3fe55ff	Add a0malloc(), a0calloc(), and a0free(). Add a0malloc(), a0calloc(), and a0free(), which are used by FreeBSD's libc to allocate/deallocate TLS in static binaries.	2012-04-03 19:25:48 -07:00
Jason Evans	633aaff967	Postpone mutex initialization on FreeBSD. Postpone mutex initialization on FreeBSD until after base allocation is safe.	2012-04-03 19:25:30 -07:00
Jason Evans	ae4c7b4b40	Clean up PAGE macros. s/PAGE_SHIFT/LG_PAGE/g and s/PAGE_SIZE/PAGE/g. Remove remnants of the dynamic-page-shift code. Rename the "arenas.pagesize" mallctl to "arenas.page". Remove the "arenas.chunksize" mallctl, which is redundant with "opt.lg_chunk".	2012-04-02 07:04:34 -07:00
Jason Evans	f004737267	Revert "Avoid NULL check in free() and malloc_usable_size()." This reverts commit `96d4120ac0`. ivsalloc() depends on chunks_rtree being initialized. This can be worked around via a NULL pointer check. However, thread_allocated_tsd_get() also depends on initialization having occurred, and there is no way to guard its call in free() that is cheaper than checking whether ptr is NULL.	2012-04-02 15:18:24 -07:00
Jason Evans	96d4120ac0	Avoid NULL check in free() and malloc_usable_size(). Generalize isalloc() to handle NULL pointers in such a way that the NULL checking overhead is only paid when introspecting huge allocations (or NULL). This allows free() and malloc_usable_size() to no longer check for NULL. Submitted by Igor Bukanov and Mike Hommey.	2012-04-02 14:50:03 -07:00
Mike Hommey	80b25932ca	Move last bit of zone initialization in zone.c, and lazy-initialize	2012-04-02 14:15:20 -07:00
Jason Evans	4eeb52f080	Remove vsnprintf() and strtoumax() validation. Remove code that validates malloc_vsnprintf() and malloc_strtoumax() against their namesakes. The validation code has adequately served its usefulness at this point, and it isn't worth dealing with the different formatting for %p with glibc versus other implementations for NULL pointers ("(nil)" vs. "0x0"). Reported by Mike Hommey.	2012-04-02 02:30:24 -07:00
Mike Hommey	3c2ba0dcbc	Avoid crashes when system libraries use the purgeable zone allocator	2012-03-30 11:03:20 -07:00
Mike Hommey	71a93b8725	Move zone registration to zone.c	2012-03-30 10:53:00 -07:00
Mike Hommey	e77fa59ece	Don't use pthread_atfork to register prefork/postfork handlers on OSX OSX libc calls zone allocators' force_lock/force_unlock already.	2012-03-28 16:17:21 -07:00
Jason Evans	2465bdf493	Check for NULL ptr in malloc_usable_size(). Check for NULL ptr in malloc_usable_size(), rather than just asserting that ptr is non-NULL. This matches behavior of other implementations (e.g., glibc and tcmalloc).	2012-03-26 13:13:55 -07:00
Mike Hommey	5c89c50d18	Fix glibc hooks when using both --with-jemalloc-prefix and --with-mangling	2012-03-26 12:43:05 -07:00
Jason Evans	41b6afb834	Port to FreeBSD. Use FreeBSD-specific functions (_pthread_mutex_init_calloc_cb(), _malloc_{pre,post}fork()) to avoid bootstrapping issues due to allocation in libc and libthr. Add malloc_strtoumax() and use it instead of strtoul(). Disable validation code in malloc_vsnprintf() and malloc_strtoumax() until jemalloc is initialized. This is necessary because locale initialization causes allocation for both vsnprintf() and strtoumax(). Force the lazy-lock feature on in order to avoid pthread_self(), because it causes allocation. Use syscall(SYS_write, ...) rather than write(...), because libthr wraps write() and causes allocation. Without this workaround, it would not be possible to print error messages in malloc_conf_init() without substantially reworking bootstrapping. Fix choose_arena_hard() to look at how many threads are assigned to the candidate choice, rather than checking whether the arena is uninitialized. This bug potentially caused more arenas to be initialized than necessary.	2012-02-02 23:09:53 -08:00
Jason Evans	6da5418ded	Remove ephemeral mutexes. Remove ephemeral mutexes from the prof machinery, and remove malloc_mutex_destroy(). This simplifies mutex management on systems that call malloc()/free() inside pthread_mutex_{create,destroy}(). Add atomic_*_u() for operation on unsigned values. Fix prof_printf() to call malloc_vsnprintf() rather than malloc_snprintf().	2012-03-23 18:05:51 -07:00
Jason Evans	9225a1991a	Add JEMALLOC_CC_SILENCE_INIT(). Add JEMALLOC_CC_SILENCE_INIT(), which provides succinct syntax for initializing a variable to avoid a spurious compiler warning.	2012-03-23 15:39:07 -07:00
Jason Evans	cd9a1346e9	Implement tsd. Implement tsd, which is a TLS/TSD abstraction that uses one or both internally. Modify bootstrapping such that no tsd's are utilized until allocation is safe. Remove malloc_[v]tprintf(), and use malloc_snprintf() instead. Fix %p argument size handling in malloc_vsnprintf(). Fix a long-standing statistics-related bug in the "thread.arena" mallctl that could cause crashes due to linked list corruption.	2012-03-23 15:14:55 -07:00
Mike Hommey	154829d256	Improve zone support for OSX I tested a build from 10.7 run on 10.7 and 10.6, and a build from 10.6 run on 10.6. The AC_COMPILE_IFELSE limbo is to avoid running a program during configure, which presumably makes it work when cross compiling for iOS.	2012-03-20 11:52:50 -07:00
Jason Evans	e24c7af35d	Invert NO_TLS to JEMALLOC_TLS.	2012-03-19 10:21:17 -07:00
Jason Evans	4e2e3dd9cf	Fix fork-related bugs. Acquire/release arena bin locks as part of the prefork/postfork. This bug made deadlock in the child between fork and exec a possibility. Split jemalloc_postfork() into jemalloc_postfork_{parent,child}() so that the child can reinitialize mutexes rather than unlocking them. In practice, this bug tended not to cause problems.	2012-03-13 16:31:41 -07:00
Jason Evans	0a0bbf63e5	Implement aligned_alloc(). Implement aligned_alloc(), which was added in the C11 standard. The function is weakly specified to the point that a minimally compliant implementation would be painful to use (size must be an integral multiple of alignment!), which in practice makes posix_memalign() a safer choice.	2012-03-13 12:55:21 -07:00
Jason Evans	4c2faa8a7c	Fix a regression in JE_COMPILABLE(). Revert JE_COMPILABLE() so that it detects link errors. Cross-compiling should still work as long as a valid configure cache is provided. Clean up some comments/whitespace.	2012-03-13 11:09:23 -07:00
Jason Evans	d81e4bdd5c	Implement malloc_vsnprintf(). Implement malloc_vsnprintf() (a subset of vsnprintf(3)) as well as several other printing functions based on it, so that formatted printing can be relied upon without concern for inducing a dependency on floating point runtime support. Replace malloc_write() calls with malloc_printf() where doing so simplifies the code. Add name mangling for library-private symbols in the data and BSS sections. Adjust CONF_HANDLE_() macros in malloc_conf_init() to expose all opt_* variable use to cpp so that proper mangling occurs.	2012-03-07 16:19:19 -08:00
Jason Evans	4507f34628	Remove the lg_tcache_gc_sweep option. Remove the lg_tcache_gc_sweep option, because it is no longer very useful. Prior to the addition of dynamic adjustment of tcache fill count, it was possible for fill/flush overhead to be a problem, but this problem no longer occurs.	2012-03-05 14:34:37 -08:00
Jason Evans	7e77eaffff	Add the --disable-experimental option.	2012-03-02 17:47:37 -08:00
Jason Evans	0a5489e37d	Add --with-mangling. Add the --with-mangling configure option, which can be used to specify name mangling on a per public symbol basis that takes precedence over --with-jemalloc-prefix. Expose the memalign() and valloc() overrides even if --with-jemalloc-prefix is specified. This change does no real harm, and simplifies the code.	2012-03-01 17:19:20 -08:00
Jason Evans	7e15dab94d	Add nallocm(). Add nallocm(), which computes the real allocation size that would result from the corresponding allocm() call. nallocm() is a functional superset of OS X's malloc_good_size(), in that it takes alignment constraints into account.	2012-02-29 12:56:37 -08:00
Jason Evans	4bb0983013	Use glibc allocator hooks. When jemalloc is used as a libc malloc replacement (i.e. not prefixed), some particular setups may end up inconsistently calling malloc from libc and free from jemalloc, or the other way around. glibc provides hooks to make its functions use alternative implementations. Use them. Submitted by Karl Tomlinson and Mike Hommey.	2012-02-29 10:37:27 -08:00
Jason Evans	5965631636	Do not enforce minimum alignment in memalign(). Do not enforce minimum alignment in memalign(). This is a non-standard function, and there is disagreement over whether to enforce minimum alignment. Solaris documentation (whence memalign() originated) says that minimum alignment is required: The value of alignment must be a power of two and must be greater than or equal to the size of a word. However, Linux's manual page says in its NOTES section: memalign() may not check that the boundary parameter is correct. This is descriptive rather than prescriptive, but applications with bad assumptions about memalign() exist, so be as forgiving as possible. Reported by Mike Hommey.	2012-02-28 21:37:38 -08:00
Jason Evans	d073a32109	Enable the stats configuration option by default.	2012-02-28 20:41:16 -08:00
Jason Evans	c90ad71237	Remove the sysv option.	2012-02-28 20:31:37 -08:00
Jason Evans	f081b88dfb	Fix realloc(p, 0) to act like free(p). Reported by Yoni Londer.	2012-02-28 20:24:05 -08:00
Jason Evans	b172610317	Simplify small size class infrastructure. Program-generate small size class tables for all valid combinations of LG_TINY_MIN, LG_QUANTUM, and PAGE_SHIFT. Use the appropriate table to generate all relevant data structures, and remove the distinction between tiny/quantum/cacheline/subpage bins. Remove --enable-dynamic-page-shift. This option didn't prove useful in practice, and it prevented optimizations. Add Tilera architecture support.	2012-02-28 16:50:47 -08:00
Jason Evans	5389146191	Remove the opt.lg_prof_bt_max option. Remove opt.lg_prof_bt_max, and hard code it to 7. The original intention of this option was to enable faster backtracing by limiting backtrace depth. However, this makes graphical pprof output very difficult to interpret. In practice, decreasing sampling frequency is a better mechanism for limiting profiling overhead.	2012-02-13 18:41:36 -08:00
Jason Evans	0b526ff94d	Remove the opt.lg_prof_tcmax option. Remove the opt.lg_prof_tcmax option and hard-code a cache size of 1024. This setting is something that users just shouldn't have to worry about. If lock contention actually ends up being a problem, the simple solution available to the user is to reduce sampling frequency.	2012-02-13 18:04:26 -08:00
Jason Evans	6ffbbeb5d6	Silence compiler warnings.	2012-02-13 12:31:30 -08:00
Jason Evans	4162627757	Remove the swap feature. Remove the swap feature, which enabled per application swap files. In practice this feature has not proven itself useful to users.	2012-02-13 10:56:17 -08:00
Jason Evans	7372b15a31	Reduce cpp conditional logic complexity. Convert configuration-related cpp conditional logic to use static constant variables, e.g.: #ifdef JEMALLOC_DEBUG [...] #endif becomes: if (config_debug) { [...] } The advantage is clearer, more concise code. The main disadvantage is that data structures no longer have conditionally defined fields, so they pay the cost of all fields regardless of whether they are used. In practice, this is only a minor concern; config_stats will go away in an upcoming change, and config_prof is the only other major feature that depends on more than a few special-purpose fields.	2012-02-10 20:22:09 -08:00
Jason Evans	30fbef8aea	Fix rallocm() test to support >4KiB pages.	2011-11-05 21:06:55 -07:00
Jason Evans	8e6f8b490d	Initialize arenas_tsd before setting it. Reported by: Ethan Burns, Rich Prohaska, Tudor Bosman	2011-11-03 18:40:03 -07:00
Jason Evans	46405e670f	Fix a prof-related bug in realloc(). Fix realloc() such that it only records the object passed in as freed if no OOM error occurs.	2011-08-30 23:37:29 -07:00
Jason Evans	749c2a0ab6	Add missing prof_malloc() call in allocm(). Add a missing prof_malloc() call in allocm(). Before this fix, negative object/byte counts could be observed in heap profiles for applications that use allocm().	2011-08-12 18:37:54 -07:00
Jason Evans	a507004d29	Fix off-by-one backtracing issues. Rewrite prof_alloc_prep() as a cpp macro, PROF_ALLOC_PREP(), in order to remove any doubt as to whether an additional stack frame is created. Prior to this change, it was assumed that inlining would reduce the total number of frames in the backtrace, but in practice behavior wasn't completely predictable. Create imemalign() and call it from posix_memalign(), memalign(), and valloc(), so that all entry points require the same number of stack frames to be ignored during backtracing.	2011-08-12 13:48:27 -07:00
Jason Evans	b493ce22a4	Conditionalize an isalloc() call in rallocm(). Conditionalize an isalloc() call in rallocm() that be unnecessary.	2011-08-12 11:28:47 -07:00
Jason Evans	183ba50c19	Fix two prof-related bugs in rallocm(). Properly handle boundary conditions for sampled region promotion in rallocm(). Prior to this fix, some combinations of 'size' and 'extra' values could cause erroneous behavior. Additionally, size class recording for promoted regions was incorrect.	2011-08-11 23:00:25 -07:00
Jason Evans	7427525c28	Move repo contents in jemalloc/ to top level.	2011-03-31 20:36:17 -07:00

... 3 4 5 6 7

302 Commits