server-skynet-source-3rd-jemalloc

project-base/server-skynet-source-3rd-jemalloc

Author	SHA1	Message	Date
Jason Evans	dd0438ee6b	Specify 'inline' in addition to always_inline attribute. Specify both inline and __attribute__((always_inline)), in order to avoid warnings when using newer versions of gcc.	2013-01-22 20:43:04 -08:00
Jason Evans	ae03bf6a57	Update hash from MurmurHash2 to MurmurHash3. Update hash from MurmurHash2 to MurmurHash3, primarily because the latter generates 128 bits in a single call for no extra cost, which simplifies integration with cuckoo hashing.	2013-01-22 12:02:08 -08:00
Jason Evans	88393cb0eb	Add and use JEMALLOC_ALWAYS_INLINE. Add JEMALLOC_ALWAYS_INLINE and use it to guarantee that the entire fast paths of the primary allocation/deallocation functions are inlined.	2013-01-22 08:45:43 -08:00
Jason Evans	38067483c5	Tighten valgrind integration. Tighten valgrind integration such that immediately after memory is validated or zeroed, valgrind is told to forget the memory's 'defined' state. The only place newly allocated memory should be left marked as 'defined' is in the public functions (e.g. calloc() and realloc()).	2013-01-21 20:04:42 -08:00
Garrett Cooper	13e4e24c42	Fix build break on *BSD Linux uses alloca.h; many other operating systems define alloca(3) in stdlib.h. Signed-off-by: Garrett Cooper <yanegomi@gmail.com>	2012-12-24 10:32:16 -08:00
Jason Evans	a3b3386ddd	Avoid arena_prof_accum()-related locking when possible. Refactor arena_prof_accum() and its callers to avoid arena locking when prof_interval is 0 (as when profiling is disabled). Reported by Ben Maurer.	2012-11-13 13:47:53 -08:00
Jason Evans	e3d13060c8	Purge unused dirty pages in a fragmentation-reducing order. Purge unused dirty pages in an order that first performs clean/dirty run defragmentation, in order to mitigate available run fragmentation. Remove the limitation that prevented purging unless at least one chunk worth of dirty pages had accumulated in an arena. This limitation was intended to avoid excessive purging for small applications, but the threshold was arbitrary, and the effect of questionable utility. Relax opt_lg_dirty_mult from 5 to 3. This compensates for increased likelihood of allocating clean runs, given the same ratio of clean:dirty runs, and reduces the potential for repeated purging in pathological large malloc/free loops that push the active:dirty page ratio just over the purge threshold.	2012-11-06 00:59:53 -08:00
Jason Evans	609ae595f0	Add arena-specific and selective dss allocation. Add the "arenas.extend" mallctl, so that it is possible to create new arenas that are outside the set that jemalloc automatically multiplexes threads onto. Add the ALLOCM_ARENA() flag for {,r,d}allocm(), so that it is possible to explicitly allocate from a particular arena. Add the "opt.dss" mallctl, which controls the default precedence of dss allocation relative to mmap allocation. Add the "arena.<i>.dss" mallctl, which makes it possible to set the default dss precedence on a per arena or global basis. Add the "arena.<i>.purge" mallctl, which obsoletes "arenas.purge". Add the "stats.arenas.<i>.dss" mallctl.	2012-10-12 18:26:16 -07:00
Jason Evans	20f1fc95ad	Fix fork(2)-related deadlocks. Add a library constructor for jemalloc that initializes the allocator. This fixes a race that could occur if threads were created by the main thread prior to any memory allocation, followed by fork(2), and then memory allocation in the child process. Fix the prefork/postfork functions to acquire/release the ctl, prof, and rtree mutexes. This fixes various fork() child process deadlocks, but one possible deadlock remains (intentionally) unaddressed: prof backtracing can acquire runtime library mutexes, so deadlock is still possible if heap profiling is enabled during fork(). This deadlock is known to be a real issue in at least the case of libgcc-based backtracing. Reported by tfengjun.	2012-10-09 15:21:46 -07:00
Jason Evans	7de92767c2	Fix mlockall()/madvise() interaction. mlockall(2) can cause purging via madvise(2) to fail. Fix purging code to check whether madvise() succeeded, and base zeroed page metadata on the result. Reported by Olivier Lecomte.	2012-10-08 18:04:49 -07:00
Jason Evans	dd03a2e377	Define LG_QUANTUM for hppa. Submitted by Jory Pratt.	2012-10-08 15:41:06 -07:00
Jason Evans	781fe75e0a	Auto-detect whether running inside Valgrind. Auto-detect whether running inside Valgrind, thus removing the need to manually specify MALLOC_CONF=valgrind:true.	2012-05-15 14:48:14 -07:00
Jason Evans	3860eac170	Fix heap profiling crash for realloc(p, 0) case. Fix prof_realloc() to not call prof_ctx_set() if a sampled object is being freed via realloc(p, 0).	2012-05-15 13:56:28 -07:00
Jason Evans	d8ceef6c55	Fix large calloc() zeroing bugs. Refactor code such that arena_mapbits_{large,small}_set() always preserves the unzeroed flag, and manually manipulate the unzeroed flag in the one case where it actually gets reset (in arena_chunk_purge()). This fixes unzeroed preservation bugs in arena_run_split() and arena_ralloc_large_grow(). These bugs caused large calloc() to return non-zeroed memory under some circumstances.	2012-05-10 21:49:43 -07:00
Jason Evans	53bd42c1fe	Update a comment.	2012-05-10 00:18:46 -07:00
Jason Evans	2e671ffbad	Add the --enable-mremap option. Add the --enable-mremap option, and disable the use of mremap(2) by default, for the same reason that freeing chunks via munmap(2) is disabled by default on Linux: semi-permanent VM map fragmentation.	2012-05-09 16:12:00 -07:00
Jason Evans	80737c3323	Further optimize and harden arena_salloc(). Further optimize arena_salloc() to only look at the binind chunk map bits in the common case. Add more sanity checks to arena_salloc() that detect chunk map inconsistencies for large allocations (whether due to allocator bugs or application bugs).	2012-05-02 16:11:03 -07:00
Jason Evans	9a7944f8ab	Update private namespace mangling.	2012-05-02 02:16:51 -07:00
Jason Evans	889ec59bd3	Make malloc_write() non-inline. Make malloc_write() non-inline, in order to resolve its dependency on je_malloc_write().	2012-05-02 02:08:03 -07:00
Jason Evans	8d5865eb57	Make CACHELINE a raw constant. Make CACHELINE a raw constant in order to work around a __declspec(align()) limitation. Submitted by Mike Hommey.	2012-05-02 01:22:16 -07:00
Jason Evans	203484e2ea	Optimize malloc() and free() fast paths. Embed the bin index for small page runs into the chunk page map, in order to omit [...] in the following dependent load sequence: ptr-->mapelm-->[run-->bin-->]bin_info Move various non-critcal code out of the inlined function chain into helper functions (tcache_event_hard(), arena_dalloc_small(), and locking).	2012-05-02 00:30:36 -07:00
Mike Hommey	fd97b1dfc7	Add support for MSVC Tested with MSVC 8 32 and 64 bits.	2012-05-01 11:32:11 -07:00
Mike Hommey	da99e31105	Replace JEMALLOC_ATTR with various different macros when it makes sense Theses newly added macros will be used to implement the equivalent under MSVC. Also, move the definitions to headers, where they make more sense, and for some, are even more useful there (e.g. malloc).	2012-04-30 17:57:31 -07:00
Mike Hommey	7cdea3973c	Few configure.ac adjustments - Use the extensions autoconf finds for object and executable files. - Remove the sorev variable, and replace SOREV definition with sorev's. - Default to je_ prefix on win32.	2012-04-30 17:13:45 -07:00
Mike Hommey	a14bce85e8	Use Get/SetLastError on Win32 Using errno on win32 doesn't quite work, because the value set in a shared library can't be read from e.g. an executable calling the function setting errno. At the same time, since buferror always uses errno/GetLastError, don't pass it.	2012-04-30 16:50:55 -07:00
Mike Hommey	8b49971d0c	Avoid variable length arrays and remove declarations within code MSVC doesn't support C99, and building as C++ to be able to use them is dangerous, as C++ and C99 are incompatible. Introduce a VARIABLE_ARRAY macro that either uses VLA when supported, or alloca() otherwise. Note that using alloca() inside loops doesn't quite work like VLAs, thus the use of VARIABLE_ARRAY there is discouraged. It might be worth investigating ways to check whether VARIABLE_ARRAY is used in such context at runtime in debug builds and bail out if that happens.	2012-04-29 00:25:34 -07:00
Jason Evans	f278994029	Fix more prof_tdata resurrection corner cases.	2012-04-28 23:27:13 -07:00
Jason Evans	0050a0f7e6	Handle prof_tdata resurrection. Handle prof_tdata resurrection during thread shutdown, similarly to how tcache and quarantine handle resurrection.	2012-04-28 18:14:24 -07:00
Jason Evans	3fb50b0407	Fix a PROF_ALLOC_PREP() error path. Fix a PROF_ALLOC_PREP() error path to initialize the return value to NULL.	2012-04-25 13:13:44 -07:00
Jason Evans	65f343a632	Fix ctl regression. Fix ctl to correctly compute the number of children at each level of the ctl tree.	2012-04-23 19:31:45 -07:00
Mike Hommey	461ad5c87a	Avoid using a union for ctl_node_s MSVC doesn't support C99, and as such doesn't support designated initialization of structs and unions. As there is never a mix of indexed and named nodes, it is pretty straightforward to use a different type for each.	2012-04-23 11:43:44 -07:00
Jason Evans	52386b2dc6	Fix heap profiling bugs. Fix a potential deadlock that could occur during interval- and growth-triggered heap profile dumps. Fix an off-by-one heap profile statistics bug that could be observed in interval- and growth-triggered heap profiles. Fix heap profile dump filename sequence numbers (regression during conversion to malloc_snprintf()).	2012-04-22 16:00:11 -07:00
Mike Hommey	a5288ca934	Remove unused #includes	2012-04-21 21:32:09 -07:00
Mike Hommey	a19e87fbad	Add support for Mingw	2012-04-21 21:27:46 -07:00
Jason Evans	a8f8d7540d	Remove mmap_unaligned. Remove mmap_unaligned, which was used to heuristically decide whether to optimistically call mmap() in such a way that could reduce the total number of system calls. If I remember correctly, the intention of mmap_unaligned was to avoid always executing the slow path in the presence of ASLR. However, that reasoning seems to have been based on a flawed understanding of how ASLR actually works. Although ASLR apparently causes mmap() to ignore address requests, it does not cause total placement randomness, so there is a reasonable expectation that iterative mmap() calls will start returning chunk-aligned mappings once the first chunk has been properly aligned.	2012-04-21 19:17:21 -07:00
Jason Evans	7ad54c1c30	Fix chunk allocation/deallocation bugs. Fix chunk_alloc_dss() to zero memory when requested. Fix chunk_dealloc() to avoid chunk_dealloc_mmap() for dss-allocated memory. Fix huge_palloc() to always junk fill when requested. Improve chunk_recycle() to report that memory is zeroed as a side effect of pages_purge().	2012-04-21 16:04:51 -07:00
Jason Evans	8f0e0eb1c0	Fix a memory corruption bug in chunk_alloc_dss(). Fix a memory corruption bug in chunk_alloc_dss() that was due to claiming newly allocated memory is zeroed. Reverse order of preference between mmap() and sbrk() to prefer mmap(). Clean up management of 'zero' parameter in chunk_alloc*().	2012-04-21 13:33:48 -07:00
Jason Evans	bedceea2a8	Fix isthreaded-related build breakage.	2012-04-20 14:12:30 -07:00
Jason Evans	918d6e20b7	Add missing private namespace mangling.	2012-04-20 13:42:21 -07:00
Jason Evans	7d20fbc44a	Don't mangle pthread_create(). Don't mangle pthread_create(); it's an exported symbol when defined.	2012-04-20 13:06:39 -07:00
Jason Evans	f7088e6c99	Make arena_salloc() an inline function.	2012-04-19 18:28:03 -07:00
Mike Hommey	13067ec835	Remove extra argument for malloc_tsd_cleanup_register Bookkeeping an extra argument that actually only stores a function pointer for a function we already have is not very useful.	2012-04-18 19:25:01 -07:00
Mike Hommey	8ad483fe60	Remove initialization of the non-TLS tsd wrapper from static memory Using static memory when malloc_tsd_malloc fails means all threads share the same wrapper and thus the same wrapped value. This defeats the purpose of TSD.	2012-04-18 19:23:53 -07:00
Mike Hommey	7ff1ce4131	Initialize all members of non-TLS tsd wrapper when creating it Not setting the initialized member leads to randomly calling the cleanup function in cases it shouldn't be called (and isn't called in other implementations).	2012-04-18 19:23:32 -07:00
Jason Evans	86e58583bb	Make special FreeBSD function overrides visible. Make special FreeBSD libc/libthr function overrides for _malloc_prefork(), _malloc_postfork(), and _malloc_thread_cleanup() visible.	2012-04-18 19:01:00 -07:00
Mike Hommey	666c5bf7a8	Add a pages_purge function to wrap madvise(JEMALLOC_MADV_PURGE) calls This will be used to implement the feature on mingw, which doesn't have madvise.	2012-04-18 18:57:48 -07:00
Jason Evans	0b25fe79aa	Update prof defaults to match common usage. Change the "opt.lg_prof_sample" default from 0 to 19 (1 B to 512 KiB). Change the "opt.prof_accum" default from true to false. Add the "opt.prof_final" mallctl, so that "opt.prof_prefix" need not be abused to disable final profile dumping.	2012-04-17 16:39:33 -07:00
Jason Evans	b57d3ec571	Add atomic(9) implementations of atomic operations. Add atomic(9) implementations of atomic operations. These are used on FreeBSD for non-x86 architectures.	2012-04-17 13:27:39 -07:00
Mike Hommey	45f208e112	Replace fprintf with malloc_printf in tests.	2012-04-16 23:05:39 -07:00
Mike Hommey	72ca7220f2	Use echo instead of cat in loops in size_classes.sh This avoids fork/exec()ing in loops, as echo is a builtin, and makes size_classes.sh much faster (from > 10s to < 0.2s on mingw on my machine).	2012-04-16 22:45:09 -07:00

1 2 3 4

161 Commits