freebsd-dev

Author	SHA1	Message	Date
Konstantin Belousov	9be6046a47	Some third-party malloc(3) implementations use pthread_setspecific(3) to handle per-thread information. Since our pthread_setspecific() implementation calls calloc(3) to allocate per-thread specific data storage, things get complicated. Switch the allocator to use bare mmap(2). There is some loss of the allocated page, since e.g. on amd64, PTHREAD_KEYS_MAX * sizeof(struct pthread_specific_elem) is 3K (it actually spans whole page due to padding), but I believe it is more acceptable than additional code for specialized allocator(). The alternatives would either to make the specific data array be part of the struct thread, or use internal bindings to call the libc malloc, avoiding interposing. Also do the style pass over the thr_spec.c, esp. simplify the conditionals nesting by returning early when an error detected. Remove trivial comments. Found by: yuri@rawbw.com PR: 200138 Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-05-15 08:40:17 +00:00
Andrew Turner	ae99516732	Disable the tests that use makecontext on arm64, it still needs to be written.	2015-04-27 13:56:20 +00:00
Enji Cooper	1119ece4d3	Build/install libc, librt, libthr, and msun NetBSD test suites on all architectures MFC after: 1 week	2015-04-27 06:49:27 +00:00
Pedro F. Giffuni	9acf5917d3	_pthread_cleanup_push: fix allocator sizeof operand mismatch Same fix appears to be in DragonFly's libthread_xu. Found by: Clang Static Analyzer MFC after: 1 week	2015-04-22 16:51:21 +00:00
Konstantin Belousov	0538aafc41	The lseek(2), mmap(2), truncate(2), ftruncate(2), pread(2), and pwrite(2) syscalls are wrapped to provide compatibility with pre-7.x kernels which required padding before the off_t parameter. The fcntl(2) contains compatibility code to handle kernels before the struct flock was changed during the 8.x CURRENT development. The shims were reasonable to allow easier revert to the older kernel at that time. Now, two or three major releases later, shims do not serve any purpose. Such old kernels cannot handle current libc, so revert the compatibility code. Make padded syscalls support conditional under the COMPAT6 config option. For COMPAT32, the syscalls were under COMPAT6 already. Remove WITHOUT_SYSCALL_COMPAT build option, which only purpose was to (partially) disable the removed shims. Reviewed by: jhb, imp (previous versions) Discussed with: peter Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-04-18 21:50:13 +00:00
Konstantin Belousov	3d0045bb2b	Make wait6(2), waitid(3) and ppoll(2) cancellation points. The waitid() function is required to be cancellable by the standard. The wait6() and ppoll() follow the other syscalls in their groups. Reviewed by: jhb, jilles (previous versions) Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-04-18 21:35:41 +00:00
Andrew Turner	a7dfee7ab1	Add pthread_md.h for arm64. Differential Revision: https://reviews.freebsd.org/D2137 Reviewed by: kib Sponsored by: The FreeBSD Foundation	2015-03-30 19:10:09 +00:00
Konstantin Belousov	b072e86d09	Make kevent(2) a cancellation point. Note that to cancel blocked kevent(2) call, changelist must be empty, since we cannot cancel a call which already made changes to the process state. And in reverse, call which only makes changes to the kqueue state, without waiting for an event, is not cancellable. This makes a natural usage model to migrate kqueue loop to support cancellation, where existing single kevent(2) call must be split into two: first uncancellable update of kqueue, then cancellable wait for events. Note that this is ABI-incompatible change, but it is believed that there is no cancel-safe code that relies on kevent(2) not being a cancellation point. Option to preserve the ABI would be to keep kevent(2) as is, but add new call with flags to specify cancellation behaviour, which only value seems to add complications. Suggested and reviewed by: jilles Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-03-29 19:14:41 +00:00
Andrew Turner	8daa81674e	Start to import support for the AArch64 architecture from ARM. This change only adds support for kernel-toolchain, however it is expected further changes to add kernel and userland support will be committed as they are reviewed. As our copy of binutils is too old the devel/aarch64-binutils port needs to be installed to pull in a linker. To build either TARGET needs to be set to arm64, or TARGET_ARCH set to aarch64. The latter is set so uname -p will return aarch64 as existing third party software expects this. Differential Revision: https://reviews.freebsd.org/D2005 Relnotes: Yes Sponsored by: The FreeBSD Foundation	2015-03-19 13:53:47 +00:00
Jung-uk Kim	be070eb896	Fix a typo in comment and explain the reason.	2015-03-09 20:26:42 +00:00
Konstantin Belousov	3e6d2e9b4e	Propagate errors from _thr_umutex_unlock2 through mutex_unlock_common. Errors from _thr_umutex_unlock2 should "never happen" in normal circumstances. If they do, however, return them to the application so it can fail early and loudly. Hiding the errors will only delay the inevitable failure, making it harder to find and diagnose. Submitted by: Eric van Gyzen <eric_van_gyzen@dell.com> Obtained from: Dell Inc. PR: 198914 MFC after: 1 week	2015-02-25 16:18:26 +00:00
Konstantin Belousov	45468c5356	Properly interpose libc spinlocks, was missed in r276630. In particular, stdio locking was affected. Reported and tested by: "Matthew D. Fuller" <fullermd@over-yonder.net> Sponsored by: The FreeBSD Foundation MFC after: 3 days	2015-02-14 11:47:40 +00:00
Konstantin Belousov	e50def75bd	Update libthr(3) man page to reflect the work done to support dlopen. Noted and reviewed by: bdrewery Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-02-12 17:16:54 +00:00
Konstantin Belousov	83d74204c8	Fully initialize allocated memory for the new barrier. The b_destroying member was left uninitialized, which caused spurious EBUSY. PR: 197365 Noted by: Florent Guiliani <fguiliani@verisign.com> Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-02-06 12:18:38 +00:00
Andrew Turner	20fe2c9465	Merge all the copies of _tcb_ctor and _tcb_dtor. The amd64, i386, and sparc64 versions were identical, with the one difference where the former two used inline asm instead of _tcb_get. I have compared the function before and after replacing the asm with _tcb_get and found the object files to be identical. The arm, mips, and powerpc versions were almost identical. The only difference was the powerpc version used an alignment of 1 where arm and mips used 16. As this is an increase in alignment is will be safe. Along with this arm, mips, and powerpc all passed, when initial was true, the value returned from _tcb_get as the first argument to _rtld_allocate_tls. This would then return this pointer back to the caller. We can remove these extra calls by checking if initial is set and setting the thread control block directly. As this is what the sparc64 code does we can use it directly. As after these observations all the architectures can now have identical code we can merge them into a common file. Differential Revision: https://reviews.freebsd.org/D1556 Reviewed by: kib Sponsored by: The FreeBSD Foundation	2015-01-21 16:41:05 +00:00
Konstantin Belousov	9e8bff64cb	Fix bug in r276630. Do not allow pthread_sigmask() to block SIGCANCEL. Reported and tested by: royger Sponsored by: The FreeBSD Foundation MFC after: 3 days	2015-01-21 16:13:37 +00:00
Konstantin Belousov	397d851d66	Reduce the size of the interposing table and amount of cancellation-handling code in the libthr. Translate some syscalls into their more generic counterpart, and remove translated syscalls from the table. List of the affected syscalls: creat, open -> openat raise -> thr_kill sleep, usleep -> nanosleep pause -> sigsuspend wait, wait3, waitpid -> wait4 Suggested and reviewed by: jilles (previous version) Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-01-11 22:16:31 +00:00
Justin Hibbits	85eda151ff	Avoid use of register variables. Clang 3.5 treats this as undefined behavior, and bad things happen. MFC after: 1 week	2015-01-06 03:50:43 +00:00
Konstantin Belousov	1a744fefc2	Avoid calling internal libc function through PLT or accessing data though GOT, by staticizing and hiding. Add setter for __error_selector to hide it as well. Suggested and reviewed by: jilles Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-01-05 01:06:54 +00:00
Konstantin Belousov	8495e8b1e9	Fix known issues which blow up the process after dlopen("libthr.so") (or loading a dso linked to libthr.so into process which was not linked against threading library). - Remove libthr interposers of the libc functions, including __error(). Instead, functions calls are indirected through the interposing table, similar to how pthread stubs in libc are already done. Libc by default points either to syscall trampolines or to existing libc implementations. On libthr load, libthr rewrites the pointers to the cancellable implementations already in libthr. The interposition table is separate from pthreads stubs indirection table to not pull pthreads stubs into static binaries. - Postpone the malloc(3) internal mutexes initialization until libthr is loaded. This avoids recursion between calloc(3) and static pthread_mutex_t initialization. - Reinstall signal handlers with wrapper on libthr load. The _rtld_is_dlopened(3) is used to avoid useless calls to sigaction(2) when libthr is statically referenced from the main binary. In the process, fix openat(2), swapcontext(2) and setcontext(2) interposing. The libc symbols were exported at different versions than libthr interposers. Export both libc and libthr versions from libc now, with default set to the higher version from libthr. Remove unused and disconnected swapcontext(3) userspace implementation from libc/gen. No objections from: deischen Tested by: pho, antoine (exp-run) (previous versions) Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-01-03 18:38:46 +00:00
Ed Maste	294246bb7d	Revert r274772: it is not valid on MIPS Reported by: sbruno	2014-11-25 03:50:31 +00:00
Ed Maste	688fd61ae8	Use canonical __PIC__ flag It is automatically set when -fPIC is passed to the compiler. Reviewed by: dim, kib Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D1179	2014-11-21 02:05:48 +00:00
Simon J. Gerraty	9268022b74	Merge from head@274682	2014-11-19 01:07:58 +00:00
Enji Cooper	3eee258dfb	Add reachover Makefiles for contrib/netbsd-tests/lib/libpthread as lib/libthr/tests A variant of this code has been tested on amd64/i386 for some time by EMC/Isilon on 10-STABLE/11-CURRENT. It builds on other architectures, but the code will remain off until it's proven it works on virtual hardware or real hardware on other architectures Original work by: pho Sponsored by: EMC / Isilon Storage Division	2014-11-16 06:35:20 +00:00
Sergey Kandaurov	663222b9f6	Fix description of mutex acquisition. Reviewed by: kib X-MFC with: r272070 Sponsored by: Nginx, Inc.	2014-09-26 04:33:27 +00:00
Konstantin Belousov	2f02abc196	Expand the libthr(3) manpage to document knobs accepted by libthr.so and explain some internal working of the library, neccessary to understand the knobs effects. Reviewed by: bjk, pluknet Sponsored by: The FreeBSD Foundation MFC after: 3 weeks	2014-09-24 12:41:39 +00:00
Konstantin Belousov	36bcb07ab5	Switch the defaults to not split the RLIMIT_STACK-sized initial thread stack into the stacks of the created threads. Add knob LIBPTHREAD_SPLITSTACK_MAIN to restore the older behaviour. Sponsored by: The FreeBSD Foundation MFC after: 3 weeks	2014-09-24 12:39:12 +00:00
Rui Paulo	585bf8ae67	Fix typo in a comment.	2014-09-02 18:21:19 +00:00
Simon J. Gerraty	ee7b0571c2	Merge head from 7/28	2014-08-19 06:50:54 +00:00
Konstantin Belousov	6c8ce3bfce	Add a knob LIBPTHREAD_BIGSTACK_MAIN, which instructs libthr to leave the whole RLIMIT_STACK-sized region of the kernel-allocated stack as the stack of main thread. By default, the main thread stack is clamped at 2MB (4MB on 64bit ABIs) and the rest is used for other threads stack allocation. Since there is no programmatic way to adjust the size of the main thread stack, pthread_attr_setstacksize() is too late, the knob allows user to manage the main stack size both for single-threaded and multi-threaded processes with the rlimit. Reported by: "Ivan A. Kosarev" <ivan@ivan-labs.com> Tested by: dim Sponsored by: The FreeBSD Foundation MFC after: 3 days	2014-08-13 05:53:41 +00:00
Konstantin Belousov	f6abec6c64	Style. Sponsored by: The FreeBSD Foundation MFC after: 3 days	2014-08-13 05:47:49 +00:00
Marcel Moolenaar	e7d939bda2	Remove ia64. This includes: o All directories named ia64 o All files named ia64 o All ia64-specific code guarded by __ia64__ o All ia64-specific makefile logic o Mention of ia64 in comments and documentation This excludes: o Everything under contrib/ o Everything under crypto/ o sys/xen/interface o sys/sys/elf_common.h Discussed at: BSDcan	2014-07-07 00:27:09 +00:00
Rui Paulo	45d79bdba1	Add the DTrace probe definitions for plockstat support. This will be connected to the system later. Sponsored by: The FreeBSD Foundation	2014-07-05 19:49:31 +00:00
Baptiste Daroussin	2b7af31cf5	use .Mt to mark up email addresses consistently (part3) PR: 191174 Submitted by: Franco Fichtner <franco at lastsummer.de>	2014-06-23 08:23:05 +00:00
Konstantin Belousov	1c70d00733	Right now, the rtld prefork hook locks the rtld bind lock in the read mode. This allows the binder to be functional in the child after the fork (assuming no lazy loading of a filter is needed), but other rtld services which require write lock on rtld_bind_lock cause deadlock, if called by child. Change the _rtld_atfork() to lock the bind lock in write mode, making the rtld fully functional after the fork. Pre-resolve the symbols which are called by the libthr' fork() interposer, since dynamic resolution causes deadlock due to the rtld_bind_lock already owned in the write mode. Reported and tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2014-05-24 10:23:06 +00:00
Simon J. Gerraty	fae50821ae	Updated dependencies	2014-05-16 14:09:51 +00:00
Simon J. Gerraty	76b28ad6ab	Updated dependencies	2014-05-10 05:16:28 +00:00
Simon J. Gerraty	cc3f4b9965	Merge from head	2014-05-08 23:54:15 +00:00
Warner Losh	c6063d0da8	Use src.opts.mk in preference to bsd.own.mk except where we need stuff from the latter.	2014-05-06 04:22:01 +00:00
Simon J. Gerraty	9d2ab4a62d	Merge head	2014-04-27 08:13:43 +00:00
Warner Losh	a5fc5b6223	Convert from WITHOUT_SYSCALL_COMPAT to MK_SYSCALL_COMPAT.	2014-04-05 17:54:43 +00:00
Konstantin Belousov	082aa03e4b	In _pthread_kill(), if passed pthread is current thread, do not send the signal second time, by adding the missed else before if statement. While there, postpone initializing local curthread variable until passed signal number is checked for validity. Submitted by: John Wolfe <jlw@xinuos.com> PR: threads/186309 MFC after: 1 week	2014-02-01 18:13:18 +00:00
Konstantin Belousov	0a9655a082	If check_deferred_signal() execution needs binding of PLT symbol, unlocking the rtld bind lock results in the processing of ast and recursing into the check_deferred_signal(). Nested execution of check_deferred_signal() delivers the signal to user code and clears si_signo. On return, top-level check_deferred_signal() frame continues delivering the same signal one more time, but now with zero si_signo. Fix this by adding a flag to indicate that deferred delivery is running, so check_deferred_signal() should avoid doing anything. Since user signal handler is allowed to modify the passed machine context to make return from the signal handler to cause arbitrary jump, or do longjmp(). For this case, also clear the flag in thr_sighandler(), since kernel signal delivery means that nested delivery code should not run right now. Reported by: Vitaly Magerya <vmagerya@gmail.com> Reviewed by: davidxu, jilles Tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 1 week	2013-11-23 15:48:17 +00:00
Simon J. Gerraty	d1d0158641	Merge from head	2013-09-05 20:18:59 +00:00
Konstantin Belousov	a0b9cbc8a2	The SUSv4tc1 requires that pthread_setcancelstate() shall be not a cancellation point. When enabling the cancellation, only process the pending cancellation for asynchronous mode. Reported and reviewed by: Kohji Okuno <okuno.kohji@jp.panasonic.com> Sponsored by: The FreeBSD Foundation MFC after: 1 week	2013-06-19 04:47:41 +00:00
Konstantin Belousov	91ddaeb725	Since the cause of the problems with the __fillcontextx() was identified, unify the code of check_deferred_signal() for all architectures, making the variant under #ifdef x86 common. Tested by: marius (sparc64) Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2013-06-03 04:22:42 +00:00
Konstantin Belousov	55a1911ef2	The getcontext() from the __fillcontextx() call in the check_deferred_signal() returns twice, since handle_signal() emulates the return from the normal signal handler by sigreturn(2)ing the passed context. Second return is performed on the destroyed stack frame, because __fillcontextx() has already returned. This causes undefined and bad behaviour, usually the victim thread gets SIGSEGV. Avoid nested frame and the need to return from it by doing direct call to getcontext() in the check_deferred_signal() and using a new private libc helper __fillcontextx2() to complement the context with the extended CPU state if the deferred signal is still present. The __fillcontextx() is now unused, but is kept to allow older libthr.so to be used with the new libc. Mark __fillcontextx() as returning twice [1]. Reported by: pgj Pointy hat to: kib Discussed with: dim Tested by: pgj, dim Suggested by: jilles [1] MFC after: 1 week	2013-05-28 04:54:16 +00:00
Konstantin Belousov	5b1dd97092	Partially apply the capitalization of the heading word of the sequence and fix typo. Sponsored by: The FreeBSD Foundation MFC after: 1 week	2013-05-27 18:45:45 +00:00
David Xu	8096915018	Return one-based key so that user can check if the key is ever allocated in the first place. Initial patch submitted by: phk	2013-05-16 03:01:04 +00:00
David Xu	66f6c2721d	Fix return value for setcontext and swapcontext.	2013-05-09 04:41:03 +00:00
Jilles Tjoelker	da7d2afb6d	Add accept4() system call. The accept4() function, compared to accept(), allows setting the new file descriptor atomically close-on-exec and explicitly controlling the non-blocking status on the new socket. (Note that the latter point means that accept() is not equivalent to any form of accept4().) The linuxulator's accept4 implementation leaves a race window where the new file descriptor is not close-on-exec because it calls sys_accept(). This implementation leaves no such race window (by using falloc() flags). The linuxulator could be fixed and simplified by using the new code. Like accept(), accept4() is async-signal-safe, a cancellation point and permitted in capability mode.	2013-05-01 20:10:21 +00:00
David Xu	9ae844e124	Remove extra code for SA_RESETHAND, it is not needed because kernel has already done this.	2013-04-28 03:13:45 +00:00
Jilles Tjoelker	3cb14a8923	libthr: Fix a parameter name in an internal header file.	2013-04-27 14:21:36 +00:00
David Xu	31e9d5b85e	Remove debug code.	2013-04-18 05:58:07 +00:00
David Xu	8bbeb7e9e0	Avoid copying memory if SIGCANCEL is not masked.	2013-04-18 05:56:00 +00:00
David Xu	acad2b1e22	Revert revision 249323, the PR/177624 is confusing, that bug is caused by using buggy getcontext/setcontext on same stack, while swapcontext normally works on different stack, there is no such a problem.	2013-04-18 05:12:11 +00:00
Simon J. Gerraty	69e6d7b75e	sync from head	2013-04-12 20:48:55 +00:00
Jilles Tjoelker	706b04b66f	libthr: Remove _thr_rtld_fini(), unused since r245630.	2013-04-12 19:47:32 +00:00
David Xu	31c18e29cc	swapcontext wrapper can not be implemented in C, the stack pointer saved in the context becomes invalid when the function returns, same as setjmp, it must be implemented in assemble language, see discussions in PR misc/177624.	2013-04-10 02:40:03 +00:00
Simon J. Gerraty	7cf3a1c6b2	Updated dependencies	2013-03-11 17:21:52 +00:00
Simon J. Gerraty	f5f7c05209	Updated dependencies	2013-02-16 01:23:54 +00:00
David E. O'Brien	d9a447559b	Sync with HEAD.	2013-02-08 16:10:16 +00:00
Jilles Tjoelker	b18943f3b4	libthr: Always use the threaded rtld lock implementation. The threaded rtld lock implementation is faster even in the single-threaded case because it postpones signal handlers via THR_CRITICAL_ENTER and THR_CRITICAL_LEAVE instead of calling sigprocmask(2). As a result, exception handling becomes faster in single-threaded applications linked with libthr. Reviewed by: kib	2013-01-18 23:08:40 +00:00
Simon J. Gerraty	7cd2dcf076	Updated/new Makefile.depend	2012-11-08 21:24:17 +00:00
Simon J. Gerraty	23090366f7	Sync from head	2012-11-04 02:52:03 +00:00
David Xu	a7b84c6512	In suspend_common(), don't wait for a thread which is in creation, because pthread_suspend_all_np() may have already suspended its parent thread. Add locking code in pthread_suspend_all_np() to only allow one thread to suspend other threads, this eliminates a deadlock where two or more threads try to suspend each others.	2012-08-27 03:09:39 +00:00
David Xu	0aa81bff0b	Eliminate redundant code, _thr_spinlock_init() has already been called in init_private(), don't call it again in fork() wrapper.	2012-08-23 05:15:15 +00:00
Marcel Moolenaar	7750ad47a9	Sync FreeBSD's bmake branch with Juniper's internal bmake branch. Requested by: Simon Gerraty <sjg@juniper.net>	2012-08-22 19:25:57 +00:00
David Xu	d65f1abca7	Implement syscall clock_getcpuclockid2, so we can get a clock id for process, thread or others we want to support. Use the syscall to implement POSIX API clock_getcpuclock and pthread_getcpuclockid. PR: 168417	2012-08-17 02:26:31 +00:00
Oleksandr Tymoshenko	89e757fe6f	Merging of projects/armv6, part 2 Handle TLS for ARMv6 and ARMv7	2012-08-15 03:08:29 +00:00
David Xu	aa75bc577a	Do defered mutex wakeup once.	2012-08-12 00:56:56 +00:00
David Xu	e220a13ab9	MFp4: Further decreases unexpected context switches by defering mutex wakeup until internal sleep queue lock is released.	2012-08-11 23:17:02 +00:00
David Xu	5674256c7f	Don't forget to initialize return value.	2012-07-20 05:47:12 +00:00
David Xu	ec225efc58	Simplify code by replacing _thr_ref_add() with _thr_find_thread().	2012-07-20 03:37:19 +00:00
David Xu	340e384de9	Eliminate duplicated code.	2012-07-20 03:27:07 +00:00
David Xu	30dd4f448c	Don't assign same value.	2012-07-20 03:22:17 +00:00
David Xu	670bc18dfe	Eliminate duplicated code.	2012-07-20 03:16:52 +00:00
David Xu	7e0cf81bc9	Eliminate duplicated code.	2012-07-20 03:00:41 +00:00
David Xu	12dbbf86f8	Don't forget to release a thread reference count, replace _thr_ref_add() with _thr_find_thread(), so reference count is no longer needed. MFC after: 3 days	2012-07-20 01:56:14 +00:00
David Xu	e3b090f037	Return EBUSY for PTHREAD_MUTEX_ADAPTIVE_NP too when the mutex could not be acquired. PR: 168317 MFC after: 3 days	2012-05-27 01:24:51 +00:00
David Xu	fa782a2611	Create a common function lookup() to search a chan, this eliminates redundant SC_LOOKUP() calling.	2012-05-10 09:30:37 +00:00
David Xu	173943ace3	Fix mis-merged line, move SC_LOOKUP() call to upper level.	2012-05-05 23:51:24 +00:00
David Xu	84ac0fb8ca	MFp4: Enqueue thread in LIFO, this can cause starvation, but it gives better performance. Use _thr_queuefifo to control the frequency of FIFO vs LIFO, you can use environment string LIBPTHREAD_QUEUE_FIFO to configure the variable.	2012-05-03 09:17:31 +00:00
George V. Neville-Neil	6e047a2426	Set SIGCANCEL to SIGTHR as part of some cleanup of DTrace code. Reviewed by: davidxu@ MFC after: 1 week	2012-04-18 16:29:55 +00:00
David Xu	17ce606321	umtx operation UMTX_OP_MUTEX_WAKE has a side-effect that it accesses a mutex after a thread has unlocked it, it event writes data to the mutex memory to clear contention bit, there is a race that other threads can lock it and unlock it, then destroy it, so it should not write data to the mutex memory if there isn't any waiter. The new operation UMTX_OP_MUTEX_WAKE2 try to fix the problem. It requires thread library to clear the lock word entirely, then call the WAKE2 operation to check if there is any waiter in kernel, and try to wake up a thread, if necessary, the contention bit is set again by the operation. This also mitgates the chance that other threads find the contention bit and try to enter kernel to compete with each other to wake up sleeping thread, this is unnecessary. With this change, the mutex owner is no longer holding the mutex until it reaches a point where kernel umtx queue is locked, it releases the mutex as soon as possible. Performance is improved when the mutex is contensted heavily. On Intel i3-2310M, the runtime of a benchmark program is reduced from 26.87 seconds to 2.39 seconds, it even is better than UMTX_OP_MUTEX_WAKE which is deprecated now. http://people.freebsd.org/~davidxu/bench/mutex_perf.c	2012-04-05 02:24:08 +00:00
Jilles Tjoelker	91792417bb	libthr: In the atfork handlers for signals, do not skip the last signal. _SIG_MAXSIG works a bit unexpectedly: signals 1 till _SIG_MAXSIG are valid, both bounds inclusive. Reviewed by: davidxu MFC after: 1 week	2012-03-26 17:05:26 +00:00
David Xu	81cd726a95	Use clockid parameter instead of hard-coded CLOCK_REALTIME. Reported by: pjd	2012-03-19 00:07:10 +00:00
David Xu	1b008f5e51	Some software think a mutex can be destroyed after it owned it, for example, it uses a serialization point like following: pthread_mutex_lock(&mutex); pthread_mutex_unlock(&mutex); pthread_mutex_destroy(&muetx); They think a previous lock holder should have already left the mutex and is no longer referencing it, so they destroy it. To be maximum compatible with such code, we use IA64 version to unlock the mutex in kernel, remove the two steps unlocking code.	2012-03-18 00:22:29 +00:00
David Xu	e70bf9d5eb	When destroying a barrier, waiting all threads exit the barrier, this makes it possible a thread received PTHREAD_BARRIER_SERIAL_THREAD immediately free memory area of the barrier.	2012-03-16 04:35:52 +00:00
Oleksandr Tymoshenko	34e3f7e717	- Switch to saving non-offseted pointer to TLS block in order too keep things simple	2012-03-06 03:27:58 +00:00
David Xu	24c209494a	Follow changes made in revision 232144, pass absolute timeout to kernel, this eliminates a clock_gettime() syscall.	2012-02-27 13:38:52 +00:00
David Xu	df1f1bae9e	In revision 231989, we pass a 16-bit clock ID into kernel, however according to POSIX document, the clock ID may be dynamically allocated, it unlikely will be in 64K forever. To make it future compatible, we pack all timeout information into a new structure called _umtx_time, and use fourth argument as a size indication, a zero means it is old code using timespec as timeout value, but the new structure also includes flags and a clock ID, so the size argument is different than before, and it is non-zero. With this change, it is possible that a thread can sleep on any supported clock, though current kernel code does not have such a POSIX clock driver system.	2012-02-25 02:12:17 +00:00
David Xu	b13a8fa78f	Use unused fourth argument of umtx_op to pass flags to kernel for operation UMTX_OP_WAIT. Upper 16bits is enough to hold a clock id, and lower 16bits is used to pass flags. The change saves a clock_gettime() syscall from libthr.	2012-02-22 03:22:49 +00:00
David Xu	879d152454	Check both seconds and nanoseconds are zero, only checking nanoseconds is zero may trigger timeout too early. It seems a copy&paste bug.	2012-02-19 08:17:14 +00:00
Oleksandr Tymoshenko	8ecdc98b5b	Add thread-local storage support for arm: - Switch to Variant I TCB layout - Use function from rtld for TCB allocation/deallocation	2012-02-14 00:17:43 +00:00
David Xu	4c91ddd690	Make code more stable by checking NULL pointers.	2012-02-11 04:12:12 +00:00
Oleksandr Tymoshenko	dda3ee8770	Switch MIPS TLS implementation to Variant I: Save pointer to the TLS structure taking into account TP_OFFSET and TCB structure size.	2012-02-10 06:53:25 +00:00
David Xu	e7004bf44d	Plug a memory leak. When a cached thread is reused, don't clear sleep queue pointers, just reuse it. PR: 164828 MFC after: 1 week	2012-02-07 02:57:36 +00:00
Konstantin Belousov	10280ca601	Use getcontextx(3) internal API instead of getcontext(2) to provide the signal handlers with the context information in the deferrred case. Only enable the use of getcontextx(3) in the deferred signal delivery code on amd64 and i386. Sparc64 seems to have some undetermined issues with interaction of alloca(3) and signal delivery. Tested by: flo (who also provided sparc64 harware access for me), pho Discussed with: marius MFC after: 1 month	2012-01-21 18:06:18 +00:00
Dimitry Andric	b34d83a709	The TCB_GET32() and TCB_GET64() macros in the i386 and amd64-specific versions of pthread_md.h have a special case of dereferencing a null pointer. Clang warns about this with: In file included from lib/libthr/arch/i386/i386/pthread_md.c:36: lib/libthr/arch/i386/include/pthread_md.h:96:10: error: indirection of non-volatile null pointer will be deleted, not trap [-Werror,-Wnull-dereference] return (TCB_GET32(tcb_self)); ^~~~~~~~~~~~~~~~~~~ lib/libthr/arch/i386/include/pthread_md.h:73:13: note: expanded from: : "m" ((u_int )(__tcb_offset(name)))); \ ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ lib/libthr/arch/i386/include/pthread_md.h:96:10: note: consider using __builtin_trap() or qualifying pointer with 'volatile' Since this indirection is done relative to the fs or gs segment, to retrieve thread-specific data, it is an exception to the rule. Therefore, add a volatile qualifier to tell the compiler we really want to dereference a zero address. MFC after: 1 week	2011-12-15 19:42:25 +00:00
David Xu	7859df8e67	Pass CVWAIT flags to kernel, this should handle timeout correctly for pthread_cond_timedwait when it uses kernel-based condition variable. PR: 162403 Submitted by: jilles MFC after: 3 days	2011-11-17 01:43:50 +00:00
Alexander Kabaev	a805bbe21a	Do not set thread name to less than informative 'initial thread'.	2011-06-19 13:35:36 +00:00
Marius Strobl	6ce2f878d3	Merge from r161730: o Set TP using inline assembly to avoid dead code elimination. o Eliminate _tcb. Merge from r161840: Stylize: avoid using a global register variable. Merge from r157461: Simplify _get_curthread() and _tcb_ctor because libc and rtld now already allocate thread pointer space in tls block for initial thread. Merge from r177853: Replace function _umtx_op with _umtx_op_err, the later function directly returns errno, because errno can be mucked by user's signal handler and most of pthread api heavily depends on errno to be correct, this change should improve stability of the thread library. MFC after: 1 week	2011-06-18 11:07:09 +00:00
Ryan Stone	aad93b043a	r179417 introduced a bug into pthread_once(). Previously pthread_once() used a global pthread_mutex_t for synchronization. r179417 replaced that with an implementation that directly used atomic instructions and thr_* syscalls to synchronize callers to pthread_once. However, calling pthread_mutex_lock on the global mutex implicitly ensured that _thr_check_init() had been called but with r179417 this was no longer guaranteed. This meant that if you were unlucky enough to have your first call into libthr be a call to pthread_once(), you would segfault when trying to access the pointer returned by _get_curthread(). The fix is to explicitly call _thr_check_init() from pthread_once(). Reviewed by: davidxu Approved by: emaste (mentor) MFC after: 1 week	2011-04-20 14:19:34 +00:00
Jung-uk Kim	678b238c85	Introduce a non-portable function pthread_getthreadid_np(3) to retrieve calling thread's unique integral ID, which is similar to AIX function of the same name. Bump __FreeBSD_version to note its introduction. Reviewed by: kib	2011-02-07 21:26:46 +00:00
David Xu	65a6aaf1f3	Fix a typo. Submitted by: avg	2011-01-11 01:57:02 +00:00
Konstantin Belousov	fad128db86	For the process that already loaded libthr but still not initialized threading, fall back to libc method of performing __pthread_map_stacks_exec() job. Reported and tested by: Mykola Dzham <i levsha me>	2011-01-10 16:10:25 +00:00
Konstantin Belousov	da2fcff746	Implement the __pthread_map_stacks_exec() for libthr. Stack creation code is changed to call _rtld_get_stack_prot() to get the stack protection right. There is a race where thread is created during dlopen() of dso that requires executable stacks. Then, _rtld_get_stack_prot() may return PROT_READ \| PROT_WRITE, but thread is still not linked into the thread list. In this case, the callback misses the thread stack, and rechecks the required protection afterward. Reviewed by: davidxu	2011-01-09 12:38:40 +00:00
Konstantin Belousov	6c69d05232	Add section .note.GNU-stack for assembly files used by 386 and amd64.	2011-01-07 16:09:33 +00:00
David Xu	ebc8e8fd7f	Return 0 instead of garbage value. Found by: clang static analyzer	2011-01-06 08:13:30 +00:00
David Xu	1f6f22dfec	Because sleepqueue may still being used, we should always check wchan with queue locked.	2011-01-04 05:35:19 +00:00
David Xu	e29ba4c2db	Always clear flag PMUTEX_FLAG_DEFERED when unlocking, as it is only significant for lock owner.	2010-12-24 07:41:39 +00:00
David Xu	0126aea6ad	Add sleep queue code.	2010-12-22 05:03:24 +00:00
David Xu	d1078b0b03	MFp4: - Add flags CVWAIT_ABSTIME and CVWAIT_CLOCKID for umtx kernel based condition variable, this should eliminate an extra system call to get current time. - Add sub-function UMTX_OP_NWAKE_PRIVATE to wake up N channels in single system call. Create userland sleep queue for condition variable, in most cases, thread will wait in the queue, the pthread_cond_signal will defer thread wakeup until the mutex is unlocked, it tries to avoid an extra system call and a extra context switch in time window of pthread_cond_signal and pthread_mutex_unlock. The changes are part of process-shared mutex project.	2010-12-22 05:01:52 +00:00
David Xu	1d1486408b	Use sysctl kern.sched.cpusetsize to retrieve size of kernel cpuset.	2010-11-02 02:13:13 +00:00
David Xu	6ed79f06f4	Return previous sigaction correctly. Submitted by: avg	2010-10-29 09:35:36 +00:00
David Xu	322a8adaa3	Remove local variable 'first', instead check signal number in memory, because the variable can be in register, second checking the variable may still return true, however this is unexpected.	2010-10-29 07:04:45 +00:00
David Xu	67753965a8	Check small set and reject it, this is how kernel did. Always use the size kernel is using.	2010-10-27 09:59:43 +00:00
David Xu	4a5478709b	- Revert r214409. - Use long word to figure out sizeof kernel cpuset, hope it works.	2010-10-27 09:29:03 +00:00
David Xu	e96b4de80e	Remove locking and unlock in pthread_mutex_destroy, because it can not fix race condition in application code, as a result, the problem described in PR threads/151767 is avoided.	2010-10-27 04:19:07 +00:00
David Xu	65df457797	Fix typo.	2010-10-25 11:16:50 +00:00
David Xu	7f25f6c72d	Get cpuset in pthread_attr_get_np() and free it in pthread_attr_destroy(). MFC after: 7 days	2010-10-25 09:16:04 +00:00
David Xu	de1e74c6a5	Revert revision 214007, I realized that MySQL wants to resolve a silly rwlock deadlock problem, the deadlock is caused by writer waiters, if a thread has already locked a reader lock, and wants to acquire another reader lock, it will be blocked by writer waiters, but we had already fixed it years ago.	2010-10-20 02:34:02 +00:00
David Xu	a24bcc04b2	Set default type to PTHREAD_RWLOCK_PREFER_WRITER_NONRECURSIVE_NP, this is the type we are using.	2010-10-18 23:37:56 +00:00
David Xu	bc15e58058	sort function name.	2010-10-18 05:16:44 +00:00
David Xu	7047ff7588	s/\|\|/&&	2010-10-18 05:15:26 +00:00
David Xu	a6b9b59e04	Add pthread_rwlockattr_setkind_np and pthread_rwlockattr_getkind_np, the functions set or get pthread_rwlock type, current supported types are: PTHREAD_RWLOCK_PREFER_READER_NP, PTHREAD_RWLOCK_PREFER_WRITER_NONRECURSIVE_NP, PTHREAD_RWLOCK_PREFER_WRITER_NP, default is PTHREAD_RWLOCK_PREFER_WRITER_NONCECURSIVE_NP, this maintains binary compatible with old code.	2010-10-18 05:09:22 +00:00
David Xu	0935fc89af	Oops, don't remove -fexceptions flag.	2010-10-08 01:53:33 +00:00
David Xu	0c3c9625a0	unwind.h was imported, gcc directory is no longer needed.	2010-10-08 01:47:14 +00:00
David Xu	722488013d	change code to use unwind.h.	2010-09-30 12:59:56 +00:00
David Xu	ec92603cf9	Check invalid mutex in _mutex_cv_unlock.	2010-09-29 06:06:58 +00:00
David Xu	bbb64c2143	In current code, statically initialized and destroyed object have same null value, the code can not distinguish between them, to fix the problem, now a destroyed object is assigned to a non-null value, and it will be rejected by some pthread functions. PTHREAD_ADAPTIVE_MUTEX_INITIALIZER_NP is changed to number 1, so that adaptive mutex can be statically initialized correctly.	2010-09-28 04:57:56 +00:00
David Xu	1d5b5089aa	Report death event to debugger before moving to gc list, otherwise debugger may can not find it on thread list.	2010-09-26 06:45:24 +00:00
David Xu	8be6abcdc6	Only access unwind_disabled when _PTHREAD_FORCED_UNWIND is defined.	2010-09-25 09:43:24 +00:00
David Xu	9f1dc4c107	Add missing field.	2010-09-25 08:36:46 +00:00
David Xu	8690b9f6dd	Because old _pthread_cleanup_push/pop do not have frame address, it is incompatible with stack unwinding code, if they are invoked, disable stack unwinding for current thread, and when thread is exiting, print a warning message.	2010-09-25 06:27:09 +00:00
David Xu	6f066bb387	Simplify code, and in while loop, fix operator to match the unwinding direction.	2010-09-25 04:21:31 +00:00
David Xu	f4213b9006	To support stack unwinding for cancellation points, add -fexceptions flag for them, two functions _pthread_cancel_enter and _pthread_cancel_leave are added to let thread enter and leave a cancellation point, it also makes it possible that other functions can be cancellation points in libraries without having to be rewritten in libthr.	2010-09-25 01:57:47 +00:00
David Xu	e5c66a0d9e	inline testcancel() into thr_cancel_leave(), because cancel_pending is almost false, this makes a slight better branch predicting.	2010-09-24 13:01:01 +00:00
David Xu	93ea4a71bf	In most cases, cancel_point and cancel_async needn't be checked again, because cancellation is almostly checked at cancellation points.	2010-09-24 07:52:07 +00:00
David Xu	81f3e99c56	If we are at cancellation point, always work as deferred mode despite whether asynchronous mode is turned on or not, this always gives us a chance to decide whether thread should be canceled or not in cancellation points.	2010-09-21 06:47:04 +00:00
David Xu	4173ebef4f	Because atfork lock is held while forking, a thread cancellation triggered by atfork handler is unsafe, use intenal flag no_cancel to disable it.	2010-09-19 09:03:11 +00:00
David Xu	7c243121b7	Fix typo.	2010-09-19 08:55:36 +00:00
David Xu	a5793db975	- _Unwind_Resume function is not used, remove it. - Use a store barrier to make sure uwl_forcedunwind is lastest thing other threads can see. - Add some comments.	2010-09-19 05:42:29 +00:00
David Xu	4da1da4b6e	Fix a race condition when finding stack unwinding functions.	2010-09-19 05:19:47 +00:00
David Xu	3832fd24f1	add code to support stack unwinding when thread exits. note that only defer-mode cancellation works, asynchrnous mode does not work because it lacks of libuwind's support. stack unwinding is not enabled unless LIBTHR_UNWIND_STACK is defined in Makefile.	2010-09-15 02:56:32 +00:00
David Xu	707ee8154d	Move back IN_GCLIST flag into field tlflags, since thread list and gc list still share same lock.	2010-09-15 01:21:30 +00:00
David Xu	7820a71113	Don't compare thread pointers again.	2010-09-13 11:58:42 +00:00
David Xu	cbadc1d7ad	Fix copy&paste problem.	2010-09-13 11:57:46 +00:00
David Xu	fa1efe5efd	Update symbol.	2010-09-13 09:23:38 +00:00

1 2 3 4 5 ...

706 Commits