freebsd-dev

Author	SHA1	Message	Date
David Xu	24c209494a	Follow changes made in revision 232144, pass absolute timeout to kernel, this eliminates a clock_gettime() syscall.	2012-02-27 13:38:52 +00:00
David Xu	df1f1bae9e	In revision 231989, we pass a 16-bit clock ID into kernel, however according to POSIX document, the clock ID may be dynamically allocated, it unlikely will be in 64K forever. To make it future compatible, we pack all timeout information into a new structure called _umtx_time, and use fourth argument as a size indication, a zero means it is old code using timespec as timeout value, but the new structure also includes flags and a clock ID, so the size argument is different than before, and it is non-zero. With this change, it is possible that a thread can sleep on any supported clock, though current kernel code does not have such a POSIX clock driver system.	2012-02-25 02:12:17 +00:00
David Xu	b13a8fa78f	Use unused fourth argument of umtx_op to pass flags to kernel for operation UMTX_OP_WAIT. Upper 16bits is enough to hold a clock id, and lower 16bits is used to pass flags. The change saves a clock_gettime() syscall from libthr.	2012-02-22 03:22:49 +00:00
David Xu	879d152454	Check both seconds and nanoseconds are zero, only checking nanoseconds is zero may trigger timeout too early. It seems a copy&paste bug.	2012-02-19 08:17:14 +00:00
David Xu	4c91ddd690	Make code more stable by checking NULL pointers.	2012-02-11 04:12:12 +00:00
David Xu	e7004bf44d	Plug a memory leak. When a cached thread is reused, don't clear sleep queue pointers, just reuse it. PR: 164828 MFC after: 1 week	2012-02-07 02:57:36 +00:00
Konstantin Belousov	10280ca601	Use getcontextx(3) internal API instead of getcontext(2) to provide the signal handlers with the context information in the deferrred case. Only enable the use of getcontextx(3) in the deferred signal delivery code on amd64 and i386. Sparc64 seems to have some undetermined issues with interaction of alloca(3) and signal delivery. Tested by: flo (who also provided sparc64 harware access for me), pho Discussed with: marius MFC after: 1 month	2012-01-21 18:06:18 +00:00
David Xu	7859df8e67	Pass CVWAIT flags to kernel, this should handle timeout correctly for pthread_cond_timedwait when it uses kernel-based condition variable. PR: 162403 Submitted by: jilles MFC after: 3 days	2011-11-17 01:43:50 +00:00
Alexander Kabaev	a805bbe21a	Do not set thread name to less than informative 'initial thread'.	2011-06-19 13:35:36 +00:00
Ryan Stone	aad93b043a	r179417 introduced a bug into pthread_once(). Previously pthread_once() used a global pthread_mutex_t for synchronization. r179417 replaced that with an implementation that directly used atomic instructions and thr_* syscalls to synchronize callers to pthread_once. However, calling pthread_mutex_lock on the global mutex implicitly ensured that _thr_check_init() had been called but with r179417 this was no longer guaranteed. This meant that if you were unlucky enough to have your first call into libthr be a call to pthread_once(), you would segfault when trying to access the pointer returned by _get_curthread(). The fix is to explicitly call _thr_check_init() from pthread_once(). Reviewed by: davidxu Approved by: emaste (mentor) MFC after: 1 week	2011-04-20 14:19:34 +00:00
Jung-uk Kim	678b238c85	Introduce a non-portable function pthread_getthreadid_np(3) to retrieve calling thread's unique integral ID, which is similar to AIX function of the same name. Bump __FreeBSD_version to note its introduction. Reviewed by: kib	2011-02-07 21:26:46 +00:00
David Xu	65a6aaf1f3	Fix a typo. Submitted by: avg	2011-01-11 01:57:02 +00:00
Konstantin Belousov	fad128db86	For the process that already loaded libthr but still not initialized threading, fall back to libc method of performing __pthread_map_stacks_exec() job. Reported and tested by: Mykola Dzham <i levsha me>	2011-01-10 16:10:25 +00:00
Konstantin Belousov	da2fcff746	Implement the __pthread_map_stacks_exec() for libthr. Stack creation code is changed to call _rtld_get_stack_prot() to get the stack protection right. There is a race where thread is created during dlopen() of dso that requires executable stacks. Then, _rtld_get_stack_prot() may return PROT_READ \| PROT_WRITE, but thread is still not linked into the thread list. In this case, the callback misses the thread stack, and rechecks the required protection afterward. Reviewed by: davidxu	2011-01-09 12:38:40 +00:00
David Xu	ebc8e8fd7f	Return 0 instead of garbage value. Found by: clang static analyzer	2011-01-06 08:13:30 +00:00
David Xu	1f6f22dfec	Because sleepqueue may still being used, we should always check wchan with queue locked.	2011-01-04 05:35:19 +00:00
David Xu	e29ba4c2db	Always clear flag PMUTEX_FLAG_DEFERED when unlocking, as it is only significant for lock owner.	2010-12-24 07:41:39 +00:00
David Xu	0126aea6ad	Add sleep queue code.	2010-12-22 05:03:24 +00:00
David Xu	d1078b0b03	MFp4: - Add flags CVWAIT_ABSTIME and CVWAIT_CLOCKID for umtx kernel based condition variable, this should eliminate an extra system call to get current time. - Add sub-function UMTX_OP_NWAKE_PRIVATE to wake up N channels in single system call. Create userland sleep queue for condition variable, in most cases, thread will wait in the queue, the pthread_cond_signal will defer thread wakeup until the mutex is unlocked, it tries to avoid an extra system call and a extra context switch in time window of pthread_cond_signal and pthread_mutex_unlock. The changes are part of process-shared mutex project.	2010-12-22 05:01:52 +00:00
David Xu	1d1486408b	Use sysctl kern.sched.cpusetsize to retrieve size of kernel cpuset.	2010-11-02 02:13:13 +00:00
David Xu	6ed79f06f4	Return previous sigaction correctly. Submitted by: avg	2010-10-29 09:35:36 +00:00
David Xu	322a8adaa3	Remove local variable 'first', instead check signal number in memory, because the variable can be in register, second checking the variable may still return true, however this is unexpected.	2010-10-29 07:04:45 +00:00
David Xu	67753965a8	Check small set and reject it, this is how kernel did. Always use the size kernel is using.	2010-10-27 09:59:43 +00:00
David Xu	4a5478709b	- Revert r214409. - Use long word to figure out sizeof kernel cpuset, hope it works.	2010-10-27 09:29:03 +00:00
David Xu	e96b4de80e	Remove locking and unlock in pthread_mutex_destroy, because it can not fix race condition in application code, as a result, the problem described in PR threads/151767 is avoided.	2010-10-27 04:19:07 +00:00
David Xu	65df457797	Fix typo.	2010-10-25 11:16:50 +00:00
David Xu	7f25f6c72d	Get cpuset in pthread_attr_get_np() and free it in pthread_attr_destroy(). MFC after: 7 days	2010-10-25 09:16:04 +00:00
David Xu	de1e74c6a5	Revert revision 214007, I realized that MySQL wants to resolve a silly rwlock deadlock problem, the deadlock is caused by writer waiters, if a thread has already locked a reader lock, and wants to acquire another reader lock, it will be blocked by writer waiters, but we had already fixed it years ago.	2010-10-20 02:34:02 +00:00
David Xu	a24bcc04b2	Set default type to PTHREAD_RWLOCK_PREFER_WRITER_NONRECURSIVE_NP, this is the type we are using.	2010-10-18 23:37:56 +00:00
David Xu	7047ff7588	s/\|\|/&&	2010-10-18 05:15:26 +00:00
David Xu	a6b9b59e04	Add pthread_rwlockattr_setkind_np and pthread_rwlockattr_getkind_np, the functions set or get pthread_rwlock type, current supported types are: PTHREAD_RWLOCK_PREFER_READER_NP, PTHREAD_RWLOCK_PREFER_WRITER_NONRECURSIVE_NP, PTHREAD_RWLOCK_PREFER_WRITER_NP, default is PTHREAD_RWLOCK_PREFER_WRITER_NONCECURSIVE_NP, this maintains binary compatible with old code.	2010-10-18 05:09:22 +00:00
David Xu	722488013d	change code to use unwind.h.	2010-09-30 12:59:56 +00:00
David Xu	ec92603cf9	Check invalid mutex in _mutex_cv_unlock.	2010-09-29 06:06:58 +00:00
David Xu	bbb64c2143	In current code, statically initialized and destroyed object have same null value, the code can not distinguish between them, to fix the problem, now a destroyed object is assigned to a non-null value, and it will be rejected by some pthread functions. PTHREAD_ADAPTIVE_MUTEX_INITIALIZER_NP is changed to number 1, so that adaptive mutex can be statically initialized correctly.	2010-09-28 04:57:56 +00:00
David Xu	1d5b5089aa	Report death event to debugger before moving to gc list, otherwise debugger may can not find it on thread list.	2010-09-26 06:45:24 +00:00
David Xu	8be6abcdc6	Only access unwind_disabled when _PTHREAD_FORCED_UNWIND is defined.	2010-09-25 09:43:24 +00:00
David Xu	9f1dc4c107	Add missing field.	2010-09-25 08:36:46 +00:00
David Xu	8690b9f6dd	Because old _pthread_cleanup_push/pop do not have frame address, it is incompatible with stack unwinding code, if they are invoked, disable stack unwinding for current thread, and when thread is exiting, print a warning message.	2010-09-25 06:27:09 +00:00
David Xu	6f066bb387	Simplify code, and in while loop, fix operator to match the unwinding direction.	2010-09-25 04:21:31 +00:00
David Xu	f4213b9006	To support stack unwinding for cancellation points, add -fexceptions flag for them, two functions _pthread_cancel_enter and _pthread_cancel_leave are added to let thread enter and leave a cancellation point, it also makes it possible that other functions can be cancellation points in libraries without having to be rewritten in libthr.	2010-09-25 01:57:47 +00:00
David Xu	e5c66a0d9e	inline testcancel() into thr_cancel_leave(), because cancel_pending is almost false, this makes a slight better branch predicting.	2010-09-24 13:01:01 +00:00
David Xu	93ea4a71bf	In most cases, cancel_point and cancel_async needn't be checked again, because cancellation is almostly checked at cancellation points.	2010-09-24 07:52:07 +00:00
David Xu	81f3e99c56	If we are at cancellation point, always work as deferred mode despite whether asynchronous mode is turned on or not, this always gives us a chance to decide whether thread should be canceled or not in cancellation points.	2010-09-21 06:47:04 +00:00
David Xu	4173ebef4f	Because atfork lock is held while forking, a thread cancellation triggered by atfork handler is unsafe, use intenal flag no_cancel to disable it.	2010-09-19 09:03:11 +00:00
David Xu	7c243121b7	Fix typo.	2010-09-19 08:55:36 +00:00
David Xu	a5793db975	- _Unwind_Resume function is not used, remove it. - Use a store barrier to make sure uwl_forcedunwind is lastest thing other threads can see. - Add some comments.	2010-09-19 05:42:29 +00:00
David Xu	4da1da4b6e	Fix a race condition when finding stack unwinding functions.	2010-09-19 05:19:47 +00:00
David Xu	3832fd24f1	add code to support stack unwinding when thread exits. note that only defer-mode cancellation works, asynchrnous mode does not work because it lacks of libuwind's support. stack unwinding is not enabled unless LIBTHR_UNWIND_STACK is defined in Makefile.	2010-09-15 02:56:32 +00:00
David Xu	707ee8154d	Move back IN_GCLIST flag into field tlflags, since thread list and gc list still share same lock.	2010-09-15 01:21:30 +00:00
David Xu	7820a71113	Don't compare thread pointers again.	2010-09-13 11:58:42 +00:00
David Xu	cbadc1d7ad	Fix copy&paste problem.	2010-09-13 11:57:46 +00:00
David Xu	b749a04db3	PS_DEAD state needs not be checked because _thr_find_thread() has already checked it.	2010-09-13 07:18:00 +00:00
David Xu	a9b764e218	Convert thread list lock from mutex to rwlock.	2010-09-13 07:03:01 +00:00
David Xu	83c9e0893f	Because POSIX does not allow EINTR to be returned from sigwait(), add a wrapper for it in libc and rework the code in libthr, the system call still can return EINTR, we keep this feature. Discussed on: thread Reviewed by: jilles	2010-09-10 01:47:37 +00:00
David Xu	17dce7e108	To avoid possible race condition, SIGCANCEL is always sent except the thread is dead.	2010-09-08 02:18:20 +00:00
David Xu	cb4a1047ce	Fix off-by-one error in function _thr_sigact_unload, also disable the function, it seems some gnome application tends to crash if we unregister sigaction automatically.	2010-09-06 03:00:54 +00:00
David Xu	21a9296f63	Remove incorrect comments, also make sure signal is disabled when unregistering sigaction.	2010-09-01 13:22:55 +00:00
David Xu	12c61c22ce	In function __pthread_cxa_finalize(), also make code for removing atfork handler be async-signal safe.	2010-09-01 07:09:46 +00:00
David Xu	a523216bc6	pthread_atfork should acquire writer lock and protect the code with critical region.	2010-09-01 03:55:10 +00:00
David Xu	ada33a6e36	Change atfork lock from mutex to rwlock, also make mutexes used by malloc() module private type, when private type mutex is locked/unlocked, thread critical region is entered or leaved. These changes makes fork() async-signal safe which required by POSIX. Note that user's atfork handler still needs to be async-signal safe, but it is not problem of libthr, it is user's responsiblity.	2010-09-01 03:11:21 +00:00
David Xu	02c3c85869	Add signal handler wrapper, the reason to add it becauses there are some cases we want to improve: 1) if a thread signal got a signal while in cancellation point, it is possible the TDP_WAKEUP may be eaten by signal handler if the handler called some interruptibly system calls. 2) In signal handler, we want to disable cancellation. 3) When thread holding some low level locks, it is better to disable signal, those code need not to worry reentrancy, sigprocmask system call is avoided because it is a bit expensive. The signal handler wrapper works in this way: 1) libthr installs its signal handler if user code invokes sigaction to install its handler, the user handler is recorded in internal array. 2) when a signal is delivered, libthr's signal handler is invoke, libthr checks if thread holds some low level lock or is in critical region, if it is true, the signal is buffered, and all signals are masked, once the thread leaves critical region, correct signal mask is restored and buffered signal is processed. 3) before user signal handler is invoked, cancellation is temporarily disabled, after user signal handler is returned, cancellation state is restored, and pending cancellation is rescheduled.	2010-09-01 02:18:33 +00:00
David Xu	ed0ee6af2e	Unregister thread specific data destructor when a corresponding dso is unloaded.	2010-08-27 05:20:22 +00:00
David Xu	8e60ce996b	clear lock to zero state if it is destroyed.	2010-08-27 03:23:07 +00:00
David Xu	1ac3d5022c	eliminate unused code.	2010-08-26 09:04:27 +00:00
David Xu	6b932eca79	Decrease rdlock count only when thread unlocked a reader lock. MFC after: 3 days	2010-08-26 07:09:48 +00:00
Konstantin Belousov	247a32fac5	Remove unused source. MFC after: 2 weeks	2010-08-24 11:55:25 +00:00
Konstantin Belousov	47536ff629	The __hidden definition is provided by sys/cdefs.h. MFC after: 2 weeks	2010-08-24 11:54:48 +00:00
David Xu	5cf2219535	Add wrapper for setcontext() and swapcontext(), the wrappers unblock SIGCANCEL which is needed by thread cancellation.	2010-08-24 09:57:06 +00:00
Konstantin Belousov	ea246b6369	On shared object unload, in __cxa_finalize, call and clear all installed atexit and __cxa_atexit handlers that are either installed by unloaded dso, or points to the functions provided by the dso. Use _rtld_addr_phdr to locate segment information from the address of private variable belonging to the dso, supplied by crtstuff.c. Provide utility function __elf_phdr_match_addr to do the match of address against dso executable segment. Call back into libthr from __cxa_finalize using weak __pthread_cxa_finalize symbol to remove any atfork handler which function points into unloaded object. The rtld needs private __pthread_cxa_finalize symbol to not require resolution of the weak undefined symbol at initialization time. This cannot work, since rtld is relocated before sym_zero is set up. Idea by: kan Reviewed by: kan (previous version) MFC after: 3 weeks	2010-08-23 15:38:02 +00:00
David Xu	82746ea546	Reduce redundant code. Submitted by: kib	2010-08-20 13:42:48 +00:00
David Xu	635f917a9d	In current implementation, thread cancellation is done in signal handler, which does not know what is the state of interrupted system call, for example, open() system call opened a file and the thread is still cancelled, result is descriptor leak, there are other problems which can cause resource leak or undeterminable side effect when a thread is cancelled. However, this is no longer true in new implementation. In defering mode, a thread is canceled if cancellation request is pending and later the thread enters a cancellation point, otherwise, a later pthread_cancel() just causes SIGCANCEL to be sent to the target thread, and causes target thread to abort system call, userland code in libthr then checks cancellation state, and cancels the thread if needed. For example, the cancellation point open(), the thread may be canceled at start, but later, if it opened a file descriptor, it is not canceled, this avoids file handle leak. Another example is read(), a thread may be canceled at start of the function, but later, if it read some bytes from a socket, the thread is not canceled, the caller then can decide if it should still enable cancelling or disable it and continue reading data until it thinks it has read all bytes of a packet, and keeps a protocol stream in health state, if user ignores partly reading of a packet without disabling cancellation, then second iteration of read loop cause the thread to be cancelled. An exception is that the close() cancellation point always closes a file handle despite whether the thread is cancelled or not. The old mechanism is still kept, for a functions which is not so easily to fix a cancellation problem, the rough mechanism is used. Reviewed by: kib@	2010-08-20 05:15:39 +00:00
David Xu	719863239e	According to specification, function fcntl() is a cancellation point only when cmd argument is F_SETLKW.	2010-08-20 04:15:05 +00:00
David Xu	cdcffc3f1c	Tweak code a bit to be POSIX compatible, when a cancellation request is acted upon, or when a thread calls pthread_exit(), the thread first disables cancellation by setting its cancelability state to PTHREAD_CANCEL_DISABLE and its cancelability type to PTHREAD_CANCEL_DEFERRED. The cancelability state remains set to PTHREAD_CANCEL_DISABLE until the thread has terminated. It has no effect if a cancellation cleanup handler or thread-specific data destructor routine changes the cancelability state to PTHREAD_CANCEL_ENABLE.	2010-08-17 02:50:12 +00:00
Konstantin Belousov	b144e48b2a	Use _SIG_VALID instead of expanded form of the macro. Submitted by: Garrett Cooper <yanegomi gmail com> MFC after: 1 week	2010-07-12 10:15:33 +00:00
Daniel Eischen	1cfc8fc759	Coalesce one more broken line.	2010-05-24 13:44:39 +00:00
Daniel Eischen	9ed8360e53	Coalesce a couple of broken lines since they can fit within 80 characters. Little nit found while looking at a bug report.	2010-05-24 13:43:11 +00:00
David Xu	60e9cdf158	remove file thr_sem_new.c.	2010-01-05 07:50:31 +00:00
David Xu	791f7a99e2	Remove extra new semaphore stubs, because libc already has them, and ld can find the newest version which is default. Poked by: kan@	2010-01-05 06:21:29 +00:00
David Xu	9b0f1823b5	Use umtx to implement process sharable semaphore, to make this work, now type sema_t is a structure which can be put in a shared memory area, and multiple processes can operate it concurrently. User can either use mmap(MAP_SHARED) + sem_init(pshared=1) or use sem_open() to initialize a shared semaphore. Named semaphore uses file system and is located in /tmp directory, and its file name is prefixed with 'SEMD', so now it is chroot or jail friendly. In simplist cases, both for named and un-named semaphore, userland code does not have to enter kernel to reduce/increase semaphore's count. The semaphore is designed to be crash-safe, it means even if an application is crashed in the middle of operating semaphore, the semaphore state is still safely recovered by later use, there is no waiter counter maintained by userland code. The main semaphore code is in libc and libthr only has some necessary stubs, this makes it possible that a non-threaded application can use semaphore without linking to thread library. Old semaphore implementation is kept libc to maintain binary compatibility. The kernel ksem API is no longer used in the new implemenation. Discussed on: threads@	2010-01-05 02:37:59 +00:00
Marcel Moolenaar	e4f141b546	Work-around a race condition on ia64 while unlocking a contested lock. The race condition is believed to be in UMTX_OP_MUTEX_WAKE. On ia64, we simply go to the kernel to unlock. The big question is why this is only a race condition on ia64... MFC after: 3 days	2009-12-14 01:26:01 +00:00
Konstantin Belousov	066d836b02	Current pselect(3) is implemented in usermode and thus vulnerable to well-known race condition, which elimination was the reason for the function appearance in first place. If sigmask supplied as argument to pselect() enables a signal, the signal might be delivered before thread called select(2), causing lost wakeup. Reimplement pselect() in kernel, making change of sigmask and sleep atomic. Since signal shall be delivered to the usermode, but sigmask restored, set TDP_OLDMASK and save old mask in td_oldsigmask. The TDP_OLDMASK should be cleared by ast() in case signal was not gelivered during syscall execution. Reviewed by: davidxu Tested by: pho MFC after: 1 month	2009-10-27 10:55:34 +00:00
Jilles Tjoelker	29670497af	Make openat(2) a cancellation point. This is required by POSIX and matches open(2). Reviewed by: kib, jhb MFC after: 1 month	2009-10-11 20:19:45 +00:00
David Xu	daf3ced72b	don't report error if key was deleted. PR: threads/135462	2009-09-25 00:15:30 +00:00
Attilio Rao	b13c5f2883	rwlock implemented from libthr need to fall through the 'hard path' and query umtx also if the shared waiters bit is set on a shared lock. The writer starvation avoidance technique, infact, can lead to shared waiters on a shared lock which can bring to a missed wakeup and thus to a deadlock if the right bit is not checked (a notable case is the writers counterpart to be handled through expired timeouts). Fix that by checking for the shared waiters bit also when unlocking the shared locks. That bug was causing a reported MySQL deadlock. Many thanks go to Nick Esborn and his employer DesertNet which provided time and machines to identify and fix this issue. PR: thread/135673 Reported by: Nick Esborn <nick at desert dot net> Tested by: Nick Esborn <nick at desert dot net> Reviewed by: jeff	2009-09-23 21:38:57 +00:00
Attilio Rao	137ae5d291	In the current code, rdlock_count is not correctly handled for some cases. The most notable is that it is not bumped in rwlock_rdlock_common() when the hard path (__thr_rwlock_rdlock()) returns successfully. This can lead to deadlocks in libthr when rwlocks recursion in read mode happens. Fix the interested parts by correctly handling rdlock_count. PR: threads/136345 Reported by: rink Tested by: rink Reviewed by: jeff Approved by: re (kib) MFC: 2 weeks	2009-07-06 09:31:04 +00:00
Brian Feldman	43af51a2b5	These are some cosmetic changes to improve the clarity of libthr's fork implementation.	2009-05-11 16:45:53 +00:00
Robert Watson	d1f2f1c3f3	Now that the kernel defines CACHE_LINE_SIZE in machine/param.h, use that definition in the custom locking code for the run-time linker rather than local definitions. Pointed out by: tinderbox MFC after: 2 weeks	2009-04-19 23:02:50 +00:00
Konstantin Belousov	29986e1bac	Forcibly unlock the malloc() locks in the child process after fork(), by temporary pretending that the process is still multithreaded. Current malloc lock primitives do nothing for singlethreaded process. Reviewed by: davidxu, deischen	2009-03-19 10:32:25 +00:00
David Xu	5b71b82e70	Don't ignore other fcntl functions, directly call __sys_fcntl if WITHOUT_SYSCALL_COMPAT is not defined. Reviewed by: deischen	2009-03-09 05:54:43 +00:00
David Xu	c30c187d60	Don't reference non-existent __fcntl_compat if WITHOUT_SYSCALL_COMPAT is defined. Submitted by: Pawel Worach "pawel dot worach at gmail dot com"	2009-03-09 02:34:02 +00:00
Peter Wemm	70ba1e8fc1	When libthr and rtld start up, there are a number of magic spells cast in order to get the symbol binding state "just so". This is to allow locking to be activated and not run into recursion problems later. However, one of the magic bits involves an explicit call to _umtx_op() to force symbol resolution. It does a wakeup operation on a fake, uninitialized (ie: random contents) umtx. Since libthr isn't active, this is harmless. Nothing can match the random wakeup. However, valgrind finds this and is not amused. Normally I'd just write a suppression record for it, but the idea of passing random args to syscalls (on purpose) just doesn't feel right.	2008-12-07 02:32:49 +00:00
Konstantin Belousov	10b4034657	Provide custom simple allocator for rtld locks in libthr. The allocator does not use any external symbols, thus avoiding possible recursion into rtld to resolve symbols, when called. Reviewed by: kan, davidxu Tested by: rink MFC after: 1 month	2008-12-02 11:58:31 +00:00
Alexander Kabaev	97df383415	Invoke _rtld_atfork_post earlier, before we reinitialize rtld locks by switching into single-thread mode. libthr ignores broken use of lock bitmaps used by default rtld locking implementation, this in turn turns lock handoff in _rtld_thread_init into NOP. This in turn makes child processes of forked multi-threaded programs to run with _thr_signal_block still in effect, with most signals blocked. Reported by: phk, kib	2008-12-01 21:00:25 +00:00
Konstantin Belousov	e711c6f0d1	Unlock the malloc() locks in the child process after fork(). This gives us working malloc in the fork child of the multithreaded process. Although POSIX requires that only async-signal safe functions shall be operable after fork in multithreaded process, not having malloc lower the quality of our implementation. Tested by: rink Discussed with: kan, davidxu Reviewed by: kan MFC after: 1 month	2008-11-29 21:46:28 +00:00
Konstantin Belousov	cb5c4b10ba	Add two rtld exported symbols, _rtld_atfork_pre and _rtld_atfork_post. Threading library calls _pre before the fork, allowing the rtld to lock itself to ensure that other threads of the process are out of dynamic linker. _post releases the locks. This allows the rtld to have consistent state in the child. Although child may legitimately call only async-safe functions, the call may need plt relocation resolution, and this requires working rtld. Reported and debugging help by: rink Reviewed by: kan, davidxu MFC after: 1 month (anyway, not before 7.1 is out)	2008-11-27 11:27:59 +00:00
Marcel Moolenaar	03fad2ad5f	Allow psaddr_t to be widened by using thr_pread_{int,long,ptr}, where critical. Some places still use ps_pread/ps_pwrite directly, but only need changed when byte-order comes into the picture. Also, change th_p in td_event_msg_t from a pointer type to psaddr_t, so that events also work when psaddr_t is widened.	2008-09-14 16:07:21 +00:00
Jason Evans	5b3842aefa	Move call to _malloc_thread_cleanup() so that if this is the last thread, the call never happens. This is necessary because malloc may be used during exit handler processing. Submitted by: davidxu	2008-09-09 17:14:32 +00:00
Jason Evans	d6742bfbd3	Add thread-specific caching for small size classes, based on magazines. This caching allows for completely lock-free allocation/deallocation in the steady state, at the expense of likely increased memory use and fragmentation. Reduce the default number of arenas to 2*ncpus, since thread-specific caching typically reduces arena contention. Modify size class spacing to include ranges of 2^n-spaced, quantum-spaced, cacheline-spaced, and subpage-spaced size classes. The advantages are: fewer size classes, reduced false cacheline sharing, and reduced internal fragmentation for allocations that are slightly over 512, 1024, etc. Increase RUN_MAX_SMALL, in order to limit fragmentation for the subpage-spaced size classes. Add a size-->bin lookup table for small sizes to simplify translating sizes to size classes. Include a hard-coded constant table that is used unless custom size class spacing is specified at run time. Add the ability to disable tiny size classes at compile time via MALLOC_TINY.	2008-08-27 02:00:53 +00:00
David Xu	fc45432be6	In function pthread_condattr_getpshared, store result correctly. PR: kern/126128	2008-08-01 01:21:49 +00:00
David Xu	7de1ecef2d	Add two commands to _umtx_op system call to allow a simple mutex to be locked and unlocked completely in userland. by locking and unlocking mutex in userland, it reduces the total time a mutex is locked by a thread, in some application code, a mutex only protects a small piece of code, the code's execution time is less than a simple system call, if a lock contention happens, however in current implemenation, the lock holder has to extend its locking time and enter kernel to unlock it, the change avoids this disadvantage, it first sets mutex to free state and then enters kernel and wake one waiter up. This improves performance dramatically in some sysbench mutex tests. Tested by: kris Sounds great: jeff	2008-06-24 07:32:12 +00:00
David Xu	83a0758789	Make pthread_cleanup_push() and pthread_cleanup_pop() as a pair of macros, use stack space to keep cleanup information, this eliminates overhead of calling malloc() and free() in thread library. Discussed on: thread@	2008-06-09 01:14:10 +00:00
Doug Rabson	cd7d66a21f	Call the fcntl compatiblity wrapper from the thread library fcntl wrappers so that they get the benefit of the (limited) forward ABI compatibility. MFC after: 1 week	2008-05-30 14:47:42 +00:00
David Xu	1b3418b2dc	Eliminate global mutex by using pthread_once's state field as a semaphore.	2008-05-30 00:02:59 +00:00
David Xu	850f4d66cb	- Reduce function call overhead for uncontended case. - Remove unused flags MUTEX_FLAGS_* and their code. - Check validity of the timeout parameter in mutex_self_lock().	2008-05-29 07:57:33 +00:00
David Xu	cf181aee60	Remove libc_r's remnant code.	2008-05-06 07:27:11 +00:00
David Xu	8d6a11a070	Use UMTX_OP_WAIT_UINT_PRIVATE and UMTX_OP_WAKE_PRIVATE to save time in kernel(avoid VM lookup).	2008-04-29 03:58:18 +00:00
Kris Kennaway	dd77f9f7f2	Increase the default MUTEX_ADAPTIVE_SPINS to 2000, after further testing it turns out 200 was too short to give good adaptive performance. Reviewed by: jeff MFC after: 1 week	2008-04-26 13:19:07 +00:00
Xin LI	d0aa4fd3ca	Avoid various shadowed variables. libthr is now almost WARNS=4 clean except for some const dequalifiers that needs more careful investigation. Ok'ed by: davidxu	2008-04-23 21:06:51 +00:00
David Xu	fb2641d9b1	Use native rwlock.	2008-04-22 06:44:11 +00:00
David Xu	6d9517bc9f	_vfork is not in libthr, remove the reference.	2008-04-16 03:19:11 +00:00
David Xu	fa4b421a7a	don't include pthread_np.h, it is not used.	2008-04-14 08:08:40 +00:00
David Xu	caad30a422	put THR_CRITICAL_LEAVE into do .. while statement.	2008-04-03 02:47:35 +00:00
David Xu	a6cba9400a	add __hidden suffix to _umtx_op_err, this eliminates PLT.	2008-04-03 02:13:51 +00:00
David Xu	7abb97dcd8	Non-portable functions are in pthread_np.h, fix compiling problem.	2008-04-02 11:41:12 +00:00
David Xu	7a30bcf04b	Add pthread_setaffinity_np and pthread_getaffinity_np to libc namespace.	2008-04-02 08:53:18 +00:00
David Xu	8b873a2328	Remove unused functions.	2008-04-02 08:33:42 +00:00
David Xu	d6e0eb0a48	Replace function _umtx_op with _umtx_op_err, the later function directly returns errno, because errno can be mucked by user's signal handler and most of pthread api heavily depends on errno to be correct, this change should improve stability of the thread library.	2008-04-02 07:41:25 +00:00
David Xu	8bf1a48cb3	Replace userland rwlock with a pure kernel based rwlock, the new implementation does not switch pointers when it resumes waiters. Asked by: jeff	2008-04-02 04:32:31 +00:00
David Xu	18967c1918	Restore normal pthread_cond_signal path to avoid some obscure races.	2008-04-01 06:23:08 +00:00
David Xu	f5bc4f9930	return EAGAIN early rather than running bunch of code later, micro optimize static branch prediction.	2008-04-01 00:21:49 +00:00
David Xu	5ab512bb8e	Rewrite rwlock to user atomic operations to change rwlock state, this eliminates internal mutex lock contention when most rwlock operations are read. Orignal patch provided by: jeff	2008-03-31 02:55:49 +00:00
Ruslan Ermilov	e03efb02bc	Compile libthr with warnings.	2008-03-25 13:28:12 +00:00
Ruslan Ermilov	7e0e78248e	Fixed mis-implementation of pthread_mutex_get{spin,yield}loops_np(). Reviewed by: davidxu	2008-03-25 09:48:10 +00:00
David Xu	9939a13667	Add POSIX pthread API pthread_getcpuclockid() to get a thread's cpu time clock id.	2008-03-22 09:59:20 +00:00
David Xu	04a57d2c83	Resolve __error()'s PLT early so that it needs not to be resolved again, otherwise rwlock is recursivly called when signal happens and the __error was never resolved before.	2008-03-21 02:31:55 +00:00
Ruslan Ermilov	a1292a02d3	pthread_mutexattr_destroy() was accidentally broken in last revision, unbreak it. We should really start compiling this with warnings.	2008-03-20 11:47:08 +00:00
David Xu	8c38215f50	Preserve application code's errno in rtld locking code, it attemps to keep any case safe.	2008-03-20 09:35:44 +00:00
David Xu	48ebe2ebc4	Make pthread_mutexattr_settype to return error number directly and conformant to POSIX specification. Bug reported by: modelnine at modelnine dt org	2008-03-20 08:27:14 +00:00
David Xu	c8a4eae56f	don't reduce new thread's refcount if current thread can not set cpuset for it, since the new thread will reduce it by itself.	2008-03-19 09:33:07 +00:00
David Xu	519e8d87bb	- Trim trailing spaces. - Use a different sigmask variable name to avoid confusing.	2008-03-19 08:13:04 +00:00
David Xu	86a06c6000	if passed thread pointer is equal to current thread, pass -1 to kernel to speed up searching.	2008-03-19 06:38:21 +00:00
David Xu	2ea1f90a18	- Copy signal mask out before THR_UNLOCK(), because THR_UNLOCK() may call _thr_suspend_check() which messes sigmask saved in thread structure. - Don't suspend a thread has force_exit set. - In pthread_exit(), if there is a suspension flag set, wake up waiting- thread after setting PS_DEAD, this causes waiting-thread to break loop in suspend_common().	2008-03-18 02:06:51 +00:00
David Xu	a9a11568ff	Actually delete SIGCANCEL mask for suspended thread, so the signal will not be masked when it is resumed.	2008-03-16 03:22:38 +00:00
David Xu	150b71918c	If a thread is cancelled, it may have already consumed a umtx_wake, check waiter and semphore counter to see if we may wake up next thread.	2008-03-11 03:26:47 +00:00
David Xu	8a18c0d3c8	Fix a bug when calculating remnant size.	2008-03-06 03:24:03 +00:00
David Xu	697b4b49be	Don't report death event to debugger if it is a forced exit.	2008-03-06 02:07:18 +00:00
David Xu	70e79fbb0d	Restore code setting new thread's scheduler parameters, I was thinking that there might be starvations, but because we have already locked the thread, the cpuset settings will always be done before the new thread does real-world work.	2008-03-06 01:59:08 +00:00
David Xu	1cb51125aa	Increase and decrease in_sigcancel_handler accordingly to avoid possible error caused by nested SIGCANCEL stack, it is a bit complex.	2008-03-05 07:04:55 +00:00
David Xu	54dff16b26	Use cpuset defined in pthread_attr for newly created thread, for now, we set scheduling parameters and cpu binding fully in userland, and because default scheduling policy is SCHED_RR (time-sharing), we set default sched_inherit to PTHREAD_SCHED_INHERIT, this saves a system call.	2008-03-05 07:01:20 +00:00
David Xu	21845eb98d	Check actual size of cpuset kernel is using and define underscore version of API.	2008-03-05 06:55:48 +00:00
David Xu	76a9679f8e	If a new thread is created, it inherits current thread's signal masks, however if current thread is executing cancellation handler, signal SIGCANCEL may have already been blocked, this is unexpected, unblock the signal in new thread if this happens. MFC after: 1 week	2008-03-04 04:28:59 +00:00
David Xu	54c9b47c2b	Include cpuset.h, unbreak compiling.	2008-03-04 03:45:11 +00:00
David Xu	a759db946a	implement pthread_attr_getaffinity_np and pthread_attr_setaffinity_np.	2008-03-04 03:03:24 +00:00
David Xu	57030e1071	Implement functions pthread_getaffinity_np and pthread_setaffinity_np to get and set thread's cpu affinity mask.	2008-03-03 09:16:29 +00:00
Dag-Erling Smørgrav	096ba44775	_pthread_mutex_isowned_np(): use a more reliable method; the current code will work in simple cases, but may fail in more complicated ones. Reviewed by: davidxu	2008-02-14 12:37:58 +00:00
Dag-Erling Smørgrav	d29b081fee	Remove unnecessary prototype.	2008-02-06 20:43:19 +00:00
Dag-Erling Smørgrav	1cbdac2668	Per discussion on -threads, rename _islocked_np() to _isowned_np().	2008-02-06 19:34:31 +00:00
Dag-Erling Smørgrav	fcd61d9141	After careful consideration (and a brief discussion with attilio@), change the semantics of pthread_mutex_islocked_np() to return true if and only if the mutex is held by the current thread. Obviously, change the regression test to match. MFC after: 2 weeks	2008-02-04 12:35:23 +00:00
Dag-Erling Smørgrav	5fd410a787	Add pthread_mutex_islocked_np(), a cheap way to verify that a mutex is locked. This is intended primarily to support the userland equivalent of the various *_ASSERT_LOCKED() macros we have in the kernel. MFC after: 2 weeks	2008-02-03 22:38:10 +00:00
David Xu	0c921dadbb	sem_post() requires to return -1 on error.	2008-01-07 02:26:29 +00:00
David Xu	9ba01c866b	call underscore version of pthread_cleanup_pop instead.	2007-12-20 04:40:12 +00:00
David Xu	06c8eb55ce	Remove vfork() overloading, it is no longer needed.	2007-12-20 04:32:28 +00:00
David Xu	c5f411515c	Add function prototypes.	2007-12-17 02:53:11 +00:00
David Xu	093fcf1694	1. Add function pthread_mutex_setspinloops_np to turn a mutex's spin loop count. 2. Add function pthread_mutex_setyieldloops_np to turn a mutex's yield loop count. 3. Make environment variables PTHREAD_SPINLOOPS and PTHREAD_YIELDLOOPS to be only used for turnning PTHREAD_MUTEX_ADAPTIVE_NP mutex.	2007-12-14 06:25:57 +00:00
David Xu	6a663207e7	Enclose all code for macro ENQUEUE_MUTEX in do while statement, and add missing brackets. MFC: after 1 day	2007-12-11 08:00:58 +00:00
Jason Evans	b6b7fd3e2a	Fix pointer dereferencing problems in _pthread_mutex_init_calloc_cb() that were obscured by pseudo-opaque pthreads API pointer casting.	2007-11-28 00:16:24 +00:00
Jason Evans	e1636e1f97	Add _pthread_mutex_init_calloc_cb() to libthr and libkse, so that malloc(3) (part of libc) can use pthreads mutexes without causing infinite recursion during initialization.	2007-11-27 03:16:44 +00:00
David Xu	da4410f25f	Simplify code, fix a thread cancellation bug in sem_wait and sem_timedwait.	2007-11-23 05:42:52 +00:00
David Xu	4877aaebc1	Reuse nwaiter member field to record number of waiters, in sem_post(), this should reduce the chance having to do a syscall when there is no waiter in the semaphore.	2007-11-21 06:01:02 +00:00
David Xu	9e1ddd5fa0	Convert ceiling type to unsigned integer before comparing, fix compiler warnings.	2007-11-21 05:25:27 +00:00
David Xu	922d56f9de	Add some function prototypes.	2007-11-21 05:23:54 +00:00
David Xu	6fdfcacb4a	Remove umtx_t definition, use type long directly, add wrapper function _thr_umtx_wait_uint() for umtx operation UMTX_OP_WAIT_UINT, use the function in semaphore operations, this fixed compiler warnings.	2007-11-21 05:21:58 +00:00
Marius Strobl	9a2706abcc	In _pthread_key_create() ensure that libthr is initialized. This fixes a NULL-dereference of curthread when libstdc+ initializes the exception handling globals on archs we can't use GNU TLS due to lack of support in binutils 2.15 (i.e. arm and sparc64), yet, thus making threaded C++ programs compiled with GCC 4.2.1 work again on these archs. Reviewed by: davidxu MFC after: 3 days	2007-11-06 21:50:43 +00:00
David Xu	56b45d9067	Avoid doing adaptive spinning for priority protected mutex, current implementation always does lock in kernel.	2007-10-31 01:50:48 +00:00
David Xu	55f18e070f	Don't do adaptive spinning if it is running on UP kernel.	2007-10-31 01:44:50 +00:00
David Xu	e8ef3c283b	Restore revision 1.55, the kris's adaptive mutex type.	2007-10-31 01:37:13 +00:00
Kris Kennaway	83941f797f	Adaptive mutexes should have the same deadlock detection properties that default (errorcheck) mutexes do. Noticed by: davidxu	2007-10-30 09:24:23 +00:00
David Xu	7416cdabcd	Add my recent work of adaptive spin mutex code. Use two environments variable to tune pthread mutex performance: 1. LIBPTHREAD_SPINLOOPS If a pthread mutex is being locked by another thread, this environment variable sets total number of spin loops before the current thread sleeps in kernel, this saves a syscall overhead if the mutex will be unlocked very soon (well written application code). 2. LIBPTHREAD_YIELDLOOPS If a pthread mutex is being locked by other threads, this environment variable sets total number of sched_yield() loops before the currrent thread sleeps in kernel. if a pthread mutex is locked, the current thread gives up cpu, but will not sleep in kernel, this means, current thread does not set contention bit in mutex, but let lock owner to run again if the owner is on kernel's run queue, and when lock owner unlocks the mutex, it does not need to enter kernel and do lots of work to resume mutex waiters, in some cases, this saves lots of syscall overheads for mutex owner. In my practice, sometimes LIBPTHREAD_YIELDLOOPS can massively improve performance than LIBPTHREAD_SPINLOOPS, this depends on application. These two environments are global to all pthread mutex, there is no interface to set them for each pthread mutex, the default values are zero, this means spinning is turned off by default.	2007-10-30 05:57:37 +00:00
Kris Kennaway	2017a7cdfe	Add a new "non-portable" mutex type, PTHREAD_MUTEX_ADAPTIVE_NP. This is also implemented in glibc and is used by a number of existing applications (mysql, firefox, etc). This mutex type is a default mutex with the additional property that it spins briefly when attempting to acquire a contested lock, doing trylock operations in userland before entering the kernel to block if eventually unsuccessful. The expectation is that applications requesting this mutex type know that the mutex is likely to be only held for very brief periods, so it is faster to spin in userland and probably succeed in acquiring the mutex, than to enter the kernel and sleep, only to be woken up almost immediately. This can help significantly in certain cases when pthread mutexes are heavily contended and held for brief durations (such as mysql). Spin up to 200 times before entering the kernel, which represents only a few us on modern CPUs. No performance degradation was observed with this value and it is sufficient to avoid a large performance drop in mysql performance in the heavily contended pthread mutex case. The libkse implementation is a NOP. Reviewed by: jeff MFC after: 3 days	2007-10-29 21:01:47 +00:00
David Xu	5150e987d2	Use macro THR_CLEANUP_PUSH/POP, they are cheaper than pthread_cleanup_push/pop.	2007-10-16 07:46:15 +00:00
David Xu	286b41104d	Reverse the logic of UP and SMP. Submitted by: jasone	2007-10-16 07:36:02 +00:00
David Xu	4aa80591b6	Output error message to STDERR_FILENO. Approved by: re (bmah)	2007-08-07 04:50:14 +00:00
David Xu	00784f8b10	backout experimental adaptive spinning mutex for product use.	2007-05-09 08:39:33 +00:00
David Xu	6839e793cf	If a thread who's name is being set is not the current thread, use macros THR_THREAD_LOCK and THR_THREAD_UNLOCK instead, this should fix wrong lock level problem. Bug reported by: ed dot maste at gmail dot com	2007-04-05 07:20:31 +00:00
Warner Losh	fed32d7544	Remove 3rd clause, renumber, ok per email	2007-01-12 07:26:21 +00:00
David Xu	03779e5c2b	Insert mutex at tail if it has highest ceiling.	2007-01-05 03:57:11 +00:00
David Xu	da20a63dbb	Oops, don't corrupt the list.	2007-01-05 03:33:47 +00:00
David Xu	5470bb56fc	Check if the PP mutex is recursive, if we have already locked it, place the mutex in right order sorted by priority ceiling.	2007-01-05 03:29:15 +00:00
David Xu	74c751131b	get LIBPTHREAD_ADAPTIVE_SPIN early, so it can be used for some global mutexes.	2006-12-20 05:05:44 +00:00
David Xu	842a092b74	Check environment variable PTHREAD_ADAPTIVE_SPIN, if it is set, use it as a default spin cycle count.	2006-12-20 04:43:34 +00:00
David Xu	d99f6dac14	- Remove variable _thr_scope_system, all threads are system scope. - Rename _thr_smp_cpus to boolean variable _thr_is_smp. - Define CPU_SPINWAIT macro for each arch, only X86 supports it.	2006-12-15 11:52:01 +00:00
David Xu	8a8178c010	Create inline function _thr_umutex_trylock2 to only try one atomic operation, if it is failed, we call syscall directly, this saves one atomic operation per lock contention.	2006-12-14 13:22:02 +00:00
David Xu	9e8a8aa551	Correctly check failed syscall.	2006-12-12 05:26:39 +00:00
David Xu	347126a2e2	Move checking for c_has_waiters into low level _thr_ucond_signal and _thr_ucond_broadcast, clear condition variable pointer in cancellation info after returing from _thr_ucond_wait, since kernel has already dropped the internal lock, so we don't need to unlock it in cancellation handler again.	2006-12-12 03:08:49 +00:00
David Xu	b774466b61	test cancel_pending to save a thr_wake call in some specical cases.	2006-12-06 00:15:35 +00:00
David Xu	a8a343d2e6	_thr_ucond_wait drops lock, we should pick it up again.	2006-12-05 23:46:11 +00:00
David Xu	3b8a017442	the c_has_waiters is lazily updated, temporarily disable the false alarm code.	2006-12-05 07:23:58 +00:00
David Xu	4d617f2d10	Use ucond to implement barrier.	2006-12-05 06:54:25 +00:00
David Xu	670b44d65a	Add _thr_ucond_init().	2006-12-05 06:53:44 +00:00
David Xu	3ce4e91d4e	Tweak _thr_cancel_leave_defer a bit to fix a possible race.	2006-12-05 05:01:57 +00:00
David Xu	3c61d00ab6	Fix typo, I was using a wrong header file, and the typo is not detected by compiler.	2006-12-04 14:27:42 +00:00
David Xu	2bd2c90703	Use kernel provided userspace condition variable to implement pthread condition variable.	2006-12-04 14:20:41 +00:00
David Xu	6f54e82927	If a thread was detached, return EINVAL instead, the error code is also returned by pthread_detach() if a thread was already detached, the error code was already documented: > [EINVAL] The implementation has detected that the value speci- > fied by thread does not refer to a joinable thread.	2006-11-28 11:05:31 +00:00
David Xu	f08e1bf682	Eliminate atomic operations in thread cancellation functions, it should reduce overheads of cancellation points.	2006-11-24 09:57:38 +00:00
David Xu	58c7bab332	Move code calculating new inherited priority into single function.	2006-11-11 13:33:47 +00:00
David Xu	5656b5fafa	Don't inherit THR_FLAGS_NEED_SUSPEND for child process, child process only has one thread, setting the flag can cause the thread to be suspended and no another thread will resume it.	2006-10-14 13:40:08 +00:00
David Xu	8042f26d52	o Make _thr_umutex_init a function. o Eliminate unused parameter for some functions. o Convert type of first parameter to void * for _thr_umtx_wait and _thr_umtx_wake.	2006-10-13 22:31:00 +00:00
David Xu	0b90fa4ad0	Use type pthread_state for thread state.	2006-10-13 12:45:21 +00:00
David Xu	e6747c7ce1	use rtprio_thread system call to get or set thread priority.	2006-09-21 04:21:30 +00:00
David Xu	ddaf6689e3	Use return value of _thr_umutex_lock instead of using zero.	2006-09-08 09:29:14 +00:00

... 2 3 4 5 6 ...

558 Commits