freebsd-nq

Author	SHA1	Message	Date
Konstantin Belousov	da2fcff746	Implement the __pthread_map_stacks_exec() for libthr. Stack creation code is changed to call _rtld_get_stack_prot() to get the stack protection right. There is a race where thread is created during dlopen() of dso that requires executable stacks. Then, _rtld_get_stack_prot() may return PROT_READ \| PROT_WRITE, but thread is still not linked into the thread list. In this case, the callback misses the thread stack, and rechecks the required protection afterward. Reviewed by: davidxu	2011-01-09 12:38:40 +00:00
David Xu	d1078b0b03	MFp4: - Add flags CVWAIT_ABSTIME and CVWAIT_CLOCKID for umtx kernel based condition variable, this should eliminate an extra system call to get current time. - Add sub-function UMTX_OP_NWAKE_PRIVATE to wake up N channels in single system call. Create userland sleep queue for condition variable, in most cases, thread will wait in the queue, the pthread_cond_signal will defer thread wakeup until the mutex is unlocked, it tries to avoid an extra system call and a extra context switch in time window of pthread_cond_signal and pthread_mutex_unlock. The changes are part of process-shared mutex project.	2010-12-22 05:01:52 +00:00
David Xu	7f25f6c72d	Get cpuset in pthread_attr_get_np() and free it in pthread_attr_destroy(). MFC after: 7 days	2010-10-25 09:16:04 +00:00
David Xu	de1e74c6a5	Revert revision 214007, I realized that MySQL wants to resolve a silly rwlock deadlock problem, the deadlock is caused by writer waiters, if a thread has already locked a reader lock, and wants to acquire another reader lock, it will be blocked by writer waiters, but we had already fixed it years ago.	2010-10-20 02:34:02 +00:00
David Xu	a6b9b59e04	Add pthread_rwlockattr_setkind_np and pthread_rwlockattr_getkind_np, the functions set or get pthread_rwlock type, current supported types are: PTHREAD_RWLOCK_PREFER_READER_NP, PTHREAD_RWLOCK_PREFER_WRITER_NONRECURSIVE_NP, PTHREAD_RWLOCK_PREFER_WRITER_NP, default is PTHREAD_RWLOCK_PREFER_WRITER_NONCECURSIVE_NP, this maintains binary compatible with old code.	2010-10-18 05:09:22 +00:00
David Xu	722488013d	change code to use unwind.h.	2010-09-30 12:59:56 +00:00
David Xu	bbb64c2143	In current code, statically initialized and destroyed object have same null value, the code can not distinguish between them, to fix the problem, now a destroyed object is assigned to a non-null value, and it will be rejected by some pthread functions. PTHREAD_ADAPTIVE_MUTEX_INITIALIZER_NP is changed to number 1, so that adaptive mutex can be statically initialized correctly.	2010-09-28 04:57:56 +00:00
David Xu	9f1dc4c107	Add missing field.	2010-09-25 08:36:46 +00:00
David Xu	f4213b9006	To support stack unwinding for cancellation points, add -fexceptions flag for them, two functions _pthread_cancel_enter and _pthread_cancel_leave are added to let thread enter and leave a cancellation point, it also makes it possible that other functions can be cancellation points in libraries without having to be rewritten in libthr.	2010-09-25 01:57:47 +00:00
David Xu	93ea4a71bf	In most cases, cancel_point and cancel_async needn't be checked again, because cancellation is almostly checked at cancellation points.	2010-09-24 07:52:07 +00:00
David Xu	4173ebef4f	Because atfork lock is held while forking, a thread cancellation triggered by atfork handler is unsafe, use intenal flag no_cancel to disable it.	2010-09-19 09:03:11 +00:00
David Xu	3832fd24f1	add code to support stack unwinding when thread exits. note that only defer-mode cancellation works, asynchrnous mode does not work because it lacks of libuwind's support. stack unwinding is not enabled unless LIBTHR_UNWIND_STACK is defined in Makefile.	2010-09-15 02:56:32 +00:00
David Xu	707ee8154d	Move back IN_GCLIST flag into field tlflags, since thread list and gc list still share same lock.	2010-09-15 01:21:30 +00:00
David Xu	cbadc1d7ad	Fix copy&paste problem.	2010-09-13 11:57:46 +00:00
David Xu	a9b764e218	Convert thread list lock from mutex to rwlock.	2010-09-13 07:03:01 +00:00
David Xu	ada33a6e36	Change atfork lock from mutex to rwlock, also make mutexes used by malloc() module private type, when private type mutex is locked/unlocked, thread critical region is entered or leaved. These changes makes fork() async-signal safe which required by POSIX. Note that user's atfork handler still needs to be async-signal safe, but it is not problem of libthr, it is user's responsiblity.	2010-09-01 03:11:21 +00:00
David Xu	02c3c85869	Add signal handler wrapper, the reason to add it becauses there are some cases we want to improve: 1) if a thread signal got a signal while in cancellation point, it is possible the TDP_WAKEUP may be eaten by signal handler if the handler called some interruptibly system calls. 2) In signal handler, we want to disable cancellation. 3) When thread holding some low level locks, it is better to disable signal, those code need not to worry reentrancy, sigprocmask system call is avoided because it is a bit expensive. The signal handler wrapper works in this way: 1) libthr installs its signal handler if user code invokes sigaction to install its handler, the user handler is recorded in internal array. 2) when a signal is delivered, libthr's signal handler is invoke, libthr checks if thread holds some low level lock or is in critical region, if it is true, the signal is buffered, and all signals are masked, once the thread leaves critical region, correct signal mask is restored and buffered signal is processed. 3) before user signal handler is invoked, cancellation is temporarily disabled, after user signal handler is returned, cancellation state is restored, and pending cancellation is rescheduled.	2010-09-01 02:18:33 +00:00
David Xu	ed0ee6af2e	Unregister thread specific data destructor when a corresponding dso is unloaded.	2010-08-27 05:20:22 +00:00
David Xu	1ac3d5022c	eliminate unused code.	2010-08-26 09:04:27 +00:00
Konstantin Belousov	47536ff629	The __hidden definition is provided by sys/cdefs.h. MFC after: 2 weeks	2010-08-24 11:54:48 +00:00
David Xu	5cf2219535	Add wrapper for setcontext() and swapcontext(), the wrappers unblock SIGCANCEL which is needed by thread cancellation.	2010-08-24 09:57:06 +00:00
Konstantin Belousov	ea246b6369	On shared object unload, in __cxa_finalize, call and clear all installed atexit and __cxa_atexit handlers that are either installed by unloaded dso, or points to the functions provided by the dso. Use _rtld_addr_phdr to locate segment information from the address of private variable belonging to the dso, supplied by crtstuff.c. Provide utility function __elf_phdr_match_addr to do the match of address against dso executable segment. Call back into libthr from __cxa_finalize using weak __pthread_cxa_finalize symbol to remove any atfork handler which function points into unloaded object. The rtld needs private __pthread_cxa_finalize symbol to not require resolution of the weak undefined symbol at initialization time. This cannot work, since rtld is relocated before sym_zero is set up. Idea by: kan Reviewed by: kan (previous version) MFC after: 3 weeks	2010-08-23 15:38:02 +00:00
David Xu	635f917a9d	In current implementation, thread cancellation is done in signal handler, which does not know what is the state of interrupted system call, for example, open() system call opened a file and the thread is still cancelled, result is descriptor leak, there are other problems which can cause resource leak or undeterminable side effect when a thread is cancelled. However, this is no longer true in new implementation. In defering mode, a thread is canceled if cancellation request is pending and later the thread enters a cancellation point, otherwise, a later pthread_cancel() just causes SIGCANCEL to be sent to the target thread, and causes target thread to abort system call, userland code in libthr then checks cancellation state, and cancels the thread if needed. For example, the cancellation point open(), the thread may be canceled at start, but later, if it opened a file descriptor, it is not canceled, this avoids file handle leak. Another example is read(), a thread may be canceled at start of the function, but later, if it read some bytes from a socket, the thread is not canceled, the caller then can decide if it should still enable cancelling or disable it and continue reading data until it thinks it has read all bytes of a packet, and keeps a protocol stream in health state, if user ignores partly reading of a packet without disabling cancellation, then second iteration of read loop cause the thread to be cancelled. An exception is that the close() cancellation point always closes a file handle despite whether the thread is cancelled or not. The old mechanism is still kept, for a functions which is not so easily to fix a cancellation problem, the rough mechanism is used. Reviewed by: kib@	2010-08-20 05:15:39 +00:00
David Xu	9b0f1823b5	Use umtx to implement process sharable semaphore, to make this work, now type sema_t is a structure which can be put in a shared memory area, and multiple processes can operate it concurrently. User can either use mmap(MAP_SHARED) + sem_init(pshared=1) or use sem_open() to initialize a shared semaphore. Named semaphore uses file system and is located in /tmp directory, and its file name is prefixed with 'SEMD', so now it is chroot or jail friendly. In simplist cases, both for named and un-named semaphore, userland code does not have to enter kernel to reduce/increase semaphore's count. The semaphore is designed to be crash-safe, it means even if an application is crashed in the middle of operating semaphore, the semaphore state is still safely recovered by later use, there is no waiter counter maintained by userland code. The main semaphore code is in libc and libthr only has some necessary stubs, this makes it possible that a non-threaded application can use semaphore without linking to thread library. Old semaphore implementation is kept libc to maintain binary compatibility. The kernel ksem API is no longer used in the new implemenation. Discussed on: threads@	2010-01-05 02:37:59 +00:00
Jilles Tjoelker	29670497af	Make openat(2) a cancellation point. This is required by POSIX and matches open(2). Reviewed by: kib, jhb MFC after: 1 month	2009-10-11 20:19:45 +00:00
David Xu	83a0758789	Make pthread_cleanup_push() and pthread_cleanup_pop() as a pair of macros, use stack space to keep cleanup information, this eliminates overhead of calling malloc() and free() in thread library. Discussed on: thread@	2008-06-09 01:14:10 +00:00
David Xu	850f4d66cb	- Reduce function call overhead for uncontended case. - Remove unused flags MUTEX_FLAGS_* and their code. - Check validity of the timeout parameter in mutex_self_lock().	2008-05-29 07:57:33 +00:00
David Xu	8d6a11a070	Use UMTX_OP_WAIT_UINT_PRIVATE and UMTX_OP_WAKE_PRIVATE to save time in kernel(avoid VM lookup).	2008-04-29 03:58:18 +00:00
David Xu	caad30a422	put THR_CRITICAL_LEAVE into do .. while statement.	2008-04-03 02:47:35 +00:00
David Xu	a6cba9400a	add __hidden suffix to _umtx_op_err, this eliminates PLT.	2008-04-03 02:13:51 +00:00
David Xu	7a30bcf04b	Add pthread_setaffinity_np and pthread_getaffinity_np to libc namespace.	2008-04-02 08:53:18 +00:00
David Xu	8b873a2328	Remove unused functions.	2008-04-02 08:33:42 +00:00
David Xu	d6e0eb0a48	Replace function _umtx_op with _umtx_op_err, the later function directly returns errno, because errno can be mucked by user's signal handler and most of pthread api heavily depends on errno to be correct, this change should improve stability of the thread library.	2008-04-02 07:41:25 +00:00
David Xu	8bf1a48cb3	Replace userland rwlock with a pure kernel based rwlock, the new implementation does not switch pointers when it resumes waiters. Asked by: jeff	2008-04-02 04:32:31 +00:00
David Xu	5ab512bb8e	Rewrite rwlock to user atomic operations to change rwlock state, this eliminates internal mutex lock contention when most rwlock operations are read. Orignal patch provided by: jeff	2008-03-31 02:55:49 +00:00
David Xu	54dff16b26	Use cpuset defined in pthread_attr for newly created thread, for now, we set scheduling parameters and cpu binding fully in userland, and because default scheduling policy is SCHED_RR (time-sharing), we set default sched_inherit to PTHREAD_SCHED_INHERIT, this saves a system call.	2008-03-05 07:01:20 +00:00
David Xu	76a9679f8e	If a new thread is created, it inherits current thread's signal masks, however if current thread is executing cancellation handler, signal SIGCANCEL may have already been blocked, this is unexpected, unblock the signal in new thread if this happens. MFC after: 1 week	2008-03-04 04:28:59 +00:00
David Xu	54c9b47c2b	Include cpuset.h, unbreak compiling.	2008-03-04 03:45:11 +00:00
David Xu	a759db946a	implement pthread_attr_getaffinity_np and pthread_attr_setaffinity_np.	2008-03-04 03:03:24 +00:00
David Xu	093fcf1694	1. Add function pthread_mutex_setspinloops_np to turn a mutex's spin loop count. 2. Add function pthread_mutex_setyieldloops_np to turn a mutex's yield loop count. 3. Make environment variables PTHREAD_SPINLOOPS and PTHREAD_YIELDLOOPS to be only used for turnning PTHREAD_MUTEX_ADAPTIVE_NP mutex.	2007-12-14 06:25:57 +00:00
David Xu	6fdfcacb4a	Remove umtx_t definition, use type long directly, add wrapper function _thr_umtx_wait_uint() for umtx operation UMTX_OP_WAIT_UINT, use the function in semaphore operations, this fixed compiler warnings.	2007-11-21 05:21:58 +00:00
David Xu	7416cdabcd	Add my recent work of adaptive spin mutex code. Use two environments variable to tune pthread mutex performance: 1. LIBPTHREAD_SPINLOOPS If a pthread mutex is being locked by another thread, this environment variable sets total number of spin loops before the current thread sleeps in kernel, this saves a syscall overhead if the mutex will be unlocked very soon (well written application code). 2. LIBPTHREAD_YIELDLOOPS If a pthread mutex is being locked by other threads, this environment variable sets total number of sched_yield() loops before the currrent thread sleeps in kernel. if a pthread mutex is locked, the current thread gives up cpu, but will not sleep in kernel, this means, current thread does not set contention bit in mutex, but let lock owner to run again if the owner is on kernel's run queue, and when lock owner unlocks the mutex, it does not need to enter kernel and do lots of work to resume mutex waiters, in some cases, this saves lots of syscall overheads for mutex owner. In my practice, sometimes LIBPTHREAD_YIELDLOOPS can massively improve performance than LIBPTHREAD_SPINLOOPS, this depends on application. These two environments are global to all pthread mutex, there is no interface to set them for each pthread mutex, the default values are zero, this means spinning is turned off by default.	2007-10-30 05:57:37 +00:00
David Xu	4aa80591b6	Output error message to STDERR_FILENO. Approved by: re (bmah)	2007-08-07 04:50:14 +00:00
David Xu	842a092b74	Check environment variable PTHREAD_ADAPTIVE_SPIN, if it is set, use it as a default spin cycle count.	2006-12-20 04:43:34 +00:00
David Xu	d99f6dac14	- Remove variable _thr_scope_system, all threads are system scope. - Rename _thr_smp_cpus to boolean variable _thr_is_smp. - Define CPU_SPINWAIT macro for each arch, only X86 supports it.	2006-12-15 11:52:01 +00:00
David Xu	4d617f2d10	Use ucond to implement barrier.	2006-12-05 06:54:25 +00:00
David Xu	2bd2c90703	Use kernel provided userspace condition variable to implement pthread condition variable.	2006-12-04 14:20:41 +00:00
David Xu	f08e1bf682	Eliminate atomic operations in thread cancellation functions, it should reduce overheads of cancellation points.	2006-11-24 09:57:38 +00:00
David Xu	0b90fa4ad0	Use type pthread_state for thread state.	2006-10-13 12:45:21 +00:00
David Xu	e6747c7ce1	use rtprio_thread system call to get or set thread priority.	2006-09-21 04:21:30 +00:00

1 2 3

119 Commits