02292f131a
o The polling mechanism for I/O readiness was changed from select() to poll(). In additon, a wrapped version of poll() is now provided. o The wrapped select routine now converts each fd_set to a poll array so that the thread scheduler doesn't have to perform a bitwise search for selected fds each time file descriptors are polled for I/O readiness. o The thread scheduler was modified to use a new queue (_workq) for threads that need work. Threads waiting for I/O readiness and spinblocks are added to the work queue in addition to the waiting queue. This reduces the time spent forming/searching the array of file descriptors being polled. o The waiting queue (_waitingq) is now maintained in order of thread wakeup time. This allows the thread scheduler to find the nearest wakeup time by looking at the first thread in the queue instead of searching the entire queue. o Removed file descriptor locking for select/poll routines. An application should not rely on the threads library for providing this locking; if necessary, the application should use mutexes to protect selecting/polling of file descriptors. o Retrieve and use the kernel clock rate/resolution at startup instead of hardcoding the clock resolution to 10 msec (tested with kernel running at 1000 HZ). o All queues have been changed to use queue.h macros. These include the queues of all threads, dead threads, and threads waiting for file descriptor locks. o Added reinitialization of the GC mutex and condition variable after a fork. Also prevented reallocation of the ready queue after a fork. o Prevented the wrapped close routine from closing the thread kernel pipes. o Initialized file descriptor table for stdio entries at thread init. o Provided additional flags to indicate to what queues threads belong. o Moved TAILQ initialization for statically allocated mutex and condition variables to after the spinlock. o Added dispatching of signals to pthread_kill. Removing the dispatching of signals from thread activation broke sigsuspend when pthread_kill was used to send a signal to a thread. o Temporarily set the state of a thread to PS_SUSPENDED when it is first created and placed in the list of threads so that it will not be accidentally scheduled before becoming a member of one of the scheduling queues. o Change the signal handler to queue signals to the thread kernel pipe if the scheduling queues are protected. When scheduling queues are unprotected, signals are then dequeued and handled. o Ensured that all installed signal handlers block the scheduling signal and that the scheduling signal handler blocks all other signals. This ensures that the signal handler is only interruptible for and by non-scheduling signals. An atomic lock is used to decide which instance of the signal handler will handle pending signals. o Removed _lock_thread_list and _unlock_thread_list as they are no longer used to protect the thread list. o Added missing RCS IDs to modified files. o Added checks for appropriate queue membership and activity when adding, removing, and searching the scheduling queues. These checks add very little overhead and are enabled when compiled with _PTHREADS_INVARIANTS defined. Suggested and implemented by Tor Egge with some modification by me. o Close a race condition in uthread_close. (Tor Egge) o Protect the scheduling queues while modifying them in pthread_cond_signal and _thread_fd_unlock. (Tor Egge) o Ensure that when a thread gets a mutex, the mutex is on that threads list of owned mutexes. (Tor Egge) o Set the kernel-in-scheduler flag in _thread_kern_sched_state and _thread_kern_sched_state_unlock to prevent a scheduling signal from calling the scheduler again. (Tor Egge) o Don't use TAILQ_FOREACH macro while searching the waiting queue for threads in a sigwait state, because a change of state destroys the TAILQ link. It is actually safe to do so, though, because once a sigwaiting thread is found, the loop ends and the function returns. (Tor Egge) o When dispatching signals to threads, make the thread inherit the signal deferral flag of the currently running thread. (Tor Egge) Submitted by: Daniel Eischen <eischen@vigrid.com> and Tor Egge <Tor.Egge@fast.no>
346 lines
11 KiB
C
346 lines
11 KiB
C
/*
|
|
* Copyright (c) 1995-1998 John Birrell <jb@cimlogic.com.au>
|
|
* All rights reserved.
|
|
*
|
|
* Redistribution and use in source and binary forms, with or without
|
|
* modification, are permitted provided that the following conditions
|
|
* are met:
|
|
* 1. Redistributions of source code must retain the above copyright
|
|
* notice, this list of conditions and the following disclaimer.
|
|
* 2. Redistributions in binary form must reproduce the above copyright
|
|
* notice, this list of conditions and the following disclaimer in the
|
|
* documentation and/or other materials provided with the distribution.
|
|
* 3. All advertising materials mentioning features or use of this software
|
|
* must display the following acknowledgement:
|
|
* This product includes software developed by John Birrell.
|
|
* 4. Neither the name of the author nor the names of any co-contributors
|
|
* may be used to endorse or promote products derived from this software
|
|
* without specific prior written permission.
|
|
*
|
|
* THIS SOFTWARE IS PROVIDED BY JOHN BIRRELL AND CONTRIBUTORS ``AS IS'' AND
|
|
* ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
|
|
* IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
|
|
* ARE DISCLAIMED. IN NO EVENT SHALL THE AUTHOR OR CONTRIBUTORS BE LIABLE
|
|
* FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
|
|
* DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
|
|
* OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
|
|
* HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
|
|
* LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
|
|
* OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
|
|
* SUCH DAMAGE.
|
|
*
|
|
* $Id$
|
|
*/
|
|
|
|
/* Allocate space for global thread variables here: */
|
|
#define GLOBAL_PTHREAD_PRIVATE
|
|
|
|
#include <errno.h>
|
|
#include <stdlib.h>
|
|
#include <string.h>
|
|
#include <fcntl.h>
|
|
#include <paths.h>
|
|
#include <poll.h>
|
|
#include <unistd.h>
|
|
#include <sys/sysctl.h>
|
|
#include <sys/time.h>
|
|
#include <sys/ttycom.h>
|
|
#ifdef _THREAD_SAFE
|
|
#include <machine/reg.h>
|
|
#include <pthread.h>
|
|
#include "pthread_private.h"
|
|
|
|
#ifdef GCC_2_8_MADE_THREAD_AWARE
|
|
typedef void *** (*dynamic_handler_allocator)();
|
|
extern void __set_dynamic_handler_allocator(dynamic_handler_allocator);
|
|
|
|
static pthread_key_t except_head_key;
|
|
|
|
typedef struct {
|
|
void **__dynamic_handler_chain;
|
|
void *top_elt[2];
|
|
} except_struct;
|
|
|
|
static void ***dynamic_allocator_handler_fn()
|
|
{
|
|
except_struct *dh = (except_struct *)pthread_getspecific(except_head_key);
|
|
|
|
if(dh == NULL) {
|
|
dh = (except_struct *)malloc( sizeof(except_struct) );
|
|
memset(dh, '\0', sizeof(except_struct));
|
|
dh->__dynamic_handler_chain= dh->top_elt;
|
|
pthread_setspecific(except_head_key, (void *)dh);
|
|
}
|
|
return &dh->__dynamic_handler_chain;
|
|
}
|
|
#endif /* GCC_2_8_MADE_THREAD_AWARE */
|
|
|
|
/*
|
|
* Threaded process initialization
|
|
*/
|
|
void
|
|
_thread_init(void)
|
|
{
|
|
int fd;
|
|
int flags;
|
|
int i;
|
|
int len;
|
|
int mib[2];
|
|
struct clockinfo clockinfo;
|
|
struct sigaction act;
|
|
|
|
/* Check if this function has already been called: */
|
|
if (_thread_initial)
|
|
/* Only initialise the threaded application once. */
|
|
return;
|
|
|
|
/*
|
|
* Check for the special case of this process running as
|
|
* or in place of init as pid = 1:
|
|
*/
|
|
if (getpid() == 1) {
|
|
/*
|
|
* Setup a new session for this process which is
|
|
* assumed to be running as root.
|
|
*/
|
|
if (setsid() == -1)
|
|
PANIC("Can't set session ID");
|
|
if (revoke(_PATH_CONSOLE) != 0)
|
|
PANIC("Can't revoke console");
|
|
if ((fd = _thread_sys_open(_PATH_CONSOLE, O_RDWR)) < 0)
|
|
PANIC("Can't open console");
|
|
if (setlogin("root") == -1)
|
|
PANIC("Can't set login to root");
|
|
if (_thread_sys_ioctl(fd,TIOCSCTTY, (char *) NULL) == -1)
|
|
PANIC("Can't set controlling terminal");
|
|
if (_thread_sys_dup2(fd,0) == -1 ||
|
|
_thread_sys_dup2(fd,1) == -1 ||
|
|
_thread_sys_dup2(fd,2) == -1)
|
|
PANIC("Can't dup2");
|
|
}
|
|
|
|
/* Get the standard I/O flags before messing with them : */
|
|
for (i = 0; i < 3; i++)
|
|
if ((_pthread_stdio_flags[i] =
|
|
_thread_sys_fcntl(i,F_GETFL, NULL)) == -1)
|
|
PANIC("Cannot get stdio flags");
|
|
|
|
/*
|
|
* Create a pipe that is written to by the signal handler to prevent
|
|
* signals being missed in calls to _select:
|
|
*/
|
|
if (_thread_sys_pipe(_thread_kern_pipe) != 0) {
|
|
/* Cannot create pipe, so abort: */
|
|
PANIC("Cannot create kernel pipe");
|
|
}
|
|
/* Get the flags for the read pipe: */
|
|
else if ((flags = _thread_sys_fcntl(_thread_kern_pipe[0], F_GETFL, NULL)) == -1) {
|
|
/* Abort this application: */
|
|
PANIC("Cannot get kernel read pipe flags");
|
|
}
|
|
/* Make the read pipe non-blocking: */
|
|
else if (_thread_sys_fcntl(_thread_kern_pipe[0], F_SETFL, flags | O_NONBLOCK) == -1) {
|
|
/* Abort this application: */
|
|
PANIC("Cannot make kernel read pipe non-blocking");
|
|
}
|
|
/* Get the flags for the write pipe: */
|
|
else if ((flags = _thread_sys_fcntl(_thread_kern_pipe[1], F_GETFL, NULL)) == -1) {
|
|
/* Abort this application: */
|
|
PANIC("Cannot get kernel write pipe flags");
|
|
}
|
|
/* Make the write pipe non-blocking: */
|
|
else if (_thread_sys_fcntl(_thread_kern_pipe[1], F_SETFL, flags | O_NONBLOCK) == -1) {
|
|
/* Abort this application: */
|
|
PANIC("Cannot get kernel write pipe flags");
|
|
}
|
|
/* Allocate and initialize the ready queue: */
|
|
else if (_pq_alloc(&_readyq, PTHREAD_MIN_PRIORITY, PTHREAD_MAX_PRIORITY) != 0) {
|
|
/* Abort this application: */
|
|
PANIC("Cannot allocate priority ready queue.");
|
|
}
|
|
/* Allocate memory for the thread structure of the initial thread: */
|
|
else if ((_thread_initial = (pthread_t) malloc(sizeof(struct pthread))) == NULL) {
|
|
/*
|
|
* Insufficient memory to initialise this application, so
|
|
* abort:
|
|
*/
|
|
PANIC("Cannot allocate memory for initial thread");
|
|
} else {
|
|
/* Zero the global kernel thread structure: */
|
|
memset(&_thread_kern_thread, 0, sizeof(struct pthread));
|
|
_thread_kern_thread.flags = PTHREAD_FLAGS_PRIVATE;
|
|
memset(_thread_initial, 0, sizeof(struct pthread));
|
|
|
|
/* Initialize the waiting and work queues: */
|
|
TAILQ_INIT(&_waitingq);
|
|
TAILQ_INIT(&_workq);
|
|
|
|
/* Initialize the scheduling switch hook routine: */
|
|
_sched_switch_hook = NULL;
|
|
|
|
/*
|
|
* Write a magic value to the thread structure
|
|
* to help identify valid ones:
|
|
*/
|
|
_thread_initial->magic = PTHREAD_MAGIC;
|
|
|
|
/* Default the priority of the initial thread: */
|
|
_thread_initial->base_priority = PTHREAD_DEFAULT_PRIORITY;
|
|
_thread_initial->active_priority = PTHREAD_DEFAULT_PRIORITY;
|
|
_thread_initial->inherited_priority = 0;
|
|
|
|
/* Initialise the state of the initial thread: */
|
|
_thread_initial->state = PS_RUNNING;
|
|
|
|
/* Initialise the queue: */
|
|
TAILQ_INIT(&(_thread_initial->join_queue));
|
|
|
|
/* Initialize the owned mutex queue and count: */
|
|
TAILQ_INIT(&(_thread_initial->mutexq));
|
|
_thread_initial->priority_mutex_count = 0;
|
|
|
|
/* Initialise the rest of the fields: */
|
|
_thread_initial->poll_data.nfds = 0;
|
|
_thread_initial->poll_data.fds = NULL;
|
|
_thread_initial->sig_defer_count = 0;
|
|
_thread_initial->yield_on_sig_undefer = 0;
|
|
_thread_initial->specific_data = NULL;
|
|
_thread_initial->cleanup = NULL;
|
|
_thread_initial->flags = 0;
|
|
_thread_initial->error = 0;
|
|
TAILQ_INIT(&_thread_list);
|
|
TAILQ_INSERT_HEAD(&_thread_list, _thread_initial, tle);
|
|
_thread_run = _thread_initial;
|
|
|
|
/* Initialise the global signal action structure: */
|
|
sigfillset(&act.sa_mask);
|
|
act.sa_handler = (void (*) ()) _thread_sig_handler;
|
|
act.sa_flags = 0;
|
|
|
|
/* Initialize signal handling: */
|
|
_thread_sig_init();
|
|
|
|
/* Enter a loop to get the existing signal status: */
|
|
for (i = 1; i < NSIG; i++) {
|
|
/* Check for signals which cannot be trapped: */
|
|
if (i == SIGKILL || i == SIGSTOP) {
|
|
}
|
|
|
|
/* Get the signal handler details: */
|
|
else if (_thread_sys_sigaction(i, NULL,
|
|
&_thread_sigact[i - 1]) != 0) {
|
|
/*
|
|
* Abort this process if signal
|
|
* initialisation fails:
|
|
*/
|
|
PANIC("Cannot read signal handler info");
|
|
}
|
|
}
|
|
|
|
/*
|
|
* Install the signal handler for the most important
|
|
* signals that the user-thread kernel needs. Actually
|
|
* SIGINFO isn't really needed, but it is nice to have.
|
|
*/
|
|
if (_thread_sys_sigaction(_SCHED_SIGNAL, &act, NULL) != 0 ||
|
|
_thread_sys_sigaction(SIGINFO, &act, NULL) != 0 ||
|
|
_thread_sys_sigaction(SIGCHLD, &act, NULL) != 0) {
|
|
/*
|
|
* Abort this process if signal initialisation fails:
|
|
*/
|
|
PANIC("Cannot initialise signal handler");
|
|
}
|
|
|
|
/* Get the kernel clockrate: */
|
|
mib[0] = CTL_KERN;
|
|
mib[1] = KERN_CLOCKRATE;
|
|
len = sizeof (struct clockinfo);
|
|
if (sysctl(mib, 2, &clockinfo, &len, NULL, 0) == 0)
|
|
_clock_res_nsec = clockinfo.tick * 1000;
|
|
|
|
/* Get the table size: */
|
|
if ((_thread_dtablesize = getdtablesize()) < 0) {
|
|
/*
|
|
* Cannot get the system defined table size, so abort
|
|
* this process.
|
|
*/
|
|
PANIC("Cannot get dtablesize");
|
|
}
|
|
/* Allocate memory for the file descriptor table: */
|
|
if ((_thread_fd_table = (struct fd_table_entry **) malloc(sizeof(struct fd_table_entry *) * _thread_dtablesize)) == NULL) {
|
|
/*
|
|
* Cannot allocate memory for the file descriptor
|
|
* table, so abort this process.
|
|
*/
|
|
PANIC("Cannot allocate memory for file descriptor table");
|
|
}
|
|
/* Allocate memory for the pollfd table: */
|
|
if ((_thread_pfd_table = (struct pollfd *) malloc(sizeof(struct pollfd) * _thread_dtablesize)) == NULL) {
|
|
/*
|
|
* Cannot allocate memory for the file descriptor
|
|
* table, so abort this process.
|
|
*/
|
|
PANIC("Cannot allocate memory for pollfd table");
|
|
} else {
|
|
/*
|
|
* Enter a loop to initialise the file descriptor
|
|
* table:
|
|
*/
|
|
for (i = 0; i < _thread_dtablesize; i++) {
|
|
/* Initialise the file descriptor table: */
|
|
_thread_fd_table[i] = NULL;
|
|
}
|
|
|
|
/* Initialize stdio file descriptor table entries: */
|
|
if ((_thread_fd_table_init(0) != 0) ||
|
|
(_thread_fd_table_init(1) != 0) ||
|
|
(_thread_fd_table_init(2) != 0)) {
|
|
PANIC("Cannot initialize stdio file descriptor "
|
|
"table entries");
|
|
}
|
|
}
|
|
}
|
|
|
|
#ifdef GCC_2_8_MADE_THREAD_AWARE
|
|
/* Create the thread-specific data for the exception linked list. */
|
|
if(pthread_key_create(&except_head_key, NULL) != 0)
|
|
PANIC("Failed to create thread specific execption head");
|
|
|
|
/* Setup the gcc exception handler per thread. */
|
|
__set_dynamic_handler_allocator( dynamic_allocator_handler_fn );
|
|
#endif /* GCC_2_8_MADE_THREAD_AWARE */
|
|
|
|
/* Initialise the garbage collector mutex and condition variable. */
|
|
if (pthread_mutex_init(&_gc_mutex,NULL) != 0 ||
|
|
pthread_cond_init(&_gc_cond,NULL) != 0)
|
|
PANIC("Failed to initialise garbage collector mutex or condvar");
|
|
|
|
gettimeofday(&kern_inc_prio_time, NULL);
|
|
|
|
return;
|
|
}
|
|
|
|
/*
|
|
* Special start up code for NetBSD/Alpha
|
|
*/
|
|
#if defined(__NetBSD__) && defined(__alpha__)
|
|
int
|
|
main(int argc, char *argv[], char *env);
|
|
|
|
int
|
|
_thread_main(int argc, char *argv[], char *env)
|
|
{
|
|
_thread_init();
|
|
return (main(argc, argv, env));
|
|
}
|
|
#endif
|
|
#else
|
|
/*
|
|
* A stub for non-threaded programs.
|
|
*/
|
|
void
|
|
_thread_init(void)
|
|
{
|
|
}
|
|
#endif
|