freebsd-dev

Author	SHA1	Message	Date
Robert Watson	64c238075f	First step in simplifying accept filter socket option logic in the post-SMPng world order. Centralize handling of the socket option clear case in do_setopt_accept_filter().	2005-03-11 21:37:45 +00:00
Robert Watson	56856fbfb4	Remove an additional commented out reference to a possible future sx lock.	2005-03-11 19:16:02 +00:00
Robert Watson	2b37548a71	When setting up a socket in socreate(), there's no need to lock the socket lock around knlist_init(), so don't. Hard code the setting of the socket reference count to 1 rather than using soref() to avoid asserting the socket lock, since we've not yet exposed the socket to other threads. This removes two mutex operations from each socket allocation.	2005-03-11 16:30:02 +00:00
Robert Watson	5fab68b19e	Remove suggestive sx_init() comment in soalloc(). We will have something like this at some point, but for now it clutters the source.	2005-03-11 16:26:33 +00:00
Robert Watson	35a196154f	The SO_NOSIGPIPE socket option allows a user process to mark a socket so that the socket does not generate SIGPIPE, only EPIPE, when a write is attempted after socket shutdown. When the option was introduced in 2002, this required the logic for determining whether SIGPIPE was generated to be pushed down from dofilewrite() to the socket layer so that the socket options could be considered. However, the change in 2002 omitted modification to soo_write() required to add that logic, resulting in SIGPIPE not being generated even without SO_NOSIGPIPE when the socket was written to using write() or related generic system calls. This change adds the EPIPE logic to soo_write(), generating a SIGPIPE signal to the process associated with the passed uio in the event that the SO_NOSIGPIPE option is not set. Notes: - The are upsides and downsides to placing this logic in the socket layer as opposed to the file descriptor layer. This is really fd layer logic, but because we need so_options, we have a choice of layering violations and pick this one. - SIGPIPE possibly should be delivered to the thread performing the write, not the process performing the write. - uio->uio_td and the td argument to soo_write() might potentially differ; we use the thread in the uio argument. - The "sigpipe" regression test in src/tools/regression/sockets/sigpipe tests for the bug. Submitted by: Mikko Tyolajarvi <mbsd at pacbell dot net> Talked with: glebius, alfred PR: 78478 MFC after: 1 week	2005-03-11 15:06:16 +00:00
John-Mark Gurney	74e620476c	fix spelling of match in comment... MFC after: 3 days	2005-03-10 21:23:06 +00:00
Poul-Henning Kamp	b43ab0e378	Try to fix the mess I made of devname, with the minimal subset of the larger minor/major patch which was posted for testing.	2005-03-10 18:21:34 +00:00
Robert Watson	53358cc907	Document, via WITNESS, that the NFS server mutex falls ahead of the socket buffer mutexes.	2005-03-09 21:38:53 +00:00
Dag-Erling Smørgrav	628b83cd08	My addled brains didn't realize that since vtp points into value, we can't freeenv(value) before we're done inspecting vtp[0]. Tested by: Anish Mistry <mistry.7@osu.edu>	2005-03-09 12:16:45 +00:00
Stefan Farfeleder	b26244446b	Fix typo in comment.	2005-03-09 11:50:55 +00:00
Sam Leffler	a4e714295a	allow the destination of m_move_pkthdr to have external storage (e.g. a cluster) Glanced at by: rwatson, silby	2005-03-08 17:52:01 +00:00
Giorgos Keramidas	0a11e99990	Remove redundant initialization that is repeated in the for() loop right below it. Approved by: jhb	2005-03-08 16:57:20 +00:00
Maxim Sobolev	8d6e40c3f1	Add kernel-only flag MSG_NOSIGNAL to be used in emulation layers to surpress SIGPIPE signal for the duration of the sento-family syscalls. Use it to replace previously added hack in Linux layer based on temporarily setting SO_NOSIGPIPE flag. Suggested by: alfred	2005-03-08 16:11:41 +00:00
Poul-Henning Kamp	d9a54d5c23	Reengineer subr_unit Add support for passing in a mutex. If NULL is passed a global subr_unit mutex is used. Add alloc_unrl() which expects the mutex to be held. Allocating a unit will never sleep as it does not need to allocate memory. Cut possible range in half so we can use -1 to mean "out of number". Collapse first and last runs into the head by means of counters. This saves memory in the common case(s).	2005-03-08 10:40:48 +00:00
Poul-Henning Kamp	3238ec33e1	Fix signedness of minor2unit().	2005-03-08 10:40:03 +00:00
Jeff Roberson	ec346d1040	- Lock access to the buffer_map with the vm_map lock. In 4.x this was done with splbio, in 5.x this was done with Giant. Discussed with: alc Reported by: julian, pho	2005-03-08 09:34:54 +00:00
Giorgos Keramidas	46da8bf8fb	Typo & grammar fixes in comments.	2005-03-08 00:58:50 +00:00
Robert Watson	9bfb7389bc	When upcalling from a socket in soisconnected() for an accept filter, call with flag M_DONTWAIT rather than M_TRYWAIT, as we don't want to do blocking memory allocation (etc) in the netisr. MFC after: 3 days	2005-03-07 13:50:16 +00:00
Poul-Henning Kamp	3b3f38ed7d	Add placeholder mutex argument to new_unrhdr().	2005-03-07 11:05:47 +00:00
Bill Paul	58a6edd121	When you call MiniportInitialize() for an 802.11 driver, it will at some point result in a status event being triggered (it should be a link down event: the Microsoft driver design guide says you should generate one when the NIC is initialized). Some drivers generate the event during MiniportInitialize(), such that by the time MiniportInitialize() completes, the NIC is ready to go. But some drivers, in particular the ones for Atheros wireless NICs, don't generate the event until after a device interrupt occurs at some point after MiniportInitialize() has completed. The gotcha is that you have to wait until the link status event occurs one way or the other before you try to fiddle with any settings (ssid, channel, etc...). For the drivers that set the event sycnhronously this isn't a problem, but for the others we have to pause after calling ndis_init_nic() and wait for the event to arrive before continuing. Failing to wait can cause big trouble: on my SMP system, calling ndis_setstate_80211() after ndis_init_nic() completes, but _before_ the link event arrives, will lock up or reset the system. What we do now is check to see if a link event arrived while ndis_init_nic() was running, and if it didn't we msleep() until it does. Along the way, I discovered a few other problems: - Defered procedure calls run at PASSIVE_LEVEL, not DISPATCH_LEVEL. ntoskrnl_run_dpc() has been fixed accordingly. (I read the documentation wrong.) - Similarly, the NDIS interrupt handler, which is essentially a DPC, also doesn't need to run at DISPATCH_LEVEL. ndis_intrtask() has been fixed accordingly. - MiniportQueryInformation() and MiniportSetInformation() run at DISPATCH_LEVEL, and each request must complete before another can be submitted. ndis_get_info() and ndis_set_info() have been fixed accordingly. - Turned the sleep lock that guards the NDIS thread job list into a spin lock. We never do anything with this lock held except manage the job list (no other locks are held), so it's safe to do this, and it's possible that ndis_sched() and ndis_unsched() can be called from DISPATCH_LEVEL, so using a sleep lock here is semantically incorrect. Also updated subr_witness.c to add the lock to the order list.	2005-03-07 03:05:31 +00:00
Alan Cox	2b2c7a6b40	The m_ext reference counts are potentially shared and modified asynchronously by different threads. Thus, declare as volatile the reference count that is accessed through m_ext's pointer, ref_cnt. Revert the previous change, revision 1.144, that casts as volatile a single dereference of ref_cnt. Reviewed by: bmilekic, dwhite Problem reported by: kris MFC after: 3 days	2005-03-06 20:09:00 +00:00
Dag-Erling Smørgrav	f3301d15f1	Teach getenv_quad() to recognize k/m/g/t suffixes in both lower- and upper-case. This means (almost) all tunables now support those suffixes.	2005-03-05 15:52:12 +00:00
David Xu	bc8e6d817d	Allocate umtx_q from heap instead of stack, this avoids page fault panic in kernel under heavy swapping.	2005-03-05 09:15:03 +00:00
David Xu	627451c1d9	The td_waitset is pointing to a stack address when thread is waiting for a signal, because kernel stack is swappable, this causes page fault in kernel under heavy swapping case. Fix this bug by eliminating unneeded code.	2005-03-04 22:46:31 +00:00
Maxim Sobolev	4b1783363f	In linux emulation layer try to detect attempt to use linux_clone() to create kernel threads and call rfork(2) with RFTHREAD flag set in this case, which puts parent and child into the same threading group. As a result all threads that belong to the same program end up in the same threading group. This is similar to what linuxthreads port does, though in this case we don't have a luxury of having access to the source code and there is no definite way to differentiate linux_clone() called for threading purposes from other uses, so that we have to resort to heuristics. Allow SIGTHR to be delivered between all processes in the same threading group previously it has been blocked for s[ug]id processes. This also should improve locking of the same file descriptor from different threads in programs running under linux compat layer. PR: kern/72922 Reported by: Andriy Gapon <avg@icyb.net.ua> Idea suggested by: rwatson	2005-03-03 16:57:55 +00:00
Doug White	a1d0c3f203	Insert volatile cast to discourage gcc from optimizing the read outside of the while loop. Suggested by: alc MFC after: 1 day	2005-03-03 02:41:37 +00:00
Joerg Wunsch	a5f50ef9e4	netchild's mega-patch to isolate compiler dependencies into a central place. This moves the dependency on GCC's and other compiler's features into the central sys/cdefs.h file, while the individual source files can then refer to #ifdef __COMPILER_FEATURE_FOO where they by now used to refer to #if __GNUC__ > 3.1415 && __BARC__ <= 42. By now, GCC and ICC (the Intel compiler) have been actively tested on IA32 platforms by netchild. Extension to other compilers is supposed to be possible, of course. Submitted by: netchild Reviewed by: various developers on arch@, some time ago	2005-03-02 21:33:29 +00:00
David Xu	6675b36ec5	In kern_sigtimedwait, remove waitset bits for td_sigmask before sleeping, so in do_tdsignal, we no longer need to test td_waitset. now td_waitset is only used to give a thread higher priority when delivering signal to multithreads process. This also fixes a bug: when a thread in sigwait states was suspended and later resumed by SIGCONT, it can no longer receive signals belong to waitset.	2005-03-02 13:43:51 +00:00
Paul Saab	b8a4edc17e	Use kern_kevent instead of the stackgap for 32bit syscall wrapping. Submitted by: jhb Tested on: amd64	2005-03-01 17:45:55 +00:00
Paul Saab	c1aa81b6d9	regen	2005-03-01 17:44:34 +00:00
Paul Saab	96d31285fe	Change the prototype of kevent to remove the const from the changelist. Reviewed by: jhb	2005-03-01 17:43:08 +00:00
Robert Watson	081322613b	When mac_check_system_acct() fails, make sure to unlock as well as close the vnode. Pointed out by: jeff	2005-03-01 08:56:13 +00:00
Wes Peters	a09150446d	Add a sysctl that records the amount of physical memory in the machine. Submitted by: Nicko Dehaine <nicko@stbernard.com> MFC after: 1 day	2005-02-28 21:42:56 +00:00
Poul-Henning Kamp	8045ce213d	Also handle d_maj hints from cloning drivers correctly.	2005-02-27 22:57:32 +00:00
Poul-Henning Kamp	84f580a093	Whine about any drivers which hardcode the device major number.	2005-02-27 22:41:07 +00:00
Poul-Henning Kamp	acd102e64b	Use dynamic major number allocation.	2005-02-27 22:02:03 +00:00
Poul-Henning Kamp	78e253c8d5	Use dynamic major number allocation.	2005-02-27 22:00:45 +00:00
Poul-Henning Kamp	89685e2269	Use dynamic major number allocation for /dev/console, there is no longer any benefit from hard wiring it. Remove special hack used to wire major to zero despite zero having a different magic meaning as well.	2005-02-27 21:52:42 +00:00
Nate Lawson	789f03ceb4	Add locking to handle multiple threads getting/setting frequencies at the same time. We use an sx lock and serialize the cpufreq device's get/set/levels methods.	2005-02-27 01:34:08 +00:00
Nate Lawson	b070969b48	Allow users to reject levels below a given frequency (in MHz) via the debug.cpufreq.lowest tunable and sysctl. Some systems seem to have problems with the lowest frequencies so setting this prevents them from being available or used.	2005-02-26 22:37:49 +00:00
Tom Rhodes	183a16a3ec	Remove recently added note about DEVICE_POLLING not working with SMP. Remove warning from kern_poll.c to allow DEVICE_POLLING to be built with SMP. Discussed with: ru, glebius	2005-02-25 22:07:51 +00:00
Robert Watson	fa6fc5b819	Insert missing increment of (i) when walking the temporary semaphore vector during fork. Fix assertion which contained an off-by-one error. Submitted by: Antoine Brodin < antoine dot brodin at laposte dot net >	2005-02-25 21:00:14 +00:00
Robert Watson	590f242cc0	Add an exit hook, sem_forkhook(), which walks the list of POSIX semaphores owned by a process when it forks, and creates a matching set of references for the child process, as prescribed by POSIX. In order to avoid races with other threads in the parent process during fork(), it is necessary to allocate a temporary reference list while holding the sem_lock, then transfer those references to the new process once the sem_lock is released. The implementation is inefficient but appears functional; in order to improve the efficiency, it will be necessary to modify the existing structures and logic, which generally rely on O(n) operations over the global set of semaphores.	2005-02-25 19:10:51 +00:00
Robert Watson	955ec4156c	Assert sem_lock in id_to_sem() and sem_lookup_byname(), since these functions iterate over the global POSIX semaphore lists. MFC after: 3 days	2005-02-25 17:01:35 +00:00
Maxim Sobolev	90dc539be0	Welcome to the 21st century: increase MAXSHELLCMDLEN from 128 bytes to PAGE_SIZE. Unlike originator of the PR suggests retain MAXSHELLCMDLEN definition (he has been proposing to replace it with PAGE_SIZE everywhere), not only this reduced the diff significantly, but prevents code obfuscation and also allows to increase/decrease this parameter easily if needed. PR: kern/64196 Submitted by: Magnus Bäckström <b@etek.chalmers.se>	2005-02-25 11:49:42 +00:00
Maxim Sobolev	6916a1da50	o Replace two while {} do loops with more appropriate do {} while loops. This doesn't change functionality, but makes code more logical. Obtained from: DrafonFlyBSD o Use VOP_GETATTR() to obtain actual size of file and parse no more than that. Previously, we parsed MAXSHELLCMDLEN characters regardless of the actual file size. This makes the following working: $ printf '#!/bin/echo' > /tmp/test.sh $ chmod 755 /tmp/test.sh $ /tmp/test.sh Previously, attempts to execve() that shell script has been failing with bogus ENAMETOOLONG. PR: kern/64196 Submitted by: Magnus B.ckstr.m <b@etek.chalmers.se>	2005-02-25 10:17:53 +00:00
Maxim Sobolev	b4305f8d91	Try harder to not exceed MAXSHELLCMDLEN when parsing first line of shell script. Otherwise it's possible to panic kernel by constructing a shell script with first line not ending in '\n'. Also, treat '\0' as line terminating character, which may me useful in some situations. Submitted by: gad	2005-02-25 08:42:04 +00:00
Nate Lawson	d269386a24	Bump the maximum number of levels to 64 and add warning messages about what to do to fix reduced functionality if the number of levels is too low.	2005-02-24 20:21:41 +00:00
Sam Leffler	59d8b31002	change m_adj to reclaim unused mbufs instead of zero'ing m_len when trim'ing space off the back of a chain; this is indirect solution to a potential null ptr deref Noticed by: Coverity Prevent analysis tool (null ptr deref) Reviewed by: dg, rwatson	2005-02-24 00:40:33 +00:00
Christian S.J. Peron	cd13819433	Add locking assertions into vn_extattr_set, vn_extattr_get and vn_extattr_rm. This is meant to catch conditions where IO_NODELOCKED has been specified without the vnode being locked. Discussed with: rwatson MFC after: 1 week	2005-02-24 00:13:16 +00:00

1 2 3 4 5 ...

8249 Commits