freebsd-nq

Author	SHA1	Message	Date
Warner Losh	1a13e01f7f	Don't use spinlocks here. The iicbus transactions can take a long time, and this prevents interrupts (say for Hz/hardclock) from happening. Time stands still during the transfers...	2007-04-17 05:48:35 +00:00
Scott Long	b653ca76bc	Don't delete the devalias, as per the man page. Submitted by: jmg	2007-04-17 01:12:35 +00:00
Andrew Thompson	18242d3b09	Rename the trunk(4) driver to lagg(4) as it is too similar to vlan trunking. The name trunk is misused as the networking term trunk means carrying multiple VLANs over a single connection. The IEEE standard for link aggregation (802.3 section 3) does not talk about 'trunk' at all while it is used throughout IEEE 802.1Q in describing vlans. The lagg(4) driver provides link aggregation, failover and fault tolerance. Discussed on: current@	2007-04-17 00:35:11 +00:00
John Baldwin	2248f68064	- Add a 'show rman <rm>' DDB command to dump the resources in a resource manager similar to 'devinfo -u'. - Add a 'show allrman' DDB command that effectively does 'show rman' on all resource managers in the system.	2007-04-16 21:09:03 +00:00
Scott Long	84f824818c	For the XPT_SASYNC_CB operation, only decouple the broadcast to the bus and device lists instead of decoupling the whole operation. This avoids problems with SIMs going away.	2007-04-16 19:55:36 +00:00
Scott Long	f35487464c	Drop the topology lock before calling the periph oninvalidate and dtor vectors.	2007-04-16 19:42:23 +00:00
Scott Long	cd5c9285cd	Drop the periph/sim lock when calling disk_destroy().	2007-04-16 19:41:14 +00:00
Scott Long	d292906a7c	Destroy the devalias before destroying the dev.	2007-04-16 19:40:13 +00:00
Robert Watson	0e92f0d7dd	Merge OpenBSM 1.0 alpha 14 changes to src/sys/security/audit: - au_to_attr64(), au_to_process64(), au_to_subject64(), au_to_subject64_ex(), au_to_zonename(), au_to_header64_tm(). - Extended address token fixes. Obtained from: TrustedBSD Project	2007-04-16 16:20:45 +00:00
Robert Watson	bfbc9a096b	Update src/sys/bsm for OpenBSM 1.0 alpha 14 import. Add new audit event types.	2007-04-16 16:13:10 +00:00
Pawel Jakub Dawidek	6b3d6017e8	s/destory/destroy/ (except for the code in contrib/).	2007-04-16 12:31:35 +00:00
Pawel Jakub Dawidek	8cb195f758	Uncomment forgotten check. Without this check in-place, ZFS will panic on unload instead of returning EBUSY. This check tells if there are mounted ZFS file systems or not. We can't unload if there are mounted file systems. Reported by: Andrey V. Elsukov <bu7cher@yandex.ru>	2007-04-16 10:23:24 +00:00
Kip Macy	d302816a12	PHYS_TO_VM_PAGE requires explicit vm_page.h include on sparc64	2007-04-15 22:17:10 +00:00
Robert Watson	215c8d75b8	Remove unused variable tcbinfo_mtx.	2007-04-15 21:03:23 +00:00
Dag-Erling Smørgrav	8edf8ae133	Avoid "unused variable" warning when building without PSEUDOFS_TRACE.	2007-04-15 20:35:18 +00:00
Matt Jacob	07589439e5	Use %j and args cast to uintmax_t to print bus_addr_t && length args.	2007-04-15 19:03:45 +00:00
Christian S.J. Peron	db8086c4fa	Add an entry for AUT_ZONENAME and the prototype for the au_to_zonename() function that will be implemented shortly. This is being done for the openbsm import.	2007-04-15 17:24:41 +00:00
Dag-Erling Smørgrav	388596dffc	Make pseudofs (and consequently procfs, linprocfs and linsysfs) MPSAFE.	2007-04-15 17:10:01 +00:00
Dag-Erling Smørgrav	b1f9e8cec9	Instead of stating GIANT_REQUIRED, just acquire and release Giant where needed. This does not make a difference now, but will when procfs is marked MPSAFE.	2007-04-15 17:06:09 +00:00
Dag-Erling Smørgrav	78c3440e7d	Whitespace cleanup.	2007-04-15 17:02:03 +00:00
Robert Watson	a0bda9d077	In nfsrv_rcv(), don't reacquire the nfs server lock until after nfs_realign() has been called, as it may sleep waiting on memory allocation. Reported by: simon	2007-04-15 15:50:50 +00:00
Kip Macy	2b6dbb2afa	Add pmap includes needed by i386	2007-04-15 15:30:45 +00:00
Dag-Erling Smørgrav	302762c344	Fix the same bug as in procfs_doproc{,db}regs(): check that uio_offset is 0 upon entry, and don't reset it before returning. MFC after: 3 weeks	2007-04-15 13:29:36 +00:00
Dag-Erling Smørgrav	66cd74a611	Don't reset uio_offset to 0 before returning. Instead, refuse to service requests where uio_offset is not 0 to begin with. This fixes a long- standing bug where e.g. 'cat /proc/$$/regs' would loop forever. MFC after: 3 weeks	2007-04-15 13:24:03 +00:00
Randall Stewart	f1d6e6dc71	Fix stupid syntax error - Pointy hat to me :-(	2007-04-15 13:03:14 +00:00
Dag-Erling Smørgrav	ab26caf6af	Add macros to assert that the process is / isn't held in memory. MFC after: 3 weeks	2007-04-15 12:59:49 +00:00
Randall Stewart	478d3f0901	- Add more comments to sctps_stats struture in sctp_uio.h - Fix bug that prevented EEOR mode from working and simplified the can_we_split code in the process. - Reduce lock contention for the tcb_send_lock. I did this especially for EEOR mode, still need to look at why I need a lock when removing from the tailq and the ->next is NOT null. A lock fixes it but it implies a bug yet exists. - Activated Andre's proposed changes to better use the mbuf infrastructure. - Fixed places that were not using the aloc macro's to take advantage of the per assoc cache. - Adds ifdef fix so any logging will enable stat_logging to get the right data structures in place (suggested by Max Laier).	2007-04-15 11:58:26 +00:00
Pawel Jakub Dawidek	7ae6548e62	MFp4: Start DNLC after desiredvnodes variable is initialized. Before this change if zfs.ko was loaded by the loader, DNLC was automatically disabled. Reported by: Zephiris <zephiris@gmail.com>	2007-04-15 09:10:17 +00:00
Scott Long	2b83592fdc	Remove Giant from CAM. Drivers (SIMs) now register a mutex that CAM will use to synchornize and protect all data objects that are used for that SIM. Drivers that are not yet MPSAFE register Giant and operate as usual. RIght now, no drivers are MPSAFE, though a few will be changed in the coming week as this work settles down. The driver API has changed, so all CAM drivers will need to be recompiled. The userland API has not changed, so tools like camcontrol do not need to be recompiled.	2007-04-15 08:49:19 +00:00
Kip Macy	4f450d951a	back out option to disable packet zone Requested by: sam	2007-04-15 06:30:28 +00:00
Kip Macy	ba68b814cc	suck in more of busdma to enable more efficient mappings kill redundant INVARIANTS check	2007-04-15 05:46:34 +00:00
Kip Macy	d43f50b93a	Add sysctl for disabling/enabling mbuf chain collapsing remove map creation before calling bus_dmamap_load_mvec_sg	2007-04-15 05:45:10 +00:00
Kip Macy	52c81add3c	Implement ZERO_COPY_SOCKETS check in a way that doesn't make LINT unhappy	2007-04-15 04:55:39 +00:00
Pawel Jakub Dawidek	87e89536f1	Fix RAID-Z resilvering. Obtained from: OpenSolaris	2007-04-14 20:50:14 +00:00
Kip Macy	51580731ae	Add support for mbuf iovec in the TX path	2007-04-14 20:40:22 +00:00
Kip Macy	642046797b	add reference count pointer to mbuf iovec implement robust version of m_collapse add support for sf_buf add fix for m_iovappend add calls to m_sanity under INVARIANTS fix m_freem_vec to correctly travese the mbuf iovec chain	2007-04-14 20:38:38 +00:00
Kip Macy	21c5f3f383	hide static declaration remove extra white space	2007-04-14 20:31:05 +00:00
Kip Macy	21ee3e7aff	remove now invalid check from m_sanity panic on m_sanity check failure with INVARIANTS	2007-04-14 20:19:16 +00:00
Kip Macy	f8bbd17f06	Add option for disabling allocation from the packet zone	2007-04-14 20:16:03 +00:00
Kip Macy	38073c4181	pad out m_hdr to make pkthdr word-aligned shuffle pkthdr.len so that pkthdr.header is aligned without compiler added padding Reviewed by: rwatson, andre, sam	2007-04-14 19:42:20 +00:00
Max Laier	d0cf96b407	Fix a typeo - unbreak the build.	2007-04-14 18:27:34 +00:00
Maxim Konovalov	461f64fe8c	o Add bsm and security to a list of cscope dirs.	2007-04-14 16:29:15 +00:00
Pawel Jakub Dawidek	d48078479c	MFp4: Hmm, it seems to work now.	2007-04-14 15:01:50 +00:00
Dag-Erling Smørgrav	f61bc4ea5e	Further pseudofs improvements: The pfs_info mutex is only needed to lock pi_unrhdr. Everything else in struct pfs_info is modified only while Giant is held (during vfs_init() / vfs_uninit()); add assertions to that effect. Simplify pfs_destroy somewhat. Remove superfluous arguments from pfs_fileno_{alloc,free}(), and the assertions which were added in the previous commit to ensure they were consistent. Assert that Giant is held while the vnode cache is initialized and destroyed. Also assert that the cache is empty when it is destroyed. Rename the vnode cache mutex for consistency. Fix a long-standing bug in pfs_getattr(): it would uncritically return the node's pn_fileno as st_ino. This would result in st_ino being 0 if the node had not previously been visited by readdir(), and also in an incorrect st_ino for process directories and any files contained therein. Correct this by abstracting the fileno manipulations previously done in pfs_readdir() into a new function, pfs_fileno(), which is used by both pfs_getattr() and pfs_readdir().	2007-04-14 14:08:30 +00:00
Pawel Jakub Dawidek	8aff52ca4e	MFp4: Use max_ncpus, which is used in other places in the code.	2007-04-14 12:33:47 +00:00
Pawel Jakub Dawidek	8870baf005	MFp4: Add more debug, so we can see if zpool.cache was loaded or why it wasn't loaded.	2007-04-14 12:23:03 +00:00
Pawel Jakub Dawidek	dbd490e0e2	MFp4: Allow to tune vfs.zfs.debug from loader.conf.	2007-04-14 12:21:06 +00:00
Pawel Jakub Dawidek	c98fbf0418	MFp4: - Allow to tune number of spa_zio_* threads. - Reduce default number of spa_zio_* threads to Nspa_zio_issue plus Nspa_zio_intr threads per ZIO type, where N is the number of CPUs. - Put ZIO type number in thread's name.	2007-04-14 12:20:06 +00:00
Robert Watson	d72a615878	Some Linux applications (ping) pass a non-NULL msg_control argument to sendmsg() while using a 0-length msg_controllen. This isn't allowed in the FreeBSD system call ABI, so detect this case and set msg_control to NULL. This allows Linux ping to work. Submitted by: rdivacky	2007-04-14 10:35:09 +00:00
Randall Stewart	c105859eee	- fix source address selection when picking an acceptable address - name change of prefered -> preferred - CMT fast recover code added. - Comment fixes in CMT. - We were not giving a reason of cant_start_asoc per socket api if we failed to get init/or/cookie to bring up an assoc. Change so we don't just give a generic "comm lost" but look at actual states of dying assoc. - change "crc32" arguments to "crc32c" to silence strict/noisy compiler warnings when crc32() is also declared - A few minor tweaks to get the portable stuff truely portable for sctp6_usrreq.c :-D - one-2-one style vrf match problem. - window recovery would leave chks marked for retran during window probes on the sent queue. This would then cause an out-of-order problem and assure that the flight size "problem" would occur. - Solves a flight size logging issue that caused rwnd overruns, flight size off as well as false retransmissions.g - Macroize the up and down of flight size. - Fix a ECNE bug in its counting. - The strict_sacks options was causing aborts when window probing was active, fix to make strict sacks a bit smarter about what the next unsent TSN is. - Fixes a one-2-one wakeup bug found by Martin Kulas. - If-defed out form, Andre's copy routines pending his commit of at least m_last().. need to adjust for 6.2 as well.. since m_last won't exist. Reviewed by: gnn	2007-04-14 09:44:09 +00:00
Bruce M Simpson	05d91e4363	In member interface detach event handler, do not attempt to free state which has already been freed by in_ifdetach(). With this cumulative change, the removal of a member interface will not cause a panic in pfsync(4). Requested by: yar PR: 86848	2007-04-14 01:01:46 +00:00
Pawel Jakub Dawidek	24b0502ee0	Fix jails and jail-friendly file systems handling: - We need to allow for PRIV_VFS_MOUNT_OWNER inside a jail. - Move security checks to vfs_suser() and deny unmounting and updating for jailed root from different jails, etc. OK'ed by: rwatson	2007-04-13 23:54:22 +00:00
Pawel Jakub Dawidek	bd59d85850	Fix overflow, which was causing endless loops when 32bit machine had more than 2GB of RAM. This was because our physmem is long and 'physmem*PAGESIZE' can be negative for more than 2GB of memory. Reported by: Andrey V. Elsukov <bu7cher@yandex.ru> It is not yet tested by Andrey, so there can be other problems, but this was definiately a bug, so I'm committing a fix now.	2007-04-13 18:50:03 +00:00
Maxim Konovalov	b274ce9ef2	o Extend the list of supported CDMA-2000 terminals. Submitted by: R.Mahmatkhanov MFC after: 10 days	2007-04-13 18:15:07 +00:00
Alan Cox	0b76504872	Eliminate the misuse of PG_FRAME to truncate a virtual address to a virtual page boundary. Reviewed by: ru@	2007-04-13 16:07:29 +00:00
Christian S.J. Peron	f0cbfcc468	Fix the handling of IPv6 addresses for subject and process BSM audit tokens. Currently, we do not support the set{get}audit_addr(2) system calls which allows processes like sshd to set extended or ip6 information for subject tokens. The approach that was taken was to change the process audit state slightly to use an extended terminal ID in the kernel. This allows us to store both IPv4 IPv6 addresses. In the case that an IPv4 address is in use, we convert the terminal ID from an struct auditinfo_addr to a struct auditinfo. If getaudit(2) is called when the subject is bound to an ip6 address, we return E2BIG. - Change the internal audit record to store an extended terminal ID - Introduce ARG_TERMID_ADDR - Change the kaudit <-> BSM conversion process so that we are using the appropriate subject token. If the address associated with the subject is IPv4, we use the standard subject32 token. If the subject has an IPv6 address associated with them, we use an extended subject32 token. - Fix a couple of endian issues where we do a couple of byte swaps when we shouldn't be. IP addresses are already in the correct byte order, so reading the ip6 address 4 bytes at a time and swapping them results in in-correct address data. It should be noted that the same issue was found in the openbsm library and it has been changed there too on the vendor branch - Change A_GETPINFO to use the appropriate structures - Implement A_GETPINFO_ADDR which basically does what A_GETPINFO does, but can also handle ip6 addresses - Adjust get{set}audit(2) syscalls to convert the data auditinfo <-> auditinfo_addr - Fully implement set{get}audit_addr(2) NOTE: This adds the ability for processes to correctly set extended subject information. The appropriate userspace utilities still need to be updated. MFC after: 1 month Reviewed by: rwatson Obtained from: TrustedBSD	2007-04-13 14:55:19 +00:00
Pawel Jakub Dawidek	f0bc5ac3e1	Fix vnodes starvation caused by DNLC (ZFS name cache): - Tune number of namecache entires better (based on desiredvnodes). - Handle vfs_lowvnodes event by releasing requested number of name cache entries, but no less than 5%. Reported by: simokawa	2007-04-13 08:42:01 +00:00
Pawel Jakub Dawidek	6bc3ab2574	When we are running low on vnodes, there is currently no way to ask other subsystems to release some vnodes. Implement backpressure based on vfs_lowvnodes event (similar to vm_lowmem for memory).	2007-04-13 08:38:48 +00:00
Pawel Jakub Dawidek	6704017a15	MFp4: Synchronize with vendor (mostly 'zfs rename -r').	2007-04-12 23:16:02 +00:00
Pawel Jakub Dawidek	1da61b3665	MFp4: Bring back comments. Requested by: jhb	2007-04-12 23:14:25 +00:00
Lukas Ertl	a2237c41fc	-) Correct sdcount for a plex when removing or adding subdisks. -) Set correct sizes for plexes and volumes a subdisk has been removed. Submitted by: Ulf Lilleengen <lulf_AT_freebsd.org>	2007-04-12 17:54:35 +00:00
Lukas Ertl	9e357b05da	Avoid infinite loop if the device string given for a drive only consists of "/". Submitted by: Ulf Lilleengen <lulf_AT_freebsd.org>	2007-04-12 17:40:44 +00:00
Alan Cox	1434c1f34b	MFamd64 Define PGEX_RSV.	2007-04-12 17:00:56 +00:00
Ruslan Ermilov	f76f6cfd25	Fix PAE on SMP by enabling EFER_NXE on all APs. Reported by: kris Diagnosed by: alc	2007-04-12 11:05:24 +00:00
Kip Macy	aa84193acf	restore sense to get_imm_packet MFC after: 3 days	2007-04-12 04:48:54 +00:00
Kip Macy	98d6fba71d	switch over to per-txq dma tag to facilitate parallelism on TX MFC after: 3 days	2007-04-12 04:31:44 +00:00
Kip Macy	dd782506d8	explicitly check TSO flag don't clear and then set M_PKTHDR, m_gethdr sets it correctly improve error handling on m_gethdr failure MFC after: 3 days	2007-04-12 03:33:30 +00:00
Kip Macy	23ed7b513f	Add ETHER_HDR_LEN to hardware accepted mtu MFC after: 3 days	2007-04-12 03:07:24 +00:00
Andrew Thompson	575156b607	Fix a case where the multicast addresses were not removed from some ports. The first port to be removed from the trunk would free the multicast list so subsequent removed ports didnt have their multicast addresses removed.	2007-04-12 01:58:57 +00:00
Andre Oppermann	dfd389bf64	Add m_last() inline function.	2007-04-11 23:13:12 +00:00
Dag-Erling Smørgrav	15bad11fdb	Add a flag to struct pfs_vdata to mark the vnode as dead (e.g. process- specific nodes when the process exits) Move the vnode-cache-walking loop which was duplicated in pfs_exit() and pfs_disable() into its own function, pfs_purge(), which looks for vnodes marked as dead and / or belonging to the specified pfs_node and reclaims them. Note that this loop is still extremely inefficient. Add a comment in pfs_vncache_alloc() explaining why we have to purge the vnode from the vnode cache before returning, in case anyone should be tempted to remove the call to cache_purge(). Move the special handling for pfstype_root nodes into pfs_fileno_alloc() and pfs_fileno_free() (the root node's fileno must always be 2). This also fixes a bug where pfs_fileno_free() would reclaim the root node's fileno, triggering a panic in the unr code, as that fileno was never allocated from unr to begin with. When destroying a pfs_node, release its fileno and purge it from the vnode cache. I wish we could put off the call to pfs_purge() until after the entire tree had been destroyed, but then we'd have vnodes referencing freed pfs nodes. This probably doesn't matter while we're still under Giant, but might become an issue later. When destroying a pseudofs instance, destroy the tree before tearing down the fileno allocator. In pfs_mount(), acquire the mountpoint interlock when required. MFC after: 3 weeks	2007-04-11 22:40:57 +00:00
Robert Watson	949da0d8f8	Remove obsolete comment about privileges: SUSER_ALLOWJAIL is no longer set in this code.	2007-04-11 16:31:02 +00:00
Robert Watson	94b94b2b49	Remove now-obsolete comment regarding mqueue privileges in jail.	2007-04-11 16:22:59 +00:00
Ruslan Ermilov	7480de4305	Make "struct tcp_timer" visible only to the kernel, and unbreak world.	2007-04-11 14:08:42 +00:00
John Baldwin	e403490aa6	Fix m_freem_vec() to actually traverse the mbuf chain. This avoids double free's and an infinite loop. CID: 1834 Found by: Coverity Prevent (tm)	2007-04-11 13:47:24 +00:00
John Baldwin	4796ce4956	Group the loop to acquire/release Giant with the WITNESS_SAVE/RESTORE under a single conditional. The two operations are linked, but since the link is not very direct, Coverity can't see it. Humans might also miss the link as well. So, this isn't fixing any actual bugs, just improving readability. CID: 1787 (likely others as well) Found by: Coverity Prevent (tm)	2007-04-11 13:44:55 +00:00
Ruslan Ermilov	9fd6e3d4a4	This commit was generated by cvs2svn to compensate for changes in r168616, which included commits to RCS files with non-trunk default branches.	2007-04-11 11:09:18 +00:00
Ruslan Ermilov	1859f337c4	Unbreak world build.	2007-04-11 11:09:18 +00:00
Andre Oppermann	b8152ba793	Change the TCP timer system from using the callout system five times directly to a merged model where only one callout, the next to fire, is registered. Instead of callout_reset(9) and callout_stop(9) the new function tcp_timer_activate() is used which then internally manages the callout. The single new callout is a mutex callout on inpcb simplifying the locking a bit. tcp_timer() is the called function which handles all race conditions in one place and then dispatches the individual timer functions. Reviewed by: rwatson (earlier version)	2007-04-11 09:45:16 +00:00
Nate Lawson	1b96d500fb	Put some overly verbose prints under bootverbose. This is on the vendor branch but we need to work out a different interface with the vendor.	2007-04-11 02:03:36 +00:00
Nate Lawson	c1149e97bb	This commit was generated by cvs2svn to compensate for changes in r168609, which included commits to RCS files with non-trunk default branches.	2007-04-11 02:03:36 +00:00
Pyun YongHyeon	b5898b804f	Add work around for hardware Tx checksum offload bug in Yukon II. Yukon II generated corrupted TCP checksum for short TCP packets that's less than 60 bytes in size(e.g. window probe packet, pure ACK packet etc). Padding the frame with zeros to make the frame minimum ethernet frame size didn't work at all. Instead of dropping Tx checksum offload support we calculate TCP checksum with S/W method when we encounter short TCP frames. Fortunately it seems that short UDP datagrams appear to be handled correctly by Yukon II. While I'm here simplify ethernet/VLAN header size calculation logic. PR: 111384	2007-04-11 00:47:29 +00:00
Pawel Jakub Dawidek	7f64b05f79	Move rpc/types.h under sys/, as this is used by ZFS kernel module. Repo-copied by: simon	2007-04-10 22:10:16 +00:00
Wojciech A. Koszek	f7caeade24	strchr() and strrchr() are already present in the kernel, but with less popular names. Hence: - comment current index() and rindex() functions, as these serve the same functionality as, respectively, strchr() and strrchr() from userland; - add inlined version of strchr() and strrchr(), as we tend to use them more often; - remove str[r]chr() definitions from ZFS code; Reviewed by: pjd Approved by: cognet (mentor)	2007-04-10 21:42:12 +00:00
Pawel Jakub Dawidek	fef2a25971	Remove trailing '.' for consistency!	2007-04-10 21:40:13 +00:00
Scott Long	6eef46be3b	Whitespace fixes	2007-04-10 21:37:37 +00:00
Marius Strobl	5abeece6ab	Let brgphy(4) attach for the Broadcom BCM5755 ASIC based chipsets as well. Obtained from: OpenBSD MFC after: 1 week	2007-04-10 20:43:23 +00:00
Marius Strobl	a404cff674	On i386 compile the back-end with EISA support as well as the EISA front-end if the dpt(4) module is built along with a kernel that includes eisa(4) or when compiling it stand-alone (logic based on the corresponding ISA logic in sys/modules/sound/sound/Makefile). As as side-effect this fixes the stand-alone build of the dpt(4) module after dpt.h 1.17, dpt_eisa.c 1.22 and dpt_scsi.c 1.55. Breakage reported by: n_hibma	2007-04-10 20:33:31 +00:00
Scott Long	715ab2120d	A fix for the SG_GET_TIMEOUT function slipped into a previous commit by accident. Remove the text describing the problem as it is no longer relevant. Also give real implementations for the GET and SET ioctls.	2007-04-10 20:03:42 +00:00
Pawel Jakub Dawidek	57bcf75fd2	Add UFS_GJOURNAL options to the GENERIC kernel. Approved by: re (kensmith)	2007-04-10 16:49:41 +00:00
Robert Watson	e8c5c7a635	Update comment regarding how we check privilege on FreeBSD: we now use priv_check().	2007-04-10 16:09:00 +00:00
Robert Watson	4b08405682	Allow PRIV_NETINET_REUSEPORT in jail.	2007-04-10 15:59:49 +00:00
Robert Watson	6493245ded	Add a new privilege, PRIV_NETINET_REUSEPORT, which will replace superuser checks to see whether bind() can reuse a port/address combination while it's already in use (for some definition of use).	2007-04-10 15:58:38 +00:00
Robert Watson	cc807dbd0a	Remove unnecessary suser() check in the sysctl to set up ath_hal logging: the sysctl framework will already have checked for privilege if a sysctl value is being set. Discussed a long time ago with: sam	2007-04-10 15:48:45 +00:00
Robert Watson	9956b3f5e4	Do allow POSIX mqueue unlink privilege inside a jail, as we all all other POSIX mqueue privileges inside a jail.	2007-04-10 15:40:27 +00:00
Pawel Jakub Dawidek	08be819487	Minor style cleanups (mostly removal of trailing whitespaces).	2007-04-10 15:29:37 +00:00
Pawel Jakub Dawidek	21ff8c6715	Correct typos.	2007-04-10 15:22:40 +00:00
Pawel Jakub Dawidek	1b6e2c02fe	MFp4: Allow to set zfs_recover via vfs.zfs.recover from /boot/loader.conf.	2007-04-10 12:54:19 +00:00
Pawel Jakub Dawidek	5b9528e2d4	MFp4: Hide under '#ifdef _KERNEL' only what's really needed.	2007-04-10 12:52:14 +00:00
Giorgos Keramidas	a52da38f26	Minor typo fix, noticed while I was going through *_pager.c files.	2007-04-10 12:34:51 +00:00
Konstantin Belousov	5b959aa44f	Fix the NAMEI zone leak when snapshot was successfully created. Reported and tested by: Peter Holm MFC after: 2 weeks	2007-04-10 09:31:42 +00:00
Konstantin Belousov	9724167c2a	Recalculate the NEWBLOCK flag for pagedep structure after the softdep lock is dropped, since pagedep may be already processed and deallocated. Found and tested by: kris MFC after: 2 weeks	2007-04-10 09:30:41 +00:00
Konstantin Belousov	23743f6a11	When LK_NOWAIT is passed as argument to process_worklist_item(), this does not prevent handle_workitem_remove() from recursing into a blocking version. Add the dirrem to worklist instead of processing it now if this is the case. Reported and tested by: kris Submitted by: tegge MFC after: 2 weeks	2007-04-10 09:28:17 +00:00
Andrew Thompson	49fd43bdbc	Fix an uninitialized variable warning.	2007-04-10 08:02:33 +00:00
Andrew Thompson	40c97c2118	Fix build, trunk is a device not an option.	2007-04-10 03:09:38 +00:00
Pawel Jakub Dawidek	2d03e33170	Try to stabilize ZFS with regard to memory consumption: - Allow to shrink ARC down to 16MB (instead of 64MB). - Set arc_max to 1/2 of kmem_map by default. - Start freeing things earlier when low memory situation is detected. - Serialize execution of arc_lowmem(). I decided to setup minimum ZFS memory requirements to 512MB of RAM and 256MB of kmem_map size. If there is less RAM or kmem_map, a warning will be printed. World is cruel, be no better. In other words: modern file system requires modern hardware:) From ZFS administration guide: "Currently the minimum amount of memory recommended to install a Solaris system is 512 Mbytes. However, for good ZFS performance, at least one Gbyte or more of memory is recommended."	2007-04-10 02:35:57 +00:00
Pawel Jakub Dawidek	52124c7f1c	Reduce diff against vendor - we have now stronger check for "mutex already initialized", so we can go back to kmem_alloc().	2007-04-10 02:19:12 +00:00
Andrew Thompson	75efd6fd67	Add trunk(4) module.	2007-04-10 00:41:31 +00:00
Andrew Thompson	7b62d98bf8	Hook trunk(4) up to the build.	2007-04-10 00:35:31 +00:00
Andrew Thompson	b47888ceba	Add the trunk(4) driver for providing link aggregation, failover and fault tolerance. This driver allows aggregation of multiple network interfaces as one virtual interface using a number of different protocols/algorithms. failover - Sends traffic through the secondary port if the master becomes inactive. fec - Supports Cisco Fast EtherChannel. lacp - Supports the IEEE 802.3ad Link Aggregation Control Protocol (LACP) and the Marker Protocol. loadbalance - Static loadbalancing using an outgoing hash. roundrobin - Distributes outgoing traffic using a round-robin scheduler through all active ports. This code was obtained from OpenBSD and this also includes 802.3ad LACP support from agr(4) in NetBSD.	2007-04-10 00:27:25 +00:00
Pawel Jakub Dawidek	0404b7791b	Remove unused #define.	2007-04-09 23:30:28 +00:00
Andrew Thompson	6429a5cb9b	Fix a compiler warning so hash.h can be included in the kernel. This changes the args for hash32_stre and hash32_strne but there are no consumers in the base system and openbgpd does not use it which the initial import was for. Silence on: hackers	2007-04-09 22:55:14 +00:00
Pawel Jakub Dawidek	6db107202a	Fix build breakage.	2007-04-09 22:29:13 +00:00
Pawel Jakub Dawidek	151db24af1	Add zfs_load here. Reminded by: bmah	2007-04-09 22:09:09 +00:00
Nate Lawson	a363f67a81	Restore the locking for the sleep/wakeup to avoid waiting an extra 1 sec if a race was lost. We're still single-threaded at this point, but just be safe for the future.	2007-04-09 21:10:04 +00:00
Nate Lawson	6b1e469ea5	Clean up the root mount and mount wait code. No mutexes are needed here since a spurious wakeup() is the only possible outcome and this is fine in the BSD programming model.	2007-04-09 19:23:52 +00:00
Pawel Jakub Dawidek	82068fe7a9	Add kern.hostuuid sysctl, which will be used to keep host's UUID. Reviewed by: mlaier, rink, brooks, rwatson	2007-04-09 19:18:09 +00:00
Paolo Pisati	d640d2e29d	The old PacketAlias* API is not exported when libalias run in kernel land.	2007-04-09 17:08:27 +00:00
Kip Macy	a53b1c1753	throw sun4v into the check while we're at it	2007-04-09 17:05:54 +00:00
Kip Macy	3a0a4ac13d	busdma tags are opaque on all architectures except sparc64 for now simply don't compile/use on sparc64	2007-04-09 17:01:23 +00:00
Alexander Kabaev	74c7f74304	LINT on ia64 requires memset symbol too. Make fire it is present by adding it to libkern on this architecture.	2007-04-09 14:02:18 +00:00
Andre Oppermann	cc9164e2e6	Sort sctp_*.c files.	2007-04-09 12:51:29 +00:00
Scott Long	4400b36d94	Make use of M_ZERO in various malloc calls.	2007-04-09 05:47:32 +00:00
Scott Long	472cdbef04	Fix a logic bug that slipped in at the last minute and apparently escaped testing.	2007-04-09 05:43:02 +00:00
Pawel Jakub Dawidek	24bda1641f	Instead of detecting if lock is already initialized based on standard 1 bit check, use more accurate 13 bits check. We had too many false-positives with the standard check. Reported by: mlaier	2007-04-09 01:05:31 +00:00
Pawel Jakub Dawidek	1868634782	Always try to load zpool.cache instead of trying to find good place to document it. When there is no such file, it's invisible for the user.	2007-04-09 00:04:54 +00:00
Pawel Jakub Dawidek	33fc425c85	We don't have to wait for the root file system to be mounted anymore, now that kobj KPI supports operating on files loaded by the loader.	2007-04-09 00:03:45 +00:00
Pawel Jakub Dawidek	5fc5d6ed61	Drop the Giant lock before calling zfs_domount(), which is held when mounting root file system.	2007-04-09 00:02:11 +00:00
Pawel Jakub Dawidek	f92cb15e7b	Move zpool.cache from /etc/zfs/ to /boot/zfs/, so we can keep it on dedicated /boot/ file system and use ZFS for the root file system.	2007-04-08 23:59:39 +00:00
Pawel Jakub Dawidek	bdebccf9b9	Extend kobj compatibility KPI to support operating on files before and after the root file system is mounted. This is one of the changes that will allow to put root file system on ZFS.	2007-04-08 23:57:08 +00:00
Pawel Jakub Dawidek	df3aed4f96	Use root_mounted().	2007-04-08 23:54:23 +00:00
Pawel Jakub Dawidek	2eb68d493f	Add root_mounted() function that returns true if the root file system is already mounted.	2007-04-08 23:54:01 +00:00
Kip Macy	dc5a36e241	Add missing paren	2007-04-08 22:56:18 +00:00
Xin LI	9e3edba677	Bump __FreeBSDversion for CAM sg addition. Requested by: bsam	2007-04-08 22:45:20 +00:00
Søren Schmidt	ae4ce3ceef	OK, this is not my day, fix the former fix :/	2007-04-08 21:53:52 +00:00
Søren Schmidt	f27a14650f	Hopefully unbreak the 64bit DMA support this time.	2007-04-08 19:18:51 +00:00
Kip Macy	cae1990513	remove stale variable reference	2007-04-08 18:02:37 +00:00
Pawel Jakub Dawidek	ffe54ff0ec	MFp4: Synchronize with recent OpenSolaris changes.	2007-04-08 16:29:25 +00:00
Kip Macy	db2faf119f	add busdma function for mapping mbuf iovecs change m_collapse to return an error code	2007-04-08 15:59:07 +00:00
Pawel Jakub Dawidek	425d75486e	- Use 'name=value' so it can be properly recognized by devd(8). - Use only subclass as devd's type.	2007-04-08 15:55:48 +00:00
Søren Schmidt	cd945eed47	Dont zero out 64BIT flag on DMA ops.	2007-04-08 15:31:39 +00:00
Kip Macy	27f0ce0f2b	hook uipc_mvec.c into build	2007-04-08 15:18:03 +00:00
Kip Macy	c0a24dd4aa	Convert driver RX path over to using mbuf iovec	2007-04-08 15:04:19 +00:00
Kip Macy	a8d9a363f5	Add driver private mbuf iovec support routines	2007-04-08 14:56:16 +00:00
Pawel Jakub Dawidek	c2cda60911	prison_free() can be called with a mutex held. This wasn't a problem until I converted allprison_mtx mutex to allprison_lock sx lock. To fix this LOR, move prison removal to prison_complete() entirely. To ensure that noone will reference this prison before it's beeing removed from the list skip prisons with 'pr_ref == 0' in prison_find() and assert that pr_ref has to greater than 0 in prison_hold(). Reported by: kris OK'ed by: rwatson	2007-04-08 10:46:23 +00:00
Pawel Jakub Dawidek	61cfeccd58	Take vnode pointer and hold it under znode lock, so we won't race with zfs_reclaim(). This may or may not fix problem reported by kris, but it's definiatelly better that way.	2007-04-08 10:29:14 +00:00
Pawel Jakub Dawidek	b63b0c6529	Only use prison mutex to protect the fields that need to be protected by it.	2007-04-08 10:21:38 +00:00
Ariff Abdullah	319276aac0	Disable cmi_midiattach(). The implementation is incomplete, and causing various interesting memory leak issues.	2007-04-08 07:52:27 +00:00
Pawel Jakub Dawidek	264de85e73	pr_list is protected by the allprison_lock.	2007-04-08 02:13:32 +00:00
Pawel Jakub Dawidek	3dc4488c91	Move atomic.S files to directories that better fit OpenSolaris directory layout.	2007-04-07 23:54:54 +00:00
Pawel Jakub Dawidek	e321494eca	Fix libzpool compilation. Reported by: des	2007-04-07 23:47:14 +00:00
Pawel Jakub Dawidek	9a691cb33a	Limit the number of system taskq threads to the number of CPUs. They are only used when there is a need for reducing namecache. Observed by: kris, csjp	2007-04-07 21:41:11 +00:00
Scott Long	1eba4c7948	Add the CAM 'SG' peripheral device. This device implements a subset of the Linux SCSI SG passthrough device API. The intention is to allow for both running of Linux apps that want to talk to /dev/sg* nodes, and to facilitate porting of apps from Linux to FreeBSD. As such, both native and linuxolator entry points and definitions are provided. Caveats: - This does not support the procfs and sysfs nodes that the Linux SG driver provides. Some Linux apps may rely on these for operation, others may only use them for informational purposes. - More ioctls need to be implemented. - Linux uses a naming scheme of "sg[a-z]" for devices, while FreeBSD uses a scheme of "sg[0-9]". Devfs aliasis (symlinks) are automatically created to link the two together. However, tools like camcontrol only see the native names. - Some operations were originally designed to return byte counts or other data directly as the syscall return value. The linuxolator doesn't appear to support this well, so this driver just punts for these cases. Now that the driver is in place, others are welcome to add missing functionality. Thanks to Roman Divacky for pushing this work along.	2007-04-07 19:40:58 +00:00
Dag-Erling Smørgrav	48be553b82	Build ZFS on amd64 and pc98. Approved by: pjd@	2007-04-07 19:12:10 +00:00
Dag-Erling Smørgrav	29665eac3f	Fix some type mismatches. Reviewed by: pjd@	2007-04-07 19:11:41 +00:00
Pawel Jakub Dawidek	639fdcd852	Allow to tune maximum and minimum memory used by ARC.	2007-04-07 19:10:50 +00:00
Pawel Jakub Dawidek	2b6271b7f2	Hide SEEK_DATA and SEEK_HOLE under __BSD_VISIBLE. Suggested by: ache	2007-04-07 18:31:40 +00:00
Matt Jacob	7b88fb86e3	Hide bus reset announcements within bootverbose. MFC after: 3 days	2007-04-07 18:15:52 +00:00
Pawel Jakub Dawidek	f3fdfb670c	- Remove SEEK_DATA and SEEK_HOLE from stdio.h, they don't belong here. - Only define SEEK_DATA and SEEK_HOLE in sys/unistd.h when neither _POSIX_SOURCE nor _XOPEN_SOURCE is defined. Pointed out by: bde, ache	2007-04-07 16:02:30 +00:00
Yoshihiro Takahashi	55ccb0b485	Fix build.	2007-04-07 13:37:45 +00:00
Pawel Jakub Dawidek	a583dae953	Add missing mutex_init() which was causing assertion panic when on clone destruction. Reported by: kris	2007-04-07 11:04:37 +00:00
Paolo Pisati	c326cd0e62	Prevent the usage of an uninitialized variable: do not accept StartMediaTx message before an OpnRcvChnAck message was received. Reviewed by: glebius Approved by: glebius (mentor) MFC after: 3 days Found with: Coverity Prevent(tm) CID: 498	2007-04-07 09:52:36 +00:00
Paolo Pisati	f4296f2246	Silence Coverity about an unused variable. Reviewed by: glebius Approved by: glebius (mentor) MFC after: 3 days CID: 538	2007-04-07 09:47:39 +00:00
KATO Takenori	f2a081cfe4	Added the IPLware 3.33 support. - Added magic numbers to pretend the NEC original program version 2.70. - Added string display routine with Shift-JIS code support. - Added three nop instructions at start1 in start.s since the installaer of the IPLware put 'call $0x09ab' instruction. - Put the near return instruction at 0x9ab in selector.s. Since the Shit-JIS display routine must be located at 0x1243, the linker script file (ldscript) is applied.	2007-04-07 08:37:04 +00:00
Kip Macy	d330ae533a	back out last change Requested by: ru	2007-04-07 05:09:40 +00:00
Hidetoshi Shimokawa	54911451d5	Fix a bug for over 4GB media. MFC after: 3 days	2007-04-07 02:52:13 +00:00
Robert Watson	7b20aa9ca6	Remove XXX comment that changes to file fields should be protected with the file lock rather than the filedesc lock: I fixed this in the last revision. Spotted by: kris	2007-04-06 23:31:30 +00:00
Alexander Kabaev	7d80a3b493	pc98 boot2 is compiled with _KERNEL defined, and that makes non-static bootinfo variable declaration visible. It conflicts with static declaration in this file. Declare variable as globally visible in order to resolve the conflict.	2007-04-06 20:50:24 +00:00
Jung-uk Kim	6e612eca81	Fix kernel module dependency. linprocfs depends on sysvmsg and sysvsem. Submitted by: nork	2007-04-06 18:15:56 +00:00
Ruslan Ermilov	2e137367b4	Add the PG_NX support for i386/PAE. Reviewed by: alc	2007-04-06 18:15:03 +00:00
Søren Schmidt	fe2fb53542	Add 64bit addressing support to SiI 3132/3124	2007-04-06 17:36:35 +00:00
Søren Schmidt	2cfcfef1fc	Remove debug gunk.	2007-04-06 16:21:34 +00:00
Søren Schmidt	16194fc40b	Add support for 64bit addressing to AHCI and Marvell controllers. Munged into ATA shape and Marvell specifics my yours truely. Submitted by: jhb	2007-04-06 16:18:59 +00:00
Pawel Jakub Dawidek	68474f1930	Sysctl description is not a format string, so one % is enough.	2007-04-06 12:53:54 +00:00
Yoshihiro Takahashi	bc30e6ae00	MFi386: add libkern/memset.c	2007-04-06 11:30:31 +00:00
Yoshihiro Takahashi	9f94082ed0	sort.	2007-04-06 11:29:52 +00:00
Pawel Jakub Dawidek	93caf77f95	Use strcasecmp() from libkern.	2007-04-06 11:21:01 +00:00
Pawel Jakub Dawidek	4d00f78b40	We have strcasecmp() in libkern now.	2007-04-06 11:18:57 +00:00
Kip Macy	735d79b8df	make modules compile without updating etc	2007-04-06 06:05:45 +00:00
Alexander Kabaev	89c40e5fec	Be more conservative and compile libkern/memset.c only on architectures than need it. These are i386, amd64 and powerpc so far.	2007-04-06 04:51:50 +00:00
Pawel Jakub Dawidek	ba7c08b71b	Bump __FreeBSD_version on ZFS import. Requested by: nork	2007-04-06 02:33:43 +00:00
Pawel Jakub Dawidek	ceef0c312c	Connect ZFS to the build.	2007-04-06 02:13:30 +00:00
Pyun YongHyeon	ad6d01d151	If we've encountered unrecognized chipset don't access hardware anymore. Previously it tried to access interrupt register to disable interrupts which could result in hang if the hardware was not properly initialized by system BIOS/ACPI. Tested by: Benjamin Hansmann (benjamin.hansmann AT rub dot de) MFC after: 3 days	2007-04-06 02:02:07 +00:00
Pawel Jakub Dawidek	2109a92fd1	Add Makefile for zfs.ko kernel module.	2007-04-06 01:35:16 +00:00
Pawel Jakub Dawidek	e726fc7c37	Add ZFS-specific privileges.	2007-04-06 01:11:39 +00:00
Pawel Jakub Dawidek	f0a75d274a	Please welcome ZFS - The last word in file systems. ZFS file system was ported from OpenSolaris operating system. The code in under CDDL license. I'd like to thank all SUN developers that created this great piece of software. Supported by: Wheel LTD (http://www.wheel.pl/) Supported by: The FreeBSD Foundation (http://www.freebsdfoundation.org/) Supported by: Sentex (http://www.sentex.net/)	2007-04-06 01:09:06 +00:00
Alexander Kabaev	c8c0ba192e	Add local ptototype for memset function.	2007-04-06 00:06:26 +00:00
Pawel Jakub Dawidek	028e84c68b	allprison mutex was converted to sx(9) lock.	2007-04-05 23:32:32 +00:00
Pawel Jakub Dawidek	dc68a63332	Implement functionality I called 'jail services'. It may be used for external modules to attach some data to jail's in-kernel structure. - Change allprison_mtx mutex to allprison_sx sx(9) lock. We will need to call external functions while holding this lock, which may want to allocate memory. Make use of the fact that this is shared-exclusive lock and use shared version when possible. - Implement the following functions: prison_service_register() - registers a service that wants to be noticed when a jail is created and destroyed prison_service_deregister() - deregisters service prison_service_data_add() - adds service-specific data to the jail structure prison_service_data_get() - takes service-specific data from the jail structure prison_service_data_del() - removes service-specific data from the jail structure Reviewed by: rwatson	2007-04-05 23:19:13 +00:00
Alexander Kabaev	616db5f04c	Add trivial MI memset function implementation. GCC mandates the existence of this function as a linkable symbol in standalone configurations and existing inline memcpy from libkern.h fails this requirement.	2007-04-05 22:02:39 +00:00
Pawel Jakub Dawidek	54b369c1ae	Make prison_find() globally accessible.	2007-04-05 21:34:54 +00:00
Pawel Jakub Dawidek	f6521d1c31	Implement SEEK_DATA and SEEK_HOLE extensions to lseek(2) as found in OpenSolaris. For more information please refer to: http://blogs.sun.com/bonwick/entry/seek_hole_and_seek_data	2007-04-05 21:10:53 +00:00
Pawel Jakub Dawidek	f3a8d2f93c	Add security.jail.mount_allowed sysctl, which allows to mount and unmount jail-friendly file systems from within a jail. Precisely it grants PRIV_VFS_MOUNT, PRIV_VFS_UNMOUNT and PRIV_VFS_MOUNT_NONUSER privileges for a jailed super-user. It is turned off by default. A jail-friendly file system is a file system which driver registers itself with VFCF_JAIL flag via VFS_SET(9) API. The lsvfs(1) command can be used to see which file systems are jail-friendly ones. There currently no jail-friendly file systems, ZFS will be the first one. In the future we may consider marking file systems like nullfs as jail-friendly. Reviewed by: rwatson	2007-04-05 21:03:05 +00:00
Pawel Jakub Dawidek	0f2c2ce0a3	When KVA is exhausted, try the vm_lowmem event for the last time before panicing. This helps a lot in ZFS stability.	2007-04-05 20:52:51 +00:00
Pawel Jakub Dawidek	fcdd9721e4	Fix a problem for file systems that don't implement VOP_BMAP() operation. The problem is this: vm_fault_additional_pages() calls vm_pager_has_page(), which calls vnode_pager_haspage(). Now when VOP_BMAP() returns an error (eg. EOPNOTSUPP), vnode_pager_haspage() returns TRUE without initializing 'before' and 'after' arguments, so we have some accidental values there. This bascially was causing this condition to be meet: if ((rahead + rbehind) > ((cnt.v_free_count + cnt.v_cache_count) - cnt.v_free_reserved)) { pagedaemon_wakeup(); [...] } (we have some random values in rahead and rbehind variables) I'm not entirely sure this is the right fix, maybe we should just return FALSE in vnode_pager_haspage() when VOP_BMAP() fails? alc@ knows about this problem, maybe he will be able to come up with a better fix if this is not the right one.	2007-04-05 20:49:46 +00:00
Pawel Jakub Dawidek	24c3c19e73	Hide lbolt under _SOLARIS_C_SOURCE in preparation for ZFS import. I really couldn't avoid this with preprocessor magic.	2007-04-05 20:40:47 +00:00
Marcel Moolenaar	9760f68ca0	Add PCI IDs for the HP RMP3 serial port. This is often used as the serial console. MFC after: 1 week	2007-04-05 19:15:46 +00:00
Alexander Kabaev	b27c252dcf	Remove extern struct pcb stoppcbs[] declaration from this file. It breaks GCC 4.1 compiles and does not appear to be required.	2007-04-05 18:34:11 +00:00
Dag-Erling Smørgrav	56c62ab69c	Whitespace nits.	2007-04-05 13:43:00 +00:00
Kip Macy	0f4d9d04ea	Fix mb_ctor_clust and mb_dtor_clust to reference the appropriate zone, simplify setting refcnt Reviewed by: andre, rwatson, and glebius MFC after: 3 days	2007-04-04 21:27:01 +00:00
Andre Oppermann	995a77176f	Add INP_INFO_UNLOCK_ASSERT() and use it in tcp_input(). Also add some further INP_INFO_WLOCK_ASSERT() while there.	2007-04-04 18:30:16 +00:00
Andre Oppermann	0c38fd0a7a	Move last tcpcb initialization for the inbound connection case from tcp_input() to syncache_socket() where it belongs and the majority of it already happens. The "tp->snd_up = tp->snd_una" is removed as it is done with the tcp_sendseqinit() macro a few lines earlier.	2007-04-04 16:13:45 +00:00
Andre Oppermann	beaa515e95	Some local and style(9) cleanups.	2007-04-04 15:30:31 +00:00
Andre Oppermann	5dd9dfefd6	Retire unused TCP_SACK_DEBUG.	2007-04-04 14:44:15 +00:00
Andre Oppermann	b728e90260	In tcp_dooptions() skip over SACK options if it is a SYN segment.	2007-04-04 14:39:49 +00:00
Robert Watson	5e3f7694b1	Replace custom file descriptor array sleep lock constructed using a mutex and flags with an sxlock. This leads to a significant and measurable performance improvement as a result of access to shared locking for frequent lookup operations, reduced general overhead, and reduced overhead in the event of contention. All of these are imported for threaded applications where simultaneous access to a shared file descriptor array occurs frequently. Kris has reported 2x-4x transaction rate improvements on 8-core MySQL benchmarks; smaller improvements can be expected for many workloads as a result of reduced overhead. - Generally eliminate the distinction between "fast" and regular acquisisition of the filedesc lock; the plan is that they will now all be fast. Change all locking instances to either shared or exclusive locks. - Correct a bug (pointed out by kib) in fdfree() where previously msleep() was called without the mutex held; sx_sleep() is now always called with the sxlock held exclusively. - Universally hold the struct file lock over changes to struct file, rather than the filedesc lock or no lock. Always update the f_ops field last. A further memory barrier is required here in the future (discussed with jhb). - Improve locking and reference management in linux_at(), which fails to properly acquire vnode references before using vnode pointers. Annotate improper use of vn_fullpath(), which will be replaced at a future date. In fcntl(), we conservatively acquire an exclusive lock, even though in some cases a shared lock may be sufficient, which should be revisited. The dropping of the filedesc lock in fdgrowtable() is no longer required as the sxlock can be held over the sleep operation; we should consider removing that (pointed out by attilio). Tested by: kris Discussed with: jhb, kris, attilio, jeff	2007-04-04 09:11:34 +00:00
Xin LI	04533fc68e	Use *_EMPTY macros when appropriate.	2007-04-04 07:29:53 +00:00
Kip Macy	fa0521c0e9	Make DMA tags per-queue to facilate parallel mappings Defer mbuf allocation and initialization until after data has already been received in a cluster This reduces cpu utilization somewhat, but it only improves the rx path. Recent changes to TCP appear to make us rate limited by the TX path. This is the first step in reducing mbuf management overhead for manipulating clusters. MFC after: 3 days	2007-04-04 05:29:18 +00:00
Kip Macy	e0bfe940a4	m_extadd does not appear to do the right thing for the case of clusters allocated from UMA - add m_cljset to correspond to m_cljget MFC after: 3 days	2007-04-04 04:08:57 +00:00
Alexander Kabaev	edb2e5dca3	Include string.h for non-kernel builds to get proper memcpy prototype.	2007-04-04 03:16:59 +00:00
Alexander Kabaev	d8164209b3	Include string.h for non-kernel builds to get proper strcpy, strlen prototypes.	2007-04-04 03:14:15 +00:00
Alexander Kabaev	9160afee7c	Do not assign result of (char ) cast to u_char variable.	2007-04-04 03:10:42 +00:00
Kip Macy	ab43ffd2f6	add helper functions for mapping size to zonez and types eliminate duplicated zone lookup switch statements	2007-04-04 00:31:49 +00:00
Kip Macy	59a31e6acf	fix typo	2007-04-04 00:11:22 +00:00
Kip Macy	e2bc106690	style fixes and make sure that the lock is treated as released in the sharers == 0 case not that this is somewhat racy because a new sharer can come in while we're updating stats	2007-04-04 00:01:05 +00:00
Kip Macy	afc0bfbd90	Fixes to sx for newsx - fix recursed case and move out of inline Submitted by: Attilio Rao <attilio@freebsd.org>	2007-04-03 22:58:21 +00:00
Kip Macy	70fe8436c8	move lock_profile calls out of the macros and into kern_mutex.c add check for mtx_recurse == 0 when releasing sleep lock	2007-04-03 22:52:31 +00:00
Julian Elischer	1bd69ee131	Since we switched to using monatomically increasing timestamps, they have been reported back to the userland as being in 1970. Add boot time to the timestamp to give the time in the scale of the 'current' real timescale. Not perfect if you change the time a lot but good enough to keep all the rules correct relative to each other correct in terms of time relative to "now".	2007-04-03 22:45:50 +00:00
Kip Macy	8289600ce7	skip call to _lock_profile_obtain_lock_success entirely if acquisition time is non-zero (i.e. recursing or adding sharers)	2007-04-03 18:36:27 +00:00
Alexander Kabaev	585b090609	Add dl_iterate_phdr function prototype and corresponding dl_phdr_info structure definition.	2007-04-03 18:33:41 +00:00
Kip Macy	802d9610eb	Remove unneccessary LO_CONTESTED flag	2007-04-03 17:57:50 +00:00
Robert Watson	6246c6e2a7	Fix use after free bug: use temporary variable to hold next entry in linked list while freeing current entry, rather than using the free'd entry's next pointer. Found with: Coverity Prevent(tm) CID: 1333	2007-04-03 12:45:10 +00:00
Pawel Jakub Dawidek	afd894bb12	Add root_mount_wait() function which can be used to wait until the root file system is mounted. This is useful for kernel modules loaded from /boot/loader.conf, that have to access file system.	2007-04-03 11:45:28 +00:00
Randall Stewart	bff64a4db3	- fixed several places where we did not release INP locks. - fixed a refcount bug in the new ifa structures. - use vrf's from default stcb or inp whenever possible. - Address limits raised to account for a full IP fragmented packet (1000 addresses). - flight size correcting updated to include one message only and to handle case where the peer does not cumack the next segment aka lists 1/1 in sack blocks.. - Various bad init/init-ack handling could cause a panic since we tried to unlock the destroyed mutex. Fixes so we properly exit when we need to destroy an assoc. (Found by Cisco DevTest team :D) - name rename in src-addr-selection from pass to sifa. - route structure typedef'd to allow different platforms and updated into sctp_os_bsd file. - Max retransmissions a chunk can be made added. Reviewed by: gnn	2007-04-03 11:15:32 +00:00
Andrew Gallatin	e39a0a37cf	- Fix a bug in the TSO transmit routine where frames which had been defragged and had their headers in the same cluster as their payload would be fed to the NIC in header-sized chunks, and would likely exceed the number of available transmit descriptors. - If a TSO frame exceeds the number of available transmit descriptors, don't leak busdmma resources when freeing it. Sponsored by: Myricom Inc.	2007-04-03 10:41:33 +00:00
Kevin Lo	6d361569d5	Since the driver uses mutexes, remove splusb() and splx().	2007-04-03 05:59:17 +00:00
Alexander Kabaev	02b71ede34	Correct PT_GNU_EH_FRAME definition.	2007-04-03 01:47:07 +00:00
Marcel Moolenaar	35777a2a79	Don't use a time-limiting loop that's defined in terms of the baudrate in the putc() method. Likewise, in the getc() method, don't check for received characters with an interval defined in terms of the baudrate. In both cases it works equally well to implement a fixed delay. More importantly, it avoids calculating a delay that's roughly 1/10th the time it takes to send/receive a character. The calculation is costly and happens for every character sent or received, affecting low-level console or debug port performance significantly. Secondly, when the RCLK is not available or unreliable, the delays could disrupt normal operation. The fixed delay is 1/10th the time it takes to send a character at 230400 bps.	2007-04-03 01:21:10 +00:00
Marcel Moolenaar	f8100ce2a7	Don't expose the uart_ops structure directly, but instead have it obtained through the uart_class structure. This allows us to declare the uart_class structure as weak and as such allows us to reference it even when it's not compiled-in. It also allows is to get the uart_ops structure by name, which makes it possible to implement the dt tag handling in uart_getenv(). The side-effect of all this is that we're using the uart_class structure more consistently which means that we now also have access to the size of the bus space block needed by the hardware when we map the bus space, eliminating any hardcoding.	2007-04-02 22:00:22 +00:00
Warner Losh	cf5bdd4446	Loop on sdcard init. This helps if one hasn't plugged in the card fast enough, or there's other issues that cause the first try to fail.	2007-04-02 20:26:04 +00:00
John Baldwin	1ce2bc9187	Fix a fd leak in socketpair(): - Close the new file objects created during socketpair() if the copyout of the new file descriptors fails. - Add a test to the socketpair regression test for this edge case.	2007-04-02 19:15:47 +00:00
Jung-uk Kim	0a55a034ba	Enable MSI support on RELENG_6. MFC after: 3 days	2007-04-02 19:09:06 +00:00
Jung-uk Kim	357afa7113	MFP4: Turn emul_lock into a mutex. Submitted by: rdivacky	2007-04-02 18:38:13 +00:00
John Baldwin	ddda35b8f6	- Split out the part of SYSCALL_MODULE_HELPER() that builds a 'struct sysent' for a new system call into a new MAKE_SYSENT() macro. - Use MAKE_SYSENT() to build a full sysent for the nfssvc system call in the NFS server and use syscall_register() and syscall_deregister() to manage the nfssvc system call entry instead of manually frobbing the sysent[] array.	2007-04-02 13:53:26 +00:00
John Baldwin	ebb3c22c16	Don't go to a whole lot of extra work to handle the race where the new file descriptor is closed out from under us in kern_open(). This race is already handled and the file will be closed when kern_open() does an fdrop just before returning.	2007-04-02 13:40:38 +00:00
Ariff Abdullah	f505e02090	Revert busy refcount back to int. As a side note, multiple open is still (and always) possible and does not change previous behaviour. Requested by: netchild	2007-04-02 10:24:15 +00:00
Ariff Abdullah	ff7499570c	Disable seq_modevent(). The implementation is incomplete, and causing memory leak during unload.	2007-04-02 06:03:47 +00:00
Pyun YongHyeon	75a1d5a086	Use our own timer for watchdog instead of if_watchdog/if_timer interface.	2007-04-02 04:43:41 +00:00
Ariff Abdullah	3627e77dfa	No need to track every closing instance, and put busy counter to rest in its single bit coffin.	2007-04-02 03:46:25 +00:00
Scott Long	15735bec61	Freeze the simq, not the devq, if we run out of command slots. This fixes the last round of reported instability in the rev 13/14 driver. Approved by: Erich Chen	2007-04-02 03:31:37 +00:00
Ariff Abdullah	a9be51acfe	Provide hint / tunable for possible asynchronous USB execution. Async execution should help us avoiding potential deadlock and illegal locking while sleeping in various mixer -> usb calls. To enable it, use hint.uaudio.%d.async="1" or sysctl dev.uaudio.%d.async=1. Default is disable, to remain compatible with old behaviour (with slight risk of potential deadlock).	2007-04-02 03:25:39 +00:00
Ariff Abdullah	72e9d07fbf	- Don't wakeup() unnecessarily, so the behavior of dead interrupt or stalled DMA engine can be observed and predicted. - Minor sysctl/tunable cleanup.	2007-04-02 03:03:06 +00:00
Matt Jacob	9a1b0d43c2	Temporarily desupport simultaneous target and initiator mode. When the linux port changes were imported which split the target command list to be separate from the initiator command list and the handle format changed to encode a type in the handle the implications to the function isp_handle_index (which only the NetBSD/OpenBSD/FreeBSD ports use) were overlooked. The fault is twofold: first, the index into the DMA maps in isp_pci is wrong because a target command handle with the type bit left in place caused a bad index (and panic) into dma map. Secondly, the assumption of the array of DMA maps in either PCS or SBUS attachment structures is that there is a linear mapping between handle index and DMA map index. This can no longer be true if there are overlapping index spaces for initiator mode and target mode commands. These changes bandaid around the problem by forcing us to not have simultaneous dual roles and doing the appropriate masking to make sure things are indexed correctly. A longer term fix is being devloped.	2007-04-02 01:04:20 +00:00
Alexander Leidinger	02da6fa190	Handle errors from bus_setup_intr(). Found by: Coverity Prevent (tm) CID: 1066	2007-04-01 16:55:31 +00:00
Alexander Leidinger	68af68014e	Tell the user when the setup of the interrupt handler failed and return an error. Found by: Coverity Prevent (tm) CID: 71-78	2007-04-01 16:52:54 +00:00
Wojciech A. Koszek	4850546f51	ng_node and ng_worklist locks both migrated from being spinning locks to adaptive mutexes. Let witness(4) calm down and bring proper types of those locks to the lock order database. Glanced at by: rwatson	2007-04-01 15:48:10 +00:00
Pawel Jakub Dawidek	4874b3fb12	More style nits.	2007-04-01 15:40:56 +00:00
Alexander Leidinger	c9be0e5d4d	Tell a statistic checker that not checking the return value of the probing of the mii phy is intended for this chip. Found by: Coverity Prevent (tm) CID: 43	2007-04-01 14:15:26 +00:00
Alexander Leidinger	2acfcc2d4c	Make it obvious that we don't care about the return value of usbd_endpoint_count(), the failure case is handled implicit in the following code. Found by: Coverity Prevent (tm) CID: 56	2007-04-01 13:46:39 +00:00
Pawel Jakub Dawidek	daa88cdf0a	Style nit.	2007-04-01 13:41:10 +00:00

... 3 4 5 6 7 ...

63456 Commits