freebsd-skq

Author	SHA1	Message	Date
rwatson	ac6e0fc083	Reorder and clean up make_coda_node(), annotate weaknesses in the implementation. MFC after: 1 month	2008-02-15 11:58:11 +00:00
scottl	7163c9c1fc	Teach the dump and minidump code to respect the maxioszie attribute of the disk; the hard-coded assumption of 64K doesn't work in all cases.	2008-02-15 06:26:25 +00:00
marcel	257f2d8fc2	On Montecito processors, the instruction cache is in fact not coherent with the data caches. Implement a quick fix to allow us to boot on Montecito, while I'm working on a better fix in the mean time. Commit made on Montecito-based Itanium...	2008-02-14 18:46:50 +00:00
yar	9713f1f445	In the new order of things dictated by nmount(2), a read-only mount is to be requested via a "ro" option. At the same time, MNT_RDONLY is gradually becoming an indicator of the current state of the FS instead of a command flag. Today passing MNT_RDONLY alone to the kernel's mount machinery will lead to various glitches. (See the PRs for examples.) Therefore mount the root FS with a "ro" option instead of the MNT_RDONLY flag. (Note that MNT_RDONLY still is added to the mount flags internally, by vfs_donmount(), if "ro" was specified.) To be able to pass "ro" cleanly to kernel_vmount(), teach the latter function to accept options with NULL values. Also correct the comment explaining how mount_arg() handles length of -1. PR: bin/106636 kern/120319 Submitted by: Jaakko Heinonen <see PR kern/120319 for email> (originally)	2008-02-14 17:04:31 +00:00
gallatin	0e60a1b33c	Now that mxge supports MSI-X interrupts, reverse the logic and flag legacy interrupts rather than MSI as a special case. Prior to this commit, the interrupt handler was doing the slow handshaking with the device to ensure the legacy interrupt was lowered in both the legacy and MSI-X case. This handshaking was not required for MSI-X.	2008-02-14 16:24:14 +00:00
rpaulo	6137f591ed	Don't attach to non Core CPUs. This is needed because on the PIII, querying the number of sensors returns > 0. PR: 120541 Approved by: njl (mentor)	2008-02-14 14:14:59 +00:00
simon	49aa39283b	Fix sendfile(2) write-only file permission bypass. Security: FreeBSD-SA-08:03.sendfile Submitted by: kib	2008-02-14 11:44:31 +00:00
yongari	df40a22dd2	Prefer NULL over integer 0 for pointer type.	2008-02-14 01:25:01 +00:00
yongari	182a7cea0b	Nuke local jumbo allocator and switch to use of UMA backed page allocator for jumbo frame. o Removed unneeded jlist lock which was used to manage jumbo buffers. o Don't reinitialize hardware if MTU was not changed. o Added additional check for minimal MTU size. o Added a new tunable hw.skc.jumbo_disable to disable jumbo frame support for the driver. The tunable could be set for systems that do not need to use jumbo frames and it would save (9K * number of Rx descriptors) bytes kernel memory. o Jumbo buffer allocation failure is no longer critical error for the operation of sk(4). If sk(4) encounter the allocation failure it just disables jumbo frame support and continues to work without user intervention. With these changes jumbo frame performance of sk(4) was slightly increased and users should not encounter jumbo buffer allocation failure. Previously sk(4) tried to allocate physically contiguous memory, 3388KB for 256 Rx descriptors. Sometimes that amount of contiguous memory region could not be available for running systems which in turn resulted in failure of loading the driver. Tested by: Cy Schubert < Cy.Schubert () komquats dot com >	2008-02-14 01:10:48 +00:00
rwatson	021108eeb6	Remove debugging code under OLD_DIAGNOSTIC; this is all >10 years old and hasn't been used in that time. MFC after: 1 month	2008-02-14 00:55:03 +00:00
rwatson	0cf10cfc00	In Coda, flush the attribute cache for a cnode when its fid is changed, as its synthesized inode number may have changed and we want stat(2) to pick up the new inode number. MFC after: 1 month	2008-02-14 00:30:06 +00:00
gallatin	960266ca30	Add minimally invasive shims to ease MFCs of mxge back as far as RELENG_6 Sponsored by: Myricom, Inc.	2008-02-14 00:09:59 +00:00
jhb	fd8332efc0	Add KASSERT()'s to catch attempts to recurse on spin mutexes that aren't marked recursable either via mtx_lock_spin() or thread_lock(). MFC after: 1 week	2008-02-13 23:39:05 +00:00
jhb	b518019544	Mark the syscons video spin mutex as recursable since it is currently recursed in a few places. MFC after: 1 week	2008-02-13 23:38:08 +00:00
jhb	32100bd15f	Mark sleepqueue chain spin mutexes are recursable since the sleepq code now recurses on them in sleepq_broadcast() and sleepq_signal() when resuming threads that are fully asleep. MFC after: 1 week	2008-02-13 23:36:56 +00:00
jhb	64735ffb5f	Add a couple of assertions and KTR logging to thread_lock_flags() to match mtx_lock_spin_flags(). MFC after: 1 week	2008-02-13 23:33:50 +00:00
gallatin	f8f681d2a2	Make the type of the firmware arrays match those in the other eth*_z8e.h files.	2008-02-13 21:58:46 +00:00
jhb	e8b1d791b2	Add an automatic kernel module version dependency to prevent loading modules using invalid ABI versions (e.g. a 7.x module with an 8.x kernel) for a given kernel: - Add a 'kernel' module version whose value is __FreeBSD_version. - Add a version dependency on 'kernel' in every module that has an acceptable version range of __FreeBSD_version up to the end of the branch __FreeBSD_version is part of. E.g. a module compiled on 701000 would work on kernels with versions between 701000 and 799999 inclusive. Discussed on: arch@ MFC after: 1 week	2008-02-13 21:34:06 +00:00
attilio	784af2e88a	Bump __FreeBSD_version after the introduction of: - lockmgr_assert() - BUF_ASSERT_*() family functions which enriched the KPI.	2008-02-13 20:59:28 +00:00
attilio	456bfb1f0f	- Add real assertions to lockmgr locking primitives. A couple of notes for this: * WITNESS support, when enabled, is only used for shared locks in order to avoid problems with the "disowned" locks * KA_HELD and KA_UNHELD only exists in the lockmgr namespace in order to assert for a generic thread (not curthread) owning or not the lock. Really, this kind of check is bogus but it seems very widespread in the consumers code. So, for the moment, we cater this untrusted behaviour, until the consumers are not fixed and the options could be removed (hopefully during 8.0-CURRENT lifecycle) * Implementing KA_HELD and KA_UNHELD (not surported natively by WITNESS) made necessary the introduction of LA_MASKASSERT which specifies the range for default lock assertion flags * About other aspects, lockmgr_assert() follows exactly what other locking primitives offer about this operation. - Build real assertions for buffer cache locks on the top of lockmgr_assert(). They can be used with the BUF_ASSERT_*(bp) paradigm. - Add checks at lock destruction time and use a cookie for verifying lock integrity at any operation. - Redefine BUF_LOCKFREE() in order to not use a direct assert but let it rely on the aforementioned destruction time check. KPI results evidently broken, so __FreeBSD_version bumping and manpage update result necessary and will be committed soon. Side note: lockmgr_assert() will be used soon in order to implement real assertions in the vnode namespace replacing the legacy and still bogus "VOP_ISLOCKED()" way. Tested by: kris (earlier version) Reviewed by: jhb	2008-02-13 20:44:19 +00:00
rwatson	5e4721882e	Update cache flushing behavior in light of recent namecache and access cache improvements: - Flush just access control state on CODA_PURGEUSER, not the full namecache for /coda. - When replacing a fid on a cnode as a result of, e.g., reintegration after offline operation, we no longer need to purge the namecache entries associated with its vnode. MFC after: 1 month	2008-02-13 19:50:17 +00:00
jhb	f3a2cbebdb	Use RTFREE_LOCKED() instead of rtfree() when releasing a reference on the 'rt' route in rtredirect() as 'rt' is always locked. MFC after: 1 week PR: kern/117913 Submitted by: Stefan Lambrev stefan.lambrev of moneybookers.com	2008-02-13 16:57:58 +00:00
rwatson	d212ebbda3	Remove coda_namecache from coda5 as well. We should probably GC coda5 entirely at this point as coda6 is considered the supported branch. MFC after: 1 month	2008-02-13 16:31:04 +00:00
rwatson	e479ef560d	Remove coda_namecache from "options vcoda", it is no longer required. MFC after: 1 month Spotted by: Tinderbox	2008-02-13 16:15:47 +00:00
rwatson	621bdec0f6	Implement a rudimentary access cache for the Coda kernel module, modeled on the access cache found in NFS, smbfs, and the Linux coda module. This is a positive access cache of a single entry per file, tracking recently granted rights, but unlike NFS and smbfs, supporting explicit invalidation by the distributed file system. For each cnode, maintain a C_ACCCACHE flag indicating the validity of the cache, and a cached uid and mode tracking recently granted positive access control decisions. Prefer the cache to venus_access() in VOP_ACCESS() if it is valid, and when we must fall back to venus_access(), update the cache. Allow Venus to clear the access cache, either the whole cache on CODA_FLUSH, or just entries for a specific uid on CODA_PURGEUSER. Unlike the Coda module on Linux, we don't flush all entries on a user purge using a generation number, we instead walk present cnodes and clear only entries for the specific user, meaning it is somewhat more expensive but won't hit all users. Since the Coda module is agressive about not keeping around unopened cnodes, the utility of the cache is somewhat limited for files, but works will for directories. We should make Coda less agressive about GCing cnodes in VOP_INACTIVE() in order to improve the effectiveness of in-kernel caching of attributes and access rights. MFC after: 1 month	2008-02-13 15:45:12 +00:00
rwatson	70ff5c5a61	Remove now-unused Coda namecache. MFC after: 1 month	2008-02-13 13:26:01 +00:00
rwatson	a9d8becadf	Rather than having the Coda module use its own namecache, use the global VFS namecache, as is done by the Coda module on Linux. Unlike the Coda namecache, the global VFS namecache isn't tagged by credential, so use ore conservative flushing behavior (for now) when CODA_PURGEUSER is issued by Venus. This improves overall integration with the FreeBSD VFS, including allowing __getcwd() to work better, procfs/procstat monitoring, and so on. This improves shell behavior in many cases, and improves ".." handling. It may lead to some slowdown until we've implemented a specific access cache, which should net improve performance, but in the mean time, lookup access control now always goes to Venus, whereas previously it didn't. MFC after: 1 month	2008-02-13 13:06:22 +00:00
attilio	313dc11b0b	Fix a lock leak in the ntfs locking scheme: When ntfs_ntput() reaches 0 in the refcount the inode lockmgr is not released and directly destroyed. Fix this by unlocking the lockmgr() even in the case of zero-refcount. Reported by: dougb, yar, Scot Hetzel <swhetzel at gmail dot com> Submitted by: yar	2008-02-13 13:02:12 +00:00
jhb	e83bd487c4	Consolidate the code to generate a new XID for a NFS request into a nfs_xid_gen() function instead of duplicating the logic in both nfsm_rpchead() and the NFS3ERR_JUKEBOX handling in nfs_request(). MFC after: 1 week Submitted by: mohans (a long while ago)	2008-02-13 00:04:58 +00:00
marcel	06a30b50ee	Remove SMP left-overs from NetBSD.	2008-02-12 20:55:51 +00:00
csjp	b24cb219b9	Make sure we restrict Linux only IPC calls from being executed through the FreeBSD ABI. IPC_INFO, SHM_INFO, SHM_STAT were added specifically for Linux binary support. They are not documented as being a part of the FreeBSD ABI, also, the structures necessary for them have been hidden away from the users for a long time. Also, the Linux ABI layer uses it's own structures to populate the responses back to the user to ensure that the ABI is consistent. I think there is a bit more separation work that needs to happen. Reviewed by: jhb Discussed with: jhb Discussed on: freebsd-arch@ (very briefly) MFC after: 1 month	2008-02-12 20:55:03 +00:00
ru	841dab65e0	Regenerate for readlink(2).	2008-02-12 20:11:54 +00:00
ru	56aa644e2a	Change readlink(2)'s return type and type of the last argument to match POSIX. Prodded by: Alexey Lyashkov	2008-02-12 20:09:04 +00:00
marcel	636c607d81	There's no need to suppress option GDB.	2008-02-12 19:38:39 +00:00
marcel	ab259d0a33	Add PIC support for IPIs. When registering an interrupt handler, the PIC also informs the platform at which IRQ level it can start assigning IPIs, since this can depend on the number of IRQs supported for external interrupts.	2008-02-12 18:14:46 +00:00
scottl	db8258708b	If busdma is being used to realign dynamic buffers and the alignment is set to PAGE_SIZE or less, the bounce page counting logic was flawed and wouldn't reserve any pages. Adjust to be correct. Review of other architectures is forthcoming. Submitted by: Joseph Golio	2008-02-12 16:24:30 +00:00
jhb	266bdb9965	Fix a typo when testing for the NO_C3 quirk. MFC after: 3 days	2008-02-12 15:26:59 +00:00
raj	e825a75e75	Fix typo. Approved by: cognet (mentor)	2008-02-12 11:07:33 +00:00
raj	155d019a02	Eliminate BUS_DMA <-> cache incoherencies in USB transfers. With write-allocate cache we get into the following scenario: 1. data has been updated in the memory by the USB HC, but 2. D-cache holds an un-flushed value of it 3. when affected cache line is being replaced, the old (un-flushed) value is flushed and overwrites the newly arrived This is possible due to how write-allocate works with virtual caches (ARM for example). In case of USB transfers it leads to fatal tags discrepancies in umass(4) operation, which look like the following: umass0: Invalid CSW: tag 1 should be 2 (probe0:umass-sim0:0:0:0): Request completed with CAM_REQ_CMP_ERR (probe0:umass-sim0:0:0:0): Retrying Command umass0: Invalid CSW: tag 1 should be 3 (probe0:umass-sim0:0:0:0): Request completed with CAM_REQ_CMP_ERR (probe0:umass-sim0:0:0:0): Retrying Command umass0: Invalid CSW: tag 1 should be 4 (probe0:umass-sim0:0:0:0): Request completed with CAM_REQ_CMP_ERR (probe0:umass-sim0:0:0:0): Retrying Command umass0: Invalid CSW: tag 1 should be 5 (probe0:umass-sim0:0:0:0): Request completed with CAM_REQ_CMP_ERR (probe0:umass-sim0:0:0:0): Retrying Command umass0: Invalid CSW: tag 1 should be 6 (probe0:umass-sim0:0:0:0): Request completed with CAM_REQ_CMP_ERR (probe0:umass-sim0:0:0:0): error 5 (probe0:umass-sim0:0:0:0): Retries Exausted To eliminate this, a BUS_DMASYNC_PREREAD sync operation is required in usbd_start_transfer(). Credits for nailing this down go to Grzegorz Bernacki gjb AT semihalf DOT com. Reviewed by: imp Approved by: cognet (mentor)	2008-02-12 11:03:29 +00:00
brueffer	28d3c4acf3	Add missing \n. PR: 120341 Submitted by: CyberLeo <cyberleo@cyberleo.net> Approved by: rwatson (mentor), aradford@amcc.com MFC after: 3 days	2008-02-12 08:58:26 +00:00
kris	989a96d5cb	Switch the default NFS mount mode from UDP to TCP. UDP mounts are a historical relic, and are no longer appropriate for either LAN or WAN mounting. At modern (gigabit and 10 gigabit) LAN speeds packet loss from socket buffer fill events is common, and sequence numbers wrap quickly enough that data corruption is possible. TCP solves both of these problems without imposing significant overhead. MFC after: 1 month	2008-02-11 23:23:21 +00:00
marius	e3f122733a	The Sun disk label only uses 16-bit fields for cylinders, heads and sectors so the geometry of large IDE disks has to be adjusted. This corresponds to what the OpenSolaris dad(7D) driver does except that the latter only tweaks sectors and effectively limits the mediasize to 128GB so the cylinders and heads fields won't ever overflow. Not limiting the mediasize is a compromise between allowing to use Sun disk label as far as possible and being able to use the entire disk with another disk label. This allows to use the full capacity of large IDE disks if they were not labeled under (Open)Solaris (in both ways of the meaning). MFC after: 2 weeks	2008-02-11 21:40:22 +00:00
jkim	3bffed0bec	Fix Linux mmap with MAP_GROWSDOWN flag. Reported by: Andriy Gapon (avg at icyb dot net dot ua) Tested by: Andriy Gapon (avg at icyb dot net dot ua) Pointyhat: me MFC after: 3 days	2008-02-11 19:35:03 +00:00
raj	32549763d8	Clean up PowerPC loader(8) build config. Turn off TFTP support by default: when both TFTP and NFS are enabled in the loader, strange interactions occur in the pure netbooting scenario (i.e. loader is TFTP-ed, kernel+world mounted over NFS), leading to very slow access to the NFS-exported files. Reviewed by: grehan Approved by: cognet (mentor)	2008-02-11 12:30:32 +00:00
thompsa	25bc946ffc	Unbreak build, size_t is larger on 64bit platforms.	2008-02-11 09:20:01 +00:00
thompsa	18c68a31ef	Add missing GEOM_LVM option.	2008-02-11 05:36:15 +00:00
thompsa	c2724541ae	Hook geom_lvm(4) up to the build.	2008-02-11 03:10:40 +00:00
thompsa	1d945b74cc	Add a geom class to map Linux LVM logical volumes. The logical disks will appear as /dev/lvm/<vol group>-<logical vol>, for instance /dev/lvm/vg0-home. GLVM currently supports linear stripes with segments on multiple physical disks. The metadata is read only, logical volumes can not be allocated or resized. Reviewed by: Ivan Voras	2008-02-11 03:05:11 +00:00
rwatson	8d55ec6003	Clean up coda_pathconf() slightly while debugging a problem there. MFC after: 1 month	2008-02-11 00:01:45 +00:00
phk	22b65bed61	Add support for PC Engines ALIX boards. Style cleanup. Hide some messages behind bootverbose.	2008-02-10 19:14:42 +00:00
attilio	1ec6395a47	- Revert last ehci.c change - Include lock.h in lockmgr.h as nested header in order to safely use LOCK_FILE and LOCK_LINE. As long as this code will be replaced soon we can tollerate for a while this namespace pollution even if the real fix would be to let lockmgr() depend by lock.h as a separate header.	2008-02-10 15:50:21 +00:00
rwatson	8a831a9f22	Since we're now actively maintaining the Coda module in the FreeBSD source tree, restyle everything but coda.h (which is more explicitly shared across systems) into a closer approximation to style(9). Remove a few more unused function prototypes. Add or clarify some comments. MFC after: 1 month	2008-02-10 11:18:12 +00:00
kib	3474d93d40	After rev. 1.60 of sys/lockmgr.h, the header requires inclusion of the sys/lock.h.	2008-02-10 07:34:16 +00:00
iwasaki	9b9b90a013	Add `hw.ciss.nop_message_heartbeat' tunable (default disabled) for NOP-message polling in ciss_periodic(). Note that setting the tunable to non-zero can be workaround only for `ADAPTER HEARTBEAT FAILED' problem, and may freeze the system w/o the problem. Reviewed by: scottl Reported by: Attila Nagy MFC after: 3 days	2008-02-10 06:21:52 +00:00
attilio	7b35710fce	Use LOCK_FILE and LOCK_LINE where necessary instead than __FILE__ and __LINE__ with lockmgr.	2008-02-09 21:37:54 +00:00
attilio	4274e0aa54	namei() can call underlying nfs_readlink() passing a struct uio pointer owned by a NULL owner. This will lead consequent VOP_ISLOCKED() present into nfs_upgrade_vnlock() to panic as it only acquire curthread now. Fix nfs_upgrade_vnlock() and nfs_downgrade_vnlock() in order to not use more the struct thread pointer passed as argument (as it is really nomore required there as vn_lock() and VOP_UNLOCK doesn't get the lock more). Using curthread, in place, doesn't get ambiguity as LK_EXCLOTHER should be handled as a "not locked" request by both functions. Reported by: kris Tested by: kris Reviewed by: ups	2008-02-09 20:13:19 +00:00
rwatson	5b9b5c8121	Various further non-functional cleanups to coda: - Rename print_vattr to coda_print_vattr and make static, rename print_cred to coda_print_cred. - Remove unused coda_vop_nop. - Add XXX comment because coda_readdir forwards to the cache vnode's readdir rather than venus_readdir, and annotate venus_readdir as unused. - Rename vc_nb_* to vc_. - Use d_open_t, d_close_t, d_read_t, d_write_t, d_ioctl_t and d_poll_t for prototyping vc_ as that is the intent, don't use our own definitions. - Rename coda_nb_statfs to coda_statfs, rename NB_SFS_SIZ to CODA_SFS_SIZ. - Replace one more OBE reference to NetBSD with a reference to FreeBSD. - Tidy up a little vertical whitespace here and there. - Annotate coda_nc_zapvnode as unused. - Remove unused vcodattach. - Annotate VM_INTR as unused. - Annotate that coda_fhtovp is unused and doesn't match the FreeBSD prototype, so isn't hooked up to vfs_fhtovp. If we want NFS export of Coda to work someday, this needs to be fixed. - Remove unused getNewVnode. - Remove unused coda_vget, coda_init, coda_quotactl prototypes. MFC after: 1 month	2008-02-09 12:49:18 +00:00
rwatson	f2fd79dc06	No reason not to maintain stats on statfs in Coda, as it's done for other VFS operations, so uncomment the existing statistics gathering. MFC after: 1 month	2008-02-09 11:40:49 +00:00
rwatson	c06ae37dfb	Remove unused devtomp(), which exploited UFS-specific knowledge to find the mountpoint for a specific device. This was implemented incorrectly, a bad idea in a fundamental sense, and also never used, so presumably a long-idle debugging function. MFC after: 1 month	2008-02-09 11:12:18 +00:00
rwatson	e16828cb77	Since Coda is effectively a stacked file system, use VOP_EOPNOTSUPP for vop_bmap; delete the existing stub that returned either EINVAL or EOPNOTSUPP, and had unreachable calls to VOP_BMAP on the cache vnode. MFC after: 1 month	2008-02-09 09:33:19 +00:00
marcus	7e24637c24	Add support for displaying a process' current working directory, root directory, and jail directory within procstat. While this functionality is available already in fstat, encapsulating it in the kern.proc.filedesc sysctl makes it accessible without using kvm and thus without needing elevated permissions. The new procstat output looks like: PID COMM FD T V FLAGS REF OFFSET PRO NAME 76792 tcsh cwd v d -------- - - - /usr/src 76792 tcsh root v d -------- - - - / 76792 tcsh 15 v c rw------ 16 9130 - - 76792 tcsh 16 v c rw------ 16 9130 - - 76792 tcsh 17 v c rw------ 16 9130 - - 76792 tcsh 18 v c rw------ 16 9130 - - 76792 tcsh 19 v c rw------ 16 9130 - - I am also bumping __FreeBSD_version for this as this new feature will be used in at least one port. Reviewed by: rwatson Approved by: rwatson	2008-02-09 05:16:26 +00:00
rwatson	7445f79ec2	Lock cache vnode when VOP_FSYNC() is called on a Coda vnode. MFC after: 1 month	2008-02-09 00:12:22 +00:00
rwatson	0a37acb8a6	Make all calls to vn_lock() in Coda, including recently added ones, use LK_RETRY, since failure is undesirable (and not handled). MFC after: 1 month Pointed out by: kib	2008-02-09 00:03:22 +00:00
rwatson	409c34ce7d	The Coda module was originally ported to NetBSD from Mach by rvb, and then later to FreeBSD. Update various NetBSD-related comments: in some cases delete them because they don't appply, in others update to say FreeBSD as they still apply but in FreeBSD (and might for that matter no longer apply on NetBSD), and flag one case where I'm not sure whether it applies. MFC after: 1 month	2008-02-08 23:15:36 +00:00
rwatson	83dcd82dd9	Before invoking vnode operations on cache vnodes, acquire the vnode locks of those vnodes. Probably, Coda should do the same lock sharing/ pass-through that is done for nullfs, but in the mean time this ensures that locks are adequately held to prevent corruption of data structures in the cache file system. Assuming most operations came from the top layer of Coda and weren't performed directly on the cache vnodes, in practice this corruption was relatively unlikely as the Coda vnode locks were ensuring exclusive access for most consumers. This causes WITNESS to squeal like a pig immediately when Coda is used, rather than waiting until file close; I noticed these problems because of the lack of said squealing. MFC after: 1 month	2008-02-08 23:01:40 +00:00
rwatson	09506e9ff8	Remove undefined coda excluded by #if 1 #else, which previously protected vget() calls using inode numbers to query the root of /coda, which is not needed since we now cache the root vnode with the mountpoint. MFC after: 1 month	2008-02-08 22:37:15 +00:00
attilio	e1db4e70b3	Conver all explicit instances to VOP_ISLOCKED(arg, NULL) into VOP_ISLOCKED(arg, curthread). Now, VOP_ISLOCKED() and lockstatus() should only acquire curthread as argument; this will lead in axing the additional argument from both functions, making the code cleaner. Reviewed by: jeff, kib	2008-02-08 21:45:47 +00:00
sam	588482fb4c	belated bump for the addition of m_collapse	2008-02-08 21:23:36 +00:00
glebius	c845f83019	If the vhid already present, return EEXIST instead of non-informative EINVAL.	2008-02-07 13:18:59 +00:00
glebius	415a259ae1	Remove unused structure member from struct in_ifadown_arg.	2008-02-07 11:26:52 +00:00
glebius	34f9d2c8a9	Use rtalloc1() instead of rtalloc_ign(). It returns a locked rtentry. We quickly copy the fields of interest, and then RTFREE_LOCKED(). This should be faster then lock & unlock the rtentry twice.	2008-02-07 11:10:17 +00:00
jeff	e5687b20d7	- Add THREAD_LOCKPTR_ASSERT() to assert that the thread's lock points at the provided lock or &blocked_lock. The thread may be temporarily assigned to the blocked_lock by the scheduler so a direct comparison can not always be made. - Use THREAD_LOCKPTR_ASSERT() in the primary consumers of the scheduling interfaces. The schedulers themselves still use more explicit asserts. Sponsored by: Nokia	2008-02-07 06:55:38 +00:00
jeff	005506bb32	- In rw_wunlock_hard prefer to wakeup writers if there are both readers and writers available. Doing otherwise can cause deadlocks as no read locks can proceed while there are write waiters. Sponsored by: Nokia	2008-02-07 06:16:54 +00:00
alc	4a600fdd88	Change shm_dotruncate() so that it correctly handles cached pages that span the end of the object. (This change is analogous to revision 1.237 of vm/vnode_pager.c.) Discussed with: jhb	2008-02-07 05:55:16 +00:00
grehan	bd098ec7fe	Make the openfirmware getchar entry point non-blocking. This catches up with jhb's 2005/05/27 loader multiple-console change. Tested by: marius/sparc64, grehan/ofwppc	2008-02-06 22:04:28 +00:00
mav	a735f997fd	Do not use bcmp() to compare two bytes with constants.	2008-02-06 20:37:34 +00:00
mav	a51f95cd58	Cleanup and tune ng_snd_item() function as it is one of the most busy netgraph functions. Tune stack protection constants to avoid division operation.	2008-02-06 18:50:40 +00:00
silby	1ff81684ca	Replace the random IP ID generation code we obtained from OpenBSD with an algorithm suggested by Amit Klein. The OpenBSD algorithm has a few flaws; see Amit's paper for more information. For a description of how this algorithm works, please see the comments within the code. Note that this commit does not yet enable random IP ID generation by default. There are still some concerns that doing so will adversely affect performance. Reviewed by: rwatson MFC After: 2 weeks	2008-02-06 15:40:30 +00:00
scottl	3c799f8e4f	Remove an errant definition for AMR_CONFIG_ENQ3_SOLICITED NOTIFY that was accidently reverted in the previous commit.	2008-02-06 14:26:31 +00:00
attilio	a715e455c6	td cannot be NULL in that place, so just axe out the check.	2008-02-06 13:26:01 +00:00
scottl	29a1384601	Fix a symbol conflict between hptrr and hptmv	2008-02-06 05:33:17 +00:00
scottl	0635509b37	Update the hptrr driver to version 1.2. This adds port multiplier support for several cards. See the Highpoint website for more information. Again, many thanks to Highpoint for their continued support of FreeBSD.	2008-02-06 01:02:20 +00:00
jeff	77ea5a24c7	Adaptive spinning in write path with readers and writer starvation avoidance. - Move recursion checking into rwlock inlines to free a bit for use with adaptive spinners. - Clear the RW_LOCK_WRITE_SPINNERS flag whenever the lock state changes causing write spinners to restart their loop. - Write spinners are limited by a count while readers hold the lock as there is no way to know for certain whether readers are running still. - In the read path block if there are write waiters or spinners to avoid starving writers. Use a new per-thread count, td_rw_rlocks, to skip starvation avoidance if it might cause a deadlock. - Remove or change invalid assertions in turnstiles. Reviewed by: attilio (developed parts of the patch as well) Sponsored by: Nokia	2008-02-06 01:02:13 +00:00
attilio	209634d0e1	Bump __FreeBSD_version in order to reflect introduction of LK_NODUP and LK_NOWITNESS options in the lockmgr namespace.	2008-02-06 00:42:26 +00:00
attilio	6234a71797	Add WITNESS support to lockmgr locking primitive. This support tries to be as parallel as possible with other locking primitives, but there are differences; more specifically: - The base witness support is alredy equipped for allowing lock duplication acquisition as lockmgr rely on this. - In the case of lockmgr_disown() the lock result unlocked by witness even if it is still held by the "kernel context" - In the case of upgrading we can have 3 different situations: * Total unlocking of the shared lock and nothing else * Real witness upgrade if the owner is the first upgrader * Shared unlocking and exclusive locking if the owner is not the first upgrade but it is still allowed to upgrade - LK_DRAIN is basically handled like an exclusive acquisition Additively new options LK_NODUP and LK_NOWITNESS can now be used with lockinit(): LK_NOWITNESS disables WITNESS for the specified lock while LK_NODUP enable duplicated locks tracking. This will require manpages update and a __FreeBSD_version bumping (addressed by further commits). This patch also fixes a problem occurring if a lockmgr is held in exclusive mode and the same owner try to acquire it in shared mode: currently there is a spourious shared locking acquisition while what we really want is a lock downgrade. Probabilly, this situation can be better served with a EDEADLK failing errno return. Side note: first testing on this patch alredy reveleated several LORs reported, so please expect LORs cascades until resolved. NTFS also is reported broken by WITNESS introduction. BTW, NTFS is exposing a lock leak which needs to be fixed, and this patch can help it out if rightly tweaked. Tested by: kris, yar, Scot Hetzel <swhetzel at gmail dot com>	2008-02-06 00:37:14 +00:00
attilio	acc2f89a7f	Really, no explicit checks against against lock_class_* object should be done in consumers code: using locks properties is much more appropriate. Fix current code doing these bogus checks. Note: Really, callout are not usable by all !(LC_SPINLOCK \| LC_SLEEPABLE) primitives like rmlocks doesn't implement the generic lock layer functions, but they can be equipped for this, so the check is still valid. Tested by: matteo, kris (earlier version) Reviewed by: jhb	2008-02-06 00:04:09 +00:00
raj	8c4ccad611	Introduce a standalone shell script for embedding MFS image. This allows to fix a problem with ARM kernel.bin not having the MFS image embedded: it is objcopied from the kernel.noheader temporary ELF file, which was not subject to embedding the MFS image previously. Reviewed by: imp Approved by: cognet (mentor)	2008-02-05 10:46:30 +00:00
raj	172a5277b4	ARM locore cosmetics. Approved by: cognet (mentor)	2008-02-05 10:23:42 +00:00
raj	a6d33e3164	Improve ARM_TP_ADDRESS and RAS area. De-hardcode usage of ARM_TP_ADDRESS and RAS local storage, and move this special purpose page to a more convenient place i.e. after the vectors high page, more towards the end of address space. Previous location (0xe000_0000) caused grief if KVA was to go beyond the default limit. Note that ARM world rebuilding is required after this change since the location of ARM_TP_ADDRESS is shared between kernel and userland. Submitted by: Grzegorz Bernacki (gjb AT semihalf dot com) Reviewed by: imp Approved by: cognet (mentor)	2008-02-05 10:22:33 +00:00
mav	cd7d07a63f	Prepare hooks direct pointers on setup to avoid heavy ng_findhook() calls during operarion.	2008-02-04 19:26:53 +00:00
rwatson	f23198af5c	Further clean up sorflush: - Expose sbrelease_internal(), a variant of sbrelease() with no expectations about the validity of locks in the socket buffer. - Use sbrelease_internel() in sorflush(), and as a result avoid intializing and destroying a socket buffer lock for the temporary stack copy of the actual buffer, asb. - Add a comment indicating why we do what we do, and remove an XXX since things have gotten less ugly in sorflush() lately. This makes socket close cleaner, and possibly also marginally faster. MFC after: 3 weeks	2008-02-04 12:25:13 +00:00
marcel	c1a1c62b2a	Allocate a stack for thread0 and switch to it before calling mi_startup(). This frees up kstack for static PAL/SAL calls and double-fault handling.	2008-02-04 02:21:33 +00:00
mav	083c0a5fcb	Move all possible node logic out of the rcvdata() function to the newhook()/disconnect(). Unify function names with other nodes.	2008-02-03 18:55:45 +00:00
scottl	c08cc256e2	Update the hptiop driver to version 1.3. This adds support for the 4xxx series of adapters. Thanks again to Highpoint for their continued support of FreeBSD. Submitted by: Highpoint Technologies	2008-02-03 16:07:12 +00:00
phk	13132840a1	Give sendfile(2) a SF_SYNC flag which makes it wait until all mbufs referencing the files VM pages are returned from the network stack, making changes to the file safe. This flag does not guarantee that the data has been transmitted to the other end.	2008-02-03 15:54:41 +00:00
sephe	f42ebbd1b7	- Correctly handle ALTQ in ieee80211_deliver_data() - Add comment from sam that ALTQ probably does not work well with WME PR: kern/119548 Approved by: sam (mentor)	2008-02-03 12:00:03 +00:00
sephe	24b66433f6	Various bug fixes for 2560 parts of ral(4): - Rename rt2560_read_eeprom to rt2560_read_config, we already have rt2560_eeprom_read - If hardware gives us wrong encryption done index, shout out loudly and terminate the processing loop - Process encryption done if RX done bit is set in interrupt status register (according to Ralink Linux driver) - Turn VALID/BUSY bits in TX descriptor only after TX descriptor is fully setup - Fix BBP read: RT2560_BBPCSR can't be written until its RT2560_BBP_BUSY bit is off (according to Ralink Linux driver) - Skip invalid (0 of 0xffff) BBP register/value entries stored in EEPROM - Fix channel TX power location in EEPROM, if channel TX power is above 31 set it to 24 (TX power only has 5bits in RF register, "24" is according to Ralink Linux driver) - Configure BBP according to the BBP register/value stored in EEPROM, restore BBP17 (RX sensitivity tuning) to default value after this. - Set TX/RX antenna after BBP is initialized; these two operation will try to set BBP registers - Reconfigure ACK TX time registers according to 802.11g standard (TX @36Mb, other side's ACK should be sent @24Mb). - 2560 parts have two TX ring: one for management/control packets, one for data packets. Add private OACTIVE flag for each of them. Turn on IFF_DRV_OACTIVE if one of private OACTIVE is on; turn off IFF_DRV_OACTIVE iff all of them are off. - Rework watchdog to mimic old if_watchdog action. Process TX done/encryption done in watchdog function (according to Ralink Linux driver) Obtained from: DragonFly Approved by: sam (mentor) Tested by: sam Related to PR: kern/117655 # Forcing long slot time setting is not included in this commit, comment and # related code is in place, so if problem pops up, quick tests could be done.	2008-02-03 11:47:38 +00:00
mav	45f131f91d	Revert previous commit. glebius@ noticed that it was not a bug, but undocumented feature.	2008-02-03 10:30:45 +00:00
brueffer	c8400e0702	Don't build the rr232x module, it has been removed. Approved by: rwatson (mentor)	2008-02-03 10:05:00 +00:00
scottl	249efc9b24	Remove the rr232x driver. It has been superceded by the hptrr driver.	2008-02-03 07:07:30 +00:00
das	e0a189d4ba	Add a few more CPUID feature bits while here. We don't support these features yet.	2008-02-02 23:17:27 +00:00
das	eca0289cd9	SSE4 CPUID bits	2008-02-02 22:40:17 +00:00
rwatson	b3df56a7f6	Add comment that bpfread() has multi-threading issues. Fix minor white space nit.	2008-02-02 20:35:05 +00:00
imp	9dfc4839d1	pc98 lint builds w/o warnings. Remove the last special case from our compiler upgrade. # if tinderbox breaks, I'll fix it, but it shouldn't...	2008-02-02 19:55:28 +00:00
ambrisko	68facb1ef3	Fix some bugs in dealing with DCMD'd without data. MegaCli was sending down some DCMD's without any data. Thanks to Dell and LSI for helping to provide clues to figure out this problem. Now MegaCli can upgrade the firmware and should work identical when run on Linux. Reviewed by: scottl, LSI MFC after: 1 day	2008-02-02 17:29:28 +00:00
cognet	c68488ca76	Arm should build fine with -Werror as well.	2008-02-02 16:47:15 +00:00
imp	4ee011bbd6	sun4v has a MACHINE_ARCH of sparc64, so it was covered under that clause and shouldn't have been added. Remove it.	2008-02-02 16:40:40 +00:00
bz	cfb85f0c07	Rather than passing around a cached 'priv', pass in an ucred to ipsec_set_policy and do the privilege check only if needed. Try to assimilate both ip_ctloutput code blocks calling ipsec*_set_policy. Reviewed by: rwatson	2008-02-02 14:11:31 +00:00
marck	92f929f0dd	Fix one more grammo. Noticed by: ru	2008-02-02 08:41:53 +00:00
imp	5c71f02efc	Some platforms that are currently under development have to cope with a variety of bootloaders. This sometimes means that different loader scripts are required within one ${MACHINE_ARCH}, which makes the current practice of using ldscript.${MACHINE_ARCH} unsuitable. Instead, make the default the current convention and allow the ld scripts to be overridden as necessary.	2008-02-02 07:52:24 +00:00
imp	2c8230f498	Wall of shame rather than wall of fame for the -Werror suppression. If we aren't arm, pc98 or sun4v, then enable treating warnings like errors. That doesn't mean these platforms aren't -Werror clean, just that we haven't enforced it before. Someone with some spare time should investigate these three platforms to see if any can be removed.	2008-02-02 07:43:38 +00:00
sam	3a6b566d92	add opaque pointer to tx ampdu state for drivers MFC after: 3 days	2008-02-02 00:38:02 +00:00
sam	a0c183dcb8	o correct typo that broke check when handling addba response o add a comment about the ht rates being for 20MHz channels w/ long GI; needs a separate fix after more thought MFC after: 3 days	2008-02-01 21:31:10 +00:00
jhb	51f10aedc0	Relax the check for a PCI-express chipset by assuming the system is a PCI-express chipset (and thus has functional MSI) if there are any PCI-express devices in the system, not requiring a root port device. With PCI-X the chipset detection has to be very conservative because there are known systems with PCI-X devices that do not appear to have PCI-X chipsets. However, with PCI-express I'm not sure it is possible to have a PCI-express device in a system with a non-PCI-express chipset. If we assume that is the case then this change is valid. It is also required for at least some PCI-express systems that don't have any devices with a root port capability (some ICH9 systems). MFC after: 1 week Reported by: jfv	2008-02-01 20:31:09 +00:00
phk	df9c99b9c2	Give MEXTADD() another argument to make both void pointers to the free function controlable, instead of passing the KVA of the buffer storage as the first argument. Fix all conventional users of the API to pass the KVA of the buffer as the first argument, to make this a no-op commit. Likely break the only non-convetional user of the API, after informing the relevant committer. Update the mbuf(9) manual page, which was already out of sync on this point. Bump __FreeBSD_version to 800016 as there is no way to tell how many arguments a CPP macro needs any other way. This paves the way for giving sendfile(9) a way to wait for the passed storage to have been accessed before returning. This does not affect the memory layout or size of mbufs. Parental oversight by: sam and rwatson. No MFC is anticipated.	2008-02-01 19:36:27 +00:00
marck	e190c967dd	Reword recent comment a bit.	2008-02-01 17:35:46 +00:00
rwatson	95bb3acd1f	Use FEATURE() macro to advertise aio availability.	2008-02-01 11:59:14 +00:00
mav	49092fb3fc	Add comments about stack protection mechanism.	2008-02-01 11:01:15 +00:00
mav	7e0b4128cb	Tune the message for better informativity. Print the hook pointer as other functions do.	2008-02-01 07:25:06 +00:00
benno	538bebc843	Band-aid recent commit by mav by replacing a variable in a CTR statement with the variable that appears as if it should've been there. Pointy hat to: mav Not tested either by: benno	2008-02-01 07:17:26 +00:00
mav	5af3bb221f	Implement Session-ID hashing to improve receive performance scalability for big number of concurrent sessions.	2008-01-31 22:42:37 +00:00
jhb	9c76956524	For no good reason I had assumed that ACPI table headers would be page aligned (or at least not cross a page boundary). However, it turns out that on at least one machine one table header does cross a page boundary. This caused problems with the MADT early probe as it uses the crash dump map to load ACPI tables by loading the RSDT/XSDT into pages 1 ... N and loading the header of each ACPI table header into page 0 looking for the MADT. However, if a table header crossed a page boundary, then page 1 would get trashed resulting in a panic. Fix this by reserving the first 2 pages for ACPI table headers (headers are less than a page in size, so 2 pages will be sufficient) and use pages 2 .. N for the RSDT and XSDT. Note: amd64 should probably be simplified to just use pmap_mapbios() for all these tables which will use the direct map and not need the crash dump hack. MFC after: 5 days Tested on: i386 Reported by: Pete French petefrench of ticketswitch.com	2008-01-31 16:51:43 +00:00
rwatson	56d2c18f8e	Use dump_write() instead of direct calls to di->dumper() in textdumps. Textdumps already do pretty much the same sanity checking, but abstractions and seatbelts are both useful. MFC after: 2 months	2008-01-31 16:22:14 +00:00
mav	94236d3d42	Some code reformat.	2008-01-31 10:13:04 +00:00
mav	5df3e934b9	Implement stack protection based on GET_STACK_USAGE() macro. This fixes system panics possible with complicated netgraph setups and allows to avoid unneded extra queueing for stack unwrapping.	2008-01-31 08:51:48 +00:00
mav	739abe292f	Move GET_STACK_USAGE from MI header to i386/amd64 MD ones. Somebody who can, please feel free to implement it for other archs or copy this one if it suits.	2008-01-31 08:24:27 +00:00
rwatson	c57fa54759	Correct two problems relating to sorflush(), which is called to flush read socket buffers in shutdown() and close(): - Call socantrcvmore() before sblock() to dislodge any threads that might be sleeping (potentially indefinitely) while holding sblock(), such as a thread blocked in recv(). - Flag the sblock() call as non-interruptible so that a signal delivered to the thread calling sorflush() doesn't cause sblock() to fail. The sblock() is required to ensure that all other socket consumer threads have, in fact, left, and do not enter, the socket buffer until we're done flushin it. To implement the latter, change the 'flags' argument to sblock() to accept two flags, SBL_WAIT and SBL_NOINTR, rather than one M_WAITOK flag. When SBL_NOINTR is set, it forces a non-interruptible sx acquisition, regardless of the setting of the disposition of SB_NOINTR on the socket buffer; without this change it would be possible for another thread to clear SB_NOINTR between when the socket buffer mutex is released and sblock() is invoked. Reviewed by: bz, kmacy Reported by: Jos Backus <jos at catnook dot com>	2008-01-31 08:22:24 +00:00
grehan	54c503ac15	Enable ofwdump on powerpc (finally). Tested on G3 & G4 machines. Submitted by: Dan Stekloff <dsteklof at c i s c o dot com> Discussed with: marcel	2008-01-31 01:57:33 +00:00
cognet	4c1734c71d	Bring in the nice work from Mark Tinguely on arm pmap. The only downside is that it renames pmap_vac_me_harder() to pmap_fix_cache(). From Mark's email on -arm : pmap_get_vac_flags(), pmap_vac_me_harder(), pmap_vac_me_kpmap(), and pmap_vac_me_user() has been rewritten as pmap_fix_cache() to be more efficient in the kernel map case. I also removed the reference to the md.kro_mappings, md.krw_mappings, md.uro_mappings, and md.urw_mappings counts. In pmap_clearbit(), we can also skip over tests and writeback/invalidations in the PVF_MOD and PVF_REF cases if those bits are not set in the pv_flag. PVF_WRITE will turn caching back on and remove the PV_MOD bit. In pmap_nuke_pv(), the vm_page_flag_clear(pg, PG_WRITEABLE) has been moved to the pmap_fix_cache(). We can be more agressive in attempting to turn caching back on by calling pmap_fix_cache() at times that may be appropriate to turn cache on (a kernel mapping has been removed, a write has been removed or a read has been removed and we know the mapping does not have multiple write mappings to a page). In pmap_remove_pages() the cpu_idcache_wbinv_all() is moved to happen before the page tables are NULLed because the caches are virtually indexed and virtually tagged. In pmap_remove_all(), the pmap_remove_write(m) is added before the page tables are NULLed because the caches are virtually indexed and virtually tagged. This also removes the need for the caches fixing routine (whichever is being used pmap_vac_me_harder() or pmap_fix_cache()) to be called on any of these mappings. In pmap_remove(), I simplified the cache cleaning process and removed extra TLB removals. Basically if more than PMAP_REMOVE_CLEAN_LIST_SIZE are removed, then just flush the entire cache.	2008-01-31 00:05:40 +00:00
mav	cdf5b4e4f4	Implement GET_STACK_USAGE() macro to get the current kernel thread stack usage. This implemntation made for growing down stack organization like i386/amd64 platforms have, but prefers different machine dependent version if it is present.	2008-01-30 21:24:10 +00:00
yongari	47f4378094	Fix link state handling in bfe(4). o conversion to callout(9) API. o add a missing driver lock in bfe_ifmedia_sts(). o use our callout to drive watchdog timer. o restart Tx routine if pending queued packets are present in watchdog handler. o unarm watchdog timer only if there are no queued packets. o don't blindly reset phy and let phy driver handle link change request in bfe_init_locked(). o return the status of mii_mediachg() to caller in bfe_ifmedia_upd(). Previously it always returned 0 to caller. o add check for IFF_DRV_RUNNING flag as well as IFF_DRV_OACTIVE in bfe_start_locked(). o implement miibus_statchg method that keeps track of current link state changes as well as negotiated speed/duplex/ flow-control configuration. Reprogram MAC to appropriate duplex state. Flow-control configuration was also implemented but commented out at the moment. The flow-control configuration will be enabled again after we have general flow-control framework in mii layer. Reported by: Yousif Hassan < yousif () alumni ! jmu ! edu > Tesdted by: Yousif Hassan < yousif () alumni ! jmu ! edu >	2008-01-29 02:15:11 +00:00
mav	5eebdfa072	Avoid data copying when it is possible. bpf_filter() is able to work directly on mbuf chain.	2008-01-28 22:37:17 +00:00
ru	910410640b	Add a wrapper function that bound checks writes to the dump device.	2008-01-28 19:04:07 +00:00
brueffer	2a2d8bde46	Correct the kernel config hint in a printf statement. Submitted by: R.Mahmatkhanov <cvs-src@yandex.ru> Reviewed by: scottl Approved by: rwatson (mentor) MFC after: 3 days	2008-01-28 18:13:03 +00:00
csjp	03cdec7669	Make sure that the termid type is initialized to AU_IPv4 by default. This makes sure that process tokens credentials with un-initialized audit contexts are handled correctly. Currently, when invariants are enabled, this change fixes a panic by ensuring that we have a valid termid family. Also, this fixes token generation for process tokens making sure that userspace is always getting a valid token. This is consistent with what Solaris does when an audit context is un-initialized. Obtained from: TrustedBSD Project MFC after: 1 week	2008-01-28 17:33:46 +00:00
iwasaki	b46ea5a5ec	Don't repeat error logging about NOP message sending if ciss_report_request() return an error (which is most likely data underrun). Noticed by: Mark Atkinson MFC after: 1 week	2008-01-28 16:21:34 +00:00
gallatin	98cf716012	Only reset driver state when a hardware error is detected. Preserve warning but do not reset if we enter the routine without seeing a hardware error.	2008-01-28 13:20:51 +00:00
rrs	ce5fec50e4	- Fix a comment about prison. - Fix it so the VRF is captured while locks are held. MFC after: 1 week	2008-01-28 10:34:38 +00:00
rrs	dbf34dbcc6	- Change back to using prioity 0. Which means don't change the prioity when running the thread. (this is for the sctp_interator thread). MFC after: 1 week	2008-01-28 10:33:41 +00:00
rrs	9df3360d89	- Fix a bug where the socket may have been closed which could cause a crash in the auth code. Obtained from: Michael Tuexen MFC after: 1 week	2008-01-28 10:31:12 +00:00
rrs	13897491cf	- Fixes a comparison wrap issue with sack gap ack blocks that span the 32 bit roll over mark.	2008-01-28 10:25:43 +00:00
rwatson	3043fa2dca	Properly return the error from mls_subject_privileged() in the ifnet relabel check for MLS rather than returning 0 directly. This problem didn't result in a vulnerability currently as the central implementation of ifnet relabeling also checks for UNIX privilege, and we currently don't guarantee containment for the root user in mac_mls, but we should be using the MLS definition of privilege as well as the UNIX definition in anticipation of supporting root containment at some point. MFC after: 3 days Submitted by: Zhouyi Zhou <zhouzhouyi at gmail dot com> Sponsored by: Google SoC 2007	2008-01-28 10:20:18 +00:00
iwasaki	956e8b1018	Return errno value rather than boolean in this context. MFC after: 1 week	2008-01-28 01:57:48 +00:00
iwasaki	53452c8147	Enter the sleep state immediately without waiting for timeout if devd(8) is not running such as the system in single user mode. MFC after: 1 week	2008-01-27 16:11:04 +00:00
iwasaki	a9f086bbd3	Add devctl_process_running() so that power management system driver can check whether devd(8) is running. MFC after: 1 week	2008-01-27 16:06:37 +00:00
mav	1a411ba3c5	Run expire even without export hook connected. PR: kern/119839	2008-01-27 15:01:16 +00:00
mav	c2d1050fba	Fix memory leak when export hook is not connected.	2008-01-27 09:22:10 +00:00
mav	2adafc5538	Remove one very strange unneded if.	2008-01-27 08:52:41 +00:00
kmacy	4b5f86e85f	Fix loading for case where we don't overload tcp_usrreqs by calling tcp_drop directly	2008-01-27 04:39:38 +00:00
kmacy	f999d9af27	fix DISABLE_MBUF_IOVEC case by initializing mbuf header completely	2008-01-27 04:37:02 +00:00
mav	34b15a0c5e	Slightly simplify code.	2008-01-27 02:04:12 +00:00
marius	7851cf479d	- Fix a typo in a comment. - Fix whitespace according to style(9). - Sync the comment describing why we have to wait in nsphy_reset() with nsphyter_reset(). It's true that the manual tells to not do a reset within 500us of applying power but that's unlikely the cause of problems seen here. Generally having to wait 500us after a reset however is.	2008-01-27 01:30:02 +00:00
marius	edbc40c951	Fix a typo in a comment.	2008-01-27 01:11:57 +00:00
marius	6ebda87d5b	Add a driver for the National Semiconductor DP83815, DP83843 and DP83847 PHYs. The main reason for using a specific driver for these PHYs are reset quirks similar to the nsphy(4) driven DP83840A. PR: 112654 Obtained from: NetBSD MFC after: 2 weeks Thanks to: mlaier for testing w/ DP83815	2008-01-27 01:10:41 +00:00
marius	1bf40ea789	Sort values according to style.Makefile(5).	2008-01-27 01:02:29 +00:00
rwatson	44c26c0841	Increase maximum DDB capture buffer size to 5MB. PR: 119993 MFC after: 2 months Suggested by: Scot Hetzel <swhetzel at gmail dot com>	2008-01-26 23:02:14 +00:00
mav	afe3cc011f	Improve multilink receive performance by netgraph item reuse.	2008-01-26 22:42:47 +00:00
mav	56891c11ee	Improve multilink xmit performance by netgraph item reuse.	2008-01-26 22:41:14 +00:00
mav	4b87cfa8f8	Improve multilink receive performance with fragment headers preallocation.	2008-01-26 22:39:05 +00:00
sam	13fcc58a15	rx mbufs must have a pkthdr; use m_gethdr to populate the rx ring (and while here correct the mbuf type) Submitted by: Sam Banks <w0lfie@clear.net.nz> MFC after: 1 week	2008-01-26 22:35:57 +00:00
rwatson	db8d5a7a3d	Allow DDB_CAPTURE_DEFAULTBUFSIZE and DDB_CAPTURE_MAXBUFSIZE to be overridden at compile-time using kernel options of the same names. Rather than doing a compile-time CTASSERT of buffer sizes being even multiples of block sizes, just adjust them at boottime, as the failure mode is more user-friendly. MFC after: 2 months PR: 119993 Suggested by: Scot Hetzel <swhetzel at gmail dot com>	2008-01-26 22:32:23 +00:00
yar	ac1e4103b9	Our fts(3) API, as inherited from 4.4BSD, suffers from integer fields in FTS and FTSENT structs being too narrow. In addition, the narrow types creep from there into fts.c. As a result, fts(3) consumers, e.g., find(1) or rm(1), can't handle file trees an ordinary user can create, which can have security implications. To fix the historic implementation of fts(3), OpenBSD and NetBSD have already changed <fts.h> in somewhat incompatible ways, so we are free to do so, too. This change is a superset of changes from the other BSDs with a few more improvements. It doesn't touch fts(3) functionality; it just extends integer types used by it to match modern reality and the C standard. Here are its points: o For C object sizes, use size_t unless it's 100% certain that the object will be really small. (Note that fts(3) can construct pathnames _much_ longer than PATH_MAX for its consumers.) o Avoid the short types because on modern platforms using them results in larger and slower code. Change shorts to ints as follows: - For variables than count simple, limited things like states, use plain vanilla `int' as it's the type of choice in C. - For a limited number of bit flags use `unsigned' because signed bit-wise operations are implementation-defined, i.e., unportable, in C. o For things that should be at least 64 bits wide, use long long and not int64_t, as the latter is an optional type. See FTSENT.fts_number aka FTS.fts_bignum. Extending fts_number `to satisfy future needs' is pointless because there is fts_pointer, which can be used to link to arbitrary data from an FTSENT. However, there already are fts(3) consumers that require fts_number, or fts_bignum, have at least 64 bits in it, so we must allow for them. o For the tree depth, use `long'. This is a trade-off between making this field too wide and allowing for 64-bit inode numbers and/or chain-mounted filesystems. On the one hand, `long' is almost enough for 32-bit filesystems on a 32-bit platform (our ino_t is uint32_t now). On the other hand, platforms with a 64-bit (or wider) `long' will be ready for 64-bit inode numbers, as well as for several 32-bit filesystems mounted one under another. Note that fts_level has to be signed because -1 is a magic value for it, FTS_ROOTPARENTLEVEL. o For the `nlinks' local var in fts_build(), use `long'. The logic in fts_build() requires that `nlinks' be signed, but our nlink_t currently is uint16_t. Therefore let's make the signed var wide enough to be able to represent 2^16-1 in pure C99, and even 2^32-1 on a 64-bit platform. Perhaps the logic should be changed just to use nlink_t, but it can be done later w/o breaking fts(3) ABI any more because `nlinks' is just a local var. This commit also inludes supporting stuff for the fts change: o Preserve the old versions of fts(3) functions through libc symbol versioning because the old versions appeared in all our former releases. o Bump __FreeBSD_version just in case. There is a small chance that some ill-written 3-rd party apps may fail to build or work correctly if compiled after this change. o Update the fts(3) manpage accordingly. In particular, remove references to fts_bignum, which was a FreeBSD-specific hack to work around the too narrow types of FTSENT members. Now fts_number is at least 64 bits wide (long long) and fts_bignum is an undocumented alias for fts_number kept around for compatibility reasons. According to Google Code Search, the only big consumers of fts_bignum are in our own source tree, so they can be fixed easily to use fts_number. o Mention the change in src/UPDATING. PR: bin/104458 Approved by: re (quite a while ago) Discussed with: deischen (the symbol versioning part) Reviewed by: -arch (mostly silence); das (generally OK, but we didn't agree on some types used; assuming that no objections on -arch let me to stick to my opinion)	2008-01-26 17:09:40 +00:00
rwatson	2918e1cac2	Rename DB_ constants in db_capture.c to DDB_ so that when they are exposed as kernel compile options, they have more meaningful names. PR: 119993 MFC after: 2 months Suggested by: Scot Hetzel <swhetzel at gmail dot com>	2008-01-26 13:55:52 +00:00
rwatson	3b2455b135	Remove Giant acquisition around soreceive() and sosend() in fifofs. The bug that caused us to reintroduce it is believed to be fixed, and Kris says he no longer sees problems with fifofs in highly parallel builds. If this works out, we'll MFC it for 7.1. MFC after: 3 months Pointed out by: kris	2008-01-26 12:34:23 +00:00
kib	82cf20c0b8	In rev. 1.156, the convertion of the minor number to the unit number resulted in the argument to the make_dev() to be a unit number. Correct this by supplying a minor number to make_dev(), and using the unit number for the calculation of the slave tty name. Reported and tested by: Peter Holm Reviewed by: jhb Yet another pointy hat to: kib MFC after: 1 day	2008-01-26 06:09:23 +00:00
julian	e106c6b62c	One of my powerbooks has this chip in it.. Confirmed by looking at netbsd.. they have also added this. checked by grehen MFC After: 3 days	2008-01-26 05:11:09 +00:00
emaste	3c84281c0e	Allow arbitrary baud rates, not just the standard ones.	2008-01-26 04:30:48 +00:00
kmacy	d6170ff3e2	add opt_global.h dependency	2008-01-26 01:00:56 +00:00
jhb	dd3b84ba3a	Fix a bug where a thread that hit the race where the sleep timeout fires while the thread does not hold the thread lock would stop blocking for subsequent interruptible sleeps and would always immediately fail the sleep with EWOULDBLOCK instead (even sleeps that didn't have a timeout). Some background: - KSE has a facility for allowing one thread to interrupt another thread. During this process, the target thread aborts any interruptible sleeps much as if the target thread had a pending signal. Once the target thread acknowledges the interrupt, normal sleep handling resumes. KSE manages this via the TDF_INTERRUPTED flag. Specifically, it sets the flag when it sends an interrupt to another thread and clears it when the interrupt is acknowledged. (Note that this is purely a software interrupt sort of thing and has no relation to hardware interrupts or kernel interrupt threads.) - The old code for handling the sleep timeout race handled the race by setting the TDF_INTERRUPT flag and faking a KSE-style thread interrupt to the thread in the process of going to sleep. It probably should have just checked the TDF_TIMEOUT flag in sleepq_catch_signals() instead. - The bug was that the sleepq code would set TDF_INTERRUPT but it was never cleared. The sleepq code couldn't safely clear it in case there actually was a real KSE thread interrupt pending for the target thread (in fact, the sleepq timeout actually stomped on said pending interrupt). Thus, any future interruptible sleeps (sleep(.. PCATCH ..) or cv_wait_sig()) would see the TDF_INTERRUPT flag set and immediately fail with EWOULDBLOCK. The flag could be cleared if the thread belonged to a KSE process and another thread posted an interrupt to the original thread. However, in the more common case of a non-KSE process, the thread would pretty much stop sleeping. - Fix the bug by just setting TDF_TIMEOUT in the sleepq timeout code and not messing with TDF_INTERRUPT and td_intrval. With yesterday's fix to fix sleepq_switch() to check TDF_TIMEOUT, this is now sufficient. MFC after: 3 days	2008-01-25 19:44:46 +00:00
rwatson	1dcfe4a494	Hide ipfw internal data structures behind IPFW_INTERNAL rather than exposing them to all consumers of ip_fw.h. These structures are used in both ipfw(8) and ipfw(4), but not part of the user<->kernel interface for other applications to use, rather, shared implementation. MFC after: 3 days Reported by: Paul Vixie <paul at vix dot com>	2008-01-25 14:38:27 +00:00
emaste	fb709bf5c6	Calculate baud rate divisor instead of allowing only a fixed set of standard rates. Obtained from OpenBSD src/sys/dev/usb/uftdi.c 1.29 src/sys/dev/usb/uftdireg.h 1.11 OpenBSD revisions noted by: ticso, on hackers	2008-01-25 02:41:44 +00:00
jhb	5d22bdedcf	Fix a race in the sleepqueue timeout code that resulted in sleeps not being properly cancelled by a timeout. In general there is a race between a the sleepq timeout handler firing while the thread is still in the process of going to sleep. In 6.x with sched_lock, the race was largely protected by sched_lock. The only place it was "exposed" and had to be handled was while checking for any pending signals in sleepq_catch_signals(). With the thread lock changes, the thread lock is dropped in between sleepq_add() and sleepq_wait() opening up a new window for this race. Thus, if the timeout fired while the sleeping thread was in between sleepq_add() and sleepq_wait(), the thread would be marked as timed out, but the thread would not be dequeued and sleepq_switch() would still block the thread until it was awakened via some other means. In the case of pause(9) where there is no other wakeup, the thread would never be awakened. Fix this by teaching sleepq_switch() to check if the thread has had its sleep canceled before blocking by checking the TDF_TIMEOUT flag and aborting the sleep and dequeueing the thread if it is set. MFC after: 3 days Reported by: dwhite, peter	2008-01-25 02:09:38 +00:00
dumbbell	ba3df23cb8	When asked to use kqueue, AIO stores its internal state in the `kn_sdata' member of the newly registered knote. The problem is that this member is overwritten by a call to kevent(2) with the EV_ADD flag, targetted at the same kevent/knote. For instance, a userland application may set the pointer to NULL, leading to a panic. A testcase was provided by the submitter. PR: kern/118911 Submitted by: MOROHOSHI Akihiko <moro@remus.dti.ne.jp> MFC after: 1 day	2008-01-24 17:10:19 +00:00
kib	17d99001fa	Do not dereference NULL scp in the case the screen is not opened. Instead, return ENXIO to the ioctl caller. Reported and tested by: Pawel Worach <pawel.worach gmail com> Discussed with: markus MFC after: 3 days	2008-01-24 15:37:48 +00:00
attilio	004bd547ab	Bump __FreeBSD_version in order to signal: - lockmgr() prototype changing - lockcount() axing - LOCKMGR_ASSERT() axing	2008-01-24 12:37:54 +00:00
attilio	7213f4c32b	Cleanup lockmgr interface and exported KPI: - Remove the "thread" argument from the lockmgr() function as it is always curthread now - Axe lockcount() function as it is no longer used - Axe LOCKMGR_ASSERT() as it is bogus really and no currently used. Hopefully this will be soonly replaced by something suitable for it. - Remove the prototype for dumplockinfo() as the function is no longer present Addictionally: - Introduce a KASSERT() in lockstatus() in order to let it accept only curthread or NULL as they should only be passed - Do a little bit of style(9) cleanup on lockmgr.h KPI results heavilly broken by this change, so manpages and FreeBSD_version will be modified accordingly by further commits. Tested by: matteo	2008-01-24 12:34:30 +00:00
pjd	fbc761a694	- Reduce how much ZFS caches by default. This is another change to mitigate 'kmem_map too small panics'. - Print two warnings if there is not enough memory and not enough address space. - Improve comment.	2008-01-24 11:24:16 +00:00
pjd	435a09e625	Change type of kmem_used() and kmem_size() functions to uint64_t, so it doesn't overflow in arc.c in this check: if (kmem_used() > (kmem_size() * 4) / 5) return (1); With this bug ZFS almost doesn't cache. Only 32bit machines are affected that have vm.kmem_size set to values >=1GB. Reported by: David Taylor <davidt@yadt.co.uk>	2008-01-24 11:21:54 +00:00
bz	1c376286e0	Replace the last susers calls in netinet6/ with privilege checks. Introduce a new privilege allowing to set certain IP header options (hop-by-hop, routing headers). Leave a few comments to be addressed later. Reviewed by: rwatson (older version, before addressing his comments)	2008-01-24 08:25:59 +00:00
bz	ca561e0217	Differentiate between addifaddr and delifaddr for the privilege check. Reviewed by: rwatson MFC after: 2 weeks	2008-01-24 08:14:38 +00:00
scottl	901f8fc119	Many improvements that have been collected over time: - Improve error handling for load operations. - Fix a memory corruption bug when using certain linux management apps. - Allocate all commands up front to avoid OOM deadlocks later on.	2008-01-24 07:26:53 +00:00
rwatson	8aff4dd3cd	tcp_usrreq.c:1.313 removed tcbinfo locking from tcp_usr_accept(), which while in principle a good idea, opened us up to a race inherrent to the syncache's direct insertion of incoming TCP connections into the "completed connection" listen queue, as it transpires that the socket is inserted before the inpcb is fully filled in by syncache_expand(). The bug manifested with the occasional returning of 0.0.0.0:0 in the address returned by the accept() system call, which occurred if accept managed to execute tcp_usr_accept() before syncache_expand() had copied the endpoint addresses into inpcb connection state. Re-add tcbinfo locking around the address copyout, which has the effect of delaying the copy until syncache_expand() has finished running, as it is run while the tcbinfo lock is held. This is undesirable in that it increases contention on tcbinfo further, but a more significant change will be required to how the syncache inserts new sockets in order to fix this and keep more granular locking here. In particular, either more state needs to be passed into sonewconn() so that pru_attach() can fill in the fields before the socket is inserted, or the socket needs to be inserted in the incomplete connection queue until it is actually ready to be used. Reported by: glebius (and kris) Tested by: glebius	2008-01-23 21:15:51 +00:00
mav	1a704c3175	Fix bundle xmit octets stats for packet-split operation mode.	2008-01-23 11:47:09 +00:00
maxim	3250102574	o Add boot, gdb, nfsserver and opencrypto dirs to CSCOPEDIRS; sort.	2008-01-23 08:50:34 +00:00
jeff	be58be75dd	- sched_prio() should only adjust tdq_lowpri if the thread is running or on a run-queue. If the priority is numerically raised only change lowpri if we're certain it will be correct. Some slop is allowed however previously we could erroneously raise lowpri for an idle cpu that a thread had recently run on which lead to errors in load balancing decisions.	2008-01-23 03:10:18 +00:00
ache	fa95b66299	Bump FreeBSD_version after adding "ASCII" encoding to libc	2008-01-23 00:04:18 +00:00
gallatin	7bafc7ac86	Take advantage of the new physically contiguous 9K jumbos in 8.	2008-01-22 22:04:31 +00:00
rwatson	b33bafcdc2	Put "coda_rdwr: Internally Opening" printf generated by in-kernel writes to files, such as ktrace output, under CODA_VERBOSE. Otherwise, each such call to VOP_WRITE() results in a kernel printf. MFC after: 3 days Obtained from: NetBSD	2008-01-21 21:39:08 +00:00
rwatson	a718996964	Replace references to VOP_LOCK() w/o LK_RETRY to vn_lock() with LK_RETRY, avoiding extra error handling, or in some cases, missing error handling. MFC after: 3 days Discussed with: kib	2008-01-21 21:19:07 +00:00
ru	420c47639c	Add copyrights. PR: 119136	2008-01-21 13:26:33 +00:00
yongari	32ed0eb348	Fix build.	2008-01-21 09:51:28 +00:00
yongari	c4d5fd8820	Uncomment sf(4), sf(4) should work on all architectures.	2008-01-21 06:51:25 +00:00
yongari	571e260180	Overhaul sf(4) to make it run on all architectures and implement checksum offoload by downloading AIC-6915 firmware. Changes are o Header file cleanup. o Simplified probe logic. o s/u_int{8,16,32}_t/uint{8,16,32}_t/g o K&R -> ANSI C. o In register access function, added support both memory mapped and IO space register acccess. The function will dynamically detect which method would be choosed. o sf_setperf() was modified to support strict-alignment architectures. o Use SF_MII_DATAPORT instead of hardcoded value 0xffff. o Added link state/speed, duplex changes handling task q. The task q is also responsible for flow control settings. o Always hornor link up/down state reported by mii layers. The link state information is used in sf_start() to determine whether we got a valid link. o Added experimental flow-control setup. It was commented out but will be activated once we have flow-cotrol infrastructure in mii layer. o Simplify IFF_UP/IFCAP_POLLING and IFF_PROMISC handling logic. Rx filter always honors promiscuous mode. o Implemented suspend/resume methods. o Reorganized Rx filter routine so promiscuous mode changes doesn't require interface re-initialization. o Reimplemnted driver probe routine such that it looks for matching device from supported hardware list table. This change will help to add newer hardware revision to the driver. o Use ETHER_ADDR_LEN instead of hardcoded value. o Prefer memory space register mapping over I/O space as the hardware requires lots of register access to get various consumer/producer index. Failing to get memory space mapping, sf(4) falls back to I/O space mapping. Use of memory space register mapping requires somewhat large memory space(512K), though. o Switch to simpler bus_{read,write}_{1,2,4}. o Use PCIR_BAR macro to get BARs. o Program PCI cache line size if the cache line size was set to 0 and enable PCI MWI. o Add a new sysctl node 'dev.sf.N.stats' that shows various MAC counters for Rx/Tx statistics. o Add a sysctl node to configure interrupt moderation timer. The timer defers interrupts generation until time specified in timer control register is expired. The value in the timer register is in units of 102.4us. The allowable range for the timer is 0 - 31 (0 ~ 3.276ms). The default value is 1(102.4us). Users can change the timer value with dev.sf.N.int_mod sysctl(8) variable/loader(8) tunable. o bus_dma(9) conversion - Enable 64bit DMA addressing. - Enable 64bit descriptor format support. - Apply descriptor ring alignment requirements(256 bytes alignment). - Apply Rx buffer address alignment requirements(4 bytes alignment). - Apply 4GB boundary restrictions(Tx/Rx ring and its completion ring should live in the same 4GB address space.) - Set number of allowable number of DMA segments to 16. In fact, AIC-6915 doesn't have a limit for number of DMA segments but it would be waste of Tx descriptor resource if we allow more than 16. - Rx/Tx side bus_dmamap_load_mbuf_sg(9) support. - Added alignment fixup code for strict-alignment architectures. - Added endianness support code in Tx/Rx descriptor access. With these changes sf(4) should work on all platforms. o Don't set if_mtu in device attach, it's handled in ether_ifattach. o Use our own callout to drive watchdog timer. o Enable VLAN oversized frames and announce sf(4)'s VLAN capability to upper layer. o In sf_detach(), remove mtx_initialized KASSERT as it's not possible to get there without initialzing the mutex. Also mark that we're about to detaching so active bpf listeners do not panic the system. o To reduce PCI register access cycles, Rx completion ring is directly scanned instead of reading consumer/producer index registers. In theory, Tx completion ring also can be directly scanned. However the completion ring is composed of two types completion(1 for Tx done and 1 and DMA done). So reading producer index via register access would be more safer way to detect the ring wrap-around. o In sf_rxeof(), don't use m_devget(9) to align recevied frames. The alignment is required only for strict-alignment architectures and now the alignment is handled by sf_fixup_rx() if required. The removal of the copy operation in fast path should increase Rx performance a lot on non-strict-alignemnt architectures such as i386 and amd64. o In sf_newbuf(), don't set descriptor valid bit as sf(4) is programmed to run with normal mode. In normal mode, the valid bit have no meaning. The valid bit should be used only when the hardware uses polling(prefetch) mode. The end of descriptor queue bit could be used if needed, but sf(4) relys on auto-wrapping of hardware on 256 descriptor queue entries so both valid and descriptor end bit are not used anymore. o Don't disable generation of Tx DMA completion as said in datasheet and use the Tx DMA completion entry instead of relying on Tx done completion entry. Also added additional Tx completion entry type check in Tx completion handler. o Don't blindly reset watchdog timer in sf_txeof(). sf(4) now unarm the the watchdog only if there are no active Tx descriptors in Tx queue. o Don't manually update various counters in driver, instead, use built-in MAC statistic registers to update them. The statistic registers are updated in every second. o Modified Tx underrun handlers to increase the threshold value in units of 256 bytes. Previously it used to increase 16 bytes at a time which seems to take too long to stabalize whenever Tx underrun occurrs. o In interrupt handler, additional check for the interrupt is performed such that interrupts only for this device is allowed to process descriptor rings. Because reading SF_ISR register clears all interrtups, nuke writing to a SF_ISR register. o Tx underrun is abonormal condition and SF_ISR_ABNORMALINTR includes the interrupt. So there is no need to inspect the Tx underrun again in main interrupt loop. o Don't blindly reinitialize hardware for abnormal interrupt condition. sf(4) reintializes the hardware only when it encounters DMA error which requires an explicit hardware reinitialization. o Fix a long standing bug that incorrectly clears MAC statistic registers in sf_init_locked. o Added strict-alignment safe way of ethernet address reprogramming as IF_LLADDR may return unaligned address. o Move sf_reset() to sf_init_locked in order to always reset the hardware to a known state prior to configuring hardware. o Set default Rx DMA, Tx DMA paramters as shown in datasheet. o Enable PCI busmaster logic and autopadding for VLAN frames. o Rework sf_encap. - Previously sf(4) used to type 0 of Tx descriptor with padding enabled to store driver private data. Emebedding private data structures into descriptors is bad idea as the structure size would be different between 64bit and 32bit architectures. The type 0 descriptor allows fixed number of DMA segments in a descriptor format and provides relatively simple interface to manage multi-fragmented frames. However, it wastes lots of Tx descriptors as not all frames are fragmented as the number of allowable segments in a descriptor. - To overcome the limitation of type 0 descriptor, switch to type 2 descriptor which allows 64bit DMA addressing and can handle unliumited number of fragmented DMA segments. The drawback of type 2 descriptor is in its complexity in managing descriptors as driver should handle the end of Tx ring manually. - Manually set Tx desciptor queue end mark and record number of used descriptors to reclaim used descriptors in sf_txeof(). o Rework sf_start. - Honor link up/down state before attempting transmission. - Because sf(4) uses only one of two Tx queues, use low priority queue instead of high one. This will remove one shift operation in each Tx kick command. - Cache last produder index into softc such that subsequenet Tx operation doesn't need to access producer index register. o Rewrote sf_stats_update to include all available MAC statistic counters. o Employ AIC-6915 firmware from Adaptec and implement firmware download routine and TCP/UDP checksum offload. Partial checksum offload support was commented out due to the possibility of firmware bug in RxGFP. The firmware can strip VLAN tag in Rx path but the lack of firmware assistance of VLAN tag insertion in transmit side made it useless on FreeBSD. Unlike checksum offload, FreeBSD requires both Tx/Rx hardware VLAN assistance capability. The firmware may also detect wakeup frame and can wake system up from states other than D0. However, the lack of wakeup support form D3cold state keep me from adding WOL capability. Also detecting WOL frame requires firmware support but it's not yet known to me whether the firmware can process the WOL frame. o Changed _ADDR_HIADDR to _ADDR_HI to match other definitions of registers. o Added definitioan to interrupt moderation related constants. o Redefined SF_INTRS to include Tx DMA done and DMA errors. Removed Tx done as it's not needed anymore. o Added definition for Rx/Tx DMA high priority threshold. o Nuked unused marco SF_IDX_LO, SF_IDX_HI. o Added complete MAC statistic register definition. o Modified sf_stats structure to hold all MAC statistic regiters. o Nuke various driver private padding data in Tx/Rx descriptor definition. sf(4) no longer requires private padding. Also remove unused padding related definitions. This greatly simplifies descriptor manipulation on 64bit architectures. o Becase we no longer pad driver private data into descriptor, remove deprecated/not-applicable comments for padding. o Redefine Rx/Tx desciptor status. sf(4) doesn't use bit fileds anymore to support endianness. Tested by: bruffer (initial version)	2008-01-21 06:38:23 +00:00
yongari	d1a53c74fd	It seems that the firmware for TxGFP does not work at all. I could be wrong but I couldn't find a way to make it work. In addition, the number of TxGFP instruction does not match the firmware image size, so I guess something was wrong when Adaptec generated the TxGFP firmware from their DDK. According to datasheet, normally, the first GFP instruction would be opcode C, WaitForStartOfFrame, to synchronize checksumming with incoming frame. But the first instruction in TxGFP firmware was opcode 1, BrToImmIfTrue, so it could not process checksum correctly, I guess. Checking for RxGFP firmware also indicates the first instruction should be opcode C. Since the number of instructions in TxGFP firmware lacks exactly one instruction, I prepended the opcode C to TxGFP firmware image. With this change, the resulting image size perfectly matches with the nummber of instructions and Tx checksum offload seems to work without problems.	2008-01-21 05:09:07 +00:00
yongari	84fda61777	Import AIC-6915 firmware for GFP from Adaptec. Special thanks to bruffer to send the firmware image to me. Submitted by: gibbs via bruffer	2008-01-21 05:02:44 +00:00
yongari	f8de05c123	sf(4) was repocopied to src/sys/dev/sf.	2008-01-21 04:45:58 +00:00
yongari	d48c6f0552	Update file list and Makefile after repocopying sf(4) from src/sys/pci to src/sys/dev.	2008-01-21 04:27:32 +00:00
yongari	3ce1d0d3db	Forced commit to note that sf(4) was repocopied from sys/pci and modified for its new location.	2008-01-21 04:23:47 +00:00
rwatson	0e6bbfc8e3	Regenerate.	2008-01-20 23:44:24 +00:00
rwatson	ff05f9dd9d	Use audit events AUE_SHMOPEN and AUE_SHMUNLINK with new system calls shm_open() and shm_unlink(). More auditing will need to be done for these calls to capture arguments properly.	2008-01-20 23:43:06 +00:00
rwatson	ff397597d9	Export a type for POSIX SHM file descriptors via kern.proc.filedesc as used by procstat, or SHM descriptors will show up as type unknown in userspace.	2008-01-20 19:55:52 +00:00
bz	866f483083	Correct the commented out debugging printf()s in REPLACE and NEXT macros. ip6_sprintf() needs a buffer as first argument these days. MFC after: 2 weeks	2008-01-20 10:08:15 +00:00
kmacy	ac46645d70	Re-enable pcpu caching by default make sysctl R/W	2008-01-19 22:47:43 +00:00
das	d6e95041fd	Correct the visibility macro surrounding SIGSYS. Submitted by: Andriy Gapon <avg@icyb.net.ua>	2008-01-19 21:41:31 +00:00
rwatson	8294510902	Remove unused oldhash definition from Coda namecache. MFC after: 3 days	2008-01-19 19:21:07 +00:00
attilio	2bedc0a58f	Bump FreeBSD_version in order to reflect introduction of lockmgr_recursed(), BUF_RECURSED(), BUF_ISLOCKED() and trimming out of BUF_REFCNT().	2008-01-19 17:39:25 +00:00
attilio	caa2ca048b	- Introduce the function lockmgr_recursed() which returns true if the lockmgr lkp, when held in exclusive mode, is recursed - Introduce the function BUF_RECURSED() which does the same for bufobj locks based on the top of lockmgr_recursed() - Introduce the function BUF_ISLOCKED() which works like the counterpart VOP_ISLOCKED(9), showing the state of lockmgr linked with the bufobj BUF_RECURSED() and BUF_ISLOCKED() entirely replace the usage of bogus BUF_REFCNT() in a more explicative and SMP-compliant way. This allows us to axe out BUF_REFCNT() and leaving the function lockcount() totally unused in our stock kernel. Further commits will axe lockcount() as well as part of lockmgr() cleanup. KPI results, obviously, broken so further commits will update manpages and freebsd version. Tested by: kris (on UFS and NFS)	2008-01-19 17:36:23 +00:00
rwatson	a45d8c6482	Improve default vnode operation handling for Coda: - Don't specify vnode operations for mknod, lease, and advlock--let them fall through to vop_default. - Implement vop_default with &default_vnodeops, rather than with VOP_PANIC, so that unimplemented vnode operations are handled in more sensible ways than panicking, such as EOPNOTSUPP on ACL queries generated by bsdtar, or mknod. MFC after: 3 days	2008-01-19 17:12:44 +00:00
rwatson	5baa8fe000	Rework coda_statfs(): no longer need to zero the statfs structure or fill out all fields, just fill out the ones the file system knows about. Among other things, this causes the outpuf of "mount" and "df" to make quite a bit more sense as /dev/cfs0 is specified as the mountfrom name. MFC after: 3 days	2008-01-19 16:39:14 +00:00
rwatson	1d78104fa0	Zero mi_rotovp and coda_ctlvp immediately after calling vrele() on the vnodes during coda_unmount() in order to detect errant use of them after the vnode references may no longer be valid. No need to clear the VV_ROOT flag on mi_rootvp flag (especially after the vnode reference is no longer valid) as this isn't done on other file systems. MFC after: 3 days	2008-01-19 15:40:46 +00:00
rwatson	fc2cdfa748	Don't acquire an additional vnode reference to a vnode when it is opened and then release it when it is closed: we rely on the caller to keep the vnode around with a valid reference. This avoids vrele() destroying the vnode vop_close() is being called from during a call to vop_close(), and a crash due to lockmgr recursing the vnode lock when a Coda unmount occurs. MFC after: 3 days	2008-01-19 15:39:10 +00:00
rwatson	735d73fd1d	Don't declare functions as extern. Move all extern variable definitions to associated .h files, move some extern variable definitions between include files to place them more appropriately. MFC after: 3 days	2008-01-19 14:32:44 +00:00
rwatson	af893b7bf8	Use VOP_NULL rather than VOP_PANIC for Coda's vop_print routine, so as to avoid panicking in DDB show lockedvnods. MFC after: 3 days	2008-01-19 13:41:56 +00:00
rwatson	7c43871d32	Lock the new directory vnode returned by coda_mkdir(), as this is required by FreeBSD's vnode locking protocol. MFC after: 3 days	2008-01-19 13:29:14 +00:00
rwatson	642dbf24b6	Borrow the VM object associated with an underlying cache vnode with the Coda vnode derived from it, in the style of nullfs. This allows files in the Coda file system to be memory-mapped, such as with execve(2) or mmap(2). MFC after: 3 days Reported by: Rune <u+openafsdev-sr55 at chalmers dot se>	2008-01-19 13:27:14 +00:00
jkim	d0a4947e47	Add a flag for Ethernet@WireSpeed capability and correct chip revisions. The idea was taken from OpenBSD and cross-referenced with Linux driver.	2008-01-18 22:09:50 +00:00
jkim	dd27822e35	Fix mbuf pool watermark configuration (mismerged from OpenBSD in 1.132). Submitted by: sephe MFC after: 3 days	2008-01-18 20:33:21 +00:00
csjp	6f5423890a	Fix gratuitous whitespace bug MFC after: 1 week Obtained from: TrustedBSD Project	2008-01-18 19:57:21 +00:00
csjp	ad1f18f4a9	Add a case for AUE_LISTEN. This removes the following console error message: "BSM conversion requested for unknown event 43140" It should be noted that we need to audit the fd argument for this system call. Obtained from: TrustedBSD Project MFC after: 1 week	2008-01-18 19:50:34 +00:00
rwatson	dccd51b54f	Move unlock of global UNIX domain socket lock slightly lower in unp_connect(): it is expected to return with the lock held, and two possible error paths otherwise returned with it unlocked. The fix committed here is slightly different from the patch in the PR, but along an alternative line suggested in the PR. PR: 119778 MFC after: 3 days Submitted by: James Juran <james dot juran at baesystems dot com>	2008-01-18 19:16:03 +00:00
kib	3628ae460c	In the rev. 1.153, the one place for converting minor number to unit was missed. As result, pty_create_slave() may index out of the names[] bounds, creating wrong slave tty names. Tested by: kensmith Reviewed by: jhb MFC after: 3 days	2008-01-18 18:07:04 +00:00
remko	c050b3d1bc	Use nfsrv_destroycache() only once, else it crashes the server. PR: kern/118152 Submitted by: Bjoern Groenvall <bg at sics dot se> Approved by: imp (mentor, a while ago already), jhb MFC After: 3 days	2008-01-18 17:03:36 +00:00
julian	d6aa139aef	refactor code so it can run in a chroot without having to have /dev/mounted MFC After: 1 week	2008-01-18 17:02:14 +00:00
ambrisko	5e9948e31a	Style changes from avatar. Submitted by: avatar	2008-01-18 16:34:18 +00:00
ambrisko	22d76bf909	First real attempt at proper locking. The locking is a little complicated since the the command and data that is being built to be sent to or read from the HW lives in the softc. Commands are later run via an_setdef etc. In the ioctl path various references are kept to the data stored in the softc so it needs to be protected. Almost think of the command in the softc a global variable since it essentially is. Since locking wasn't done in this type of context the commands would get corrupted. Thanks to avatar@ for catching some lock issues and dhw@ for testing. Things are a lot more stable except for the MPI-350 cards. My an(4) remote laptop stays on the network now. The driver should be changed so that it uses private memory that is passed to the functions that talk to the card. Then only those functions would really need to grab locks. Reviewed by: avatar@	2008-01-18 16:31:24 +00:00
davidxu	ae86af8218	Make sure reading td_runtime in critical section since thread may be preempted and td_runtime will be modified.	2008-01-18 13:00:28 +00:00
rwatson	ba4fb8ac52	In tcp_ctloutput(), don't hold the inpcb lock over sooptcopyin(), rather, drop the lock and then re-acquire it, revalidating TCP connection state assumptions when we do so. This avoids a potential lock order reversal (and potential deadlock, although none have been reported) due to the inpcb lock being held over a page fault. MFC after: 1 week PR: 102752 Reviewed by: bz Reported by: VÃ¡clav Haisman <v dot haisman at sh dot cvut dot cz>	2008-01-18 12:19:50 +00:00
kib	8aee27b5a3	udf_vget() shall vgone() the vnode when the file_entry cannot be allocated or read from the volume. Otherwise, half-constructed vnode could be found later and cause panic when accessed. PR: 118322 MFC after: 1 week	2008-01-18 12:09:54 +00:00
thompsa	2d8d5733ef	Remove a chunk of duplicated code, test the destination address against the bridge the same way we check member interfaces.	2008-01-18 09:34:09 +00:00
yongari	c138b43f21	Use m_collapse(9) to collapse mbuf chains instead of relying on shortest possible chain of mbufs of m_defrag(9). What we want is chains of mbufs that can be safely stored to a Tx descriptor which can have up to STGE_MAXTXSEGS mbufs. The ethernet controller does not need to align Tx buffers on 32bit boundary. So the use of m_defrag(9) was waste of time.	2008-01-18 08:32:08 +00:00
davidxu	80ec49a2cf	Add POSIX clock id CLOCK_THREAD_CPUTIME_ID, this can be used to measure per-thread runtime in user code.	2008-01-18 07:04:42 +00:00
thompsa	f99e03f7fe	IEEE 802.1D-2004 states, frames containing any of the group MAC Addresses specified in Table 7-10 in their destination address field shall not be relayed by the Bridge. Add a check in bridge_forward() to adhere to this. PR: kern/119744	2008-01-18 00:19:10 +00:00
jhb	4d2d2276c8	- Retire npe_defrag(), gem_defrag(), msk_defrag(), nfe_defrag(), and re_defrag() and use m_collapse() instead. - Replace a reference to ath_defrag() in a comment in if_wpi.c with m_collapse().	2008-01-17 23:37:47 +00:00
jhb	7e32513a5b	Add 'compat_freebsd[4567]' features corresponding to the kernel options COMPAT_FREEBSD[4567]. MFC after: 1 week Requested by: kris	2008-01-17 22:46:32 +00:00
kmacy	53e7eb64df	- remove bogus_imm counter - disable pcpu cluster cache by default until reference counting is handled correctly for held clusters - can be re-enable by sysctl	2008-01-17 21:25:58 +00:00
sam	e443f3b38c	promote ath_defrag to m_collapse (and retire private+unused m_collapse from cxgb) Reviewed by: pyun, jhb, kmacy MFC after: 2 weeks	2008-01-17 21:25:09 +00:00
jhb	e3c7bebe5f	Remove a conditional that is always true. MFC after: 2 weeks	2008-01-17 20:15:15 +00:00
jhb	bc40648c02	Use pause(9) instead of a tsleep() on a stack variable. MFC after: 1 week	2008-01-17 20:11:58 +00:00
jhb	a114208c34	Use cpu_spinwait() (i.e., "pause") when spinning on rdtsc during DELAY(). MFC after: 1 week	2008-01-17 18:59:38 +00:00
alc	7f5a9c7a36	Retire PMAP_DIAGNOSTIC. Any useful diagnostics that were conditionally compiled under PMAP_DIAGNOSTIC are now KASSERT()s. (Note: The kernel option DIAGNOSTIC still disables inlining of certain pmap functions.) Eliminate dead code from pmap_enter(). This code implemented an assertion. On i386, an equivalent check is already implemented. However, on amd64, a small change is required to implement an equivalent check. Eliminate \n from a nearby panic string. Use KASSERT() to reimplement pmap_copy()'s two assertions.	2008-01-17 18:25:52 +00:00
bde	691f99e98f	Translate from the i386. All FP constants and operations are evaluated in the range and precision of their type(s) on amd64, but FLT_EVAL_METHOD said that they were evalated in the "interesting" (buggy) i387 methods. float_t was broken compatibly with FLT_EVAL_METHOD. These definitions seem to be broken on powerpc and possibly on arm. float_t is float on powerpc with gcc [-notraditional] according to glibc, and FLT_EVAL_METHOD is marked with XXX on arm.	2008-01-17 13:12:46 +00:00
cognet	961a1f29ef	Unbreak build by adding the missing parameter to pmap_enter().	2008-01-17 12:41:59 +00:00
thompsa	2ba124325f	Sync from OpenBSD r1.118, nuke clause 3 & 4.	2008-01-17 09:46:16 +00:00
kmacy	6dfbebef02	Fix lock ordering panic by not calling ether_ioctl with port lock held Reported by: rrs	2008-01-16 21:33:34 +00:00
jhb	e0a03b53e1	Add a header containing constants for the various HPET registers and their fields and update the code to match. The PR served more as an inspiration than providing the actual diffs. MFC after: 1 week PR: kern/112544	2008-01-16 18:47:07 +00:00
jhb	52da96d26e	Add a set of regression tests for the POSIX shm API (shm_open(2) and shm_unlink(2)).	2008-01-16 15:51:24 +00:00
remko	339d7dbc07	Dont accidentally remove a filesocket which is still in use. This gives problems when the DRM driver is loaded and the AIXGL extension is loaded , the AIXGL driver requests a drm_close and this will cause the radeon driver to fail while starting X windows. PR: kern/114688 Submitted by: vehemens <vehemens at verizon dot net> Prodded by: Robert Noland Approved by: imp (mentor, a while ago already), anholt MFC After: 1 week	2008-01-16 13:54:40 +00:00
keramida	4a8da7a19f	Bump manpage date for rev 1.27 MFC after: 3 days	2008-01-16 07:00:55 +00:00
keramida	50222e8f1f	Document that loader(8) stops reading `loader.conf' when it encounters a syntax error, and add a tip about adding first the `vital' options and then experimental ones. PR: docs/119658 Submitted by: Julian Stacey, jhs at berklix.org	2008-01-16 06:59:22 +00:00
kmacy	1e0f928b96	remove superfluous debug printfs	2008-01-16 02:39:33 +00:00

... 3 4 5 6 7 ...

66486 Commits