freebsd-skq

Author	SHA1	Message	Date
glebius	e1d22638d0	Add CARP (Common Address Redundancy Protocol), which allows multiple hosts to share an IP address, providing high availability and load balancing. Original work on CARP done by Michael Shalayeff, with many additions by Marco Pfatschbacher and Ryan McBride. FreeBSD port done solely by Max Laier. Patch by: mlaier Obtained from: OpenBSD (mickey, mcbride)	2005-02-22 13:04:05 +00:00
ru	79c91b9063	Allocate the M_VLANTAG m_pkthdr flag, and use it to indicate that a packet has VLAN mbuf tag attached. This is faster to check than m_tag_locate(), and allows us to use the tags in non-vlan(4) VLAN producers. The first argument to VLAN_OUTPUT_TAG() is now unused but retained for backward compatibility. While here, embellish a fix in rev. 1.174 of if_ethersubr.c -- it now checks for packets with VLAN (mbuf) tags, and it should now be possible to bridge(4) on vlan(4)'s whose parent interfaces support VLAN decapsulation in hardware. Reviewed by: sam	2005-02-18 22:31:19 +00:00
glebius	db16f02fe3	Check for non-NULL ac_netgraph field in interface arpcom, instead of checking global presence of ng_ether(4). Reviewed by: ru	2005-02-14 11:58:54 +00:00
ru	5d07a7452c	If no vlan(4) interfaces are configured for the interface, and the driver did VLAN decapsulation in hardware, we were passing a frame as if it came for the parent (non-VLAN) interface. Stop this from happening. Reminded by: glebius Security: This could pose a security risk in some setups	2005-02-14 08:29:42 +00:00
delphij	2511132c4e	Validate ifc->ifc_len before submitting its incarnation to sbuf_new, which will finally lead to kernel panic. Security: This prevents a local (root-launched) DoS Submitted by: Wojciech A. Koszek [dunstan at freebsd czest pl] PR: 77421 MFC After: 1 week	2005-02-12 17:51:12 +00:00
phk	13100c3699	Make a bunch of malloc types static. Found by: src/tools/tools/kernxref	2005-02-10 12:02:37 +00:00
glebius	a7cdc1bdc6	Log changes of link state. Reviewed by: rwatson	2005-01-30 12:57:47 +00:00
rwatson	c07ace7f7b	Acquire the raw_cb mutex around LIST_REMOVE() of a raw socket control block from the global raw socket list. Submitted by: Roselyn Lee <rosel at verniernetworks dot com> MFC after: 1 week	2005-01-24 22:56:09 +00:00
yar	48c3845c46	Fix spelling in a comment.	2005-01-24 15:48:00 +00:00
yar	48509d66b6	Reduce the global name space pollution. The cloner structure isn't referenced by name outside this file.	2005-01-23 23:10:33 +00:00
glebius	4f5002e054	- Reduce number of arguments passed to dummynet_io(), we already have cookie in struct ip_fw_args itself. - Remove redundant &= 0xffff from dummynet_io().	2005-01-16 11:13:18 +00:00
glebius	5d69dda0d6	Remove ip_fw.h and ip_dummynet.h from includes.	2005-01-15 22:04:17 +00:00
glebius	4db2b8d392	o Clean up interface between ip_fw_chk() and its callers: - ip_fw_chk() returns action as function return value. Field retval is removed from args structure. Action is not flag any more. It is one of integer constants. - Any action-specific cookies are returned either in new "cookie" field in args structure (dummynet, future netgraph glue), or in mbuf tag attached to packet (divert, tee, some future action). o Convert parsing of return value from ip_fw_chk() in ipfw_check_{in,out}() to a switch structure, so that the functions are more readable, and a future actions can be added with less modifications. Approved by: andre MFC after: 2 months	2005-01-14 09:00:46 +00:00
keramida	e6fbd61f67	Fix a typo in a comment that may be confusing if one doesn't really check what the code does. Separators are spaces, commas or tabs; not '*' characters (as one may assume by reading the old comment).	2005-01-11 10:47:51 +00:00
ume	23e96af981	don't see NBPFILTER.	2005-01-11 07:17:33 +00:00
ume	28e58e1cdb	remove HAVE_OLD_BPF part.	2005-01-11 07:14:37 +00:00
ume	46c04961aa	we are not OLD_BPF system.	2005-01-11 07:08:15 +00:00
ume	ddb6478aa3	fix typo.	2005-01-11 07:05:56 +00:00
glebius	e3f4f22c01	This change adds reliability for Ethernet trunks built with ng_one2many: - Introduce another ng_ether(4) callback ng_ether_link_state_p, which is called from if_link_state_change(), every time link is changed. - In ng_ether_link_state() send netgraph control message notifying of link state change to a node connected to "lower" hook. Reviewed by: sam MFC after: 2 weeks	2005-01-08 12:42:03 +00:00
imp	a50ffc2912	/* -> /*- for license, minor formatting changes	2005-01-07 01:45:51 +00:00
rik	43775c98bd	Add FR support to sppp (MFCronyx). Silence on: net@, current@, hackers@. No objections: joerg Requested by: by many (mostly Cronyx) users for a long long time. MFC after: 10 days PR: kern/21771, kern/66348	2004-12-28 00:07:57 +00:00
pjd	c06a300010	Fix mbuf leak. Submitted by: Johnny Eriksson <bygg@cafax.se> MFC after: 5 days	2004-12-27 15:53:44 +00:00
phk	cc0d4329c3	Include fcntl.h Include selinfo.h (don't rely on vnode.h to do so) Check O_NONBLOCK instead of IO_NELAY Don't include vnode.h	2004-12-22 17:39:21 +00:00
phk	3fdb7bea32	Don't include filedesc.h Include fcntl.h Include selinfo.h (don't rely on vnode.h to do so) Check O_NONBLOCK instead of IO_NDELAY Don't include vnode.h	2004-12-22 17:38:43 +00:00
phk	0970167e88	Include fcntl.h Check O_NONBLOCK instead of IO_NDELAY Include uio.h Don't include vnode.h Don't include filedesc.h	2004-12-22 17:37:57 +00:00
phk	76e8599a69	Check O_NONBLOCK instead of IO_NDELAY. Don't include <sys/vnode.h>	2004-12-22 17:32:53 +00:00
jmg	584f9ac38a	don't try to recurse on the bpf lock.. kqueue already locks the bpf lock now... Submitted by: Ed Maste of Sandvine Inc. MFC after: 1 week	2004-12-17 03:21:46 +00:00
rik	8325619210	Kill double inclusion for <netinet/in.h> and <netinet/in_systm.h>.	2004-12-14 18:18:54 +00:00
rik	a09ae8d2bf	Make sppp MPSAFE. MPSAFE could be turned off by IFF_NEEDSGIANT. Silence on: net@, current@, hackers@. No objections: joerg	2004-12-12 14:54:15 +00:00
sam	2c929f635e	Cleanup link state change notification: o add new if_link_state_change routine that deals with link state changes o change mii to use if_link_state_change	2004-12-08 05:45:59 +00:00
sam	051e994615	Don't require a device to be marked up when issuing BIOCSETIF.	2004-12-08 05:40:02 +00:00
mlaier	834b0b8b46	Implement the check I was talking about in the previous message already. Introduce domain_init_status to keep track of the init status of the domains list (surprise). 0 = uninitialized, 1 = initialized/unpopulated, 2 = initialized/done. Higher values can be used to support late addition of domains which right now "works", but is potential dangerous. I choose to only give a warning when doing so. Use domain_init_status with if_attachdomain[1]() to ensure that we have a complete domains list when we init the if_afdata array. Store the current value of domain_init_status in if_afdata_initialized. This way we can update if_afdata after a new protocol has been added (once that is allowed). Submitted by: se (with changes) Reviewed by: julian, glebius, se PR: kern/73321 (partly)	2004-11-30 22:38:37 +00:00
rwatson	b523874cef	Assign if_broadcastaddr to NULL not 0 in if_attach(). Printf() a warning if if_attachdomain() is called more than once on an interface to generate some noise on mailing lists when this occurs. Fix up style in if_start(), where spaces crept in instead of tabs at some point. MFC after: 1 week MFC note: Not the printf().	2004-11-23 23:31:33 +00:00
jmg	f5e433d72b	sync comment on IFF_OACTIVE with reality.. IFF_OACTIVE is set when the hardware cannot take anymore packets, and so will supress the calling of the device's if_start method... Submitted by: bde	2004-11-17 18:32:44 +00:00
mlaier	b188666781	Remove the #if 0 wrapping around !ALTQ stuff that can't be used due to ABI stability anyway.	2004-11-09 21:29:28 +00:00
phk	027fce30f5	Initialize struct pr_userreqs in new/sparse style and fill in common default elements in net_init_domain(). This makes it possible to grep these structures and see any bogosities.	2004-11-08 14:44:54 +00:00
cognet	c35b680996	Don't abuse tp->t_sc in sl(4) either.	2004-11-07 14:36:47 +00:00
cognet	13ca89b942	Don't abuse tp->t_sc, as it is now used by tty drivers. This fixes the panic that occurs when using ppp(4) Reported and tested by: Yann Berthier (yb at sainte-barbe dot org)	2004-11-07 14:35:53 +00:00
glebius	08501005ec	Utilize m_uiotombuf() in device write method, instead of home-grown implementation. This also gives a performance improvement, because m_uiotombuf() utilizes clusters. Approved by: julian (mentor) MFC after: 1 month	2004-10-31 17:39:46 +00:00
rwatson	f71b496ed7	Move if_handoff() from an inline in if_var.h to a function to if.c in orden to harden the ABI for 5.x; this will permit us to modify the locking in the ifnet packet dispatch without requiring drivers to be recompiled. MFC after: 3 days Discussed at: EuroBSDCon Developer's Summit	2004-10-30 09:39:13 +00:00
rwatson	a9f55430f9	Add additional "spare" fields to 'struct ifnet' in order to improve the resistance of the network driver ABI to changes that will be required as we optimize locking. MFC after: 3 days Discussed at: Developer Summit	2004-10-30 08:45:13 +00:00
jmg	6cd4381f71	use NULL instead of 0 when casting/comparing w/ a pointer...	2004-10-25 17:04:40 +00:00
rwatson	2496b0e630	Define IFF_LOCKGIANT() and IFF_UNLOCKGIANT() macros, which conditionally acquire Giant if the passed interface has IFF_NEEDSGIANT set on it. Modify calls into (ifp)->if_ioctl() in if.c to use these macros in order to ensure that Giant is held. MFC after: 3 days Bumped into by: jmg	2004-10-19 18:11:55 +00:00
rwatson	4b81ce6dd2	Push acquisition of the accept mutex out of sofree() into the caller (sorele()/sotryfree()): - This permits the caller to acquire the accept mutex before the socket mutex, avoiding sofree() having to drop the socket mutex and re-order, which could lead to races permitting more than one thread to enter sofree() after a socket is ready to be free'd. - This also covers clearing of the so_pcb weak socket reference from the protocol to the socket, preventing races in clearing and evaluation of the reference such that sofree() might be called more than once on the same socket. This appears to close a race I was able to easily trigger by repeatedly opening and resetting TCP connections to a host, in which the tcp_close() code called as a result of the RST raced with the close() of the accepted socket in the user process resulting in simultaneous attempts to de-allocate the same socket. The new locking increases the overhead for operations that may potentially free the socket, so we will want to revise the synchronization strategy here as we normalize the reference counting model for sockets. The use of the accept mutex in freeing of sockets that are not listen sockets is primarily motivated by the potential need to remove the socket from the incomplete connection queue on its parent (listen) socket, so cleaning up the reference model here may allow us to substantially weaken the synchronization requirements. RELENG_5_3 candidate. MFC after: 3 days Reviewed by: dwhite Discussed with: gnn, dwhite, green Reported by: Marc UBM Bocklet <ubm at u-boot-man dot de> Reported by: Vlad <marchenko at gmail dot com>	2004-10-18 22:19:43 +00:00
glebius	30124ad883	Fix packet flow when both ng_ether(4) and bridge(4) are in use: - push all bridge logic from if_ethersubr.c into bridge.c make bridge_in() return mbuf pointer (or NULL). - call only bridge_in() from ether_input(), after ng_ether_input() was optinally called. - call bridge_in() from ng_ether_rcv_upper(). Long description: http://lists.freebsd.org/mailman/htdig/freebsd-net/2004-May/003881.html Reported by: Jian-Wei Wang <jwwang at FreeBSD.csie.NCTU.edu.tw> Tested by: myself, Sergey Lyubka Reviewed by: sam Approved by: julian (mentor) MFC after: 2 months	2004-10-12 10:33:42 +00:00
andre	8f39f6d2c2	Correctly unregister a netisr by clearing the ni->ni_queue field to NULL as well. This field is actually used by various netisr functions to determine the availablility of the specified netisr. This uncomplete unregister leads directly to a crash when the KLD unregistering the netisr is unloaded. Submitted by: Sam <sah@softcardsystems.com> MFC after: 3 days	2004-10-11 20:01:43 +00:00
rwatson	91c64388da	When harvesting entropy from an ethernet mbuf, do so before freeing the mbuf. RELENG_5 candidate.	2004-10-11 10:21:34 +00:00
glebius	659b05c3ca	Assign pointer NULL, not 0. Approved by: julian (mentor)	2004-10-11 07:28:36 +00:00
mlaier	46859ca7fc	Change pfil starvation prevention from fail-open to fail-close. We return ENOBUF to indicate the problem, which is an errno that should be handled well everywhere. Requested & Submitted by: green Silently okay'ed by: The rest of the firewall gang MFC after: 3 days	2004-10-08 12:07:20 +00:00
brooks	ea3df621c9	Since net/net_osdep.c contained only one function that could be trivially implemented as a macro, do that and remove it. NetBSD did this quite a while ago.	2004-10-08 00:24:30 +00:00
green	a146714a11	Don't recurse the BPF descriptor lock during the BIOCSDLT operation (and panic). To try to finish making BPF safe, at the very least, the BPF descriptor lock really needs to change into a reader/writer lock that controls access to "settings," and a mutex that controls access to the selinfo/knote/callout. Also, use of callout_drain() instead of callout_stop() (which is really a much more widespread issue).	2004-10-06 04:25:37 +00:00
sam	4be594580c	Add 802.11-specific events that are dispatched through the routing socket. This really doesn't belong here but is preferred (for the moment) over adding yet another mechanism for sending msgs from the kernel to user apps. Reviewed by: imp	2004-10-05 19:48:33 +00:00
sam	e5887a56e2	add ETHERTYPE_PAE for EAPOL/802.1x	2004-10-05 19:28:52 +00:00
mlaier	b65eae4c19	Add an additional struct inpcb * argument to pfil(9) in order to enable passing along socket information. This is required to work around a LOR with the socket code which results in an easy reproducible hard lockup with debug.mpsafenet=1. This commit does not fix the LOR, but enables us to do so later. The missing piece is to turn the filter locking into a leaf lock and will follow in a seperate (later) commit. This will hopefully be MT5'ed in order to fix the problem for RELENG_5 in forseeable future. Suggested by: rwatson A lot of work by: csjp (he'd be even more helpful w/o mentor-reviews ;) Reviewed by: rwatson, csjp Tested by: -pf, -ipfw, LINT, csjp and myself MFC after: 3 days LOR IDs: 14 - 17 (not fixed yet)	2004-09-29 04:54:33 +00:00
mlaier	8c87efffcd	Switch order for mtx_unlock and cv_signal as (condvar(9)) sez: A thread must hold mp while calling cv_signal(), cv_broadcast(), or cv_broadcastpri() even though it isn't passed as an argument. and is right with this claim. While here remove a "\" from the macro -> __inline conversion. Found by: csjp MFC after: 4 days	2004-09-22 20:55:56 +00:00
stefanf	3bd075200e	Prefer C99's __func__ over GCC's __FUNCTION__.	2004-09-22 17:16:04 +00:00
green	f45221919b	Call sbuf_finish() before sbuf_data() so as to not panic the system.	2004-09-22 12:53:27 +00:00
brooks	f34045dc6a	Fix a LOR where ifconf() used copyout while holding a mutex. This LOR was seen when configuring addresses on interfaces using ifconfig. This patch has been verified to work with over eight thousand addresses assigned to an interface. LOR id: 031	2004-09-22 08:59:41 +00:00
brooks	4b3d75c228	Log the renaming of an interface. This should make it easier to follow kernel log files.	2004-09-18 05:02:08 +00:00
rwatson	e31f3d551d	Destroy global tapmtx when the if_tap module is unloaded. RELENG_5 candidated.	2004-09-17 03:55:50 +00:00
brooks	af4088bbfb	Fix a LOR where copyout was called while holding a lock. Reported by: rwatson	2004-09-15 04:41:56 +00:00
rwatson	e87cb48020	Reformulate bpf_dettachd() to acquire the BIF_LOCK() as well as BPFD_LOCK() when removing a descriptor from an interface descriptor list. Hold both over the operation, and do a better job at maintaining the invariant that you can't find partially connected descriptors on an active interface descriptor list. This appears to close a race that resulted in the kernel performing a NULL pointer dereference when BPF sessions are detached during heavy network activity on SMP systems. RELENG_5 candidate.	2004-09-09 04:11:12 +00:00
rwatson	c30a3c01a1	Reformulate use of linked lists in 'struct bpf_d' and 'struct bpf_if' to use queue(3) list macros rather than hand-crafted lists. While here, move to doubly linked lists to eliminate iterating lists in order to remove entries. This change simplifies and clarifies the list logic in the BPF descriptor code as a first step towards revising the locking strategy. RELENG_5 candidate. Reviewed by: fenner	2004-09-09 00:19:27 +00:00
rwatson	a43f8c237d	Compare/set pointers using NULL not 0.	2004-09-09 00:11:50 +00:00
brooks	143d77da28	Re-add ifi_epoch, to struct if_data, this time replacing ifi_unused to avoid ABI changes. It is set to the last time the interface counters were zeroed, currently the time if_attach() was called. It is intentended to be a valid value for RFC2233's ifCounterDiscontinuityTime and to make it easier for applications to verify that the interface they find at a given index is the one that was there last time they looked. Due to space constraints ifi_epoch is a time_t rather then a struct timeval. SNMP would prefer higher precision, but this unlikely to be useful in practice.	2004-09-08 04:50:55 +00:00
jmg	b29998067a	don't call f_detach if the filter has alread removed the knote.. This happens when a proc exits, but needs to inform the user that this has happened.. This also means we can remove the check for detached from proc and sig f_detach functions as this is doing in kqueue now... MFC after: 5 days	2004-09-06 19:02:42 +00:00
rwatson	6390604c61	Correct a comment typo: s/Note/Not/. Pointed out by: kensmith	2004-09-03 01:37:02 +00:00
brooks	9baee72236	Back out ifi_epoch. The ABI breakage is too disruptive this close to 5-STABLE. ifi_epoch will shortly be reintroduced with less precistion using the space currently allocated to ifi_unused.	2004-09-02 05:07:29 +00:00
mlaier	9597d324e0	Fix an assertion when if_down()ing a ALTQ managed interface. The lock should have been in place all the time the mtx_assert in the ALTQ code just discovered the shortcoming. PR: i386/71195 Tested by: Bettan (PR originator), myself MFC after: 5 days	2004-09-01 19:56:47 +00:00
brooks	ba918da2a5	Use a spare byte in struct if_data to store the structure size without increasing it. Add code to ifconfig to use this size to find the sockaddr_dl after the struct if_data in the routing message. This allows struct if_data to grow (up to 255 bytes) without breaking ifconfig. Submitted by: peter	2004-09-01 18:22:14 +00:00
brooks	922e581a21	Add a new variable, ifi_epoch, to struct if_data. It is set to the last time the interface counters were zeroed, currently the time if_attach() was called. It is indentended to be a valid value for RFC2233's ifCounterDiscontinuityTime and to make it easier for applications to verify that the interface they find at a given index is the one that was there last time they looked. An if_epoch "compatability" macro has not been created as ifi_epoch has never been a member of struct ifnet. Approved by: andre, bms, wollman	2004-08-30 06:29:26 +00:00
yar	7a438d757c	Use an ANSI-style definition for slstart() in accord with the rest of the file.	2004-08-30 04:48:52 +00:00
yar	39ca2a8636	Grant the poor old SLIP driver with an if_start handler so that it becomes happy and no longer panics the system upon getting the very first packet to transmit. Reported and tested by: Igor Timkin <ivt@gamma.ru> Reviewed by: rwatson MFC after: 5 days	2004-08-30 04:32:52 +00:00
rwatson	c409ad7413	Correct typo in printf() warning. Submitted by: Pawel Worach <pawel.worach at telia.com>	2004-08-28 19:27:25 +00:00
rwatson	69e658ec5a	Change the default disposition of debug.mpsafenet from 0 to 1, which will cause the network stack to operate without the Giant lock by default. This change has the potential to improve performance by increasing parallelism and decreasing latency in network processing. Due to the potential exposure of existing or new bugs, the following compatibility functionality is maintained: - It is still possible to disable Giant-free operation by setting debug.mpsafenet to 0 in loader.conf. - Add "options NET_WITH_GIANT", which will restore the default value of debug.mpsafenet to 0, and is intended for use on systems compiled with known unsafe components, or where a more conservative configuration is desired. - Add a new declaration, NET_NEEDS_GIANT("componentname"), which permits kernel components to declare dependence on Giant over the network stack. If the declaration is made by a preloaded module or a compiled in component, the disposition of debug.mpsafenet will be set to 0 and a warning concerning performance degraded operation printed to the console. If it is declared by a loadable kernel module after boot, a warning is displayed but the disposition cannot be changed. This is implemented by defining a new SYSINIT() value, SI_SUB_SETTINGS, which is intended for the processing of configuration choices after tunables are read in and the console is available to generate errors, but before much else gets going. This compatibility behavior will go away when we've finished the last of the locking work and are confident that operation is correct.	2004-08-28 15:11:13 +00:00
brooks	f71cc6cdec	When detaching an interface, don't leave an obsolete pointer to the soon to be deleted struct ifnet around. PR: kern/52260 MFC After: 3 days	2004-08-27 19:42:40 +00:00
andre	2126402238	Apply error and success logic consistently to the function netisr_queue() and its users. netisr_queue() now returns (0) on success and ERRNO on failure. At the moment ENXIO (netisr queue not functional) and ENOBUFS (netisr queue full) are supported. Previously it would return (1) on success but the return value of IF_HANDOFF() was interpreted wrongly and (0) was actually returned on success. Due to this schednetisr() was never called to kick the scheduling of the isr. However this was masked by other normal packets coming through netisr_dispatch() causing the dequeueing of waiting packets. PR: kern/70988 Found by: MOROHOSHI Akihiko <moro@remus.dti.ne.jp> MFC after: 3 days	2004-08-27 18:33:08 +00:00
andre	d243747d92	Always compile PFIL_HOOKS into the kernel and remove the associated kernel compile option. All FreeBSD packet filters now use the PFIL_HOOKS API and thus it becomes a standard part of the network stack. If no hooks are connected the entire packet filter hooks section and related activities are jumped over. This removes any performance impact if no hooks are active. Both OpenBSD and DragonFlyBSD have integrated PFIL_HOOKS permanently as well.	2004-08-27 15:16:24 +00:00
rwatson	26e22a1ea8	Revert previous revision, 1.7, as removal of GIANT_REQUIRED was made in the wrong branch (and hence to the wrong function).	2004-08-24 14:17:58 +00:00
rwatson	af140f017c	MT4 if_fwsubr.c:1.6: date: 2004/08/22 14:48:55; author: rwatson; state: Exp; lines: +0 -2 Don't need to assert Giant in fw_output(), only in the firewire start routine. Approved by: re (scottl)	2004-08-24 14:16:08 +00:00
roam	45a80babc1	Fix a typo (attacked -> attached). Approved by: sam	2004-08-24 08:47:15 +00:00
rwatson	769c4fdece	Style update: use newer style function prototypes in if_sl.c in prep for merging locking.	2004-08-22 21:32:52 +00:00
rwatson	5fe9f846c5	Don't need to assert Giant in fw_output(), only in the firewire start routine.	2004-08-22 14:48:55 +00:00
rwatson	5a65579e60	If a tunable for the routing socket netisr queue max is defined, allow it to override the default value, rather than the default value overriding the tunable.	2004-08-21 21:45:40 +00:00
rwatson	e40f2287d8	Allow the size of the routing socket netisr queue to be configured using the tunable or sysctl 'net.route.netisr_maxqlen'. Default the maximum depth to 256 rather than IFQ_MAXLEN due to the downsides of dropping routing messages. MT5 candidate. Discussed with: mdodd, mlaier, Vincent Jardin <jardin at 6wind.com>	2004-08-21 21:20:06 +00:00
csjp	657b6f650c	When a prison is given the ability to create raw sockets (when the security.jail.allow_raw_sockets sysctl MIB is set to 1) where privileged access to jails is given out, it is possible for prison root to manipulate various network parameters which effect the host environment. This commit plugs a number of security holes associated with the use of raw sockets and prisons. This commit makes the following changes: - Add a comment to rtioctl warning developers that if they add any ioctl commands, they should use super-user checks where necessary, as it is possible for PRISON root to make it this far in execution. - Add super-user checks for the execution of the SIOCGETVIFCNT and SIOCGETSGCNT IP multicast ioctl commands. - Add a super-user check to rip_ctloutput(). If the calling cred is PRISON root, make sure the socket option name is IP_HDRINCL, otherwise deny the request. Although this patch corrects a number of security problems associated with raw sockets and prisons, the warning in jail(8) should still apply, and by default we should keep the default value of security.jail.allow_raw_sockets MIB to 0 (or disabled) until we are certain that we have tracked down all the problems. Looking forward, we will probably want to eliminate the references to curthread. This may be a MFC candidate for RELENG_5. Reviewed by: rwatson Approved by: bmilekic (mentor)	2004-08-21 17:38:57 +00:00
andre	e4a34b65ad	Convert ipfw to use PFIL_HOOKS. This is change is transparent to userland and preserves the ipfw ABI. The ipfw core packet inspection and filtering functions have not been changed, only how ipfw is invoked is different. However there are many changes how ipfw is and its add-on's are handled: In general ipfw is now called through the PFIL_HOOKS and most associated magic, that was in ip_input() or ip_output() previously, is now done in ipfw_check_[in\|out]() in the ipfw PFIL handler. IPDIVERT is entirely handled within the ipfw PFIL handlers. A packet to be diverted is checked if it is fragmented, if yes, ip_reass() gets in for reassembly. If not, or all fragments arrived and the packet is complete, divert_packet is called directly. For 'tee' no reassembly attempt is made and a copy of the packet is sent to the divert socket unmodified. The original packet continues its way through ip_input/output(). ipfw 'forward' is done via m_tag's. The ipfw PFIL handlers tag the packet with the new destination sockaddr_in. A check if the new destination is a local IP address is made and the m_flags are set appropriately. ip_input() and ip_output() have some more work to do here. For ip_input() the m_flags are checked and a packet for us is directly sent to the 'ours' section for further processing. Destination changes on the input path are only tagged and the 'srcrt' flag to ip_forward() is set to disable destination checks and ICMP replies at this stage. The tag is going to be handled on output. ip_output() again checks for m_flags and the 'ours' tag. If found, the packet will be dropped back to the IP netisr where it is going to be picked up by ip_input() again and the directly sent to the 'ours' section. When only the destination changes, the route's 'dst' is overwritten with the new destination from the forward m_tag. Then it jumps back at the route lookup again and skips the firewall check because it has been marked with M_SKIP_FIREWALL. ipfw 'forward' has to be compiled into the kernel with 'option IPFIREWALL_FORWARD' to enable it. DUMMYNET is entirely handled within the ipfw PFIL handlers. A packet for a dummynet pipe or queue is directly sent to dummynet_io(). Dummynet will then inject it back into ip_input/ip_output() after it has served its time. Dummynet packets are tagged and will continue from the next rule when they hit the ipfw PFIL handlers again after re-injection. BRIDGING and IPFW_ETHER are not changed yet and use ipfw_chk() directly as they did before. Later this will be changed to dedicated ETHER PFIL_HOOKS. More detailed changes to the code: conf/files Add netinet/ip_fw_pfil.c. conf/options Add IPFIREWALL_FORWARD option. modules/ipfw/Makefile Add ip_fw_pfil.c. net/bridge.c Disable PFIL_HOOKS if ipfw for bridging is active. Bridging ipfw is still directly invoked to handle layer2 headers and packets would get a double ipfw when run through PFIL_HOOKS as well. netinet/ip_divert.c Removed divert_clone() function. It is no longer used. netinet/ip_dummynet.[ch] Neither the route 'ro' nor the destination 'dst' need to be stored while in dummynet transit. Structure members and associated macros are removed. netinet/ip_fastfwd.c Removed all direct ipfw handling code and replace it with the new 'ipfw forward' handling code. netinet/ip_fw.h Removed 'ro' and 'dst' from struct ip_fw_args. netinet/ip_fw2.c (Re)moved some global variables and the module handling. netinet/ip_fw_pfil.c New file containing the ipfw PFIL handlers and module initialization. netinet/ip_input.c Removed all direct ipfw handling code and replace it with the new 'ipfw forward' handling code. ip_forward() does not longer require the 'next_hop' struct sockaddr_in argument. Disable early checks if 'srcrt' is set. netinet/ip_output.c Removed all direct ipfw handling code and replace it with the new 'ipfw forward' handling code. netinet/ip_var.h Add ip_reass() as general function. (Used from ipfw PFIL handlers for IPDIVERT.) netinet/raw_ip.c Directly check if ipfw and dummynet control pointers are active. netinet/tcp_input.c Rework the 'ipfw forward' to local code to work with the new way of forward tags. netinet/tcp_sack.c Remove include 'opt_ipfw.h' which is not needed here. sys/mbuf.h Remove m_claim_next() macro which was exclusively for ipfw 'forward' and is no longer needed. Approved by: re (scottl)	2004-08-17 22:05:54 +00:00
jmg	bc1805c6e8	Add locking to the kqueue subsystem. This also makes the kqueue subsystem a more complete subsystem, and removes the knowlege of how things are implemented from the drivers. Include locking around filter ops, so a module like aio will know when not to be unloaded if there are outstanding knotes using it's filter ops. Currently, it uses the MTX_DUPOK even though it is not always safe to aquire duplicate locks. Witness currently doesn't support the ability to discover if a dup lock is ok (in some cases). Reviewed by: green, rwatson (both earlier versions)	2004-08-15 06:24:42 +00:00
rwatson	927adfff57	Use IFQ_SET_MAXLEN() to set the maximum queue depth of the routing socket netisr queue. Pointed out by: winter	2004-08-13 22:23:21 +00:00
tackerman	4386366573	Added two new media types for 10GBASE-SR and 10GBASE-LR	2004-08-12 23:48:26 +00:00
andre	3dc2f7c661	Convert the routing table to use an UMA zone for rtentries. The zone is called "rtentry". This saves a considerable amount of kernel memory. R_Zmalloc previously used 256 byte blocks (plus kmalloc overhead) whereas UMA only needs 132 bytes. Idea from: OpenBSD	2004-08-11 17:26:56 +00:00
emax	6e0dfecf1c	Set IFF_RUNNING flag on the interface as soon as the control device is opened.	2004-08-11 00:12:27 +00:00
mlaier	00ecbb6a92	Add a "void *if_carp" placeholder to struct ifnet with prospect to bring in the "Common address redundancy protocol" (CARP) during the 5-STABLE cycle. Hence doing the ABI break now. Approved by: re (scottl)	2004-08-07 09:32:04 +00:00
rwatson	ef39095fcd	As SLIP directly accesses the tty code from its if_start() routine, mark if_sl as IFF_NEEDSGIANT.	2004-08-06 22:41:13 +00:00
roam	e8cd412600	Do not attempt to clean up data that has not been initialized yet. This fixes two kernel panics on boot when the xl driver fails to allocate bus/port/memory resources. Reviewed by: silence on -net	2004-08-06 09:08:33 +00:00
sobomax	d3be2ab365	Set ip_v field properly. PR: kern/69957	2004-08-05 08:12:46 +00:00
rwatson	00b755c2a7	Do a lockless read of the BPF interface structure descriptor list head before grabbing BPF locks to see if there are any entries in order to avoid the cost of locking if there aren't any. Avoids a mutex lock/ unlock for each packet received if there are no BPF listeners.	2004-08-05 02:37:36 +00:00
kan	3140931e1f	Avoid casts as lvalues.	2004-07-28 06:59:55 +00:00
kan	1fc93948ca	Initialize ; variable eraly to shut up GCC warning.	2004-07-28 06:48:36 +00:00
rwatson	b463bc6c33	Add a new network interface flag, IFF_NEEDSGIANT, which will allow device drivers to declare that the ifp->if_start() method implemented by the driver requires Giant in order to operate correctly. Add a 'struct task' to 'struct ifnet' that can be used to execute a deferred ifp->if_start() in the event that if_start needs to be called in a Giant-free environment. To do this, introduce if_start(), a wrapper function for ifp->if_start(). If the interface can run MPSAFE, it directly dispatches into the interface start routine. If it can't run MPSAFE, we're running with debug.mpsafenet != 0, and Giant isn't currently held, the task is queued to execute in a swi holding Giant via if_start_deferred(). Modify if_handoff() to use if_start() instead of direct dispatch. Modify 802.11 to use if_start() instead of direct dispatch. This is intended to provide increased compatibility for non-MPSAFE network device drivers in the presence of Giant-free operation via asynchronous dispatch. However, this commit does not mark any network interfaces as IFF_NEEDSGIANT.	2004-07-27 23:20:45 +00:00
yar	a63ad31e7f	Stop tinkering with the parent's VLAN_MTU capability. Now it is user-controlled through ifconfig(8). The former ``automagic'' way of operation created more trouble than good. First, VLAN_MTU consumers other than vlan(4) had appeared, e.g., ng_vlan(4). Second, there was no way to disable VLAN_MTU manually if it were causing trouble, e.g., data corruption. Dropping the ``automagic'' should be completely invisible to the user since a) all the drivers supporting VLAN_MTU have it enabled by default, and in the first place b) there is only one driver that can really toggle VLAN_MTU in the hardware under its control (it's fxp(4), to which I added VLAN_MTU controls to illustrate the principle.)	2004-07-26 14:46:04 +00:00
rwatson	23fdd080dd	Prefer NULL to '0' when checking a pointer value.	2004-07-24 16:58:56 +00:00
brooks	69e2cf0e4d	Actually free the unit when destroying the interface. Reported by: la at delfi.lt Tested by: la at delfi.lt PR: 68618	2004-07-22 22:50:15 +00:00
mlaier	6cc5ed789d	When removing the last reference to a cloner, do not try to unlock twice - esp. not since the backing memory was just freed. Reviewed by: rwatson	2004-07-20 21:44:28 +00:00
rwatson	c3ae9c5291	Comment clarifying debug_mpsafenet.	2004-07-18 21:50:22 +00:00
rwatson	63066bad3b	Gratuitous whitespace change to un-wrap a short line.	2004-07-18 19:53:35 +00:00
phk	f00200d8a4	Preparation commit for the tty cleanups that will follow in the near future: rename ttyopen() -> tty_open() and ttyclose() -> tty_close(). We need the ttyopen() and ttyclose() for the new generic cdevsw functions for tty devices in order to have consistent naming.	2004-07-15 20:47:41 +00:00
phk	5c95d686a1	Do a pass over all modules in the kernel and make them return EOPNOTSUPP for unknown events. A number of modules return EINVAL in this instance, and I have left those alone for now and instead taught MOD_QUIESCE to accept this as "didn't do anything".	2004-07-15 08:26:07 +00:00
mlaier	d42002971f	Fix a copy-and-paste-o in IFQ_DRV_PREPEND - all pointyhats to me. While here also fix a (not less stupid) braino in IFQ_DRV_PURGE. Reported-by: clement Tested-by: clement (_PREPEND in sis(4))	2004-07-14 13:31:41 +00:00
rwatson	9893ed288d	Convert SLIP to using C99 structure initialization for its struct linesw.	2004-07-14 05:01:40 +00:00
bms	23d90b4453	Use ETHER_IS_MULTICAST() consistently in ether_resolvemulti(). Reviewed by: jmallett	2004-07-09 05:26:27 +00:00
bms	59286e68a5	Use M_ZERO instead of bzero().	2004-07-06 03:34:16 +00:00
bms	fc4a5b9caf	Be consistent and use bzero() instead of memset().	2004-07-06 03:29:41 +00:00
bms	42f466846a	Use M_ZERO instead of memset() (!).	2004-07-06 03:28:24 +00:00
bms	af7a129861	Use M_ZERO instead of bzero().	2004-07-06 03:26:26 +00:00
bms	70ed2c8cbe	Replace a bzero() after malloc() with M_ZERO.	2004-07-06 03:16:55 +00:00
bms	17a9559973	Style.	2004-07-06 03:07:50 +00:00
rwatson	afd2385482	In the BPF and ethernet bridging code, don't allow callouts to execute without Giant if we're not debug.mpsafenet=1.	2004-07-05 16:28:31 +00:00
bms	f58c856596	Workaround a locking problem in vlan(4). vlan_setmulti() may be called with sleepable locks held from further up in the network stack, and attempts to allocate memory to hold multicast group membership information with M_WAITOK. This panic was triggered specifically when an exiting routing daemon process closes its raw sockets after joining multicast groups on them. While we're here, comment some possible locking badness. PR: kern/48560	2004-07-04 18:32:54 +00:00
bms	6190bf9bc4	style(9)/whitespace cleanup while I'm in this file.	2004-07-04 16:43:24 +00:00
bms	b6bb334af4	The net.link.ether.bridge.enable sysctl MIB variable enables bridge functionality by setting to a non-zero value. This is an integer, but is treated as a boolean by the code, so clamp it to a boolean value when set so as to avoid unnecessary bridge reinitialization if it's changed to another value. PR: kern/61174 Requested by: Bruce Cran	2004-07-04 15:53:28 +00:00
brooks	5b1f1be739	Don't announce the ethernet address when it's 00:00:00:00:00:00. It's not of any interest. This primairly happens when vlan(4) interfaces are created.	2004-07-02 19:44:59 +00:00
mlaier	7bc770a254	Bring in the first chunk of altq driver modifications. This covers the following drivers: bfe(4), em(4), fxp(4), lnc(4), tun(4), de(4) rl(4), sis(4) and xl(4) More patches are pending on: http://peoples.freebsd.org/~mlaier/ Please take a look and tell me if "your" driver is missing, so I can fix this. Tested-by: many No-objection: -current, -net	2004-07-02 12:16:02 +00:00
rik	fb5ac405c9	Do not m_free packet since IF_HANDOFF (called from netisr_queue) will do it for us, just count it.	2004-06-28 15:32:24 +00:00
pjd	537ad587c5	Those are unneeded too.	2004-06-27 09:06:10 +00:00
pjd	5055061c5d	Add two missing includes and remove two uneeded. This is quite serious fix, because even with MAC framework compiled in, MAC entry points in those two files were simply ignored.	2004-06-27 09:03:22 +00:00
phk	0567d4ef5f	Pick the hotchar out of the tty structure instead of caching private copies. No current line disciplines have a dynamically changing hotchar, and expecting to receive anything sensible during a change in ldisc is insane so no locking of the hotchar field is necessary.	2004-06-26 09:20:07 +00:00
phk	1aa6c5a754	Fix line discipline switching issues: If opening a new ldisc fails, we have to revert to TTYDISC which we know will successfully open rather than try the previous ldisc which might also fail to open. Do not let ldisc implementations muck about with ->t_line, and remove code which checks for reopens, it should never happen. Move ldisc->l_hotchar to tty->t_hotchar and have ldisc implementation initialize it in their open routines. Reset to zero when we enter TTYDISC. ("no" should really be -1 since zero could be a valid hotchar for certain old european mainframe protocols.)	2004-06-26 08:44:04 +00:00
rik	f844c60a44	Do not count loobacks as other fuilures. As a result magic will not be rejected any more in case of loopback. Discussed with: joerg@	2004-06-25 10:25:33 +00:00
joerg	9b721035ea	Add a couple of #ifdef DEBUG printf()s in vlan_input() I found to be useful when debugging the ether_demux() problem (when bridging over VLANs).	2004-06-24 12:32:41 +00:00
joerg	f7a4300d05	When considering an ethernet frame that is not destined for us, do not only allow this to be further processed when bridging is active on that interface, but also if the current packet has a VLAN tag and VLANs are active on our interface. This gives the VLAN layers a chance to also consider the packet (and perhaps drop it instead of the main dispatcher). This fixes a situation where bridging was only active on VLAN interfaces but ether_demux() called on behalf of the main interface had already thrown the packet away. MFC after: 4 weeks	2004-06-24 12:31:44 +00:00
des	383d0b372c	Make dependencies on the TCP/IP stack conditional on INET / INET6. This makes it possible to build a kernel with NIC drivers but no TCP/IP stack. Sponsored by: Teleplan AS	2004-06-24 10:58:08 +00:00
brooks	e1dd867b55	Major overhaul of pseudo-interface cloning. Highlights include: - Split the code out into if_clone.[ch]. - Locked struct if_clone. [1] - Add a per-cloner match function rather then simply matching names of the form <name><unit> and <name>. - Use the match function to allow creation of <interface>.<tag> vlan interfaces. The old way is preserved unchanged! - Also the match function to allow creation of stf(4) interfaces named stf0, stf, or 6to4. This is the only major user visible change in that "ifconfig stf" creates the interface stf rather then stf0 and does not print "stf0" to stdout. - Allow destroy functions to fail so they can refuse to delete interfaces. Currently, we forbid the deletion of interfaces which were created in the init function, particularly lo0, pflog0, and pfsync0. In the case of lo0 this was a panic implementation so it does not count as a user visiable change. :-) - Since most interfaces do not need the new functionality, an family of wrapper functions, ifc_simple_*(), were created to wrap old style cloner functions. - The IF_CLONE_INITIALIZER macro is replaced with a new incompatible IFC_CLONE_INITIALIZER and ifc_simple consumers use IFC_SIMPLE_DECLARE instead. Submitted by: Maurycy Pawlowski-Wieronski <maurycy at fouk.org> [1] Reviewed by: andre, mlaier Discussed on: net	2004-06-22 20:13:25 +00:00
markm	ae932b023a	Give zlib the ability to be a module that can be depended on, in the MODULE_DEPEND() sense.	2004-06-20 17:42:35 +00:00
bde	e041a584a6	Include <sys/_lock.h>'s prerequisite <sys/queue.h> before including the former, not after. Don't hide this bug by including <sys/queue.h> in <sys/_lock.h>.	2004-06-19 14:58:35 +00:00
phk	40dd98a3bd	Second half of the dev_t cleanup. The big lines are: NODEV -> NULL NOUDEV -> NODEV udev_t -> dev_t udev2dev() -> findcdev() Various minor adjustments including handling of userland access to kernel space struct cdev etc.	2004-06-17 17:16:53 +00:00
phk	dfd1f7fd50	Do the dreaded s/dev_t/struct cdev */ Bump __FreeBSD_version accordingly.	2004-06-16 09:47:26 +00:00
mlaier	02300f227f	Replace IF_HANDOFF with new IFQ_HANDOFF to enqueue with ALTQ once enabled on the respective drivers.	2004-06-15 23:57:42 +00:00
rwatson	292410a6b8	Lock down rawcb_list, a global list of control blocks for raw sockets, using rawcb_mtx. Hold this mutex while modifying or iterating over the control list; this means that the mutex is held over calls into socket delivery code, which no longer causes a lock order reversal as the routing socket code uses a netisr to avoid recursing socket -> routing -> socket. Note: Locking of IPsec consumers of rawcb_list is not included in this commit.	2004-06-15 04:13:59 +00:00
mlaier	586342bb6a	Fix a typeo in IFQ_HANDOFF.	2004-06-15 03:40:39 +00:00
mlaier	de92edb6b4	Transform tbr_dequeue into a function pointer in order to build drivers with ALTQ enabled versions of IFQ_* macros by default, as requested by serveral others. This is a follow-up to the quick fix I committed yesterday which turned off the ALTQ checks for non-ALTQ kernels.	2004-06-15 01:45:19 +00:00
dfr	614bae2942	Fix big-endian build.	2004-06-14 08:17:51 +00:00
mlaier	131fb63c62	Unbreak non-ALTQ kernel linking. I forgot about tbr_dequeue. In the end drivers should be building with ALTQ checks by default, but for now build them with the old macros for non-ALTQ kernels. Note: Check new features w/ LINT and w/ LINT minus the new feature. Found-by: rwatson	2004-06-14 03:55:09 +00:00
dfr	79e1f4d678	Add MAC framework bits to the output path.	2004-06-13 19:55:16 +00:00
dfr	bc5900009b	Remove advertising clause.	2004-06-13 19:15:44 +00:00
mlaier	977d97b004	Link ALTQ to the build and break with ABI for struct ifnet. Please recompile your (network) modules as well as any userland that might make sense of sizeof(struct ifnet). This does not change the queueing yet. These changes will follow in a seperate commit. Same with the driver changes, which need case by case evaluation. __FreeBSD_version bump will follow. Tested-by: (i386)LINT	2004-06-13 17:29:10 +00:00
dfr	a1fa8042f5	Add a new driver to support IP over firewire. This driver is intended to conform to the rfc2734 and rfc3146 standard for IP over firewire and should eventually supercede the fwe driver. Right now the broadcast channel number is hardwired and we don't support MCAP for multicast channel allocation - more infrastructure is required in the firewire code itself to fix these problems.	2004-06-13 10:54:36 +00:00
rwatson	82295697cd	Extend coverage of SOCK_LOCK(so) to include so_count, the socket reference count: - Assert SOCK_LOCK(so) macros that directly manipulate so_count: soref(), sorele(). - Assert SOCK_LOCK(so) in macros/functions that rely on the state of so_count: sofree(), sotryfree(). - Acquire SOCK_LOCK(so) before calling these functions or macros in various contexts in the stack, both at the socket and protocol layers. - In some cases, perform soisdisconnected() before sotryfree(), as this could result in frobbing of a non-present socket if sotryfree() actually frees the socket. - Note that sofree()/sotryfree() will release the socket lock even if they don't free the socket. Submitted by: sam Sponsored by: FreeBSD Foundation Obtained from: BSD/OS	2004-06-12 20:47:32 +00:00
rwatson	54cb112a38	Constify raw_sendspace and raw_recvspace, as they're not mutable.	2004-06-11 03:52:56 +00:00
rwatson	fe59af8e68	Switch to conditionally acquiring and dropping Giant around calls into ifp->if_output() basedd on debug.mpsafenet. That way once bpfwrite() can be called without Giant, it will acquire Giant (if desired) before entering the network stack.	2004-06-11 03:47:21 +00:00
rwatson	0fa5ca52c6	Un-staticize 'dst' sockaddr in the stack of bpfwrite() to prevent the need to synchronize access to the structure. I believe this should fit into the stack under the necessary circumstances, but if not we can either add synchronization or use a thread-local malloc for the duration.	2004-06-11 03:45:42 +00:00
rwatson	e550332ee6	Introduce a netisr to deliver kernel-generated routing, avoiding recursive entering of the socket code from the routing code: - Modify rt_dispatch() to bundle up the sockaddr family, if any, associated with a pending mbuf to dispatch to routing sockets, in an m_tag on the mbuf. - Allocate NETISR_ROUTE for use by routing sockets. - Introduce rtsintrq, an ifqueue to be used by the netisr, and introduce rts_input(), a function to unbundle the tagged sockaddr and inject the mbuf and address into raw_input(), which previously occurred in rt_dispatch(). - Introduce rts_init() to initialize rtsintrq, its mutex, and register the netisr. Perform this at the same point in system initialization as setup of the domains. This change introduces asynchrony between the generation of a pending routing socket message and delivery to sockets for use by userspace. It avoids socket->routing->rtsock->socket use and helps to avoid lock order reversals between the routing code and socket code (in particular, raw socket control blocks), as route locks are held over calls to rt_dispatch(). Reviewed by: "George V.Neville-Neil" <gnn@neville-neil.com> Conceptual head nod by: sam	2004-06-09 02:48:23 +00:00
phk	635c1632db	Use ldisc_[de]register() instead of frobbing linesw[] directly.	2004-06-07 20:43:37 +00:00
naddy	00ef095261	Add helper functions to calculate the standard ethernet CRC in little/big endian fashion, so that network drivers can just reference the standard implementation and don't have to bring their own. As discussed on arch@. Obtained from: NetBSD	2004-06-02 21:34:14 +00:00
phk	f43aa0c4bc	add missing #include <sys/module.h>	2004-05-30 20:27:19 +00:00
phk	d6f7d2bde6	Add some missing <sys/module.h> includes which are masked by the one on death-row in <sys/kernel.h>	2004-05-30 17:57:46 +00:00
dwmalone	43ffabb3fb	Make the comment for DLT_NULL slightly more accurate. PR: 62272 Submitted by: Radim Kolar <hsn@netmag.cz> MFC after: 1 week	2004-05-30 17:03:48 +00:00
yar	64caa10f3b	if_printf() won't emit a newline unless told to.	2004-05-26 11:41:26 +00:00
rik	210c22329d	Keepalive timer should be added if we does not have any sppp consumers before and should be deleted if we do not have any anymore.	2004-05-25 21:54:07 +00:00
yar	bd82e3f62a	After all the relevant drivers have been fixed, fix vlan(4) itself WRT manipulating capabilities of the parent interface: - use ioctl(SIOCSIFCAP) to toggle VLAN_MTU (the way that was done before was just wrong); - use the right order of conditional clauses to set the MTU fudge (that is logically independent from toggling VLAN_MTU.)	2004-05-25 14:30:12 +00:00
mux	f082205682	Remove another redundant if_output initialization.	2004-05-24 11:01:45 +00:00
yar	c06663e28d	Consult parent's if_capenable for active VLAN-related capabilities. This change is possible since all the relevant drivers have been fixed to set if_capenable properly. The field if_capabilities tracks supported capabilities, which may be disabled administratively. Inheriting checksum offload support from the parent interface isn't that easy because the checksumming capabilities of the parent may be toggled on the fly. Disable the code for now.	2004-05-23 22:32:15 +00:00
ru	418aa56fe4	Added dependency on the miibus module.	2004-05-21 08:43:38 +00:00
csjp	3cc360e7bb	Zero the un-used portions of the struct sockaddr data before sending it back to userspace, so it does not break bind(2) on raw sockets in jails. Currently some processes, like traceroute(8) construct a routing request to determine its source address based on the destination. This sockaddr data is fed directly to bind(2). When bind calls ifa_ifwithaddr(9) to make sure the address exists on the interface, the comparison will fail causing bind(2) to return EADDRNOTAVAIL if the data wasnt zero'ed before initialization. Approved by: bmilekic (mentor)	2004-05-10 15:07:23 +00:00
scottl	4bc7b15849	Add route.h to pick up the rt_ifmsg() declaration.	2004-05-04 02:39:41 +00:00
maxim	96efdc3250	o Fix misindentation in the previous commit.	2004-05-03 17:15:34 +00:00
andre	25ae331e12	Link state change notification of ethernet media to the routing socket. o Extend the if_data structure with an ifi_link_state field and provide the corresponding defines for the valid states. o The mii_linkchg() callback updates the ifi_link_state field and calls rt_ifmsg() to notify listeners on the routing socket in addition to the kqueue KNOTE. o If vlans are configured on a physical interface notify and update all vlan pseudo devices as well with the vlan_link_state() callback. No objections by: sam, wpaul, ru, bms Brucification by: bde	2004-05-03 13:48:35 +00:00
bmilekic	6bbcc9da29	Give jail(8) the feature to allow raw sockets from within a jail, which is less restrictive but allows for more flexible jail usage (for those who are willing to make the sacrifice). The default is off, but allowing raw sockets within jails can now be accomplished by tuning security.jail.allow_raw_sockets to 1. Turning this on will allow you to use things like ping(8) or traceroute(8) from within a jail. The patch being committed is not identical to the patch in the PR. The committed version is more friendly to APIs which pjd is working on, so it should integrate into his work quite nicely. This change has also been presented and addressed on the freebsd-hackers mailing list. Submitted by: Christian S.J. Peron <maneo@bsdpro.com> PR: kern/65800	2004-04-26 19:46:52 +00:00
luigi	59063f7a08	This commit does two things: 1. rt_check() cleanup: rt_check() is only necessary for some address families to gain access to the corresponding arp entry, so call it only in/near the resolve() routines where it is actually used -- at the moment this is arpresolve(), nd6_storelladdr() (the call is embedded here), and atmresolve() (the call is just before atmresolve to reduce the number of changes). This change will make it a lot easier to decouple the arp table from the routing table. There is an extra call to rt_check() in if_iso88025subr.c to determine the routing info length. I have left it alone for the time being. The interface of arpresolve() and nd6_storelladdr() now changes slightly: + the 'rtentry' parameter (really a hint from the upper level layer) is now passed unchanged from _output(), so it becomes the route to the final destination and not to the gateway. + the routines will return 0 if resolution is possible, non-zero otherwise. + arpresolve() returns EWOULDBLOCK in case the mbuf is being held waiting for an arp reply -- in this case the error code is masked in the caller so the upper layer protocol will not see a failure. 2. arpcom untangling Where possible, use 'struct ifnet' instead of 'struct arpcom' variables, and use the IFP2AC macro to access arpcom fields. This mostly affects the netatalk code. === Detailed changes: === net/if_arcsubr.c rt_check() cleanup, remove a useless variable net/if_atmsubr.c rt_check() cleanup net/if_ethersubr.c rt_check() cleanup, arpcom untangling net/if_fddisubr.c rt_check() cleanup, arpcom untangling net/if_iso88025subr.c rt_check() cleanup netatalk/aarp.c arpcom untangling, remove a block of duplicated code netatalk/at_extern.h arpcom untangling netinet/if_ether.c rt_check() cleanup (change arpresolve) netinet6/nd6.c rt_check() cleanup (change nd6_storelladdr)	2004-04-25 09:24:52 +00:00
luigi	6d55bbb3f6	fix one typo and remove one wrong line	2004-04-25 01:39:00 +00:00
luigi	0e877d510e	Correct and extend the description of the behaviour of rt_check().	2004-04-24 23:34:56 +00:00
luigi	339997e711	document the locking behaviour of the functions that access the routing table.	2004-04-24 23:34:04 +00:00
luigi	62793e142c	arpcom untangling: consistently with the rest of the code, use IFP2AC(ifp) to access the arpcom structure given the ifp. In this case also fix a difference in assumptions WRT the rest of the net/ sources: it is not the 'struct *softc' that starts with a 'struct arpcom', but a 'struct arpcom' that starts with a 'struct ifnet'	2004-04-24 22:24:48 +00:00
luigi	3a8abc28c7	arpcom untangling: do not use struct arpcom directly, rather use IFP2AC(ifp).	2004-04-24 22:11:13 +00:00
luigi	963f4166f4	arpcom untangling: - use ifp instead if &ac->ac_if in a couple of nd6* calls; this removes a useless dependency. - use IFP2AC(ifp) instead of an extra variable to point to the struct arpcom; this does not remove the nesting dependency between arpcom and ifnet but makes it more evident.	2004-04-24 21:59:41 +00:00
andre	5b2a80f166	Add the comment of the previous commit to the source file directly. Requested by: ru	2004-04-23 16:57:43 +00:00
andre	6207069c8c	Call ip_output() with IP_FORWARD flag to prevent it from overwriting the ip_id again. ip_id is already set to the ip_id of the encapsulated packet. Make a comment about mbuf allocation failures more realistic. Reviewed by: sobomax	2004-04-23 16:10:23 +00:00
luigi	bce6deb4fb	Readability fixes: Clearly comment the assumptions on the structure of keys (addresses) and masks, and introduce a macro, LEN(p), to extract the size of these objects instead of using (u_char )p which might be confusing. Comment the confusion in the types used to pass around pointers to keys and masks, as a reminder to fix that at some point. Add a few comments on what some functions do. Comment a probably inefficient (but still correct) section of code in rn_walktree_from() The object code generated after this commit is the same as before. At some point we should also change same variable identifiers such as "t, tt, ttt" to fancier names such as "root, left, right" (just in case someone wants to understand the code!), replace misspelling of NULL as 0, remove 'register' declarations that make little sense these days.	2004-04-21 15:27:36 +00:00
luigi	214b2b05ae	Clearly comment the assumptions that allow us to cast a 'struct radix_node ' to a 'struct rtentry ' in this code, and introduce a macro, RNTORT(), to do this type conversion.	2004-04-21 15:16:08 +00:00
luigi	52a5485343	Fix the initial check for NULL arguments in rtfree (previously it checked for rt == NULL after dereferencing the pointer). We never check for those events elsewhere, so probably these checks might go away here as well. Slightly simplify (and document) the logic for memory allocation in rt_setgate(). The rest is mostly style changes -- replace 0 with NULL where appropriate, remove the macro SA() that was only used once, remove some useless debugging code in rt_fixchange, explain some odd-looking casts.	2004-04-20 07:04:47 +00:00
luigi	872141d7c7	Document an assumption on the structure of 'struct rtentry'	2004-04-20 07:03:30 +00:00
luigi	cb1916d883	Add some comments, move a static array of constants in the only place where it is used, and replace R_Malloc with R_Zalloc in a couple of places removing the corresponding bzero()'s	2004-04-19 17:28:39 +00:00
luigi	aeac3672f4	Fix a recently introduced panic in if_detach() by delaying the invalidation of ifindex_table[] entry. Probably this code should be moved even further down, but for the time being let's do it this way.	2004-04-19 17:28:15 +00:00
ru	126fcbbfad	More style and deobfuscation fixes. Submitted by: bde	2004-04-19 07:20:32 +00:00
brooks	3d1134df29	Use an tempory struct ifnet *ifp instead of sc->sc_if to access the ifnet in stf_clone_create. Also use if_printf() instead of printf().	2004-04-19 05:06:27 +00:00
rwatson	d599942920	First pass at softc list locking for if_ppp.c. Many parts of this patch were submitted by Maurycy Pawlowski-Wieronski. In addition to Maurycy's change, break out softc tear down from ppp_clone_destroy() into ppp_destroy() rather than performing a convoluted series of extraction casts and indirections during tear down at mod unload. Submitted by: Maurycy Pawlowski-Wieronski <maurycy@fouk.org>	2004-04-19 01:36:24 +00:00
ru	40fb1e73cd	Style and code unobfuscation.	2004-04-18 19:38:20 +00:00
ru	9540f2e593	Fixed a bug from rev. 1.42: cast to a correct type. Submitted by: luigi	2004-04-18 19:36:01 +00:00
mlaier	22ff34b571	Make if_(un)route static in if.c as they are called from if_up/if_down only. This is also cleanup to make locking easier. Reviewed by: luigi Approved by: bms(mentor)	2004-04-18 18:59:44 +00:00
luigi	c8ac6abb75	+ move MKGet()/MKFree() into the only file that can use them. + remove useless wrappers around bcmp(), bcopy(), bzero(). The code assumes that bcmp() returns 0 if the size is 0, but this is true for both the libc and the libkern versions. + nuke Bcmp, Bzero, Bcopy from radix.h now that nobody uses them anymore.	2004-04-18 11:48:35 +00:00
luigi	f2ac9cb854	+ replace Bcmp/Bzero with 'the real thing' as in the rest of the file. + remember to check and fix or explain a strange cast in route_output()	2004-04-18 11:47:04 +00:00
luigi	bc53551bd9	replace Bcopy with bcopy as in the rest of the file.	2004-04-18 11:46:29 +00:00
luigi	46400cdf4d	replace Bcmp() with the same bcmp() used in the rest of the file.	2004-04-18 11:01:15 +00:00
luigi	9cffdfc5ca	+ rename and document an unused field in struct arpcom (field is still there so there are no ABI changes); + replace 5 redefinitions of the IPF2AC macro with one in if_arp.h Eventually (but before freezing the ABI) we need to get rid of struct arpcom (initially with the help of some smart #defines to avoid having to touch each and every driver, see below). Apart from the struct ifnet, struct arpcom now only stores a copy of the MAC address (ac_enaddr, but we already have another copy in the struct ifnet -- if_addrhead), and a netgraph-specific field which is _always_ accessed through the ifp, so it might well go into the struct ifnet too (where, besides, there is already an entry for AF_NETGRAPH data...) Too bad ac_enaddr is widely referenced by all drivers. But this can be fixed as follows: #define ac_enaddr ac_if.the_original_ac_enaddr_in_struct_ifnet (note that the right hand side would likely be a pointer rather than the base address of an array.)	2004-04-18 01:15:32 +00:00
luigi	04f5fa0216	Minor changes to improve code readability (no actual code changes): + replace 0 with NULL where appropriate (not complete) + remove register declaration while there + add argument names to function prototypes to have a better idea of what they are used for + add 'const' qualifiers in 3 places	2004-04-18 00:56:44 +00:00
luigi	36ff2c8c63	make route_init() static	2004-04-17 15:10:20 +00:00
luigi	908ad90621	misc cleanup in sysctl_ifmalist(): + remove a partly incorrect comment that i introduced in the last commit; + deal with the correct part of the above comment by cleaning up the updates of 'info' -- rti_addrs needd not to be updated, rti_info[RTAX_IFP] can be set once outside the loop. While at it, correct a few misspelling of NULL as 0, but there are way too many in this file, and i did not want to clutter the important part of this commit.	2004-04-17 15:09:36 +00:00
luigi	a7f6bd46b9	Use if_link instead of the alias if_list, and change a for() into the TAILQ_FOREACH() form. Comment the need to store the same info (mac address for ethernet-type devices) in two different places. No functional changes. Even the compiler output should be unmodified by this change.	2004-04-16 10:32:13 +00:00
luigi	ea6500e14f	Documented the intended usage of if_addrhead and ifaddr_byindex() This commit only changes comments. Nothing to recompile.	2004-04-16 10:28:54 +00:00
luigi	457dbfc9de	Consistently use ifaddr_byindex() to access the link-level address of an interface. No functional change. On passing, comment a likely bug in net/rtsock.c:sysctl_ifmalist() which, if confirmed, would deserve to be fixed and MFC'ed	2004-04-16 08:14:34 +00:00

... 2 3 4 5 6 ...

1779 Commits