freebsd-skq

Author	SHA1	Message	Date
dim	fda4020a88	Apply the STATIC_VNET_DEFINE and STATIC_DPCPU_DEFINE macros throughout the tree.	2010-11-14 20:38:11 +00:00
tuexen	bd09c40347	Take out special code for disable CRC computations on the loopback interface for IPv6. It will be handled by the loopback interface.	2010-11-14 16:44:18 +00:00
tuexen	46d16d30f2	Simplify sctp_delayed_cksum() a bit. MFC after: 3 days.	2010-11-14 14:37:20 +00:00
tuexen	c0d6d04d71	Fix a locking issue reported by brucec@ affecting 1-to-1 style sockets which have not yet been accepted. MFC after: 3 days.	2010-11-13 12:52:44 +00:00
gnn	c3225b5eaa	Add a queue to hold packets while we await an ARP reply. When a fast machine first brings up some non TCP networking program it is quite possible that we will drop packets due to the fact that only one packet can be held per ARP entry. This leads to packets being missed when a program starts or restarts if the ARP data is not currently in the ARP cache. This code adds a new sysctl, net.link.ether.inet.maxhold, which defines a system wide maximum number of packets to be held in each ARP entry. Up to maxhold packets are queued until an ARP reply is received or the ARP times out. The default setting is the old value of 1 which has been part of the BSD networking code since time immemorial. Expose the time we hold an incomplete ARP entry by adding the sysctl net.link.ether.inet.wait, which defaults to 20 seconds, the value used when the new ARP code was added.. Reviewed by: bz, rpaulo MFC after: 3 weeks	2010-11-12 22:03:02 +00:00
tuexen	48df378ef0	Don't print an empty line when printing mapping arrays. MFC after: 3 days.	2010-11-12 20:46:33 +00:00
tuexen	29f904bd05	Fix more issues with the SACK/NR-SACK generation code. MFC after: 3 days.	2010-11-12 20:45:21 +00:00
luigi	e7ccc85b8f	The first customer of the SO_USER_COOKIE option: the "sockarg" ipfw option matches packets associated to a local socket and with a non-zero so_user_cookie value. The value is made available as tablearg, so it can be used as a skipto target or pipe number in ipfw/dummynet rules. Code by Paul Joe, manpage by me. Submitted by: Paul Joe MFC after: 1 week	2010-11-12 13:05:17 +00:00
lstewart	df9f23bf3f	This commit marks the first formal contribution of the "Five New TCP Congestion Control Algorithms for FreeBSD" FreeBSD Foundation funded project. More details about the project are available at: http://caia.swin.edu.au/freebsd/5cc/ - Add a KPI and supporting infrastructure to allow modular congestion control algorithms to be used in the net stack. Algorithms can maintain per-connection state if required, and connections maintain their own algorithm pointer, which allows different connections to concurrently use different algorithms. The TCP_CONGESTION socket option can be used with getsockopt()/setsockopt() to programmatically query or change the congestion control algorithm respectively from within an application at runtime. - Integrate the framework with the TCP stack in as least intrusive a manner as possible. Care was also taken to develop the framework in a way that should allow integration with other congestion aware transport protocols (e.g. SCTP) in the future. The hope is that we will one day be able to share a single set of congestion control algorithm modules between all congestion aware transport protocols. - Introduce a new congestion recovery (TF_CONGRECOVERY) state into the TCP stack and use it to decouple the meaning of recovery from a congestion event and recovery from packet loss (TF_FASTRECOVERY) a la RFC2581. ECN and delay based congestion control protocols don't generally need to recover from packet loss and need a different way to note a congestion recovery episode within the stack. - Remove the net.inet.tcp.newreno sysctl, which simplifies some portions of code and ensures the stack always uses the appropriate mechanisms for recovering from packet loss during a congestion recovery episode. - Extract the NewReno congestion control algorithm from the TCP stack and massage it into module form. NewReno is always built into the kernel and will remain the default algorithm for the forseeable future. Implementations of additional different algorithms will become available in the near future. - Bump __FreeBSD_version to 900025 and note in UPDATING that rebuilding code that relies on the size of "struct tcpcb" is required. Many thanks go to the Cisco University Research Program Fund at Community Foundation Silicon Valley and the FreeBSD Foundation. Their support of our work at the Centre for Advanced Internet Architectures, Swinburne University of Technology is greatly appreciated. In collaboration with: David Hayes <dahayes at swin edu au> and Grenville Armitage <garmitage at swin edu au> Sponsored by: Cisco URP, FreeBSD Foundation Reviewed by: rpaulo Tested by: David Hayes (and many others over the years) MFC after: 3 months	2010-11-12 06:41:55 +00:00
lstewart	100d9dcd2f	Standardise all Swinburne related copyright/licence statements throughout the tree in preparation for another large code import. Swinburne University is the legal entity that owns copyright and the 2-clause BSD licence is acceptable.	2010-11-12 00:44:18 +00:00
lstewart	2b9cb75d46	The university does not require that its CRICOS number be included in source code. Remove all references from the tree. MFC after: 3 days	2010-11-12 00:19:42 +00:00
tuexen	9596952a63	Fix the SACK/NR-SACK generation code. MFC after: 3 days.	2010-11-11 18:41:03 +00:00
rrs	8bcde2d437	Fix so that a multicast packet can be sent even if there is no route out to that mcast address. The code in in_pcb inadvertantly would error (no route) even though the user may have specified the address with the proper socket option (to specify the egress interface). Thanks bz for reminding me I forgot to commit this ;-) Reviewed by: bz MFC after: 1 week	2010-11-11 05:40:39 +00:00
tuexen	daddd7a01b	Improve the scalability by using the local and remote port when putting inps in the tcpephash. MFC after: 3 days.	2010-11-09 16:18:32 +00:00
tuexen	66c05b025a	Fix a bug which resulted in kevent() reporting an event twice on 1-to-1 style sockets when an ABORT was received. MFC after: 3 days.	2010-11-09 12:00:39 +00:00
brucec	696c4e1f9b	Fix typos. PR: bin/148894 Submitted by: olgeni	2010-11-09 10:59:09 +00:00
tuexen	557f7dc61c	Do not have the MTU table twice in the code. Therefore move the function from the timer code to util, rename it appropriately and also fix a bug in sctp_get_prev_mtu(), where calling it with a value existing in the MTU table did not return a smaller one. MFC after: 3 days.	2010-11-07 18:50:35 +00:00
tuexen	6b30c1f2be	Remove two functions which are not used. MFC after: 3 days.	2010-11-07 17:50:56 +00:00
tuexen	275229d14f	* Use exponential backoff for retransmission of SHUTDOWN and SHUTDOWN-ACK chunks. * While there, do some cleanups. MFC after: 3 days.	2010-11-07 17:44:04 +00:00
tuexen	128945a395	Not only stop all timers when entering the SHUTDOWN_SENT state, but also when entering the SHUTDOWN_ACK_SEND state. MFC after: 3 days.	2010-11-07 14:39:40 +00:00
tuexen	eab6797fe5	Do not resend DATA chunks without delay when dropped by the peer and the CRC was correct. MFC after: 3 days.	2010-11-06 13:43:18 +00:00
tuexen	12a24d255f	* Fix an accounting bug regarding SACK/NR-SACK chunks. * Fix the generation of the SACK/NR-SACK gap lists. MFC after: 3 days.	2010-11-06 13:30:54 +00:00
n_hibma	06824871e7	Don't spam the console with loaded modules during boot and/or during startup of ppp. Note: This cannot be hidden behind bootverbose as this file is included from lib/libalias as well.	2010-11-03 21:10:12 +00:00
jhb	07732251f2	Don't leak the LLE lock if the arptimer callout is pending or inactive. Reported by: David Rhodus MFC after: 1 month	2010-11-02 13:00:56 +00:00
glebius	192d172d5b	Remove meaningless XXXXX, that is a remain of comment, removed in r186200.	2010-10-29 11:13:42 +00:00
glebius	20ab72ce86	Revert a small part of the r198301, that is entirely unrelated to the r198301 itself. It also broke the logic of not sending more than one ARP request per second, that consequently lead to a potential problem of flooding network with broadcast packets. MFC after: 1 week	2010-10-29 10:57:18 +00:00
bz	520543ca43	Add initial inet DDB support for show in_ifaddr and show sin commands which proved to be useful while debugging address list problems. MFC after: 6 days	2010-10-24 22:02:36 +00:00
bz	de9392f9e0	Make the IPsec SADB embedded route cache a union to be able to hold both the legacy and IPv6 route destination address. Previously in case of IPv6, there was a memory overwrite due to not enough space for the IPv6 address. PR: kern/122565 MFC After: 2 weeks	2010-10-23 20:35:40 +00:00
uqs	7fc6083fed	mdoc: drop even more redundant .Pp calls No change in rendered output, less mandoc lint warnings. Tool provided by: Nobuyuki Koganemaru n-kogane at syd.odn.ne.jp	2010-10-19 12:35:40 +00:00
bz	afcba250aa	MfP4 CH182763 (original version): Make it harder to exploit certain in_control() related races between the intiial lookup at the beginning and the time we will remove the entry from the lists by re-checking that entry is still in the list before trying to remove it. (*) It is believed that with the current code and locking strategy we cannot completely fix all race. Reported by: Nima Misaghian (nima_misa hotmail.com) on net@ 20100817 Tested by: Nima Misaghian (nima_misa hotmail.com) (original version) PR: kern/146250 Submitted by: Mikolaj Golub (to.my.trociny gmail.com) (different version) MFC after: 1 week	2010-10-16 19:53:22 +00:00
lstewart	91a790ee6e	Retire the system-wide, per-reassembly queue segment limit. The mechanism is far too coarse grained to be useful and the default value significantly degrades TCP performance on moderate to high bandwidth-delay product paths with non-zero loss (e.g. 5+Mbps connections across the public Internet often suffer). Replace the outgoing mechanism with an individual per-queue limit based on the number of MSS segments that fit into the socket's receive buffer. This should strike a good balance between performance and the potential for resource exhaustion when FreeBSD is acting as a TCP receiver. With socket buffer autotuning (which is enabled by default), the reassembly queue tracks the socket buffer and benefits too. As the XXX comment suggests, my testing uncovered some unexpected behaviour which requires further investigation. By using so->so_rcv.sb_hiwat instead of sbspace(&so->so_rcv), we allow more segments to be held across both the socket receive buffer and reassembly queue than we probably should. The tradeoff is better performance in at least one common scenario, versus a devious sender's ability to consume more resources on a FreeBSD receiver. Sponsored by: FreeBSD Foundation Reviewed by: andre, gnn, rpaulo MFC after: 2 weeks	2010-10-16 07:12:39 +00:00
lstewart	ab26135efc	- Switch the "net.inet.tcp.reass.cursegments" and "net.inet.tcp.reass.maxsegments" sysctl variables to be based on UMA zone stats. The value returned by the cursegments sysctl is approximate owing to the way in which uma_zone_get_cur is implemented. - Discontinue use of V_tcp_reass_qsize as a global reassembly segment count variable in the reassembly implementation. The variable was used without proper synchronisation and was duplicating accounting done by UMA already. The lack of synchronisation was particularly problematic on SMP systems terminating many TCP sessions, resulting in poor TCP performance for connections with non-zero packet loss. Sponsored by: FreeBSD Foundation Reviewed by: andre, gnn, rpaulo (as part of a larger patch) MFC after: 2 weeks	2010-10-16 05:37:45 +00:00
bz	1b336481b4	Use ifa_ifwithaddr_check() rather than ifa_ifwithaddr() as we are not interested in the result and would leak a reference otherwise. PR: kern/151435 Submitted by: Andrew Boyer (aboyer averesystems.com) MFC after: 3 days	2010-10-14 12:32:49 +00:00
luigi	c271001bfa	put back the assigment to sched_time. It was correct, and it was necessary. Submitted by: Riccardo Panicucci	2010-10-01 15:38:35 +00:00
bz	35c65502b9	Proper bracketing. PR: kern/151100 Submitted by: SunMinghao (sunminghao hotmail.com) MFC after: 3 days	2010-10-01 11:48:14 +00:00
luigi	a41992e8ea	remove an unnecessary (and wrong) assignment. It was meant to reset idle_time (and it was not needed), but i even used the wrong field. Obtained from: Oleg MFC after: 3 days	2010-09-29 21:02:31 +00:00
luigi	db4f2cdbdf	whitespace changes in preparation for future commits	2010-09-29 09:40:20 +00:00
luigi	27b07fc731	fix handling of initial credit for an idle pipe. This fixes the bug where setting bw > 1 MTU/tick resulted in infinite bandwidth if io_fast=1 PR: 147245 148429 Obtained from: Riccardo Panicucci MFC after: 3 days	2010-09-29 09:22:12 +00:00
luigi	c0dff81a76	fix breakage in in-kernel NAT: the code did not honor net.inet.ip.fw.one_pass and always moved to the next rule in case of a successful nat. This should fix several related PR (waiting for feedback before closing them) PR: 145167 149572 150141 MFC after: 3 days	2010-09-28 23:23:23 +00:00
luigi	9c86b371d6	Whitespace changes to reduce diffs wrt the most recent ipfw/dummynet code: + remove an unused macro, + adjust the constants in an enum + small whitespace changes MFC after: 3 days	2010-09-28 22:46:13 +00:00
delphij	f0a3248def	Add a bandaid for a long-standing race condition during route entry un-expiring. The previous version of code have no locking when testing rt_refcnt. The result of the lack of locking may result in a condition where a routing entry have a reference count but at the same time have RTPRF_OURS bit set and an expiration timer. These would eventually lead to a panic: panic: rtqkill route really not free When the system have ICMP redirects accepted from local gateway in a moderate frequency, for instance. Commit this workaround for now until we have some better solution. PR: kern/149804 Reviewed by: bz Tested by: Zhao Xin, Pete French MFC after: 2 weeks	2010-09-27 19:26:56 +00:00
lstewart	09dabf65c8	Log the number of segments currently in the reassembly queue. Sponsored by: FreeBSD Foundation	2010-09-25 09:16:46 +00:00
lstewart	6f2607a695	Internalise reassembly queue related functionality and variables which should not be used outside of the reassembly queue implementation. Provide a new function to flush all segments from a reassembly queue and call it from the appropriate places instead of manipulating the queue directly. Sponsored by: FreeBSD Foundation Reviewed by: andre, gnn, rpaulo MFC after: 2 weeks	2010-09-25 04:58:46 +00:00
attilio	278493a1a5	Make the RPC specific __rpc_inet_ntop() and __rpc_inet_pton() general in the kernel (just as inet_ntoa() and inet_aton()) are and sync their prototype accordingly with already mentioned functions. Sponsored by: Sandvine Incorporated Reviewed by: emaste, rstone Approved by: dfr MFC after: 2 weeks	2010-09-24 15:01:45 +00:00
attilio	d658ddc7c7	IP_BINDANY is not correctly handled in getsockopt() case. Fix it by specifying the correct bits. Sponsored by: Sandvine Incorporated Reviewed by: bz, emaste, rstone Obtained from: Sandvine Incorporated MFC after: 10 days	2010-09-24 14:38:54 +00:00
glebius	182b83a3dd	Do not convert some meaningful error value to EINVAL. Reviewed by: will	2010-09-20 12:23:10 +00:00
tuexen	b713b01600	Fix a locking issue which resulted in aborted associations due to a corrupted nr-mapping array. MFC after: 2 weeks.	2010-09-20 12:19:11 +00:00
tuexen	7e0a1a3cff	Allow the initial congestion window to be configure to one MTU. Improve the description. MFC after: 2 weeks.	2010-09-19 11:57:21 +00:00
tuexen	ea42c22bd8	Fix a locking issue which shows up when the code is used on Mac OS X. MFC after: 2 weeks.	2010-09-19 11:42:16 +00:00
andre	d63aabd58e	Rearrange the TSO code to make it more readable and to clearly separate the decision logic, of whether we can do TSO, and the calculation of the burst length into two distinct parts. Change the way the TSO burst length calculation is done. While TSO could do bursts of 65535 bytes that can't be represented in ip_len together with the IP and TCP header. Account for that and use IP_MAXPACKET instead of TCP_MAXWIN as base constant (both have the same value of 64K). When more data is available prevent less than MSS sized segments from being sent during the current TSO burst. Add two more KASSERTs to ensure the integrity of the packets. Tested by: Ben Wilber <ben-at-desync com> MFC after: 10 days	2010-09-17 22:05:27 +00:00

1 2 3 4 5 ...

3928 Commits