freebsd-dev

Author	SHA1	Message	Date
Garrett Wollman	98271db4d5	Convert socket structures to be type-stable and add a version number. Define a parameter which indicates the maximum number of sockets in a system, and use this to size the zone allocators used for sockets and for certain PCBs. Convert PF_LOCAL PCB structures to be type-stable and add a version number. Define an external format for infomation about socket structures and use it in several places. Define a mechanism to get all PF_LOCAL and PF_INET PCB lists through sysctl(3) without blocking network interrupts for an unreasonable length of time. This probably still has some bugs and/or race conditions, but it seems to work well enough on my machines. It is now possible for `netstat' to get almost all of its information via the sysctl(3) interface rather than reading kmem (changes to follow).	1998-05-15 20:11:40 +00:00
David Greenman	552b7df4c1	Ensure that TCP_REXMTVAL doesn't return a value less than t_rttmin. This is believed to have been broken with the Brakmo/Peterson srtt calculation changes. The result of this bug is that TCP connections could time out extremely quickly (in 12 seconds). Also backed out jdp's partial fix for this problem in rev 1.17 of tcp_timer.c as it is obsoleted by this commit. Bug was pointed out by Kevin Lehey <kml@roller.nas.nasa.gov>. PR: 6068	1998-04-24 09:25:39 +00:00
Poul-Henning Kamp	8e5db87cdb	Remove the last traces of TUBA. Inspired by: PR kern/3317	1998-04-06 06:52:47 +00:00
David Greenman	f498eeeead	Changes to support the addition of a new sysctl variable: net.inet.tcp.delack_enabled Which defaults to 1 and can be set to 0 to disable TCP delayed-ack processing (i.e. all acks are immediate).	1998-02-26 05:25:39 +00:00
David Greenman	c3229e05a3	Improved connection establishment performance by doing local port lookups via a hashed port list. In the new scheme, in_pcblookup() goes away and is replaced by a new routine, in_pcblookup_local() for doing the local port check. Note that this implementation is space inefficient in that the PCB struct is now too large to fit into 128 bytes. I might deal with this in the future by using the new zone allocator, but I wanted these changes to be extensively tested in their current form first. Also: 1) Fixed off-by-one errors in the port lookup loops in in_pcbbind(). 2) Got rid of some unneeded rehashing. Adding a new routine, in_pcbinshash() to do the initialial hash insertion. 3) Renamed in_pcblookuphash() to in_pcblookup_hash() for easier readability. 4) Added a new routine, in_pcbremlists() to remove the PCB from the various hash lists. 5) Added/deleted comments where appropriate. 6) Removed unnecessary splnet() locking. In general, the PCB functions should be called at splnet()...there are unfortunately a few exceptions, however. 7) Reorganized a few structs for better cache line behavior. 8) Killed my TCP_ACK_HACK kludge. It may come back in a different form in the future, however. These changes have been tested on wcarchive for more than a month. In tests done here, connection establishment overhead is reduced by more than 50 times, thus getting rid of one of the major networking scalability problems. Still to do: make tcp_fastimo/tcp_slowtimo scale well for systems with a large number of connections. tcp_fastimo is easy; tcp_slowtimo is difficult. WARNING: Anything that knows about inpcb and tcpcb structs will have to be recompiled; at the very least, this includes netstat(1).	1998-01-27 09:15:13 +00:00
Garrett Wollman	a29f300e80	The long-awaited mega-massive-network-code- cleanup. Part I. This commit includes the following changes: 1) Old-style (pr_usrreq()) protocols are no longer supported, the compatibility glue for them is deleted, and the kernel will panic on boot if any are compiled in. 2) Certain protocol entry points are modified to take a process structure, so they they can easily tell whether or not it is possible to sleep, and also to access credentials. 3) SS_PRIV is no more, and with it goes the SO_PRIVSTATE setsockopt() call. Protocols should use the process pointer they are now passed. 4) The PF_LOCAL and PF_ROUTE families have been updated to use the new style, as has the `raw' skeleton family. 5) PF_LOCAL sockets now obey the process's umask when creating a socket in the filesystem. As a result, LINT is now broken. I'm hoping that some enterprising hacker with a bit more time will either make the broken bits work (should be easy for netipx) or dike them out.	1997-04-27 20:01:29 +00:00
Peter Wemm	6875d25465	Back out part 1 of the MCFH that changed $Id$ to $FreeBSD$. We are not ready for it yet.	1997-02-22 09:48:43 +00:00
Jordan K. Hubbard	1130b656e5	Make the long-awaited change from $Id$ to $FreeBSD$ This will make a number of things easier in the future, as well as (finally!) avoiding the Id-smashing problem which has plagued developers for so long. Boy, I'm glad we're not using sup anymore. This update would have been insane otherwise.	1997-01-14 07:20:47 +00:00
Paul Traina	561c2ad31c	Move TCPCTL_KEEPINIT to end of MIB list (sigh)	1996-09-13 23:54:03 +00:00
Paul Traina	7b40aa327d	Make the misnamed tcp initial keepalive timer value (which is really the time, in seconds, that state for non-established TCP sessions stays about) a sysctl modifyable variable. [part 1 of two commits, I just realized I can't play with the indices as I was typing this commit message.]	1996-09-13 23:51:44 +00:00
Garrett Wollman	2c37256e5a	Modify the kernel to use the new pr_usrreqs interface rather than the old pr_usrreq mechanism which was poorly designed and error-prone. This commit renames pr_usrreq to pr_ousrreq so that old code which depended on it would break in an obvious manner. This commit also implements the new interface for TCP, although the old function is left as an example (#ifdef'ed out). This commit ALSO fixes a longstanding bug in the TCP timer processing (introduced by davidg on 1995/04/12) which caused timer processing on a TCB to always stop after a single timer had expired (because it misinterpreted the return value from tcp_usrreq() to indicate that the TCB had been deleted). Finally, some code related to polling has been deleted from if.c because it is not relevant t -current and doesn't look at all like my current code.	1996-07-11 16:32:50 +00:00
Garrett Wollman	6da5712b60	Correct formula for TCP RTO calculation. Also try to do a better job in filling in a new PCB's rttvar (but this is not the last word on the subject). And get rid of `#ifdef RTV_RTT', it's been true for four years now...	1996-06-05 16:57:38 +00:00
Garrett Wollman	a2352fc148	Delete #ifdef notdef blocks containing old method of srtt calculation. Requested by: davidg	1996-04-26 18:32:58 +00:00
Garrett Wollman	233e8c18e8	A number of performance-reducing flaws fixed based on comments from Larry Peterson &co. at Arizona: - Header prediction for ACKs did not exclude Fast Retransmit/Recovery. - srtt calculation tended to get ``stuck'' and could never decrease when below 8. It still can't, but the scaling factors are adjusted so that this artifact does not cause as bad an effect on the RTO value as it used to. The paper also points out the incr/8 error that has been long since fixed, and the problems with ACKing frequency resulting from the use of options which I suspect to be fixed already as well (as part of the T/TCP work). Obtained from: Brakmo & Peterson, ``Performance Problems in BSD4.4 TCP''	1996-03-22 18:09:21 +00:00
Bruce Evans	3420f4ab37	Spell tcp_listendrop consistently so that tcp_input.c and netstat compile.	1996-02-27 15:12:53 +00:00
Guido van Rooij	1347f5b8e5	Add a counter for the number of times the listen queue was overflowed to the tcpstat structure. (netstat -s) Reviewed by: wollman Obtained from: Steves, TCP/IP Ill. vol.3, page 189	1996-02-26 21:47:13 +00:00
Mike Pritchard	6c5e9bbdf5	Fix a bunch of spelling errors in the comment fields of a bunch of system include files.	1996-01-30 23:02:38 +00:00
Peter Wemm	34da58488d	remove tcp_lastport - it has not been used for quite a while (at least since the hashed pcb's I think).	1996-01-19 08:02:34 +00:00
David Greenman	858c045f77	Remove some bogus externs.	1995-12-29 01:12:02 +00:00
Bruce Evans	b62d102cbb	Uniformized pr_ctlinput protosw functions. The third arg is now `void *' instead of caddr_t and it isn't optional (it never was). Most of the netipx (and netns) pr_ctlinput functions abuse the second arg instead of using the third arg but fixing this is beyond the scope of this round of changes.	1995-12-16 02:14:44 +00:00
Garrett Wollman	b7a44e3486	Path MTU Discovery is now standard.	1995-12-05 17:46:50 +00:00
Poul-Henning Kamp	0312fbe97d	New style sysctl & staticize alot of stuff.	1995-11-14 20:34:56 +00:00
Andras Olah	a45d27261d	Fix a logical error in T/TCP: when we actively open a connection, we have to decide whether to send a CC or CCnew option in our SYN segment depending on the contents of our TAO cache. This decision has to be made once when the connection starts. The earlier code delayed this decision until the segment was assembled in tcp_output() and retransmitted SYN segments could have different CC options. Reviewed by: Richard Stevens, davidg, wollman	1995-11-03 22:08:13 +00:00
Garrett Wollman	3d1f141b23	The ability to administratively change the MTU of an interface presents a few new wrinkles for MTU discovery which tcp_output() had better be prepared to handle. ip_output() is also modified to do something helpful in this case, since it has already calculated the information we need.	1995-10-16 18:21:26 +00:00
Garrett Wollman	3abc79d2ee	The additional checks involving sequence numbers in MTU discovery resends turned out not to be necessary; simply watching for MTU decreases (which we already did) automagically eliminates all the cases we were trying to protect against.	1995-10-12 17:37:25 +00:00
Garrett Wollman	143d7a5499	More MTU discovery: avoid over-retransmission if route changes in the middle of a fully-open window. Also, keep track of how many retransmits we do as a result of MTU discovery. This may actually do more work than necessary, but it's an unusual condition... Suggested by: Janey Hoe <janey@lcs.mit.edu>	1995-10-10 17:45:43 +00:00
Garrett Wollman	6bb9a8e77d	Make a whole bunch of PCB variables ints rather than shorts. There appear to be no ill effects, and so far as Iknow none of the variables in question depend on 16-bit wraparound behavior. (The sizes are in many cases relics from when a PCB had to fit inside a 128-byte mbuf. PCBs are no longer stored in that way, and the old structure would not have fit, either.)	1995-10-04 20:49:03 +00:00
Peter Wemm	4dc45a5fa1	Remove duplicate definition for tcps_persistdrop, as added by davidg some time ago. I left in Garrett's one, because his was in the 4.4-Lite-2 location, making any diffs just that little bit smaller. I presume this choice means that netstat needs to be recompiled before "netstat -s" will give a meaningful answer on tcp stats.	1995-09-22 07:40:18 +00:00
Garrett Wollman	6c52bc4648	Merge with 4.4-Lite-2. This just adds a couple of tcpstat entries which we don't currently set, but might in the future.	1995-09-21 17:29:13 +00:00
David Greenman	cc0964fb2b	Add connection drop capability for persist timeouts. Reviewed by: Andras Olah Obtained from: 4.4BSD-lite2 via W. Richard Stevens	1995-07-29 18:48:44 +00:00
Garrett Wollman	dd22498271	tcp_input.c - keep track of how many times a route contained a cached rtt or ssthresh that we were able to use tcp_var.h - declare tcpstat entries for above; declare tcp_{send,recv}space in_rmx.c - fill in the MTU and pipe sizes with the defaults TCP would have used anyway in the absence of values here	1995-07-10 15:39:16 +00:00
Garrett Wollman	fc97827135	Keep track of the number of samples through the srtt filter so that we know better when to cache values in the route, rather than relying on a heuristic involving sequence numbers that broke when tcp_sendspace was increased to 16k.	1995-06-29 18:11:24 +00:00
Garrett Wollman	9167720192	Now that we've gone to all sorts of effort to allow TCP to cache some of its connection parameters, we want to keep statistics on how often this actually happens to see whether there is any work that needs to be done in TCP itself. Suggested by: John Wroclawski <jtw@lcs.mit.edu>	1995-06-19 16:45:33 +00:00
David Greenman	15bd2b4385	Implemented PCB hashing. Includes new functions in_pcbinshash, in_pcbrehash, and in_pcblookuphash.	1995-04-09 01:29:31 +00:00
Bruce Evans	b5e8ce9f12	Add and move declarations to fix all of the warnings from `gcc -Wimplicit' (except in netccitt, netiso and netns) and most of the warnings from `gcc -Wnested-externs'. Fix all the bugs found. There were no serious ones.	1995-03-16 18:17:34 +00:00
Garrett Wollman	41f82abe5a	Transaction TCP support now standard. Hack away!	1995-02-16 00:55:44 +00:00
Garrett Wollman	f2ea20e676	Add lots of useful MIB variables and a few not-so-useful ones for completeness.	1995-02-16 00:27:47 +00:00
Garrett Wollman	2f96f1f446	Get rid of some unneeded #ifdef TTCP lines. Also, get rid of some bogus commons declared in header files.	1995-02-14 02:35:19 +00:00
Garrett Wollman	a0292f2375	Merge Transaction TCP, courtesy of Andras Olah <olah@cs.utwente.nl> and Bob Braden <braden@isi.edu>. NB: This has not had David's TCP ACK hack re-integrated. It is not clear what the correct solution to this problem is, if any. If a better solution doesn't pop up in response to this message, I'll put David's code back in (or he's welcome to do so himself).	1995-02-09 23:13:27 +00:00
David Greenman	512ff5eaa0	Fix/#ifdef prototype for tcp_mss...apparantly overlooked by Garrett.	1995-02-09 00:49:20 +00:00
Garrett Wollman	eb6ad69646	Merge in T/TCP TCP header file changes.	1995-02-08 20:18:48 +00:00
Paul Richards	707f139edb	Made idempotent. Submitted by: Paul	1994-08-21 05:27:42 +00:00
David Greenman	3c4dd3568f	Added $Id$	1994-08-02 07:55:43 +00:00
Rodney W. Grimes	df8bae1de4	BSD 4.4 Lite Kernel Sources	1994-05-24 10:09:53 +00:00

44 Commits