freebsd kernel with SKQ
Go to file
Hans Petter Selasky f3e7afe2d7 Implement kernel support for hardware rate limited sockets.
- Add RATELIMIT kernel configuration keyword which must be set to
enable the new functionality.

- Add support for hardware driven, Receive Side Scaling, RSS aware, rate
limited sendqueues and expose the functionality through the already
established SO_MAX_PACING_RATE setsockopt(). The API support rates in
the range from 1 to 4Gbytes/s which are suitable for regular TCP and
UDP streams. The setsockopt(2) manual page has been updated.

- Add rate limit function callback API to "struct ifnet" which supports
the following operations: if_snd_tag_alloc(), if_snd_tag_modify(),
if_snd_tag_query() and if_snd_tag_free().

- Add support to ifconfig to view, set and clear the IFCAP_TXRTLMT
flag, which tells if a network driver supports rate limiting or not.

- This patch also adds support for rate limiting through VLAN and LAGG
intermediate network devices.

- How rate limiting works:

1) The userspace application calls setsockopt() after accepting or
making a new connection to set the rate which is then stored in the
socket structure in the kernel. Later on when packets are transmitted
a check is made in the transmit path for rate changes. A rate change
implies a non-blocking ifp->if_snd_tag_alloc() call will be made to the
destination network interface, which then sets up a custom sendqueue
with the given rate limitation parameter. A "struct m_snd_tag" pointer is
returned which serves as a "snd_tag" hint in the m_pkthdr for the
subsequently transmitted mbufs.

2) When the network driver sees the "m->m_pkthdr.snd_tag" different
from NULL, it will move the packets into a designated rate limited sendqueue
given by the snd_tag pointer. It is up to the individual drivers how the rate
limited traffic will be rate limited.

3) Route changes are detected by the NIC drivers in the ifp->if_transmit()
routine when the ifnet pointer in the incoming snd_tag mismatches the
one of the network interface. The network adapter frees the mbuf and
returns EAGAIN which causes the ip_output() to release and clear the send
tag. Upon next ip_output() a new "snd_tag" will be tried allocated.

4) When the PCB is detached the custom sendqueue will be released by a
non-blocking ifp->if_snd_tag_free() call to the currently bound network
interface.

Reviewed by:		wblock (manpages), adrian, gallatin, scottl (network)
Differential Revision:	https://reviews.freebsd.org/D3687
Sponsored by:		Mellanox Technologies
MFC after:		3 months
2017-01-18 13:31:17 +00:00
bin chmod: Add SIGINFO handler 2017-01-08 06:50:53 +00:00
cddl Convert ipv4_flags and ipv4_offset fields into host byte order. 2016-12-29 20:27:54 +00:00
contrib MFV r312333: zlib 1.2.11. 2017-01-17 05:55:47 +00:00
crypto MFV r311913: 2017-01-11 05:49:39 +00:00
etc Follow r311103: add "pool" to the keywords that rc.d/ntpdate examines to 2017-01-11 00:14:47 +00:00
gnu Enable /usr/lib32 for o32 binaries on mips64. 2017-01-06 23:30:54 +00:00
include Remove some uses of the GCC __nonnull() attribute. 2017-01-01 17:16:47 +00:00
kerberos5 Conditionalize adding ${KRB5DIR}/lib/gssapi/krb5/gkrb5_err.et to ETSRCS 2017-01-02 19:03:01 +00:00
lib Implement kernel support for hardware rate limited sockets. 2017-01-18 13:31:17 +00:00
libexec rtld: do not rely on a populated GOT on amd64 2017-01-16 14:49:29 +00:00
release Enable IPv6 networking on Amazon EC2. 2017-01-15 09:06:45 +00:00
rescue DIRDEPS_BUILD: Update dependencies. 2016-11-13 00:11:30 +00:00
sbin Implement kernel support for hardware rate limited sockets. 2017-01-18 13:31:17 +00:00
secure Conditionalize building libwrap support into sshd 2017-01-07 08:08:35 +00:00
share Add USB audio support for S/PDIF output with C-Media CM6206 devices. 2017-01-17 08:15:10 +00:00
sys Implement kernel support for hardware rate limited sockets. 2017-01-18 13:31:17 +00:00
targets Merge ^/head r308491 through r308841. 2016-11-19 16:05:55 +00:00
tests Fix typo in r312216 2017-01-15 10:29:53 +00:00
tools Add a new socket option SO_TS_CLOCK to pick from several different clock 2017-01-16 17:46:38 +00:00
usr.bin rpcgen(1): Check getrlimit() return for generated code. 2017-01-12 14:44:41 +00:00
usr.sbin Add smilint target to subdir targets so "make smilint" here will run 2017-01-17 03:44:45 +00:00
.arcconfig callsign isn't required anymore 2016-09-29 06:19:45 +00:00
.arclint phabricator related changes: 2015-04-20 20:33:22 +00:00
COPYRIGHT Bump copyright year. 2016-12-31 12:41:42 +00:00
LOCKS
MAINTAINERS Remove myself from kern_timeout.c yeah! 2016-07-27 20:37:32 +00:00
Makefile Add full softfloat and hardfloat support for RISC-V. 2016-11-16 15:21:32 +00:00
Makefile.inc1 Enable /usr/lib32 for o32 binaries on mips64. 2017-01-06 23:30:54 +00:00
Makefile.libcompat Enable /usr/lib32 for o32 binaries on mips64. 2017-01-06 23:30:54 +00:00
ObsoleteFiles.inc Fix typo from change 310985 in ObsoleteFiles.inc 2017-01-10 20:37:44 +00:00
README Vendor import of zlib 1.2.11. 2017-01-17 05:47:05 +00:00
UPDATING Deprecate kernel configuration option EM_MULTIQUEUE now that the em(4) 2017-01-12 14:38:18 +00:00

This is the top level of the FreeBSD source directory.  This file
was last revised on:
$FreeBSD$

For copyright information, please see the file COPYRIGHT in this
directory (additional copyright information also exists for some
sources in this tree - please see the specific source directories for
more information).

The Makefile in this directory supports a number of targets for
building components (or all) of the FreeBSD source tree.  See build(7)
and http://www.freebsd.org/doc/en_US.ISO8859-1/books/handbook/makeworld.html
for more information, including setting make(1) variables.

The `buildkernel` and `installkernel` targets build and install
the kernel and the modules (see below).  Please see the top of
the Makefile in this directory for more information on the
standard build targets and compile-time flags.

Building a kernel is a somewhat more involved process.  See build(7), config(8),
and http://www.freebsd.org/doc/en_US.ISO8859-1/books/handbook/kernelconfig.html
for more information.

Note: If you want to build and install the kernel with the
`buildkernel` and `installkernel` targets, you might need to build
world before.  More information is available in the handbook.

The kernel configuration files reside in the sys/<arch>/conf
sub-directory.  GENERIC is the default configuration used in release builds.
NOTES contains entries and documentation for all possible
devices, not just those commonly used.


Source Roadmap:
---------------

bin		System/user commands.

cddl		Various commands and libraries under the Common Development
		and Distribution License.

contrib		Packages contributed by 3rd parties.

crypto		Cryptography stuff (see crypto/README).

etc		Template files for /etc.

gnu		Various commands and libraries under the GNU Public License.
		Please see gnu/COPYING* for more information.

include		System include files.

kerberos5	Kerberos5 (Heimdal) package.

lib		System libraries.

libexec		System daemons.

release		Release building Makefile & associated tools.

rescue		Build system for statically linked /rescue utilities.

sbin		System commands.

secure		Cryptographic libraries and commands.

share		Shared resources.

sys		Kernel sources.

tests		Regression tests which can be run by Kyua.  See tests/README
		for additional information.

tools		Utilities for regression testing and miscellaneous tasks.

usr.bin		User commands.

usr.sbin	System administration commands.


For information on synchronizing your source tree with one or more of
the FreeBSD Project's development branches, please see:

  http://www.freebsd.org/doc/en_US.ISO8859-1/books/handbook/synching.html