Hans Petter Selasky f3e7afe2d7 Implement kernel support for hardware rate limited sockets.
- Add RATELIMIT kernel configuration keyword which must be set to
enable the new functionality.

- Add support for hardware driven, Receive Side Scaling, RSS aware, rate
limited sendqueues and expose the functionality through the already
established SO_MAX_PACING_RATE setsockopt(). The API support rates in
the range from 1 to 4Gbytes/s which are suitable for regular TCP and
UDP streams. The setsockopt(2) manual page has been updated.

- Add rate limit function callback API to "struct ifnet" which supports
the following operations: if_snd_tag_alloc(), if_snd_tag_modify(),
if_snd_tag_query() and if_snd_tag_free().

- Add support to ifconfig to view, set and clear the IFCAP_TXRTLMT
flag, which tells if a network driver supports rate limiting or not.

- This patch also adds support for rate limiting through VLAN and LAGG
intermediate network devices.

- How rate limiting works:

1) The userspace application calls setsockopt() after accepting or
making a new connection to set the rate which is then stored in the
socket structure in the kernel. Later on when packets are transmitted
a check is made in the transmit path for rate changes. A rate change
implies a non-blocking ifp->if_snd_tag_alloc() call will be made to the
destination network interface, which then sets up a custom sendqueue
with the given rate limitation parameter. A "struct m_snd_tag" pointer is
returned which serves as a "snd_tag" hint in the m_pkthdr for the
subsequently transmitted mbufs.

2) When the network driver sees the "m->m_pkthdr.snd_tag" different
from NULL, it will move the packets into a designated rate limited sendqueue
given by the snd_tag pointer. It is up to the individual drivers how the rate
limited traffic will be rate limited.

3) Route changes are detected by the NIC drivers in the ifp->if_transmit()
routine when the ifnet pointer in the incoming snd_tag mismatches the
one of the network interface. The network adapter frees the mbuf and
returns EAGAIN which causes the ip_output() to release and clear the send
tag. Upon next ip_output() a new "snd_tag" will be tried allocated.

4) When the PCB is detached the custom sendqueue will be released by a
non-blocking ifp->if_snd_tag_free() call to the currently bound network
interface.

Reviewed by:		wblock (manpages), adrian, gallatin, scottl (network)
Differential Revision:	https://reviews.freebsd.org/D3687
Sponsored by:		Mellanox Technologies
MFC after:		3 months
2017-01-18 13:31:17 +00:00
..
2016-06-04 20:20:14 +00:00
2016-08-18 06:39:09 +00:00
2016-09-30 15:47:13 +00:00
2013-01-15 14:09:08 +00:00
2015-10-20 23:52:37 +00:00
2013-08-16 13:16:27 +00:00
2016-03-09 13:45:03 +00:00
2016-02-29 17:20:04 +00:00
2016-09-30 13:04:18 +00:00
2015-03-29 19:14:41 +00:00
2016-10-06 17:35:50 +00:00
2016-02-18 18:50:03 +00:00
2014-05-30 02:16:28 +00:00
2016-03-18 08:47:17 +00:00
2012-05-08 18:56:21 +00:00
2012-05-08 18:56:21 +00:00
2016-08-29 19:53:13 +00:00
2016-08-29 18:47:51 +00:00
2016-02-29 17:22:34 +00:00
2016-08-18 06:39:09 +00:00
2016-08-18 06:39:09 +00:00
2015-02-07 13:11:45 +00:00
2016-09-30 16:02:25 +00:00
2012-05-08 18:56:21 +00:00
2016-01-18 20:21:38 +00:00