numam-dpdk

Go to file

Morten Brørup b77f58604a mempool: align cache objects on cache lines

Add __rte_cache_aligned to the objs array.

It makes no difference in the general case, but if get/put operations are
always 32 objects, it will reduce the number of memory (or last level
cache) accesses from five to four 64 B cache lines for every get/put
operation.

For readability reasons, an example using 16 objects follows:

Currently, with 16 objects (128B), we access to 3
cache lines:

      ┌────────┐
      │len     │
cache │********│---
line0 │********│ ^
      │********│ |
      ├────────┤ | 16 objects
      │********│ | 128B
cache │********│ |
line1 │********│ |
      │********│ |
      ├────────┤ |
      │********│_v_
cache │        │
line2 │        │
      │        │
      └────────┘

With the alignment, it is also 3 cache lines:

      ┌────────┐
      │len     │
cache │        │
line0 │        │
      │        │
      ├────────┤---
      │********│ ^
cache │********│ |
line1 │********│ |
      │********│ |
      ├────────┤ | 16 objects
      │********│ | 128B
cache │********│ |
line2 │********│ |
      │********│ v
      └────────┘---

However, accessing the objects at the bottom of the mempool cache is a
special case, where cache line0 is also used for objects.

Consider the next burst (and any following bursts):

Current:
      ┌────────┐
      │len     │
cache │        │
line0 │        │
      │        │
      ├────────┤
      │        │
cache │        │
line1 │        │
      │        │
      ├────────┤
      │        │
cache │********│---
line2 │********│ ^
      │********│ |
      ├────────┤ | 16 objects
      │********│ | 128B
cache │********│ |
line3 │********│ |
      │********│ |
      ├────────┤ |
      │********│_v_
cache │        │
line4 │        │
      │        │
      └────────┘
4 cache lines touched, incl. line0 for len.

With the proposed alignment:
      ┌────────┐
      │len     │
cache │        │
line0 │        │
      │        │
      ├────────┤
      │        │
cache │        │
line1 │        │
      │        │
      ├────────┤
      │        │
cache │        │
line2 │        │
      │        │
      ├────────┤
      │********│---
cache │********│ ^
line3 │********│ |
      │********│ | 16 objects
      ├────────┤ | 128B
      │********│ |
cache │********│ |
line4 │********│ |
      │********│_v_
      └────────┘
Only 3 cache lines touched, incl. line0 for len.

Credits go to Olivier Matz for the nice ASCII graphics.

Signed-off-by: Morten Brørup <mb@smartsharesystems.com>
Reviewed-by: Andrew Rybchenko <andrew.rybchenko@oktetlabs.ru>
Acked-by: Olivier Matz <olivier.matz@6wind.com>

2022-10-30 10:07:58 +01:00

.ci

ci: combine static and shared linking build tests

2022-10-27 13:20:12 +02:00

.github/workflows

ci: combine static and shared linking build tests

2022-10-27 13:20:12 +02:00

app

ethdev: add structure for indirect flow age update

2022-10-28 12:41:03 +02:00

buildtools

buildtools: fix NUMA nodes count

2022-10-11 02:13:52 +02:00

config

config/arm: add Phytium TengYun S2500

2022-10-26 17:39:24 +02:00

devtools

devtools: guess checkpatch.pl path

2022-10-11 02:18:48 +02:00

doc

ethdev: add structure for indirect flow age update

2022-10-28 12:41:03 +02:00

drivers

net/cnxk: handle SA hard expiry events

2022-10-18 12:59:55 +02:00

examples

examples/qos_sched: support higher rates for subport/pipe

2022-10-28 16:20:59 +02:00

kernel

kni: use dedicated function to set MAC address

2022-06-08 19:17:21 +02:00

lib

mempool: align cache objects on cache lines

2022-10-30 10:07:58 +01:00

license

license: add MIT license exception for GVE driver

2022-10-27 12:36:22 +02:00

usertools

usertools/pmdinfo: rewrite simpler script

2022-10-11 02:11:33 +02:00

.editorconfig

devtools: clarify that lines up to 100 characters are ok

2021-11-25 11:51:24 +01:00

.gitattributes

…

.gitignore

doc: add eventdev feature matrices

2021-11-26 16:29:25 +01:00

.travis.yml

version: 22.11-rc0

2022-07-21 12:13:48 +02:00

ABI_VERSION

version: 22.11-rc0

2022-07-21 12:13:48 +02:00

MAINTAINERS

maintainers: update for Microsoft vmbus and netvsc

2022-10-30 09:47:17 +01:00

Makefile

build: create dummy Makefile

2020-09-07 23:51:57 +02:00

meson_options.txt

flow_classify: mark library as deprecated

2022-10-28 16:20:59 +02:00

meson.build

build: export include directories list

2022-10-28 14:27:48 +02:00

README

license: introduce SPDX identifiers

2018-01-04 22:41:38 +01:00

VERSION

version: 22.11-rc1

2022-10-11 02:39:28 +02:00

README

DPDK is a set of libraries and drivers for fast packet processing.
It supports many processor architectures and both FreeBSD and Linux.

The DPDK uses the Open Source BSD-3-Clause license for the core libraries
and drivers. The kernel components are GPL-2.0 licensed.

Please check the doc directory for release notes,
API documentation, and sample application information.

For questions and usage discussions, subscribe to: users@dpdk.org
Report bugs and issues to the development mailing list: dev@dpdk.org