numam-dpdk

Author	SHA1	Message	Date
David Marchand	7a66c72d6c	virtio: fix check when mapping PCI resources According to the api, rte_eal_pci_map_device is only successful when returning 0. Fixes: 6ba1f63b5ab0 ("virtio: support specification 1.0") Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-16 22:55:44 +01:00
David Marchand	25294cd3a6	virtio: fix FreeBSD build Fixes: c52afa68d763 ("virtio: move left PCI stuff in the right file") Signed-off-by: David Marchand <david.marchand@6wind.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-16 22:55:44 +01:00
Thomas Monjalon	0972d7c22b	eal: remove compiler optimization workaround The compiler optimization was disabled a long time ago without describing what was the exact issue. Maybe it does not apply anymore. As it looks unneeded, let's remove this strange pragma. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-16 08:28:00 +01:00
Thomas Monjalon	9369dcb7a6	eal/ppc: adapt CPU flags check to the arch The structure feature_entry does not need leaf/subleaf which were copied from x86 CPUID implementation. On x86, a valid flag is detected with the non-zero leaf value. This check is replaced by a check with a dummy "none" register. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-16 08:28:00 +01:00
Thomas Monjalon	5851aa9171	eal/arm: adapt CPU flags check to the arch The structure feature_entry does not need leaf/subleaf which were copied from x86 CPUID implementation. On x86, a valid flag is detected with the non-zero leaf value. This check is replaced by a check with a dummy "none" register. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Tested-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2016-02-16 08:28:00 +01:00
Thomas Monjalon	ba560ac30c	eal: move CPU flag functions out of headers The patch c344eab3ee has moved the hardware definition of CPU flags. Now the functions checking these hardware flags are also moved. The function rte_cpu_get_flag_enabled() is no more inline. The benefits are: - remove rte_cpu_feature_table from the ABI (recently added) - hide hardware details from the API - allow to adapt structures per arch (done in next patch) Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2016-02-16 08:28:00 +01:00
Thomas Monjalon	9f8faed956	eal: get CPU flag name The new function rte_cpu_get_flag_name() is added to the EAL API. It is implemented (duplicated) in each arch because the next patch will remove the public exposure of the feature tables. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-16 08:28:00 +01:00
Thomas Monjalon	1f1d7f76ed	examples: fix build dependencies When building for ARM some examples were failing to compile because of some dependencies disabled. Declaring these dependencies prevent from trying to compile some not supported examples. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-16 07:33:44 +01:00
Thomas Monjalon	71e6e8c519	examples/ethtool: fix build When building for ARM, the spinlock structure was not found. It appears to be a mismatch with rwlock which is not used in this file. Fixes: bda68ab9d1e7 ("examples/ethtool: add user-space ethtool sample application") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Remy Horton <remy.horton@intel.com>	2016-02-16 07:33:44 +01:00
Thomas Monjalon	28377375c6	examples/ip_pipeline: fix build for x86_64 without SSE4.2 The compiler cannot use _mm_crc32_u64: examples/ip_pipeline/pipeline/hash_func.h:165:9: error: implicit declaration of function '_mm_crc32_u64' is invalid in C99 Fixes: 947024a26df7 ("examples/ip_pipeline: rework passthrough pipeline") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-16 07:33:43 +01:00
Thomas Monjalon	a00341bbe5	examples/l3fwd: fix build without SSE4.1 clang reports this error: examples/l3fwd/main.c:550:1: error: unused function 'send_packetsx4' The function is used only when ENABLE_MULTI_BUFFER_OPTIMIZE is 1. Fixes: 96ff445371e0 ("examples/l3fwd: reorganise and optimize LPM code path") Fixes: 6f1c1e28d98e ("examples/l3fwd: fix build with exact-match enabled") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Pablo de Lara <pablo.de.lara.guarch@intel.com>	2016-02-16 07:33:38 +01:00
Jerin Jacob	64502fab90	examples/distributor: fix build for non-x86 arch _mm_prefetch is defined only in x86 compilers. Use rte_prefetch_non_temporal() abstraction instead of _mm_prefetch(x, 0) to in-order to build distributor application for non x86 platforms Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2016-02-16 07:21:31 +01:00
Jerin Jacob	ab3af0959d	eal: introduce non-temporal prefetch non-temporal/transient/stream version of rte_prefetch0() The non-temporal prefetch is intended as a prefetch hint that processor will use the prefetched data only once or short period, unlike the rte_prefetch0() function which imply that prefetched data to use repeatedly. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Jan Viktorin <viktorin@rehivetech.com>	2016-02-16 07:19:19 +01:00
Jerin Jacob	5fa83b5398	ethdev: reduce alignment requirement for 128-byte cache line slow-path data structures need not be 128-byte cache aligned. Reduce the alignment to 64-byte to save the memory. No behavior change for 64-byte cache aligned systems as minimum cache line size as 64. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2016-02-11 12:45:35 +01:00
Jerin Jacob	0580a664e3	bitmap: optimize for 128-bytes cache line existing rte_bitmap library implementation optimally configured to run on 64-bytes cache line, extending to 128-bytes cache line targets. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2016-02-11 12:45:35 +01:00
Jerin Jacob	99a5744147	mbuf: fix performance with 128-byte cache line No need to split mbuf structure to two cache lines for 128-byte cache line size targets as it can fit on a single 128-byte cache line. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2016-02-11 12:45:35 +01:00
Jerin Jacob	acf7b47cdc	eal: introduce new cache line macros - RTE_CACHE_LINE_MIN_SIZE(Supported minimum cache line size) - __rte_cache_min_aligned(Force minimum cache line alignment) - RTE_CACHE_LINE_SIZE_LOG2(Express cache line size in terms of log2) Suggested-by: Konstantin Ananyev <konstantin.ananyev@intel.com> Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com>	2016-02-11 12:45:35 +01:00
Jerin Jacob	6e757e6942	config: clean cache line size selection scheme by default, all the targets will be configured with the 64-byte cache line size, targets which have different cache line size can be overridden through target specific config file. Selected ThunderX and power8 as CONFIG_RTE_CACHE_LINE_SIZE=128 targets based on existing configuration. Signed-off-by: Jerin Jacob <jerin.jacob@caviumnetworks.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-11 12:45:35 +01:00
Thomas Monjalon	94e4b3a607	config: add a common x86 flag Intel Architecture (IA), also called x86, is declined in - i686 - x86_x32 - x86_64 The code common to all of these architectures can now be guarded by a single flag RTE_ARCH_X86. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2016-02-11 12:45:31 +01:00
Thomas Monjalon	5b71dc1b08	config: remove obsolete machine descriptions More and more machines and architectures are added without keeping the lists up-to-date. Replace the lists with a pointer to the reference directory. The same kind of pointer is used for the supported compilers and environments. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Konstantin Ananyev <konstantin.ananyev@intel.com>	2016-02-11 12:45:21 +01:00
Thomas Monjalon	50810f095a	config: remove useless explicit includes of generated header The file rte_config.h is automatically generated and included. No need to #include it. The example performance-thread needs a makefile fix to avoid overwriting the default cflags. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-10 22:43:38 +01:00
Bruce Richardson	ad8f40dabe	doc: rename release notes 2.3 to 16.04 Updated release documentation to reflect new numbering scheme. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: John McNamara <john.mcnamara@intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-10 22:43:32 +01:00
Bruce Richardson	6d7de6d2e3	version: switch to year.month numbers As discussed on list, switch numbering scheme to be based on year/month. Release 2.3 then becomes 16.04. Ref: http://dpdk.org/ml/archives/dev/2015-December/030336.html Also, added zero padding to the month so that it appear as 16.04 and not 16.4 in "make showversion" and rte_version(). Signed-off-by: Bruce Richardson <bruce.richardson@intel.com> Signed-off-by: John McNamara <john.mcnamara@intel.com> Acked-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-10 22:43:26 +01:00
Thomas Monjalon	4b15247150	doc: drop old naming of the project It was requested by Intel, more than one year ago, to replace the name "Intel DPDK" by "DPDK". Some references to the old name were still in some docs and code comments, leading to confusion. Fixes: ac8ada004c12 ("doc: remove Intel references from release notes") Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2016-02-10 15:47:51 +01:00
Huawei Xie	693f715da4	remove extra parentheses in return statement fix the error reported by checkpatch: "ERROR: return is not a function, parentheses are not required" remove parentheses in return like: "return (logical expressions)" remove parentheses in return a function like: "return (rte_mempool_lookup(...))" Fixes: 6307b909b8e0 ("lib: remove extra parenthesis after return") Signed-off-by: Huawei Xie <huawei.xie@intel.com>	2016-02-10 15:47:50 +01:00
Kamil Rytarowski	6e7caa1ad9	eal/linux: support built-in kernel modules Currently rte_eal_check_module() detects Linux kernel modules via reading /proc/modules. Built-in ones aren't listed there and therefore they are not being found. Add support for checking built-in modules with parsing the sysfs files This commit obsoletes the /proc/modules parsing approach. Signed-off-by: Kamil Rytarowski <kamil.rytarowski@caviumnetworks.com> Acked-by: David Marchand <david.marchand@6wind.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-09 16:03:46 +01:00
Kamil Rytarowski	bb9f408550	tools: support binding to built-in kernel modules Currently dpdk_nic_bind.py detects Linux kernel modules via reading /proc/modules. Built-in ones aren't listed there and therefore they are not being found by the script. Add support for checking built-in modules with parsing the sysfs files. This commit obsoletes the /proc/modules parsing approach. Signed-off-by: Kamil Rytarowski <kamil.rytarowski@caviumnetworks.com> Acked-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2016-02-09 16:03:46 +01:00
Dawid Jurczak	16c1814c80	tools: support Python 3 in bind script This patch fixes syntax errors during binding ethernet device on systems where Python 3 is default. Backward compatibility with Python 2 is preserved. Signed-off-by: Dawid Jurczak <dawid_jurek@vp.pl>	2016-02-09 12:54:09 +01:00
Jeff Shaw	da82ee17e6	tools: fix unbinding failure handling We should call sys.exit(), not divide sys by exit(). Signed-off-by: Jeff Shaw <jeffrey.b.shaw@intel.com> Acked-by: Bruce Richardson <bruce.richardson@intel.com>	2016-02-09 12:54:09 +01:00
Thomas Monjalon	e45ae7065e	doc: introduce networking driver matrix In order to better compare the drivers and check what is missing for a common baseline, we need to fill a matrix. A CSS trick is used to fit the HTML page. The PDF output needs some LaTeX wizardry. Signed-off-by: Thomas Monjalon <thomas.monjalon@6wind.com>	2016-02-09 12:22:26 +01:00
Antonio Fischetti	9c699fd8e8	doc: add a further example in ACL guide Add a further ACL example where the elements of the search key are not entirely fitting into the 4 consecutive bytes of all input fields. Signed-off-by: Antonio Fischetti <antonio.fischetti@intel.com> Acked-by: John McNamara <john.mcnamara@intel.com>	2016-02-09 12:22:26 +01:00
Ferruh Yigit	f02730abde	doc: fix multi-process guide * remove outdated chapter reference to Multi-process support. * html output converts "--" to "-", this is wrong when explaining the command arguments, used fixed width quotes for them. Fixes: fc1f2750a3ec ("doc: programmers guide") Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com>	2016-02-09 12:22:26 +01:00
Zhihong Wang	bb62344cb7	eal/x86: fix build with clang for old AVX When configuring RTE_MACHINE to "default", rte_memcpy implementation is the default one (old AVX). In this code, clang raises a warning thanks to -Wsometimes-uninitialized: rte_memcpy.h:838:6: error: variable 'srcofs' is used uninitialized whenever 'if' condition is false if (dstofss > 0) { ^~~~~~~~~~~ rte_memcpy.h:849:6: note: uninitialized use occurs here if (srcofs == 0) { ^~~~~~ It is fixed by moving srcofs initialization out of the condition. Also dstofss calculation is corrected. Fixes: 1ae817f9f887 ("eal/x86: tune memcpy for platforms without AVX512") Reported-by: Pablo de Lara <pablo.de.lara.guarch@intel.com> Signed-off-by: Zhihong Wang <zhihong.wang@intel.com>	2016-02-04 22:36:02 +01:00
Yuanhan Liu	b86af7b1b5	virtio: move ioport macros virtio_pci.c is the only file references macros VIRTIO_READ/WRITE_REG_X. Move them there. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:50 +01:00
Yuanhan Liu	6ba1f63b5a	virtio: support specification 1.0 Modern (v1.0) virtio pci device defines several pci capabilities. Each cap has a configure structure corresponding to it, and the cap.bar and cap.offset fields tell us where to find it. Firstly, we map the pci resources by rte_eal_pci_map_device(). We then could easily locate a cfg structure by: cfg_addr = dev->mem_resources[cap.bar].addr + cap.offset; Therefore, the entrance of enabling modern (v1.0) pci device support is to iterate the pci capability lists, and to locate some configs we care; and they are: - common cfg For generic virtio and virtqueue configuration, such as setting/getting features, enabling a specific queue, and so on. - nofity cfg Combining with `queue_notify_off' from common cfg, we could use it to notify a specific virt queue. - device cfg Where virtio_net_config structure is located. - isr cfg Where to read isr (interrupt status). If any of above cap is not found, we fallback to the legacy virtio handling. If succeed, hw->vtpci_ops is assigned to modern_ops, where all operations are implemented by reading/writing a (or few) specific configuration space from above 4 cfg structures. And that's basically how this patch works. Besides those changes, virtio 1.0 introduces a new status field: FEATURES_OK, which is set after features negotiation is done. Last, set the VIRTIO_F_VERSION_1 feature flag. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:50 +01:00
Yuanhan Liu	962cf902e6	pci: export device mapping functions Normally we could set RTE_PCI_DRV_NEED_MAPPING flag so that eal will invoke pci_map_device internally for us. From that point view, there is no need to export pci_map_device. However, for virtio pmd driver, which is designed to work without binding UIO (or something similar first), pci_map_device() will fail, which ends up with virtio pmd driver being skipped. Therefore, we can not set RTE_PCI_DRV_NEED_MAPPING blindly at virtio pmd driver. Therefore, this patch exports pci_map_device, and let virtio pmd call it when necessary. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Santosh Shukla <sshukla@mvista.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: David Marchand <david.marchand@6wind.com> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:50 +01:00
Yuanhan Liu	1905e101dc	virtio: retrieve header size from device setting The mergeable virtio net hdr format has been the standard and the only virtio net hdr format since virtio 1.0. Therefore, we can not hardcode hdr_size to "sizeof(struct virtio_net_hdr)" any more at virtio_recv_pkts(), otherwise, there would be a mismatch of hdr size from rte_vhost_enqueue_burst() and virtio_recv_pkts(), leading a packet corruption. Instead, we should retrieve it from hw->vtnet_hdr_size; we will do proper settings at eth_virtio_dev_init() in later patches. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:49 +01:00
Yuanhan Liu	3891f233f7	virtio: switch to 64 bit features Switch to 64 bit features, which virtio 1.0 supports. While legacy virtio only supports 32 bit features, it complains aloud and quit when trying to setting > 32 bit features. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:49 +01:00
Yuanhan Liu	c52afa68d7	virtio: move left PCI stuff in the right file virtio_pci.c is a more proper place for pci stuff; virtio_ethdev is not. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:49 +01:00
Yuanhan Liu	d5bbeefca8	virtio: introduce PCI implementation structure Introduce struct virtio_pci_ops, to let legacy virtio (v0.95) and modern virtio (1.0) have different implementation regarding to a specific pci action, such as read host status. With that, this patch reimplements all exported pci functions, in a way like: vtpci_foo_bar(struct virtio_hw *hw) { hw->vtpci_ops->foo_bar(hw); } So that we need pay attention to those pci related functions only while adding virtio 1.0 support. This patch introduced a new vtpci function, vtpci_init(), to do proper virtio pci settings. It's pretty simple so far: just sets hw->vtpci_ops to legacy_ops as we don't support 1.0 yet. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:49 +01:00
Yuanhan Liu	4c2277ff45	virtio: define offset as size_t type offset arg of vtpci_read/write_dev_config is derived from offsetof(), which is of size_t type, instead of uint64_t. So, define it as size_t type. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:49 +01:00
Yuanhan Liu	c47787cfaa	virtio: do not set vring address again at queue startup As we have already set up it at virtio_dev_queue_setup(), and a vq restart will not reset the settings. Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Tested-by: Qian Xu <qian.q.xu@intel.com> Reviewed-by: Tetsuya Mukawa <mukawa@igel.co.jp> Tested-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Huawei Xie <huawei.xie@intel.com>	2016-02-03 16:07:49 +01:00
John McNamara	228d0c681c	doc: add example text to release notes Added example text to each of the release notes sections to show the preferred format. Signed-off-by: John McNamara <john.mcnamara@intel.com>	2016-02-03 16:07:49 +01:00
Ferruh Yigit	c344eab3ee	eal: move cpu flags out of headers Move cpu_feature_table array from arch specific rte_cpuflags.h files to new arch specific rte_cpuflags.c files. Main motivation is to escape from static variable declarations in header files. cpu_feature_table has many copies in final binary, even exist in some object files that does not use this variable at all. And this can be a sample to create architecture specific source files and move some functions which are not performance sensitive from architecture header files to source files. Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com>	2016-01-29 19:41:48 +01:00
Ferruh Yigit	dd34ff1f0e	lib: remove keyword extern for functions Remove "extern" keywords in header files, the ones for function prototypes Signed-off-by: Ferruh Yigit <ferruh.yigit@intel.com>	2016-01-28 18:40:46 +01:00
Anatoly Burakov	e61512e406	vfio: support no-IOMMU mode This commit is adding a generic mechanism to support multiple IOMMU types. For now, it's only type 1 (x86 IOMMU) and no-IOMMU (a special VFIO mode that doesn't use IOMMU at all), but it's easily extended by adding necessary definitions to eal_vfio.h, and DMA mapping functions to eal_pci_vfio.c. Since type 1 IOMMU module is no longer necessary to have VFIO, we fix the module check to check for vfio-pci instead. It's not ideal and triggers VFIO checks more often (and thus produces more error output, which was the reason behind the module check in the first place), so we compensate for that by providing more verbose logging, indicating whether VFIO initialization has succeeded or failed. Signed-off-by: Anatoly Burakov <anatoly.burakov@intel.com> Signed-off-by: Santosh Shukla <sshukla@mvista.com> Tested-by: Santosh Shukla <sshukla@mvista.com>	2016-01-28 17:56:05 +01:00
Michael Qiu	2593612db0	eal/x86: fix build with gcc 5.3.1 In fedora 22 with GCC version 5.3.1, when compile, will result an error: include/rte_memcpy.h:309:7: error: "RTE_MACHINE_CPUFLAG_AVX2" is not defined [-Werror=undef] #elif RTE_MACHINE_CPUFLAG_AVX2 Fixes: 9484092baad3 ("eal/x86: optimize memcpy for AVX512 platforms") Signed-off-by: Michael Qiu <michael.qiu@intel.com> Acked-by: Zhihong Wang <zhihong.wang@intel.com>	2016-01-28 09:33:50 +01:00
Zhihong Wang	48093287c8	app/test: adjust alignment unit for memcpy performance Decide alignment unit for memcpy perf test based on predefined macros. Signed-off-by: Zhihong Wang <zhihong.wang@intel.com>	2016-01-27 21:16:07 +01:00
Zhihong Wang	1ae817f9f8	eal/x86: tune memcpy for platforms without AVX512 For prior platforms, add condition for unalignment handling, to keep this operation from interrupting the batch copy loop for aligned cases. Signed-off-by: Zhihong Wang <zhihong.wang@intel.com>	2016-01-27 21:14:52 +01:00
Zhihong Wang	9484092baa	eal/x86: optimize memcpy for AVX512 platforms Implement AVX512 memcpy and choose the right implementation based on predefined macros, to make full utilization of hardware resources and deliver high performance. In current DPDK, memcpy holds a large proportion of execution time in libs like Vhost, especially for large packets, and this patch can bring considerable benefits for AVX512 platforms. The implementation is based on the current DPDK memcpy framework, some background introduction can be found in these threads: http://dpdk.org/ml/archives/dev/2014-November/008158.html http://dpdk.org/ml/archives/dev/2015-January/011800.html Signed-off-by: Zhihong Wang <zhihong.wang@intel.com>	2016-01-27 21:14:52 +01:00

... 3 4 5 6 7 ...

4003 Commits