net/i40e: remove compiler barrier from NEON Rx
As packet length extraction code was simplified,the ordering was not necessary any more.[1] 2% performance gain was measured on Marvell ThunderX2. 4.3% performance gain was measured on Ampere eMAG80 [1] http://mails.dpdk.org/archives/dev/2016-April/037529.html Fixes: ae0eb310f253 ("net/i40e: implement vector PMD for ARM") Cc: stable@dpdk.org Signed-off-by: Gavin Hu <gavin.hu@arm.com> Reviewed-by: Ruifeng Wang <ruifeng.wang@arm.com> Reviewed-by: Steve Capper <steve.capper@arm.com>
This commit is contained in:
parent
78b50591c8
commit
f1f0f39806
@ -307,9 +307,6 @@ _recv_raw_pkts_vec(struct i40e_rx_queue *rxq, struct rte_mbuf **rx_pkts,
|
||||
rte_mbuf_prefetch_part2(rx_pkts[pos + 3]);
|
||||
}
|
||||
|
||||
/* avoid compiler reorder optimization */
|
||||
rte_compiler_barrier();
|
||||
|
||||
/* pkt 3,4 shift the pktlen field to be 16-bit aligned*/
|
||||
uint32x4_t len3 = vshlq_u32(vreinterpretq_u32_u64(descs[3]),
|
||||
len_shl);
|
||||
|
Loading…
x
Reference in New Issue
Block a user