numam-dpdk

Author	SHA1	Message	Date
Wenzhuo Lu	b4e5e60cbe	e1000/base: fix EEPROM access for i210 The i210 has two EEPROM access registers that are located in non-standard offsets: EEARBC and EEMNGCTL. EEARBC was fixed previously and EEMNGCTL should also be corrected. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2015-10-27 16:12:56 +01:00
Wenzhuo Lu	9896a25392	e1000/base: fix redundant PHY power down for i210 The wrong bit is being used in PHYREG16 for PHY power down. In addition, the use of PHYREG 16 is unnecessary if bit 11 of PHYREG 0 is used. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2015-10-27 14:20:24 +01:00
Wenzhuo Lu	f26b979166	e1000/base: fix jumbo frame CRC failures This is a patch to change the value of register 776.20[11:2] for jumbo mode from 0x1A to 0x1F. This is to enlarge the gap between read and write pointers in the TX Fifo. And replace the magic number with a macro by the way. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2015-10-27 14:20:24 +01:00
Wenzhuo Lu	edcdb3c5f7	e1000/base: fix link flap on 82579 Several customers have reported a link flap issue on 82579. The symptoms are random and intermittent link losses when 82579 is connected to specific switches. Issue has been root caused as interoperability problem between the NIC and at least some Broadcom PHYs in the Energy Efficient Ethernet wake mechanism. To fix the issue, we are disabling the Phase Locked Loop shutdown in 100M Low Power Idle. This solution will cause an increase of power in 100M EEE link. It may cost additional 28mW in this specific mode. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2015-10-27 14:20:24 +01:00
Wenzhuo Lu	ea613aefc2	e1000/base: return error in resume workaround Add u32 return value to function e1000_resume_workarounds_pchlan, so that calling function can detect PHY access failure during resuming flow. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2015-10-27 14:20:24 +01:00
Wenzhuo Lu	32c646d6d2	e1000/base: check more errors for ESB2 init and reset Adding code where missing to handle case where calls to e1000_read_kmrn_reg_80003es2lan and e1000_write_kmrn_reg_80003es2lan return an error value. Also, when accessing the E1000_KMRNCTRLSTA_INBAND_PARAM offset to disable far-end loopback on 80003es2lan devices, make the handling of a read or write failure consistent between hw_init and hw_reset. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2015-10-27 14:20:24 +01:00
Wenzhuo Lu	a70cf8c3ee	e1000/base: check more NVM read errors Adding code to a case where e1000_nvn_read is called, but there is no consideration for when the read fails (returns an error code). Also, this patch adds an error message to a base NVM reading function that is missing it for consistency. This patch is not covering all cases of these conditions, it only covers the code used by the e1000e driver. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2015-10-27 14:20:24 +01:00
Wenzhuo Lu	939d7978bd	e1000/base: return code after setting receive address register Previously, the rar_set functions were of type void, and when they failed to program an address register they would, at most, put a message into the log and end. The fact that they failed to program an address into a address register, if checked for, should be captured and passed back to the caller so that the drivers can deal with the situation (or not) as they deem best. Drivers can ignore or use the return value. No change to base drivers is mandated by this change unless a driver wants to handle the failure to program an address register (e.g. evaluate the return value). Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2015-10-27 14:20:24 +01:00
Wenzhuo Lu	bf36acdc7d	e1000/base: remove useless return variables Although this change should be optimized out by the compiler, just return a constant directly rather than declare a variable. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2015-10-27 14:20:24 +01:00
Wenzhuo Lu	3c027139fe	e1000/base: remove obsolete comment The "FIXME" comment is revomed from e1000_acquire_swfw_sync_80003es2lan but forgotten being removed from e1000_acquire_swfw_sync_82575 while the similar changes were made to both. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2015-10-27 14:20:24 +01:00
Wenzhuo Lu	cdef79acc2	e1000/base: cleanup unused tag Remove all NAHUM6LP_HW tags. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2015-10-27 14:20:23 +01:00
Wenzhuo Lu	71ddb0b93c	e1000/base: apply paranoia to macro arguments Macro arguments need to be in parens since we can pass in expressions. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2015-10-27 14:00:36 +01:00
Wenzhuo Lu	212deae353	e1000/base: add new devices Add some new i218 devices. Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2015-10-26 21:52:50 +01:00
Wenzhuo Lu	36b2330acc	e1000/base: update readme and copyright Signed-off-by: Wenzhuo Lu <wenzhuo.lu@intel.com>	2015-10-26 21:46:37 +01:00
Changchun Ouyang	6d7740e2c1	virtio: fix deadloop after wrong config read The old code adjusts the config bytes we want to read depending on what kind of features we have, but we later cast the entire buf we read with "struct virtio_net_config", which is obviously wrong. The wrong config reading results to a dead loop at virtio_send_command() while starting testpmd. The right way to go is to read related config bytes when corresponding feature is set, which is exactly what this patch does. Fixes: `823ad64795` ("virtio: support multiple queues") Signed-off-by: Changchun Ouyang <changchun.ouyang@intel.com> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Acked-by: Flavio Leitner <fbl@sysclose.org> Acked-by: Huawei Xie <huawei.xie@intel.com>	2015-10-26 21:23:53 +01:00
Ravi Kerur	e83982c4b3	mpipe: return error for init allocation failure In function rte_pmd_mpipe_devinit, if rte_eth_dev_allocate fails return error which is inline with other drivers. Signed-off-by: Ravi Kerur <rkerur@gmail.com> Acked-by: Tetsuya Mukawa <mukawa@igel.co.jp> Acked-by: Zhigang Lu <zlu@ezchip.com>	2015-10-24 19:24:17 +02:00
Stephen Hemminger	1e7bd2380f	virtio: fix Coverity unsigned warnings There are some places in virtio driver where uint16_t or int are used where it would be safer to use unsigned. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2015-10-21 16:14:02 +02:00
Stephen Hemminger	954ea11540	virtio: do not report link state feature unless available If host does not support virtio link state (like current DPDK vhost) then don't set the flag. This keeps applications from incorrectly assuming that link state is available when it is not. It also avoids useless "guess what works in the config". Signed-off-by: Stephen Hemminger <stephen@networkplumber.org> Acked-by: Changchun Ouyang <changchun.ouyang@intel.com>	2015-10-21 16:12:32 +02:00
Bernard Iremonger	ce8e121870	virtio: fix crash when releasing null queue if input parameter vq is NULL, hw = vq->hw, causes a segmentation fault. Signed-off-by: Bernard Iremonger <bernard.iremonger@intel.com> Acked-by: Stephen Hemminger <stephen@networkplumber.org>	2015-10-20 23:29:37 +02:00
Didier Pallard	50d86a005e	mlx4: do not expose broadcast address in MAC list Use the last array entry to store the broadcast address and keep it hidden by not reporting the entire array size. This is done to prevent DPDK applications from attempting to modify or remove it. Signed-off-by: Didier Pallard <didier.pallard@6wind.com>	2015-10-20 21:58:16 +02:00
Francesco Santoro	04c6383de9	mlx4: save bound interface Allows applications to retrieve the name of the related netdevice. Signed-off-by: Francesco Santoro <francesco.santoro@6wind.com>	2015-10-20 21:50:29 +02:00
Adrien Mazarguil	08c192054b	mlx4: fix missing offload flags in scattered Rx They were dropped by mistake in the commit below. Fixes: `ab351fe1c9` ("mbuf: remove packet type from offload flags") Signed-off-by: Adrien Mazarguil <adrien.mazarguil@6wind.com>	2015-10-20 21:49:24 +02:00
David Marchand	922a5466c1	enic: fix hash creation when not using first numa node If dpdk is run with memory only available on socket != 0, then hash creation will fail and flow director feature won't be available. Fix this by asking for allocation on caller socket. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked by: Sujith Sankar <ssujith@cisco.com>	2015-10-20 21:32:06 +02:00
David Marchand	7a0b8b7cab	enic: fix allocation when not using first numa node Seen by code review. If dpdk is run with memory only available on socket != 0, then enic pmd refuses to initialize ports as this pmd requires some memory on socket 0. Fix this by setting socket to SOCKET_ID_ANY, so that allocations happen on the caller socket. Signed-off-by: David Marchand <david.marchand@6wind.com> Acked by: Sujith Sankar <ssujith@cisco.com>	2015-10-20 21:30:48 +02:00
Rahul Lakkireddy	0ec33be4c8	cxgbe: allow to change mtu Add a mtu_set() eth_dev_ops to allow DPDK apps to modify device mtu. Signed-off-by: Rahul Lakkireddy <rahul.lakkireddy@chelsio.com> Signed-off-by: Kumar Sanghvi <kumaras@chelsio.com>	2015-10-20 18:49:18 +02:00
Rahul Lakkireddy	0740f6ea3c	cxgbe: receive jumbo frames Ensure jumbo mode is enabled and that the mbuf data room size can accommodate jumbo size. If the mbuf data room size can't accommodate jumbo size, chain mbufs to jumbo size. Signed-off-by: Rahul Lakkireddy <rahul.lakkireddy@chelsio.com> Signed-off-by: Kumar Sanghvi <kumaras@chelsio.com>	2015-10-20 18:49:18 +02:00
Rahul Lakkireddy	0dae2ba2cb	cxgbe: transmit jumbo frames Add a non-coalesce path. Skip coalescing for Jumbo Frames, and send the packet through non-coalesced path if there are enough credits. Also, free these non-coalesced packets while reclaiming credits. Signed-off-by: Rahul Lakkireddy <rahul.lakkireddy@chelsio.com> Signed-off-by: Kumar Sanghvi <kumaras@chelsio.com>	2015-10-20 18:49:18 +02:00
Rahul Lakkireddy	4b2eff452d	cxgbe: enable jumbo frames Increase max_rx_pktlen to accommodate jumbo frame size. Perform sanity checks and enable jumbo mode in rx queue setup. Set link mtu based on max_rx_pktlen. Signed-off-by: Rahul Lakkireddy <rahul.lakkireddy@chelsio.com> Signed-off-by: Kumar Sanghvi <kumaras@chelsio.com>	2015-10-20 18:49:18 +02:00
Rahul Lakkireddy	bf89cbedd2	cxgbe: optimize forwarding performance for 40G Update sge initialization with respect to free-list manager configuration and ingress arbiter. Also update refill logic to refill mbufs only after a certain threshold for rx. Optimize tx packet prefetch. Approx. 3 MPPS improvement seen in forwarding performance after the optimization. Signed-off-by: Rahul Lakkireddy <rahul.lakkireddy@chelsio.com> Signed-off-by: Kumar Sanghvi <kumaras@chelsio.com>	2015-10-20 18:49:18 +02:00
John W. Linville	b89664c07b	af_packet: check Tx error Coverity CID # 13200 If sendto fails, the packets will not get transmitted. Return 0 as the number of packets transmitted. Signed-off-by: John W. Linville <linville@tuxdriver.com>	2015-10-20 17:59:19 +02:00
John W. Linville	43254a3367	af_packet: refactor error handling to avoid NULL pointer dereference Coverity CID # 13321 Checking internals != NULL before accessing req is not good enough, because *internals is a function argument and the function doesn't really know what is passed-in. We can close our eyes and ignore the warning on the basis of controlling all the calling code, or we can refactor the error exit to avoid the issue entirely... Signed-off-by: John W. Linville <linville@tuxdriver.com> Acked-by: Neil Horman <nhorman@tuxdriver.com>	2015-10-20 17:58:10 +02:00
Wang Xiao W	97661df7d2	fm10k/base: add FM10420 device ids Add the device ID for Boulder Rapids and Atwood Channel to enable drivers to support those devices. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:35:53 +02:00
Wang Xiao W	925c862cbc	fm10k/base: pack TLV overlay structures This patch adds #pragma pack(push, 1) around some structures which are passed via TLV messages. These structures must not be left unpacked as GCC and other compilers are wont to do. Otherwise, we get invalid message responses from the Switch Manager software since it sends 20 bytes and we expect 24. Solaris (and other OS's) are not C99 compliant, so they are not able to use the C99 style #pragma pack() code. Wrap with C99 tag for easy stripping. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:35:53 +02:00
Wang Xiao W	b56d0781aa	fm10k/base: fix ieee1588 adjustment direction The SYSTIME_CFG.Adjust field has a Direction bit to indicate whether the adjustment is positive or negative. However, we incorrectly read the documentation and the direction bit should be set 1 when positive, not when negative. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:35:48 +02:00
Wang Xiao W	d6eaec3589	fm10k/base: add clock offset message Add support for clock offset message from switch manager. Each PEP will be responsible for notifying its own VFs, and the originating PEP must notify its own VFs prior or in addition to sending, as it will not receive a copy of its own message. Base drivers are expected to need custom implementations so no message handler is provided in shared code. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:35:48 +02:00
Wang Xiao W	001c2f311a	fm10k/base: add ieee1588 clock owner message support Add support for tx timestamp mode response message. The switch manager should send this message whenever the owner changes or when a new port appears. To simplify logic, treat this as full clock ownership, and call it the CLOCK_OWNER message. Implement this as a hw->flags field, so that base driver may use it to disable any functions which modify the clock including Tx timestamps, frequency adjustments, and offset adjustments. This ensures only one PEP will be handling these at a time. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:35:40 +02:00
Wang Xiao W	685e5fb30b	fm10k/base: remove 1588 VF API Remove support for VF transmit timestamps. VFs should not write the timestamp bit in the Tx descriptor. Only one Tx timestamp can be realistically handled at once. It is expected that the switch manager use FFU logic to disable all timestamp requests except for those originating from a specific virtual port. It is not possible to correlate this timestamp accurately if more than one occurs out any given EPL at a time. Since the primary purpose of Tx timestamps is to implement PTP daemon, which also requires BAR4 access to change the clock, do not allow VFs to transmit timestamp. Remove the PF<->VF message for this behavior. Note, the VF already didn't have ability to request Tx timestamp mode, so it essentially wasn't allowed to timestamp before anyways under the old API. No longer support old API of request-response timestamp mode messages. New API only sends timestamp-response when the switch decides which port will be given control of timestamps. To simplify review of this code, completely remove the support and re-add support for the response message in a future patch. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:25:07 +02:00
Wang Xiao W	8b8264bdb9	fm10k/base: check VF has a queue It is possible that the PF has not yet assigned resources to the VF. Although rare, this could result in the VF attempting to read queues it does not own and result in FUM or THI faults in the PF. To prevent this, check queue 0 before we continue in init_hw_vf. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:25:07 +02:00
Wang Xiao W	6eca661890	fm10k/base: fix VF re-enabling When a VF issues an LPORT_STATE request to enable a port which is already enabled, the PF will first disable the VF. Then it is supposed to re-enable the VF again with new settings. This is primarily done in order to ensure that the switch management software properly clears the previous VF settings. (ie: switch flow rules and so forth). However, there is a bug in the flow because we check if VF is enabled and don't re-enable it at the end. The issue is that we disable the VF in order to clear switch rules, and never follow-up with a re-enable. This results in a call to enable the VF results in disabling the logical port. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:25:07 +02:00
Wang Xiao W	46e018c501	fm10k/base: fix VF queues counting During initialization, the VF counts its rings by walking the TQDLOC registers. This only works if the TQMAP/RQMAP registers are set to map the out-of-bound rings to the first one, so the VF driver can detect when it has run out of queues cleanly. Update the PF to reset the empty TQMAP/RQMAP registers post-VFLR to prevent innocent VF drivers from triggering malicious events. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:25:07 +02:00
Wang Xiao W	e78a599651	fm10k/base: fix VF multicast The VF will send a message to request multicast addresses with the default vid. In the current code, if the PF has statically assigned a VLAN to a VF, then the VF will not get the multicast addresses. Fix up all of the various vlan messages to use identical checks (since each check was different). Also use set as a variable, so that it simplifies our check for whether vlan matches the pf_vid. The new logic will allow set of a vlan if it is zero, automatically converting to the default vid. Otherwise it will allow setting the PF vid, or any VLAN if PF has not statically assigned a VLAN. This is consistent behavior, and allows VF to request either 0 or the default_vid without silently failing. Note that we need the check for zero since VFs might not get the default VID message in time to actually request non-zero VLANs. Create a function, fm10k_iov_select_vid which implements the logic for selecting a default vid. This helps us remove duplicate code and streamlines location of this logic so that we don't make similar bugs in the future. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:25:07 +02:00
Wang Xiao W	9d87822572	fm10k/base: fix VF multicast update VFs were being improperly added to the switch's multicast group. The error stems from the fact that incorrect arguments were passed to the "update_mc_addr" function. It would seem to be a copy paste error since the parameters are similar to the "update_uc_addr" function. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:25:07 +02:00
Wang Xiao W	ba4ea5ee35	fm10k/base: add mailbox counters A previous bug was uncovered by addition of a debug stat to indicate the actual number of DWORDS we pulled from the mbmem. It turned out this was not the same as the tx_dwords counter. While the previous bug fix should have corrected this in all cases, add some debug stats that count the number of DWORDs pushed or pulled from the mbmem. Base drivers can use this in debug builds to help detect this problem in the future. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:25:07 +02:00
Wang Xiao W	c3cb69d9a6	fm10k/base: fix mailbox connect When we connect to the mailbox, we insert a fake disconnect header so that the code does not see an error and thus instantly error every time we bring up the mailbox. However, we incorrectly record the tail and head from the local perspective. Since the remote end shouldn't have anything for us, add a "create_fake_disconnect_hdr" function which inverts the TAIL and HEAD fields. This enables us to connect without any errors of either TAIL or HEAD incorrectness, and prevents creating extraneous error messages. This is necessary now since mbx_reset_work does not actually clear the Tx FIFO head and tail pointers. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:25:07 +02:00
Wang Xiao W	00b18e3de1	fm10k/base: fix mailbox phantom messages The phantom messages were a result of incorrectly forgetting to drop already transmitted messages. We would reset pulled, and tail_len but left the head/tail pointers alone. The correct fix is to loop through pulled and drop messages until we've dropped at least as many bytes as we pulled (possibly dropping a message we've only partially transmitted. However, we also have to account for tail_len variable and the 'ack' value as in mbx_pull_head. This means that we need to re-read the HEAD field of the mailbox header. Based on testing, this resolves the phantom messages issue, as well as correctly keeping messages which have yet to be transmitted at all in the Tx FIFO. Thus, we will begin re-transmission once we have re-connected. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:25:07 +02:00
Wang Xiao W	63cfcf90fe	fm10k/base: ignore oversized mailbox messages When we call update_max_size, it does not drop all oversized messages. This is due to the difficulty in performing this operation, since it is a FIFO which makes updating anything other than head or tail very difficult. To fix this, modify validate_msg_size to ensure that we error out later when trying to transmit the message that could be oversized. This will generally be a rare condition, as it requires the FIFO to include a message larger than the max_size negotiated during mailbox connect. Note that max_size is always smaller than rx.size, so it should be safe to use here. Also, update the update_max_size function header comment to clearly indicate that it does not drop all oversized messages, but only those at the head of the FIFO. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:25:07 +02:00
Wang Xiao W	400f3c18fa	fm10k/base: avoid Tx drop increment during mailbox negotiation After shutting down the mailbox by force, we then go about resetting max size to 0, and clearing all messages in the FIFO. However, we should just reset the head pointer so that the FIFO will become empty, rather than changing the max size to 0. This helps prevent increment in tx_dropped counter during mailbox negotiation, which is confusing to viewers of Linux ethtool statistics output. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:25:07 +02:00
Wang Xiao W	e18683f8ef	fm10k/base: scale interrupt on PCIe link speed Red Rock Canyon's interrupt throttle timers are based on the PCIe link speed. Because of this, the value being programmed into the ITR registers must be scaled. For the PF, this is as simple as reading the PCIe link speed and storing the result. However, in the case of SR-IOV, the VF's interrupt throttle timers are based on the link speed of the PF. However, the VF is unable to get the link speed information from its configuration space, so the PF must inform it of what scale to use. Rather than passing this scale via mailbox message, we take advantage of unused bits in the TDLEN register to pass the scale. It is the responsibility of the PF to program this for the VF while setting up the VF queues and the responsibility of the VF to get the information accordingly. This is preferable because it allows the VF to set up the interrupts properly during initialization and matches how the MAC address is passed in the TDBAL/TDBAH registers. A VF unload followed by a reload incorrectly left this value as 0. If the VF driver blindly trusted this value it could cause a divide by zero failure. Fix this by having stop_hw_vf reset the ITR scale as the device goes down, similar to the way we handle the MAC address. To prevent divide-by-zero issues, ensure that we always have an ITR scale. Default to Gen3 scaling if we don't know the speed. Also ensure the VF checks the register value and ensures we use Gen3 if we are provided a zero value. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:25:07 +02:00
Wang Xiao W	76cf5b44b9	fm10k/base: set unlimited bandwidth for PF queues Set PF queues used for VMDq to unlimited bandwidth when virtualization resources are assigned. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:25:07 +02:00
Wang Xiao W	ce1a476adc	fm10k/base: add VF Tx timestamp mode no-op This patch resolves a bug in Linux where we called the request_tx_timestamp_mode function that is undefined for VF. Implement a no-op function that simply ensures that the mode is NONE, otherwise it would fail with ERR_PARAM. Signed-off-by: Wang Xiao W <xiao.w.wang@intel.com>	2015-10-07 13:25:07 +02:00

1 2 3 4 5 ...

389 Commits