numam-spdk

Author	SHA1	Message	Date
Seth Howell	584a630287	nvme: don't fail the ctrlr from ctrlr_process_init If we are to have multiple reconnect attempts, we have to control whetehr the controller is placed in the failed state from outside the reset function itself. This will allow us to fail the controller only after all of our retries are exhausted. Change-Id: Ia82e10325272f25b2b8527336dc3bc507c93b401 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469932 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-10-07 15:05:00 +00:00
Seth Howell	f5d88e46e2	nvme: always set ctrlr->is_failed through API Use the standard API function to fail the controller in all cases. This patch, and the several following patches are aimed at creating a mechanism for reporting up to the application layer that a controller is failed and or removed. To do this, I use the reset_cb to inform the upper layer that the controller is failed. This also requires changes to how we handle a controller reset to pave the way for doing optional reset retries in the libraries. Change-Id: I06dfce08326c23472a1caa8f6efbac2fd1a720f2 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469635 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-10-07 15:05:00 +00:00
Seth Howell	2c68fef058	nvme: move queued request resubmit to generic layer We were already passing up from each transport the number of completions done during the transport specific call. So just use that return code and batch all of the submissions together at one time in the generic code. This change and subsequent moves of code from the transport layer to the genric layer are aimed at making reset handling at the generic NVMe layer simpler. Change-Id: I028aea86d76352363ffffe661deec2215bc9c450 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469757 Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-10-07 15:05:00 +00:00
Seth Howell	afc9800b06	nvme: _nvme_qpair_submit_request does not requeue This will be handled by nvme_qpair_submit_request when it receives -EAGAIN from _nvme_qpair_submit_request. Change-Id: I5e76aae170c981df0cadaadcd5da1163c715006f Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/470407 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2019-10-07 15:05:00 +00:00
Seth Howell	18dc53c531	nvme: move submit_request impl to a private function This patch series is aimed at preserving the order of qpair entries when resubmitting queued requests. The hope is that we will make the API fool proof and future proof against ever reordering any queued requests. Change-Id: Ib20d61d3abaed637c9c305b75081947630190fd4 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/470062 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2019-10-07 15:05:00 +00:00
Chunyang Hui	f74b33ad0b	Opal: Small fixes 1. Log level change to info when checking support 2. Delete new lines 3. Enlarge the timeout seconds to 10min for revert TPer as it sometimes need 6-7min for this operation. Change-Id: I1b7e32917bd99c859f1515b07f2530669418f0db Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/468915 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2019-10-01 14:12:57 +00:00
Seth Howell	7630daa204	nvme: move queueing requests to the generic layer The tailq and the requests all belong to the generic layer, might as well put the queueing code there for better encapsulation. Change-Id: Id5f08f798121b50a21044cfc61856999c50ca227 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469758 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-09-30 21:17:47 +00:00
Seth Howell	fd892b333d	nvme_ctrlr: when reconnecting admin queue, check rc. This was being ignored, and can cause some problems when trying to reset a defunt controller over a fabric. Change-Id: I32c11a0e2df0e140e20f870fe0fb5b9045a567b3 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469638 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-09-30 21:17:47 +00:00
Seth Howell	13fb1b690e	nvme_rdma: add a timeout for spinning on cm events. Previously we would just sit forever. preventing us from properly attempting reconnects and timing out. Change-Id: Id7386ab95cf75fd9ac972b44afa2719aad412f49 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469021 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-09-30 21:17:47 +00:00
Seth Howell	5ac814e36c	nvme_rdma: share the cm_event channel between qpairs. This enables us to create a single file descriptor and a single event channel to poll for completions. With that accomplished, we can easily poll for events on the admin qpair each time we check it for completions. Change-Id: I8b901252510744a956bef12594d1e045715e002e Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467549 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-09-30 21:17:47 +00:00
Seth Howell	f12e6bc041	nvme_rdma: in qp_disconnect, set resources to NULL This prevents us from failing a reset and then trying to double put the rqpair->cq which ends up causing seg faults. Change-Id: If3e14a3d039b4b19cc587a7482157f4b23f8ee32 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469609 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2019-09-30 21:17:47 +00:00
Seth Howell	06746448c1	nvme: fix confusion around nvme_ctrlr_set_state In most places, we are passing NVME_TIMEOUT_INFINITE as the timeout_in_ms argument to nvme_ctrlr_set_state, presumably in an attempt to specify an infinite timeout. However, nvme_ctrlr_set_state only checked against 0 when setting the actual timeout, and we didn't have any logic to check for overflow so we just ended up setting random timeout_tsc values which changes the behavior of the nvme_ctrlr_process_init function in several places. So, change NVME_TIMEOUT_INFINITE to 0, and add some integer overflow checking to nvme_ctrlr_set_state. Change-Id: Ic9d0cc57ed153df30c3b20313c3742072a5f992d Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469485 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2019-09-30 21:17:47 +00:00
Benjamin Saunders	6bcd3588d1	nvme: add support for write uncorrectable command Change-Id: I9fb7a998f7c13ce53cba630a895e8e11cf5f4a1c Signed-off-by: Benjamin Saunders <bsaunders@google.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467559 Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-26 18:42:57 +00:00
Seth Howell	8a2527836d	log: remove old-style errlog entries. SPDK_ERRLOG lists the function name, so remove old references that assume it doesn't and reprint the function name. Change-Id: I69da6ca0a25bf0eda07d8dad52bcfadf964ac715 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469487 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-09-26 16:15:11 +00:00
Changpeng Liu	acb9849c05	nvme: add arbitration configuration options to NVMe driver Weighted Round Robin can be enabled for users, and users can allocate different priority IO queues for different purpose. For now we will enable this feature in the NVMe driver first, following patches will enable this feature in bdev layer. Change-Id: I0f799236ca04eb85ef3c9f972ed63ff2718563ba Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466852 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-09-20 02:04:06 +00:00
Seth Howell	579d44b0ee	nvme_rdma: make handling of cm_events more robust By splitting all cm_event handling into a single function, we can create a single point of contact for cm_events, whether we want to process them synchronously or asynchronously. Change-Id: I053a850358605115362f424de55e66806a769320 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467546 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-09-18 22:19:37 +00:00
Seth Howell	ad7a01bde3	nvme_rdma: make cm_event fd asynchronous. This is paving the way for additional changes to enable polling for cm_events in the initiator. For now, just present the same blocking API on top of the now polled file descriptor. Later, we will change this API to be more useful. Change-Id: I174dac028720f95c30100f6dc2ed49b5bb2a7e40 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467545 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-09-18 22:19:37 +00:00
Darek Stojaczyk	c049304a95	env: add spdk_pci_device_unclaim() spdk_pci_device_claim() could create a file on the filesystem that couldn't be deleted programatically. It could only be overwritten - e.g. by another spdk instance - but this didn't really work if that another instance had less privileges and hence no access to the previous file. This is exactly the case we're seeing on our CI when running SPDK as non-root. In general it's a good idea not to leave any leftover files, so now we'll delete the pci claim file when the spdk process exits. spdk_pci_device_claim() used to return a file descriptor that could be simply closed to "un-claim" the device. It'll now return only a return code. The fd will be stored inside spdk_pci_device and will be closed either when user calls the newly introduced spdk_pci_device_unclaim(), or when the device is detached. We'll still need to clean up those files somewhere in our test scripts (probably ./setup.sh cleanup) to clean up after crashed processes or so - but we don't necessarily want to run such scripts inside the autotest whenever a non-root spdk is about to be started. Change-Id: I797e079417bb56491013cc5b92f0f0d14f451d18 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467107 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-09-18 20:34:39 +00:00
Benjamin Saunders	7188bb994f	nvme: fix missing memory barrier in shadow doorbell update If the CPU reorders the eventidx read before the shadow doorbell write, it is indeterminate whether the controller will read the updated shadow doorbell without an MMIO write. See https://lkml.org/lkml/2018/8/14/1031 for details. Signed-off-by: Benjamin Saunders <bsaunders@google.com> Change-Id: I5aa08fdd5b32c7b81e8048ca6efe546318d80b5c Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/468188 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-17 19:44:20 +00:00
Ben Walker	647afdec44	Revert "nvme: small code cleanup for nvme_transport_ctrlr_scan" This reverts commit `6129e78d26`. When the initiator sends the discovery log page, if the log page exceeds the size of its data buffer, it will break it up into multiple log page commands with appropriate offsets. However, supporting offsets in log pages is an optional feature in NVMe and reported by the EDLP bit in the identify data. This commit changed the discovery process to no longer send an identify command prior to doing the discovery log page command, so the values in the identify data are always 0. If the discovery log page exceeds the size of the data buffer (4k), it will then fail to send the second log page with an offset because it believes the controller does not support the feature. Revert this change to fix it. An identify should always be sent as part of the discovery process. A test case is included in a follow up patch the demonstrates the bug. Reported-by: Zahra Khatami <zahra.k.khatami@oracle.com> Reported-by: Akshay Shah <akshay.shah@oracle.com> Change-Id: Iefd512a7521e0fea90541b3eb547671cfa816ea6 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466819 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-09-09 21:52:07 +00:00
Ziye Yang	24eb7a84b0	nvme/tcp: fix the iov vector count. Since we use pdu->data_iovcnt to build the iov in nvme_tcp_build_iovs, so send out pdu has the maximal iov number equals to: 2 + pdu->data_iovcnt, so we change the comparison. This makes sure that we can handle all the data owned by one pdu. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I2b9258cc5716d706c0fa38af609726c439708768 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467207 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-09-09 02:08:31 +00:00
Changpeng Liu	6ad44e8be6	nvme: add weighted round robin supported flags Change-Id: I4b303e7096dfdd29ef5d39f30223d03c32d20ae1 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466679 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-09 01:55:18 +00:00
Changpeng Liu	2f9d2b811c	nvme: move nvme_ctrlr_construct() before the PCI initialization This will be consistent with TCP and RDMA transport, and we will use ctrlr->flags in nvme_ctrlr_init_cap() in next patch, the flags will be cleared to 0 for now. Change-Id: Ic360cd0c00d60c77452d19cdc1e7a32a5fc34df0 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466678 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-09 01:55:18 +00:00
Ziye Yang	ea5ad0b286	nvme/tcp: Change hdr in nvme_tcp_pdu to pointer Purpose: Prepare the further optimnization in the target side whening receving pdu headers, we expect to use zero copy. Change-Id: Iae7f9106844736d7160d39d0af1f5941084422ec Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465380 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-28 15:38:02 +00:00
Jim Harris	32e22643ef	nvme: add NVME_QUIRK_DELAY_BEFORE_INIT quirk Currently we always wait 2 seconds before starting controller initialization during attach. This works around an issue where some older Intel NVMe SSDs could not handle MMIO writes too soon after a PCIe FLR (which would be triggered when VFIO was enabled). After further discussion with Intel experts, we know the SSD models that exhibit this issue. So we can quirk this so that only the older SSDs incur the extra delay. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ieb408c24f6afd5bd5147d1c87239aa20f2d13511 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466064 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-26 17:35:06 +00:00
Chunyang Hui	0fae4f64c4	Opal: Add support for erase locking range Change-Id: Ie40ea642bc266f84ad5a3dbad8012b9eac178360 Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465244 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-08-20 20:38:54 +00:00
Jim Harris	0aa72ffb74	nvme: fix WRITE_TO_RO_RANGE status code WRITE_TO_RO_PAGE was incorrect and misleading. This 0x82 NVMe status code indicates a write to a read-only range of LBAs. So modify the constant name and associated usages to use WRITE_TO_RO_RANGE instead. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I993dbebb5acc2e685a0e99aa14084942ef79d659 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465083 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-08-14 02:19:49 +00:00
Changpeng Liu	2226750a7c	nvme: add an option 'no_shn_notification' to driver spdk_nvme_detach() will do the normal shutdown notification for most cases, and it will take some time e.g. 2 seconds to finish the process for PCIe based controllers. If users' environment has several drives, each drive will call spdk_nvme_detach() one by one, and the shutdown process may take very long time. Since users know exactly what they would like to do for the next step, so here we provide an option to users, users can enable it to skip the shutdown notification process so that they can have very quick shutdown process, and when starting next time, the controller can be enabled again. Change-Id: Ie7f87115d57776729fab4cdac489cae6dc13511b Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/463949 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-13 22:50:03 +00:00
Changpeng Liu	7cbe1ccd56	nvme: move SPDK_NVME_DEFAULT_RETRY_COUNT out from nvme.h SPDK_NVME_DEFAULT_RETRY_COUNT is the default value for each controller, so we can move it out from public header file, and change the value if users provide a new one. "NvmeRetryCount" was deprecated for a long time, so we removed the support for this configuration option as well. Change-Id: I187251cc1e5342abb4fce96727d06631b7c16a01 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464489 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-08-09 00:44:50 +00:00
Changpeng Liu	62bb65289d	nvme: change retry count can be configured via bdev nvme driver Also eliminate 'spdk_nvme_retry_count' finally. Change-Id: I2f3e390e4b8a49208a11b54bb82c4891cf3e1845 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464473 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-09 00:44:50 +00:00
Changpeng Liu	936d856219	nvme: eliminate global configuration 'spdk_nvme_retry_count' option with PCIe transport We have defined NVMe controller initialization 'transport_retry_count' option, so global 'spdk_nvme_retry_count' can be removed, we will remove the variable with PCIe transport first, and make the retry count can be configured via RPC. Change-Id: I4d54f78c8da2180d536635587e7291f44a57c4fb Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464472 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-08-09 00:44:50 +00:00
Chunyang Hui	a4516ad2ed	opal: Fix get string for bigger length Skip token header length which varies for short, medium and long atom. Fix Issue #898 Change-Id: I2351193e5a43608495f3d816ff4e5932399a6312 Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464502 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-08 20:06:40 +00:00
Ziye Yang	73d9cef8c5	nvmf/tcp: add nvme_tcp_pdu_cal_psh function. Purpose: 1 Do not caculated the psh_len every time. 2 Small fix, for ch_valid_bypes, and psh_valid_bytes, we do not need to use uin32_t. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I9b643da4b0ebabdfe50f30e9e0a738fe95beb159 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464253 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-07 01:46:54 +00:00
Tomasz Zawadzki	8df52a0f4a	lib/nvme_tcp: assert tcp_req->req before it is dereferenced The value of tcp_req->req was asserted after it was already dereferenced. This patch fixes that. Change-Id: I5eb01e88be09d41fb8e632c49d5a7ccf2315788f Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462508 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-24 18:09:33 +00:00
Chunyang Hui	07f432641a	opal: Fix memory leakage Change-Id: I37f1468a41d568f7313143f0270f854f73bc4000 Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461560 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: yidong0635 <dongx.yi@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-22 04:32:59 +00:00
Chunyang Hui	8522624d03	opal: Add multiuser support Admin can enable user and add user to locking range. Then the user can lock/unlock his range. Change-Id: Ifc5a8cf5c6b5febeb59c86333981f0cf5b938500 Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460891 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-22 04:32:59 +00:00
Changpeng Liu	e27421b344	nvme: fix req leaks There are many req leaks when a controller failure occurs during submitting IO. It must free all of the children before freeing the parent req. If a part of the child req has been sent to the back end and a part of the child req fails, removes the failed req from the parent req and the parent req must be retained, freeing the parent req after all of the submitted reqs return. Change-Id: Ieb5423fd19c9bb0420f154b3cfc17918c2b80748 Signed-off-by: Huiming Xie <xiehuiming@huawei.com> Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461734 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-22 04:15:34 +00:00
Changpeng Liu	c4f7c1bc2a	nvme: put child I/O helper functions in nvme_internal.h Existing children split functions defined in nvme_ns_cmd.c can also be used in nvme_qpair.c to free children requests with error paths. Change-Id: I640b32884424709da67ee89ff780c2de45acc54c Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461372 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-22 04:15:13 +00:00
James Bergsten	5acf617c6e	nvme: add functions to pretty-print commands and completions This change attempts to address the Trello request to decode I/O errors in NVMe hello_world example. See https://trello.com/c/MzJJw7hM/2-decode-io-errors-in-nvme-helloworld-example As part of this change, spdk_nvme_cpl_get_status_string was declared in nvme.h, and spdk_nvme_qpair_print_command and spdk_nvme_qpair_print_completion were renamed and added to nvme.h, allowing all three to used "externally." To test the failing paths, two compile time defines were added to force a write or read error (bad LBA) respectively. As the example does a read after write, if the write fails, the example fails. Signed-off-by: James Bergsten <jamesx.bergsten@intel.com> Change-Id: Ib94b4a02495eb40966e3f49517a5bdf64485538a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457076 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-15 07:47:03 +00:00
Richael Zhuang	d4cbbf1751	nvme: use atomic builtins for g_signal_lock The __sync builtin based implementation generates full memory barriers on some non-x86 platforms. Replace it with C11 atomic builtins can make: ·arm and ppc from full barrier to half barrier ·x86 code same as before Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Change-Id: Ib6624ef8e45af497b9eced6ecfa7710bcc88a733 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461590 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-15 06:01:37 +00:00
yidong0635	ff0a7dfc42	nvme: Handle CQ polling failures by marking the controller as failed. nvme_transport_qpair_process_completions calls nvme_rdma_qpair_process_completions There are some cases return -1 due to failure of "CQ errors". Handle CQ polling failures by marking the controller as failed. That a completion with an error will be treated as controller failed. Requests will be aborted after retry counter exceeded. Otherwise, code will keep on reporting errors without recovery. This is to fix issue #850. Change-Id: I0b324232310e107bf7fd5722aca54d402a19b14d Signed-off-by: yidong0635 <dongx.yi@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460569 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-09 01:43:02 +00:00
Andrey Kuzmin	fa6bfa80af	Nvme: check spdk_nvme_qpair_process_completions return value. nvme_tcp_qpair_process_completions returns -1 on socket I/O error. Unless the caller checks this return value (which spdk_nvme_wait_for_completion_robust_lock currently doesn't), on connection loss or any other fatal connection error spdk_nvme_wait_for_completion will never exit the completion check loop. Change-Id: I92bb349beb071db312e6c31b84db2a7b51ec486c Signed-off-by: Andrey Kuzmin <akuzmin@jetstreamsoft.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460657 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-09 00:27:54 +00:00
Shuhei Matsumoto	8b539eb553	nvme: Set appropriate value to max_xfer_size and max_sge SPDK NVMe-oF initiator driver could not transfer IO whose size is more than 128KiB even if NVMe-oF target allows IO whose size is more than 128KiB both for RDMA and TCP transport. Some use cases need to transfer IO larger than 128KiB. For RDMA transport, max_mr_size by ibv_query_device of RDMA devices indicates the maximum size of a single memory region and is independent from the actual I/O size, and is very likely to be larger than 2 MiB which is the granularity we currently register memory regions. Actually some RDMA NICs return UINT64_MAX for max_mr_size by ibv_query_device. Hence use UINT32_MAX and let the generic layer use the controller data to moderate this value. On the other hand, for TCP transport, there is no limit for maximum IO size and hence use UINT32_MAX. Besides, for RDMA transport, max_sges should be the minimum of max_sge got by querying RDMA devices and NVME_RDMA_MAX_SGL_DESCRIPTORS. Hence do this change together in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Idc813afd3e525bf5f370c0fcd2623f9c146a5528 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459218 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-05 06:35:41 +00:00
Shuhei Matsumoto	cf3c54bc03	nvme: Ensure max_sges not to exceed what controller supports in generic layer Previously comparing the transport supported value and the target value was done in RDMA transport layer. However this comparison should be done in the generic layer like the maximum IO transfer size. Hence change the comparison to do in the generic layer in this patch. Besides, for MSDBD, the value 0 indicates no limit but we had handled this as maximum number of SGS entries was 0 by mistake. This patch fixes the bug together. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I54365cf114169b10180ec2c659f9c7302672674c Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459574 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-05 06:35:41 +00:00
Chunyang Hui	fbd2f3fd2e	opal: add support for getting locking range info Change-Id: I8e3e39673c260f823a9703e86006b5334dedc987 Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457576 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-05 02:23:28 +00:00
Chunyang Hui	505dbf59ff	Opal: Add locking range support Change-Id: I4974d4134aed3b63e204b79c9292ce940e32d40c Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455175 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-05 02:23:28 +00:00
Chunyang Hui	755b4390f9	Opal: Add activate locking SP method Change-Id: I4189bdefdb5a6651bb73bd32e61c16e899b2ae5a Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454211 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-05 02:23:28 +00:00
Shuhei Matsumoto	3ff1ff004e	nvme/tcp: Minor cleanups for SGL operations Using naming rules consistent with other related libraries is helpful to ensure the quality as verified by this patch series. This patch changes a few parts to use iov and iovcnt for SGL operations. Besides, name of an array points to the head of the array and is constant. So copying name of array to an another pointer is not necessary and can be removed. Change-Id: I2324f28126b3088098c1c767cf6c060f22c175c3 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455629 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-04 08:58:40 +00:00
Shuhei Matsumoto	3184884f9d	nvmf/tcp: Properly handle multiple iovecs in processing H2C and C2H NVMe/TCP target had assumed the size of each iovec was io_unit_size. Using nvme_tcp_pdu_set_data_buf() instead removes the assumption and supports any alignment transparently. Hence this patch moves nvme_tcp_pdu_set_data_buf() to include/spdk_internal/nvme_tcp.h and replaces the current code to use it. Besides, this patch simplifies spdk_nvmf_tcp_calc_c2h_data_pdu_num() because sum of iov_len of iovecs is equal to the variable length now. We cannot separate code movement (lib/nvme/nvme_tcp.c to include/ spdk_internal/nvme_tcp.h) and code replacement (lib/nvmf/tcp.c) because moved functions are static and compiler give warning if they are not referenced in lib/nvmf/tcp.c. The next patch will add UT code. Change-Id: Iaece5639c6d9a41bd35ee4eb2b75220682dcecd1 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455625 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-04 08:58:40 +00:00
Hailiang Wang	3a65c8729b	lib/nvme: fix a warning of spdk_pci_addr->domain Compilation Warning on fedora30. In file included from nvme_ut.c:42: /home/vagrant/spdk_repo/spdk/test/common/lib/test_env.c:517:17: warning: The left operand of '>' is a garbage value if (a1->domain > a2->domain) { ~~~~~~~~~~ ^ This is related to issue #822. Change-Id: I2b61e821130b89af04db3c475e81d2e91a380a90 Signed-off-by: Hailiang Wang <hailiangx.e.wang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459923 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-01 13:07:48 +00:00

1 2 3 4 5 ...

850 Commits