numam-spdk

Author	SHA1	Message	Date
Changpeng Liu	acb9849c05	nvme: add arbitration configuration options to NVMe driver Weighted Round Robin can be enabled for users, and users can allocate different priority IO queues for different purpose. For now we will enable this feature in the NVMe driver first, following patches will enable this feature in bdev layer. Change-Id: I0f799236ca04eb85ef3c9f972ed63ff2718563ba Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466852 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-09-20 02:04:06 +00:00
Changpeng Liu	6ad44e8be6	nvme: add weighted round robin supported flags Change-Id: I4b303e7096dfdd29ef5d39f30223d03c32d20ae1 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466679 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-09 01:55:18 +00:00
Jim Harris	32e22643ef	nvme: add NVME_QUIRK_DELAY_BEFORE_INIT quirk Currently we always wait 2 seconds before starting controller initialization during attach. This works around an issue where some older Intel NVMe SSDs could not handle MMIO writes too soon after a PCIe FLR (which would be triggered when VFIO was enabled). After further discussion with Intel experts, we know the SSD models that exhibit this issue. So we can quirk this so that only the older SSDs incur the extra delay. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ieb408c24f6afd5bd5147d1c87239aa20f2d13511 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466064 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-26 17:35:06 +00:00
Changpeng Liu	2226750a7c	nvme: add an option 'no_shn_notification' to driver spdk_nvme_detach() will do the normal shutdown notification for most cases, and it will take some time e.g. 2 seconds to finish the process for PCIe based controllers. If users' environment has several drives, each drive will call spdk_nvme_detach() one by one, and the shutdown process may take very long time. Since users know exactly what they would like to do for the next step, so here we provide an option to users, users can enable it to skip the shutdown notification process so that they can have very quick shutdown process, and when starting next time, the controller can be enabled again. Change-Id: Ie7f87115d57776729fab4cdac489cae6dc13511b Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/463949 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-13 22:50:03 +00:00
Changpeng Liu	936d856219	nvme: eliminate global configuration 'spdk_nvme_retry_count' option with PCIe transport We have defined NVMe controller initialization 'transport_retry_count' option, so global 'spdk_nvme_retry_count' can be removed, we will remove the variable with PCIe transport first, and make the retry count can be configured via RPC. Change-Id: I4d54f78c8da2180d536635587e7291f44a57c4fb Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464472 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-08-09 00:44:50 +00:00
Shuhei Matsumoto	cf3c54bc03	nvme: Ensure max_sges not to exceed what controller supports in generic layer Previously comparing the transport supported value and the target value was done in RDMA transport layer. However this comparison should be done in the generic layer like the maximum IO transfer size. Hence change the comparison to do in the generic layer in this patch. Besides, for MSDBD, the value 0 indicates no limit but we had handled this as maximum number of SGS entries was 0 by mistake. This patch fixes the bug together. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I54365cf114169b10180ec2c659f9c7302672674c Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459574 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-05 06:35:41 +00:00
Darek Stojaczyk	f9a6588f57	nvme: switch to spdk_malloc(). spdk_dma_malloc() is about to be deprecated. Change-Id: I6c308ee546c28c479ceb903bc1749bf5209dc6fe Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448172 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: <uma.willpower@gmail.com>	2019-06-27 04:34:50 +00:00
James Bergsten	8785d5052d	nvme: spdk_nvme_ctrlr_alloc_io_qpair extensions Adds fields to structure spdk_nvme_io_qpair_opts. These fields allow specifying the locations of memory buffers used for the submission and/or completion queues. By default, vaddr is set to NULL meaning SPDK will allocate the memory to be used. If vaddr is NULL then paddr must be set to 0. If vaddr is non-NULL, and paddr is zero, SPDK derives the physical address for the NVMe device, in this case the memory must be registered. If a paddr value is non-zero, SPDK uses the vaddr and paddr as passed. SPDK assumes that the memory passed is both virtually and physically contiguous. If these fields are used, SPDK will NOT impose any restriction on the number of elements in the queues. The buffer sizes are in number of bytes, and are used to confirm that the buffers are large enough to contain the appropriate queue. These fields are only used by PCIe attached NVMe devices. They are presently ignored for other transports. Signed-off-by: James Bergsten <jamesx.bergsten@intel.com> Change-Id: Ibfab3939eefe48109335f43a1167082dd4865e7c Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454074 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-18 12:19:41 +00:00
James Bergsten	f2d46446ca	nvme: add spdk_nvme_ctrlr_get_registers implementation Prior merge contained all of the code EXCEPT for the user-callable function. Signed-off-by: James Bergsten <jamesx.bergsten@intel.com> Change-Id: I1cb7105ab85ffae8ed4f600261fed86c9c778893 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456282 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-30 22:38:27 +00:00
Jim Harris	f0dd2b789e	nvme: add spdk_nvme_ctrlr_get_transport_id() Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie32a1bb144c239b923b5cbb9e608a7dfc9c05208 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456076 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-05-29 20:27:10 +00:00
Jim Harris	af38d200e6	nvme: add ctrlr option for logging errors Currently the nvme driver will always log any request completed with error status. Some applications may not want this behavior. So provide an option to disable it at the controller level. When this option is enabled, any failed requests from queues associated with that controller (including the admin queue) will not log the failed request. Of course the application will still receive the failed status code and can decide to do its own logging there. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia093fcd23cf321a820fd53183ee7e2dac4f9d378 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454081 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-14 13:51:44 +00:00
Jim Harris	bb01a08915	nvme: plumb disconnect/connect in reset path This will (finally) enable resets for fabrics controllers. Move some of the work previously done in enable_admin_queue up to this new disconnect/connect logic. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I6239f0c0f36192db921d33f2322b1874b9382a01 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453939 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-14 13:49:19 +00:00
Jim Harris	963e450a71	nvme: complete error reqs when re-enabling queue We cannot complete error reqs from spdk_nvme_ctrlr_reset - this could result in completions on threads not expected by the user for I/O queues. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I2e266a2618f1791ef1a1b713d1940357f23f7bff Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453932 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-14 08:48:11 +00:00
Jim Harris	8986de8b98	nvme: rename transport reconnect function to just connect The RDMA transport was the only one implementing this function, and it only does a connect - not a disconnect followed by a connect. A later patch will add a matching disconnect function. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ib68eb0ff2f8e59f437d6d8831bb37dfddf83e9a4 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453929 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-14 08:48:11 +00:00
Jim Harris	4aac975b35	nvme: make nvme_qpair_enable just set the is_enabled flag Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I6782f311156dba87875a754fc64525f5ad7d06ea Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453748 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-14 08:48:11 +00:00
Jim Harris	67882ed76f	nvme: add calls to nvme_qpair_disable These were accidentally removed in a previous patch. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Idab274427c064ff8aff1cdca2dd80d7d24e8cce4 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453747 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-05-09 19:03:18 +00:00
Jim Harris	fabd7fbb41	nvme: remove qpair_disable This transport function is a complete nop now, so remove it. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I5cc6ac75795a3cf5311f24e2ac293fb53d4b9f8c Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453487 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-08 01:44:20 +00:00
Jim Harris	74aa552ef9	nvme: make helper function to abort outstanding err reqs The nvme_qpair_disable functions will be going away in an upcoming patch, so move this one bit of functionality into a helper function in advance. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I61c2de535c2230b988d56dea13b00f39cb59dcfa Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453483 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-08 01:44:20 +00:00
Jim Harris	f366e261a6	nvme: abort aers at common layer We submit AERs to all controllers - both pcie and fabrics. But currently we only manually abort the aers when disabling the qpair for pcie. Make this common instead by creating a new transport function for aborting aers. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I1e926b61b8035488cdc6e8cb4336b373732f985e Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453482 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-08 01:44:20 +00:00
Jim Harris	14e67af3c5	nvme: rename reinit_io_qpair to reconnect_qpair This better explains what the function is doing, and makes the name more general so we can use it for the adminq as well. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I6b55761cb141a9a79cdef876be47995d8813b312 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453480 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-08 01:44:20 +00:00
cranechu	6a67d5178e	nvme: remove set_state after nvme_ctrlr_identify_id_desc_namespaces Fixes #722. The state was set in nvme_ctrlr_identify_id_desc_async Signed-off-by: cranechu <cranechu@gmail.com> Change-Id: I232f0035e8c45d49eca2de7174c91860a299d804 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449527 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-04-01 09:09:07 +00:00
Changpeng Liu	2e6dbe7539	nvme: reduce default Admin timeout to 30 seconds 120 seconds is too long for controllers which can't be setup during initialization, because this value is only used for Admin commands so also rename as it is. Change-Id: I0a3d3192252c0f6fc0bef4d8b868eaef2ae40fe3 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448601 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-21 18:22:28 +00:00
Darek Stojaczyk	27c42e313f	nvme: don't rely on phys_addr retrieved from spdk_malloc() The phys_addr param in spdk_malloc() is about to be deprecated, so use a separate spdk_vtophys() call to retrieve physical addresses. This patch also adds error checks against SPDK_VTOPHYS_ERROR. The error handling paths are already there to account for spdk_malloc() failures themselves, so reuse them in case of vtophys failures. Change-Id: I377636e66b8c570d013c1bb2021f04bce4e6c0ce Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/416998 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-03-20 01:06:09 +00:00
Ben Walker	cf0eac8c66	nvme: Add qpair option to batch command submissions Avoid ringing the submission queue doorbell until the call to spdk_nvme_qpair_process_completions(). Change-Id: I7b3cd952e5ec79109eaa1c3a50f6537d7aaea51a Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/447239 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-03-19 07:27:44 +00:00
Chunyang Hui	51ab378862	nvme: Add getting supported flag for controllers New API added for upper level to get controllers' supported flags. Change-Id: I51e9d0e57c355fa37f092602a94f4c08deb8898c Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/446091 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-03-07 00:03:34 +00:00
Jim Harris	4680db9e09	nvme: clarify nvme_ctrlr_update_namespaces assignment The nsdata assignment is strangely aligned with some variable declarations - fix it to make it more clear. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I43b1a6d5a69ca035a21f3996e8f859a45bd10b9c Reviewed-on: https://review.gerrithub.io/c/446447 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-03-01 16:05:37 +00:00
Jim Harris	518c8add8a	nvme: add SHST_COMPLETE quirk for VMWare emulated SSDs VMWare Workstation NVMe emulation does not seem to write the SHST_COMPLETE bit within 10 seconds, resulting in an ERRLOG during detach/shutdown. So add a quirk to cover these VMWare SSDs. But rather than squashing the ERRLOG completely for these SSDs, just add a message instead indicating this is somewhat expected on these VMWare emulated SSDs. Fixes issue #676. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I3dfcb631feda639926fd712f1f41abb66cbf2096 Reviewed-on: https://review.gerrithub.io/c/445942 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-02-27 01:46:32 +00:00
Changpeng Liu	7d4d22a846	nvme: add a wait for completion timeout API Althrough SPDK already provides a API to users which can process runtime timeout NVMe commands, but it's nice to have another API here, SPDK NVMe driver can use it to break the endless wait. Also use the API first in the initialization process, because we don't want to add another initialization state with Intel only supported log pages. Change-Id: Ibe7cadbc59033a299a1fcf02a66e98fc4eca8100 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/444353 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-02-14 03:47:13 +00:00
Ben Walker	993c4a0799	nvme: Add a function to query controller memory buffer support Change-Id: Id539f4eaabe2038d4925eaa140864c0abd9b2649 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/442635 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: wuzhouhui <wuzhouhui@kingsoft.com>	2019-02-06 16:01:56 +00:00
Changpeng Liu	44c6faac9a	nvme: move hardcoded keep alive timeout value to macro definition Change-Id: I27ab6ea046ade42f941b323cea5f104bb952c53d Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/441994 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Arshad Hussain <arshad.super@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-25 18:52:45 +00:00
Ziye Yang	3608464f04	nvme: fix the aer request sent to disabled controller The purpose this patch is to fix the following issue: https://github.com/spdk/spdk/issues/568. The root cause of issue is in nvme_rdma_fail_qpair since we want to recycle all outstanding rdma_reqs. There is an aer req, the callback of which is: nvme_ctrlr_async_event_cb. In this function, we will call nvme_ctrlr_construct_and_submit_aer again, however the nvme controller is already in shutdown state. (The ctrlr->vcprop.cc.bits.en is set to 0). Change-Id: I422f0fe5faf472e9a1cb6bbd174e806e6405b95c Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/440014 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-18 17:44:08 +00:00
Igor Konopko	2077fbd7e4	nvme: do not fail init when Intel log pages are not supported Currently for all the Intel drives nvme driver tries to add Intel VS log pages support. When this log pages are not supported whole init process fails. This patch changes this behaviour by allowing to init Intel drives which rejects VS log pages. This is valid scenario for drives which are in states other than healthy. Such a drives are still accesible via admin queue, but does not expose some of the features, such as this particular VS log pages. Change-Id: I3764f2d67fd7153b6b1889273a9fedeb9c4213d3 Signed-off-by: Igor Konopko <igor.j.konopko@intel.com> Reviewed-on: https://review.gerrithub.io/c/437162 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-01-07 16:23:21 +00:00
Chunyang Hui	19feb4e181	nvme: add security receive and security send wrapper Change-Id: Id25040d62f89d4e8f2268bb3383c5665c0508f5a Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/438776 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2019-01-07 05:51:07 +00:00
Lance Hartmann	e865a52415	nvme: Eliminate identify errors to Discovery ctrlr The nvme/identify cmd issued some cmds to a ctrlr irrespective of its type, and when the target was a Discovery ctrlr which only accepts a very limited cmd set, that would result in errors observable both on the initiator side (from nvme/identify) and in the output on the target (nvmf_tgt). Introduce new API, spdk_nvme_ctrlr_is_discovery(), and alter identify to make use of that in determining which commands to send to the target. Change-Id: I974a569843f1d2b9e1ece7bd3bf9ceee1bfae872 Signed-off-by: Lance Hartmann <lance.hartmann@oracle.com> Reviewed-on: https://review.gerrithub.io/436225 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-12-11 17:39:52 +00:00
Ziye Yang	be4fbb2141	nvme_tcp: Make the header and data digest configurable. Change-Id: Ia65e235a85207c128ba274e1bab38d6c35344239 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/435563 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-12-07 23:24:12 +00:00
Ziye Yang	b4692083f1	nvme: Fix the race condition in nvme_ctrlr_get_cc When the applications call spdk_nvme_ctrlr_alloc_io_qpair, there will be cmd to the admin qpairs in nvme_ctrlr_get_cc, so there is contention. We should use the lock to protect nvme_ctrl_get_cc. Otherwise, the multiple threads will have contention on the admin qpair, thus there will be coredump issue. We get the bug when testing NVMe-oF TCP transport, and this patch can address this issue. Change-Id: I7247f98cdf890c2eafaf8fb94580ecd714010bd5 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/435577 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-12-05 00:32:21 +00:00
Jim Harris	72f8c6a1f3	log: remove "trace" from internal API Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I8b1c0d4b00d5d41aae89d3b33f18d1ae957567dc Reviewed-on: https://review.gerrithub.io/435344 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-12-03 19:50:15 +00:00
Darek Stojaczyk	1d3e0340b4	nvme: fix pci device leak when detaching a controller in primary process This case isn't particularly supported, but still caused a memory leak and rendered the pci device inaccessible for the rest of the primary process lifetime. This happens when a controller is removed from the primary process while a secondary process still uses it. The controller will likely misbehave without its primary process managing it, but at least there won't be a leak. Change-Id: I67581cffa33ce14ff516b5743d13c9ef7b351625 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/434408 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-11-30 16:25:16 +00:00
Changpeng Liu	2706cd4238	nvme: add timeout for Admin commands when initialization Currently there are no timeout mechanism for Admin commands when initialization, the NVMe driver may enter infinite loop. While here, add a new parameter to the controller initialization options, NVMe controller will report an error when timeout happens during initialization. Change-Id: Id0c6b6fa15abe5227b486bee95c8e02914b0d358 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/424622 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-11-16 15:29:33 +00:00
Liu Xiaodong	5aace13984	lib/nvme: tolerate abnormal char device In some special cases, NVMe device with cdata.nn=0 may be used to do validation or other test work. cdata.nn=0 means the device can't support NS at all. Change-Id: I55f75a8cb21b8d1b99c5318e27c876a4371d6dd4 Signed-off-by: Liu Xiaodong <xiaodong.liu@intel.com> Reviewed-on: https://review.gerrithub.io/432191 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: joevannip <jparairo@nvxltech.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-11-08 23:35:28 +00:00
Darek Stojaczyk	5a588715d9	nvme: detach PCI device in secondary process We only detached the PCI device on the controller destruction, which happens just once - in the primary process, but secondary process needs the PCI detach as well. Requesting to hotremove the NVMe PCIe controller in secondary process is broken, because DPDK will still keep the device reference and won't allow SPDK to hotplug it again. Fix this by detaching the local PCI device whenever removing a secondary process from spdk_nvme_ctrlr. This does require an additional transport check in the generic NVMe layer, but I found it an overkill to create a multi-process transport abstraction just for this case. Change-Id: I812dc1c878ade5b149556806228a2afcb49f0b17 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/431487 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-11-02 18:30:09 +00:00
Darek Stojaczyk	0258728f2b	nvme/pci: increase the init delay to 2s The time required to wait increases with the amount of submitted FLR resets. Now that DPDK takes less and less time to initialize, this starts to become an issue. We can even see on our CI within regular tests where a single application is start-stopped in a short period of time. This is also a problem if a device is detached via RPC and immediately attached afterwards. The time required to wait seems to cap at 2 seconds, so instruct our driver to wait exactly that. Change-Id: I18b6fbdea9b0dca5d7e1756e9ead7d97119f2fa2 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/429415 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2018-10-22 17:57:24 +00:00
Darek Stojaczyk	951bb3a458	env/pci: move the vfio init delay to nvme/pci This is an NVMe-specific issue and I/OA or VirtIO devices don't need it. Additionally, the delay is now asynchronous, meaning that potentially multiple NVMe controllers can wait all at once. The drawback of this change is that we're needlessly waiting even when using uio_pci_generic. However, since the delay does not block anymore, its impact is significantly minimized. Change-Id: I5d16a7fd7cb66c785acb687f14690e95f6188b9e Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/429414 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-10-22 17:57:24 +00:00
Jim Harris	073f2dd8f2	nvme: do not retry AER if ASYNC_LIMIT_EXCEEDED received This indicates an out-of-spec device, so just print an error message but don't bother retrying the AER. While here, add status code type (sct) check for the other status code check when an AER fails - it is not enough to compare just the status code. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ibd26549aa08d3eb4814c239b6b2c6fe95e069a54 Reviewed-on: https://review.gerrithub.io/429533 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-10-17 04:51:07 +00:00
Changpeng Liu	a2fdc4dd73	nvme: make identify NS id descriptors can be executed asynchronously With Identify Namespace Identification Descriptors can be executed asynchronously, most of functions in the controller initialization now can be executed asynchronously now, for host with multiple controllers this can save some time during initialization. Change-Id: I70e3c6c2c691134d2ae4c5969288cced1538c6cc Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/428585 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-10-15 17:57:56 +00:00
Changpeng Liu	92bf76c9a9	nvme: make identify ns can be executed asynchronously Change-Id: I189ad8889c74937bf43bcf2c3029416ddb94976d Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/425705 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: GangCao <gang.cao@intel.com>	2018-10-15 17:57:56 +00:00
Changpeng Liu	d9ecb5724e	nvme: broke up NS construction with extra states Change-Id: I4e95e6283283be48cc8682a5e18a84618e2f34d9 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/425704 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-10-09 18:09:32 +00:00
Changpeng Liu	cf5448a910	nvme: make nvme_ctrlr_configure_aer() can be executed asynchronously Change-Id: I1cc4c79dc5f27aef18936e00953b72ed45c859bd Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/425070 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-09-14 22:39:52 +00:00
Changpeng Liu	38a396d959	nvme: make nvme_ctrlr_set/get_num_queues() can be executed asynchronously Change-Id: I6d4bd667df1842b76119de21e6ba5a589237cc7e Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/425064 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-09-14 22:39:52 +00:00
Changpeng Liu	8b95dbab84	nvme: broken up nvme_ctrlr_set_num_qpairs() into set/get functions Change-Id: If5744389ae36f9af0964040d30f81afca3fc4962 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/425063 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-09-14 22:39:52 +00:00

1 2 3 4 5 ...

256 Commits