numam-spdk

Author	SHA1	Message	Date
Seth Howell	7d6d95db3c	nvmf: change the function signature of spdk_nvmf_tgt_create This is necessary to allow the spdk_nvmf_tgt structure to evolve over time without having to further change the target API. Change-Id: Ib0f0f9b1f190913feff0229c96df4e84b1bf35f7 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465363 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-08-20 19:15:04 +00:00
Seth Howell	0ac5050624	lib/nvmf: add a global list of targets As part of moving the nvmf rpc code to the library, we will need to make it more inclusive of use cases outside of the example spdk nvmf_tgt application. That application only supports a single nvmf target structure. As such, many of the RPCs have this assumption built into them. In order to enable the multi-target use case, we need to configure a way to translate between user supplied RPCs and actual target objects in the library. Change-Id: I5d3745afe9c2ca1c33f6e1a1bcc2b8bb3196ccd6 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465329 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2019-08-20 19:15:04 +00:00
Ben Walker	1e82ec0640	nvmf: Delay sending AER until subsystem resumes Change-Id: Id5152a793c6b530cb1419c559ac3ed71ee042037 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464614 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-08-14 21:24:27 +00:00
Evgeniy Kochetov	c9c80e6932	nvmf/rpc: Fix io channel reference counting in NVMf statistics NVMf statistics functions use spdk_get_io_channel function to get a poll group. It increases reference counter in io channel and causes problems on application exit. spdk_put_io_channel calls were added to release the channel. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I832d1eae346c3bc3858ed0ed063ff7a7a897a2f5 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/463389 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-29 18:05:09 +00:00
Evgeniy Kochetov	fca6ff8f75	rpc: Add nvmf_get_stats RPC method This patch adds nvmf_get_stats RPC method and basic infrastructure to report NVMf global and per poll group statistics in JSON format. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I13b83e28b75a02bc1dcb7b95cbce52ae10ff0f7b Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452298 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-12 12:46:29 +00:00
Ziye Yang	750a4213ef	nvmf: add spdk_nvmf_get_optimal_poll_group This patch is used to do the following work: 1 It is optimized for NVMe/TCP transport. If the qpair's socket has same NAPI_ID, then the qpair will be handled by the same polling group. 2. We add a new connection scheduling strategy, named as ConnectionScheduler in the configuration file. It will be used to input different scheduler according to the customers' input. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ifc9246eece0da69bdd39fd63bfdefff18be64132 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454550 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-10 02:30:41 +00:00
Ben Walker	09ef0593d4	nvmf: Leverage bdev uuid to correctly detected remove+add ns while paused Change-Id: Idbf00956394f7ee7ff7e27f2627785cd7146b01f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459605 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2019-07-10 01:59:05 +00:00
Ben Walker	85e9760161	nvmf: Capture ns_info onto stack in poll_group_update_subsystem By capturing this pointer onto the stack, we inform the compiler that we don't expect it to change. That allows the compiler to generate more efficient code. Change-Id: I0f3ff9373662198e915269c4498e4902a2cdb808 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459754 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2019-07-10 01:59:05 +00:00
Ben Walker	ab3abc15aa	nvmf: Capture channel variable to stack when updating poll groups This signals to the compiler and analysis programs that this won't change during iteration, so it may produce better code. Change-Id: I478c0c9445d4ddf8a69ab1b3deaf628b82a0eaea Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459753 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2019-07-10 01:59:05 +00:00
JinYu	8fc9ac7b0e	nvmf: complete all I/Os before changing sgroup to PAUSED For the nvme device, I/Os are completed asynchronously. So we need to check the outstanding I/Os before putting IO channel when we hot remove the device. We should be sure that all the I/Os have been completed when we change the sgroup->state to PAUSED, so that we can update the subsystem. Fix #615 #755 Change-Id: I0f727a7bd0734fa9be1193e1f574892ab3e68b55 Signed-off-by: JinYu <jin.yu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452038 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-06-11 01:51:56 +00:00
Gregory Shapiro	14032a984c	NVMF: Add model number as parameter to construct_nvmf_subsystem (-d option). Change-Id: Ia1a458a0ac1c5a17d2955a3f31c6dfe77538eb17 Signed-off-by: Gregory Shapiro <gregory.shapiro@kaminario.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/438562 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-04-23 16:51:16 +00:00
Changpeng Liu	7c331adfeb	nvmf: update the subsystem poll group's reservation information correctly Existing condition for updating subsystem poll group's reservation information is wrong, when received the RELEASE command, the reservation type may be changed to none, but it will not be saved to the subsystem's poll group. Change-Id: Idc177a0f03fb9611d6eda1e25a5b90caaa73d1be Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/450727 Reviewed-by: Liang Yan <liang.z.yan@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-04-11 01:13:59 +00:00
Changpeng Liu	ba431e231e	nvmf: store registrants' host id into subsystem's poll group Now data structure spdk_nvmf_subsystem_pg_ns_info holds all the reservation information from the associate namespace, so for the IO processing routine we don't need to send a message to the subsystem's thread to check the IO command is permited or not. Change-Id: Ib6be6abf7bf5f24c230dff80c163a1eb963e20d0 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448256 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-21 17:35:11 +00:00
Changpeng Liu	d11aa87320	nvmf: add reservation information to each subsystem's poll group Change-Id: Idcbc3053daf756c818ae3715b4ba0cbd91ed3d44 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/446212 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-15 20:45:43 +00:00
Changpeng Liu	2099401e94	nvmf: rename subsystem poll group's num_channels to num_ns Array channels in the subsystem's poll group are indexed by nsid - 1, so rename the previous num_channels to num_ms makes more sense. Also embed the channels into a namespace data structure here, and this can be reused in the following patch. Change-Id: If5d9aab4b1d5bcf7a3c22f29fa58d84752f0d4cc Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/446211 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-15 20:45:43 +00:00
Evgeniy Kochetov	ed0b611fc5	nvmf/rdma: Add shared receive queue support This is a new feature for NVMEoF RDMA target, that is intended to save resource allocation (by sharing them) and utilize the locality (completions and memory) to get the best performance with Shared Receive Queues (SRQs). We'll create a SRQ per core (poll group), per device and associate each created QP/CQ with an appropriate SRQ. Our testing environment has 2 hosts. Host 1: CPU: Intel(R) Xeon(R) CPU E5-2609 0 @ 2.40GHz dual socket (8 cores total) Network: ConnectX-5, ConnectX-5 VPI , 100GbE, single-port QSFP28, PCIe3.0 x16 Disk: Intel Optane SSD 900P Series OS: Fedora 27 x86_64 Host 2: CPU: Intel(R) Xeon(R) CPU E5-2630 v2 @ 2.60GHz dual-socket (24 cores total) Network: ConnectX-4 VPI , 100GbE, dual-port QSFP28 Disk: Intel Optane SSD 900P Series OS : CentOS 7.5.1804 x86_64 Hosts are connected via Spectrum switch. Host 1 is running SPDK NVMeoF target. Host 2 is used as initiator running fio with SPDK plugin. Configuration: - SPDK NVMeoF target: cpu mask 0x0F (4 cores), max queue depth 128, max SRQ depth 1024, max QPs per controller 1024 - Single NVMf subsystem with single namespace backed by physical SSD disk - fio with SPDK plugin: randread pattern, 1-256 jobs, block size 4k, IO depth 16, cpu_mask 0xFFF0, IO rate 10k, rate process “poisson” Here is a full fio command line: fio --name=Job --stats=1 --group_reporting=1 --idle-prof=percpu \ --loops=1 --numjobs=1 --thread=1 --time_based=1 --runtime=30s \ --ramp_time=5s --bs=4k --size=4G --iodepth=16 --readwrite=randread \ --rwmixread=75 --randrepeat=1 --ioengine=spdk --direct=1 \ --gtod_reduce=0 --cpumask=0xFFF0 --rate_iops=10k \ --rate_process=poisson \ --filename='trtype=RDMA adrfam=IPv4 traddr=1.1.79.1 trsvcid=4420 ns=1' SPDK allocates the following entities for every work request in receive queue (shared or not): reqs (1024 bytes), recvs (96 bytes), cmds (64 bytes), cpls (16 bytes), in_capsule_buffer. All except the last one are fixed size. In capsule data size is configured to 4096. Memory consumption calculation (target): - Multiple SRQ: core_num * ib_devs_num * SRQ_depth * (1200 + in_capsule_data_size) - Multiple RQ: queue_num * RQ_depth * (1200 + in_capsule_data_size) We ignore admin queues in calculations for simplicity. Cases: 1. Multiple SRQ with 1024 entries: - Mem = 4 * 1 * 1024 * (1200 + 4096) = 20.7 MiB (Constant number – does not depend on initiators number) 2. RQ with 128 entries for 64 initiators: - Mem = 64 * 128 * (1200 + 4096) = 41.4 MiB Results: FIO_JOBS kIOPS Bandwidth,MiB/s AvgLatency,us MaxResidentSize,kiB RQ SRQ RQ SRQ RQ SRQ RQ SRQ 1 8.623 8.623 33.7 33.7 13.89 14.03 144376 155624 2 17.3 17.3 67.4 67.4 14.03 14.1 145776 155700 4 34.5 34.5 135 135 14.15 14.23 146540 156184 8 69.1 69.1 270 270 14.64 14.49 148116 156960 16 138 138 540 540 14.84 15.38 151216 158668 32 276 276 1079 1079 16.5 16.61 157560 161936 64 513 502 2005 1960 1673 1612 170408 168440 128 535 526 2092 2054 3329 3344 195796 181524 256 571 571 2232 2233 6854 6873 246484 207856 We can see the benefit in memory consumption. Change-Id: I40c70f6ccbad7754918bcc6cb397e955b09d1033 Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/428458 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-15 19:19:17 +00:00
Seth Howell	145485769e	nvmf: remove qpair state activating. This intermediate state is unused and meaningless. the qpair transitions into this state right before calling a synchronous operation and then transitions to active as soon as that operation completes successfully. If the operation did not complete successfully, we were leaving qpairs in this weird intermediate state when for all intents and purposes they had reverted to an uninitialized state. Keeping qpairs in the uninitialized state until they have been added to a poll group creates a meaningful distinction between states that can be actionable from the transport level. Change-Id: I6de9bc424b393b6fff221aa2f4212aaa91488629 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/443471 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-02-12 20:39:44 +00:00
Seth Howell	ceb32abbd8	nvmf: don't set qpair->group to NULL. The typical rdma qpair disconnect function goes through the function _nvmf_rdma_disconnect_retry. When this function was introduced, it was discovered that we could receive a qpair disconnect event for a given qpair before that qpair had been assigned to a poll group. In order to ensure that the disconnect procedure completed properly, we waited on the current thread in _nvmf_rdma_disconnect_retry for the qpair to be assigned a poll group before we finally disconnected. see rdma.c:2250. Since _nvmf_rdma_disconnect_retry was not necessarily called from the poll group's thread, we relied upon the assumption that the group variable would never be set back to NULL. See the comment on rdma.c: 2243. However, in _spdk_nvmf_qpair_destroy we were setting the group back to NULL. This operation can result in the following set of operations across multiple threads that prevent a qpair from ever being fully destroyed. 1. thread 1: receive a disconnect event - call nvmf_rdma_disconnect 2. thread 1: from nvmf_rdma_disconnect call spdk_nvmf_rdma_qpair_inc_refcnt - setting rqpair->refcnt to 1. 3. thread 2: call spdk_nvmf_rdma_poller_poll. 4. thread 2: in spdk_nvmf_rdma_poller_poll reap a completion with an error status which causes us to call spdk_nvmf_qpair_disconnect - rdma:2846 5. thread 2: spdk_nvmf_qpair_disconnect calls _spdk_nvmf_qpair_destroy which sets qpair->group = NULL 6. thread 1: from nvmf_rdma_disconnect we call _nvmf_rdma_disconnect_retry which checks if qpair->group == NULL. If that is the case, we assume that the qpair has not been assigned a group yet and send ourself a message to call _nvmf_rdma_disconnect_retry again. see rdma.c:2253 7. thread 2: from _spdk_nvmf_qpair_destroy we call spdk_nvmf_transport_qpair_fini which results in a call to spdk_nvmf_rdma_close_qpair. which sends dummy send and recvs to the qpair. 8. thread 2: we call poller_poll and get completions for both the send and recv dummy requests. This results in a call to spdk_nvmf_rdma_qpair_destroy. 9. thread 2: spdk_nvmf_rdma_qpair_destroy checks rqpair->refcnt and when it sees that it does not = 0 (see step 2 above) it returns without freeing the resources. see rdma.c:629 10. thread 1: we keep churning in _nvmf_rdma_disconnect_retry sending ourselves messages because rqpair->group is going to be null. Thread 1 never reaches line 2257 where it sends a message to call _nvmf_rdma_qpair_disconnect. _nvmf_rdma_qpair_disconnect is the function that decreases the rqpair->refcnt and allows us to make forward progress on destroying the qpair. I encountered this issue while trying to disconnect from our target using the kernel initiator with an x722 NIC. I think the timing on this bug comes out with that specific configuration because come of the calls in the disconnect path on thread 1 fail causing it to take longer giving a chance to the second thread to delete the qpair. There are really two issues at play here. We don't have a single point of entry for disconnecting RDMA qpairs, and we rely on the qpair->group variable never being set back to NULL. This patch addresses the second issue, and the next patch in the series addresses the first. Change-Id: I65395d0bbb67edfa7bad2ddc70906606c3d83781 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/443304 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-02-11 19:25:51 +00:00
Seth Howell	4620386417	nvmf: abort I/O from pg queued list when destroying qp This change was provided by GitHub user vikasbrcm to fix issue 562. I am uploading his change to facilitate testing of the issues and possibly get it merged before the 19.01 window closes. Change-Id: I58fb1058f68c6c02006ceed6e577be627e6dbc09 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/441611 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-01-24 20:27:21 +00:00
Ziye Yang	9d11abfd0e	nvmf: Do not set the error state of the qpair Reason: I checked the code in different transport, the qpair is already freed, so we dot need to set any state. Change-Id: I3d78c259c3f79ea4426dc9408e5c3469bc171358 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/437493 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2018-12-18 04:00:59 +00:00
Ziye Yang	ea8aa1bf0a	nvmf: check the qpair->ctrlr The ctrlr may be NULL, so we need to add a check here to present segment fault. Change-Id: I6c5361cc829af065082a95df0b8cc2f8d49a6002 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/436950 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-12-13 21:52:45 +00:00
Ziye Yang	527c825c81	nvmf: Re-add spdk_nvmf_transport_poll_group_remove For TCP/IP transport, we need to remove the socket from the polling group since we do not want to keep the tgroup info in the NVMe/TCP qpair, it should be general. Change-Id: I4b064d8378f66ea5d91ac554fe628d9ccebd07f4 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/434128 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2018-12-13 02:41:14 +00:00
Seth Howell	962ba4e89a	nvmf: remove tgt_opts from nvmf_tgt This option is deprecated. Also, rename the rpc and configuration options for setting the opts to reflect that they now only set the max number of subsystems Change-Id: Iaabcbf33dd0a0dc489d81233fda74e9e7f3e0d2e Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/430161 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-11-08 23:08:26 +00:00
Seth Howell	7f128c757b	nvmf: don't implicitly create the transport in tgt listen. In order to prepare for multiple transports, the nvmf tgt should never implicitly create a transport when listen is called. Change-Id: If1286e7e3f7bce422a4acd66390852736113df7a Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/430160 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-11-02 18:04:06 +00:00
yidong0635	bb2486a468	nvmf: change the return type of calloc failed 1.nvmf: change the return type of calloc failed to -ENOMEM and keep consistency in this file. 2.thread: revise rc condition to ( rc!= 0),to deal with all abnormal return. Change-Id: I7cccb548f30448eaa1bac1a5904c3edcad9c1208 Signed-off-by: yidong0635 <dongx.yi@intel.com> Reviewed-on: https://review.gerrithub.io/431459 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-11-02 17:56:40 +00:00
Ben Walker	91b9b4b2a1	nvmf: Simplify qpair states When we thought we could do error recovery we differentiated between inactive and erro states. However, that's not possible so collapse them back into one. Change-Id: I57622c400378f2d4c518efbc12fb52e665a9ba4c Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/430627 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-11-02 16:39:37 +00:00
GangCao	98e119f7a9	lib/nvmf: add the nvmf qpair to the available poll group In the case that the subsystem in the related poll group has NULL IO channel assigned due to some problem like out of resource, for example, the NVMe SSD hardware itself has limited number of IO qpairs. The subsystems in the particular poll group could have zero valid channels. In this case, the creation of assoicated poll group will fail and when adding the new qpair to the specified poll group, needs to have a check and pick the available poll group. Change-Id: Iedee2a6375e48eb7bf899cfb0542c565c7ebd231 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.gerrithub.io/423646 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-10-16 12:54:02 +00:00
Ben Walker	523810947e	nvmf: Dump new-style configuration RPCs Avoid using the deprecated construct_nvmf_subsystem when dumping configuration. Change-Id: I908d87bdd77a8b2a8e54baeb7b73e8b52c4912ee Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/425186 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-18 15:54:21 +00:00
John Barnard	183d81d0c6	nvmf: Move target opts to transport opts (part 2) - Add independent functions to create transport with specific opts and add to target while maintaining backward compatibility with current apps and rpc configuration that still use the add listener method to create a transport. - Add new rpc function to create transport and add to target. + Update json reporting to include new rpc function. + Update python scripts to support new rpc function. + New nvmf test script (cr_trprt.sh) to test new rpc function. Change-Id: I12d0a42e34c9edff757755f18a78b722d5e1523e Signed-off-by: John Barnard <john.barnard@broadcom.com> Reviewed-on: https://review.gerrithub.io/423590 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-09-17 20:42:16 +00:00
Ben Walker	f10a91ed0d	nvmf: Add function to get local addr for a qpair Change-Id: I19b9834c709bf97b1bbc1a9278b8c3b9350546e2 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/425185 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-11 15:23:33 +00:00
Ben Walker	311ce0e2ee	nvmf: Add a function to get the listen addr for a qpair The function returns the transport ID describing the listen address on which the connection originated. Change-Id: Ib11cddb8ff2ceb04a5f3ce236ba96c68b7226773 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/425023 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-11 15:23:33 +00:00
Ben Walker	756bf3be20	nvmf: No longer send message on spdk_nvmf_qpair_disconnect Now that it is required to be on the same thread, the message isn't necessary. Change-Id: I714b77b46467dbcfa51186c8404c5976eaeea08a Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/424593 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-10 16:44:33 +00:00
Ben Walker	8f64db180e	nvmf: Add a function to get the source address for a qpair Change-Id: I6ae1f380aebbcf090a0ff31ff96fc4592fc29591 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/421173 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-07 16:03:06 +00:00
Ben Walker	fd94895432	nvmf: Require qpair disconnect to be performed from owning thread I observed that spdk_nvmf_qpair_disconnect is only ever called from the thread that owns the qpair - i.e. the one associated with the poll group - with only one exception where the qpair wasn't fully initialized. Add a check that enforces this condition, as it will allow some major simplifications. Change-Id: Ied434c9ea63fd4f2a6f9eacdf8f3f26a7b6bcf3f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/424591 Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-09-05 18:08:02 +00:00
Ben Walker	c94020001a	thread: Add a name parameter to spdk_register_io_device This is a string name used for debugging only. Change-Id: I9827f0e6c83be7bc13951c7b5f0951ce6c2a1ece Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/424127 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-09-05 16:00:54 +00:00
Ben Walker	194ba5833f	nvmf: Add helper function to verify qpair state is set from correct thread In debug mode this will verify that the state is being set from the correct thread only. Change-Id: I6234299d1fcdb63cd047417b6255c91e29991242 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/423411 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-28 16:13:38 +00:00
John Barnard	8e8084903e	nvmf: Move target opts to transport opts (part 1) - Move most of the target opts from nvmf_tgt to nvmf_transport. - Update transport create functions to pass in transport opts. - When transport opts are NULL in transport create function, use target opts. (for backward compatiblity) - Part 1 of 2 patches. Part 2 (to follow after part 1 accepted) will allow independent creation of transport with specific opts while maintaining backward compatibility with current apps and rpc configuration that still use the add listener method to create a transport. Change-Id: I0e27447c4a98e0b6a6c590541404b4e4be879b47 Signed-off-by: John Barnard <john.barnard@broadcom.com> Reviewed-on: https://review.gerrithub.io/423329 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-08-27 20:43:53 +00:00
GangCao	da01835d84	lib/nvmf: handle the failed case when activating the subsystem In the case of failing to spdk_nvmf_poll_group_add_subsystem() operation, the subsystem still needs to initialize the related queue so that later coming request can be properly queued. Also needs to correctly handle the expected state in this failed condition so that when destroying the subsystem, it could be properly handled. Change-Id: I419f2ac7164c25258c3911952c38b9433fca762b Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.gerrithub.io/422799 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-08-21 22:19:54 +00:00
Ben Walker	008ec0bd91	nvmf: Store thread in controller structure The admin queue pair may get disconnected before the controller is entirely destroyed and can't be relied on to obtain the correct thread. Change-Id: I5e80ef286693d53a161134610dd8354c458f8390 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/422134 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: qun wan <qun.wan@intel.com>	2018-08-16 03:30:24 +00:00
Ben Walker	ed60507d5e	nvmf: Queue pairs can no longer be removed from poll groups In RDMA, qpairs can't be removed from poll groups because the poll group defines the completion queue. So don't allow this operation anymore, even if it were theoretically possible on other transports. Change-Id: I69a3d1b336decd2d25e43ddea94f8b2095ef662f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/421174 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-08-13 18:57:45 +00:00
GangCao	25a89b2ac3	nvmf: return error when getting the NULL I/O channel In the case that NVMe SSD itself has limited number of hardware I/O QPairs, the corresponding abstraction of I/O channel where upper module used to send I/Os down will be NULL. Add a check here for the NVMe-oF module and return the error if the related I/O channel is NULL. Change-Id: I97b799c6ecb026a01b0a414f1b49b949aa2407fd Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.gerrithub.io/416689 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-09 00:39:50 +00:00
Ziye Yang	4c4cba9a95	nvmf: simplify the qpair_mask handling. We should not use mutex, but use the spdk_send_msg policy, then we can let only one thread to handle that and eliminates the segement fault issue. Now in the code, the qpair_mask is handled by the same thread, e.g., the thread which owns the admin qpair of the ctrlr. Change-Id: I609fd4d49f5ecc85bc47bf9c23afbb507900be7c Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/420827 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-08-03 03:38:34 +00:00
Ben Walker	6779479067	nvmf: Simplify spdk_nvmf_qpair_disconnect Asking which thread we're currently on is more expensive than sending a message. Change-Id: I9d9007c9f7f30e4cdd9a97de6bf7a10b0e2a0594 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/420933 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2018-08-01 17:13:48 +00:00
Seth Howell	b0171f79c3	nvmf_tgt: delete connections accepted during shutdown With the reordering of the nvmf_tgt states, we need to remove any connections accepted during the shutdown pahse of the target. Change-Id: I768484366da8273df74b8d52a3e8de6158b6995f Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/420681 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Ziye Yang <optimistyzy@gmail.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-07-31 16:14:29 +00:00
Seth Howell	e5a6540777	nvmf: disconnect qpairs before freeing i/o channel Previously, qpair deletion was synchronous and handled by the io_channel_destroy_cb for the target. However, with the new asynchronous qpair deletion api, these qpairs need to be completely removed before we free the i/o channel and the poll group. Change-Id: I42c62391df62825d53e158306c4372523403ad27 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/420208 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-07-31 16:14:29 +00:00
Seth Howell	e4c1e5f866	nvmf: destroy_poll_group uses disconnect_qpair asynch api Change-Id: I47eff0db1ab33be23881f694d104e903706f1c28 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/417371 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-07-31 16:14:29 +00:00
Seth Howell	f2b22d68d6	subsystem: defer channel iter until pg functions return The poll group pause, resume, remove, and add functions are only called from the subsystem_state_change_on_pg function. Previously, they would return immediately and the state change would move on to the next channel. However, some of these functions (specifically remove) kick off asynchronous APIs and we should not iterate past them until those asynchronous operations complete. Change-Id: I78804273b39f2d171ba26ac4478ad515356833f3 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/419289 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-07-31 16:14:29 +00:00
Seth Howell	d3995f6eca	nvmf: remove_subsystem now uses qpair_remove asynch api This is necessary to avoid race conditions when freeing subsystems. Change-Id: I9b4a7d006cc42cd29e13179e940ced0cc580f548 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/417351 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-07-27 20:50:36 +00:00
Seth Howell	1e2c9afa95	nvmf: always call qpair_delete cb on original thread This ensures that when we continue to iterate through channels after deleting the qpair, we will be able to continue iterating through channels. Change-Id: I6fba43dc14a3e5e8faac78f8b37e9e0c6aad2687 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/419920 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-07-27 20:50:36 +00:00
Seth Howell	4bee4e03b6	nvmf: free AER resourcess before disconnecting qpair It is necessary to free the AER without sending a completion to ensure that the host does not attempt to send an additional AER upon receiving the first completion. Change-Id: I2b3f8f286d6396019d8ace97d2376547705b8d9d Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/420661 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-07-27 20:50:36 +00:00

1 2 3 4

189 Commits