numam-spdk

Author	SHA1	Message	Date
Changpeng Liu	5adf099eed	nvme: don't send Identify NS ID Descriptor List to inactive NS The specification says it will return INVALID FIELD if the NS is in inactive state. Fix issue #1551. Change-Id: I1b32f023ed665d410f4705e439068699e2b2f8de Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3860 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-21 08:45:31 +00:00
Alexey Marchuk	fdf2490a32	nvmf/rdma: Don't destroy qpair if rdma_accept fails Failed qpair will be destroyed on generic nvmf layer during handling of error code returned from spdk_nvmf_poll_group_add. The current approach leads to heap-use-after-free. Change-Id: I99331150fa36a3c3c18176589afb973dee449b3a Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3538 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-21 08:26:15 +00:00
paul luse	c3fd3e95bb	lib/accel: change task alloc from global mempool->per chan list The one large global mempool was a waste of memory for apps that don't use the accel framework as its always allocated a pool sized to handle a heavy load with multiple threads. Instead move to a per channel list of just 1024 tasks greatly decreasing the memory footprint but still able to scale as more threads are added. Also renamed all accel_req to acccel_taak and simply task to accel_task as this was being touched anyways and not consistent. fixes issue #1510 Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I0e93ca6270323e2df4b739711c5d9b667a52e1eb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3740 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-21 08:25:04 +00:00
Alexey Marchuk	8bec9feb76	nvme/rdma: Remove unused spdk_nvme_send_wr_list nvme_rdma_qpair::sends_to_post is not used, remove it and spdk_nvme_send_wr_list structure Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: If9c42736d4e796a947bbfe80f59efd2fd7f77859 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3822 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-21 08:24:43 +00:00
Alexey Marchuk	58f43df1f5	nvmf/rdma: Handle several ibv events in a row Currently rdma acceptor handles only one ibv event per poll Taking into account the default acceptor poll rate (10ms), it can take a long time to handle e.g. LAST_WQE_REACHED events when we close huge amount of qpairs at the same time. This patch allows to handle up to 32 ibv events per acceptor poll. Change-Id: Ic2884dfc5b54c6aec0655aaa547b491a9934a386 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3821 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-21 08:24:43 +00:00
Seth Howell	ce83fc2aff	lib/nvme: remove qpair from ctrl list in connect fail path This is an oversight that can cause issues with looping through the list if we end up allocating the same qpair twice. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I513ea35398f4b724366c21be144531fbfbdb4347 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3835 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-21 08:24:18 +00:00
Jim Harris	ed7848f2df	blob: handle overlapping open case We only create one spdk_blob object for a given blob, and just increase the ref_count if it is opened multiple times. bs_open_blob would do the lookup for existing opened blobs. But if the blob is opened again, before the previous open operation has completed, we would end up with two spdk_blob objects for the same blob. Solution is to do another lookup when the open operation completes. If we find the blob, free the one we just finished opening and return the existing one instead. Also added unit test that failed on the existing code but passes now with this patch. Signed-off-by: Jim Harris <james.r.harris@intel.com> Reported-by: Mike Cui Change-Id: I00c3a913b413deddf06f0b63f7a669efb2b5658f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3855 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-21 08:24:09 +00:00
yidong0635	361cddfd63	nvme/nvme_pcie: Remove unused codes. pctrlr->cmb.mem_register_addr and pctrlr->cmb.mem_register_size are assigned after spdk_mem_register. if spdk_mem_register is failed , ctrlr_map_cmb hasn't been executed. they are not be used. So remove them. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I3d1996eee8b5260b79c4c3e0a2e1d376da2343b7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3856 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-21 08:24:00 +00:00
Changpeng Liu	01b6bd8a92	nvmf: fix the associate timeout value SPDK poller uses microsecond as the input parameter, so we need to change the correct value when opts.association_timeout is expressed by millisecond. Change-Id: Ia674f0115ea176b998e4c0c70b8ce75b28984701 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3861 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-21 08:23:45 +00:00
Changpeng Liu	5d5a9077a3	nvme: don't assert on custom transport Change-Id: I2d425c127dc070f7bb508f5a61e6304f6042fdf7 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3857 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-21 08:23:19 +00:00
Shuhei Matsumoto	1da94ed7b8	rpc/nvmf: Add ana_reporting parameter to nvmf_create_subsystem RPC Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9adc8373050e68872a4d9e89518c137e61005254 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3852 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-21 08:22:40 +00:00
Shuhei Matsumoto	6f2265734d	lib/nvmf: Optionalize ANA reporting feature After supporting ANA reporting by default, Linux kernel 5.3 reported error when parsing NVMe ANA log. The newer kernel fixed the issue but we should optionalize ANA reporting feature to avoid error for Linux kernel 5.3 or before. Add a bool variable ana_reporting to struct spdk_nvmf_subsystem and disable ANA reporting and initialization of related variables if it is false. We can expose MNAN (Maximum Number of Allowed Namespaces) even if ANA reporting is disabled. But MNAN is not required if ANA reporting is disabled. So do not set MNAN if it is false too. Add a public API spdk_nvmf_subsystem_set_ana_reporting() to set ana_reporting by the nvmf_create_subssytem RPC. The next patch will add ana_reporting to nvmf_create_subsystem RPC. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Icc77773b4c9513daba2f1a9fdaf951d80574f379 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3850 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-21 08:22:40 +00:00
Shuhei Matsumoto	4cc04a1251	lib/nvmf: Add nvmf_subsystem_get_controllers RPC Add an new RPC, nvmf_subsystem_get_controllers to retrieve the list of NVMe-oF controllers of an NVMe-oF subsystem. One of the main use cases will be to get identification information of NVMe-oF controllers to configure their ANA states dynamically. Pause and resume the subsystem to access the controllers safely. One subtle issue remains. The JSON RPC returns success even if resuming the subsystem fails. Write FIXME explicitly to address this. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ibf8d1cf56850a705e343b86022d101b4c7204199 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3848 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-21 08:22:40 +00:00
Michael Haeuptle	43ad7febb2	lib/nvmf: Fixes stuck subsystem RPC A subsystem RPC is not transitioned to a paused state when there are ios outstanding (tracked by subsystem poll group). In general AERs, are not tracked as outstanding IOs. However, there are 3 paths in nvmf_ctrlr_async_event_request which do not adjust the outstanding io count. If we get into any of these 3 paths, the subsystem pause can hang forever. The issue was reproduced with hot plug stress testing under load. We can get into the second path (SPDK_NVME_ASYNC_EVENT_TYPE_NOTICE) under these circumstances: - An AER completion is sent to the initiator due to a namespace change (e.g. hot remove/add) - In this case, type is set to SPDK_NVME_ASYNC_EVENT_TYPE_NOTICE - The initiator sends a new AER admin command, hitting the second path where we return without adjusting the outstanding ios. Fixes: 1552 Change-Id: I45f781966cc1e9a601b2305c7985a21154d802e8 Signed-off-by: Michael Haeuptle <michael.haeuptle@hpe.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3854 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: JinYu <jin.yu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-20 09:31:17 +00:00
Sochin Jiang	db3d1201a4	lib/blob: fix a data corruption bug There is a fatal bug that could easily cause data corruption when using thin-provisioned blobs. In blob_request_submit_rw_iov(), we first get lba by calling blob_calculate_lba_and_lba_count(), blob_calculate_lba_and_lba_count() calculates different lbas according to the return of bs_io_unit_is_allocated(). Later, we call bs_io_unit_is_allocated() again to judge whether the specific cluster is allocated, the problem is it may have be allocated here while not be allocated when calling blob_calculate_lba_and_lba_count() before. To ensure the correctness of lba, we can do lba recalculation when bs_io_unit_is_allocated() returns true, or make blob_calculate_lba_and_lba_count() return the result of bs_io_unit_is_allocated(), use the second solution in this patch. By configuring more than one cpu core, md thread will run in a separate SPDK thread, this data corruption scenario could be easily reproduced by running fio verify in VMs using thin-provisioned Lvols as block devices. Signed-off-by: Sochin Jiang <jiangxiaoqing.sochin@bytedance.com> Change-Id: I099865ff291ea42d5d49b693cc53f64b60881684 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3318 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-20 09:26:23 +00:00
Ziye Yang	0d3cc15a62	nvme/tcp: Correct the incapsule data usage According to page35 in recent NVMe-oF spec ( NVMe-over-Fabrics-1.1-2019.10.22-Ratified), ioccsz is used to restrict the incapsule size of I/O command, so do not restrict the NVMe-oF OPC command and also the admin command. We accidently trigger an bug in kernel since we do not send the fabrics command with the incapsule and make the kernel coredump, though the kernel has bugs. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I869a2c8ab7b9c2ac1e5cc5b603920662591c2c64 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3837 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-20 09:26:06 +00:00
yupeng	2d30df9b0b	bdev: add bdev_examine_bdev API The bdev_examine_bdev api will examine a bdev explicitly. After disabling the auto_examine feature, a user could call bdev_examine_bdev to examine a specific bdev he/she wants. Signed-off-by: Peng Yu <yupeng0921@gmail.com> Change-Id: Ifbbfb6f667287669ddf6175b8208efee39762933 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3219 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-20 08:56:53 +00:00
Ziye Yang	2031f8f70d	nvme: set the error code if we cannot send keep alive command. If the transport is broken, we should set errno code in spdk_nvme_ctrlr_process_admin_completions instead of keeping silence. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ie73763e1329e12a8c82a0223d360991f86c39be3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3773 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-19 07:29:26 +00:00
Seth Howell	518a1e013a	lib/nvme: make fabrics connect timeout configurable. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: If829d399882ef948d95673c17e5689c91386c21d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3795 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-19 07:29:19 +00:00
Seth Howell	b3bb3a1bbf	lib/nvme: change timeout in wait_for_completions to usec This allows for much more granular control over the timeout. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ib23de21e60eec4207c55320579699edf284f4e16 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3794 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-19 07:29:19 +00:00
Ziye Yang	85ff3fcea6	rdma: Do not use the poller to handle the qpair exiting. Generally, this patch did the following work: Remove the destruct poller. I think that we do not need this, the destruct poller is specially for Softwaare RoCE case. Since SoftRoCE will not have IBV_EVENT_QP_LAST_WQE_REACHED event, we will not wait the last_wqe_reached flag when srq is enabled. So we can avoid using the poller. And the purpose of this patch is to solve the coredump issue. For example, if we run rdma local test such as, e.g., test/nvmf/host/bdevperf.sh --transport=rdma The coredump reason: the qpair is freed twice. Because for RDMA transport, we do not really remove the qpair from the group if the upper layer does it. The first time is called by nvmf_rdma_destroy_drained_qpair in nvmf_rdma_poller_poll, and the second time is called by nvmf_rdma_qpair_reject_connection in in nvme_rdma_close_qpair. Since nvme_rdma_close_qpair will always called, so we need make sure that the qpair will be close after calling this function. Otherwise we will have the double free qpair. So our approach here is add a flag ("to_close")in rqpair structure and make sure the rqpair be freed after the "to_close" is set nvme_rdma_close_qpair Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I6f97debbcd29bbb7c6e3f9725907b4102a1d2892 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3661 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com>	2020-08-19 07:28:36 +00:00
Shuhei Matsumoto	05cd697757	lib/iscsi: Add MaxR2TPerConnection to iSCSI options Add MaxR2TPerConnection to iSCSI global options and make it configurable by JSON RPC. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ida95e5c7dac301a22520656709e1aa4d611f31ef Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3777 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00
Shuhei Matsumoto	5af42000c1	lib/iscsi: Make max outstanding R2Ts per connection configurable By the recent refactoring, we have no static size array for outstanding R2Ts per connection. It looks that we do not have any critical reason to prohibit us from making max outstanding R2Ts per connection configurable. There are some use cases to use large write I/O intensively (e.g. 128KB). Let such use cases change the value of max R2Ts per connection by their responsibility to do performance tuning. Maximum outstanding R2Ts per task are defined both for iSCSI target and NVMe-TCP target but maximum outstanding R2Ts per connection is unique for iSCSI target. The next patch will add the corresponding iSCSI option. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I4f6fd3c750a9a0a99bcf23064fe43a3389829aa9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3776 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00
Shuhei Matsumoto	fb229e1eb2	lib/iscsi: Count R2T and Data Out PDUs into PDU pool size It is likely that the raw number 8 in the macro NUM_PDU_PER_CONNECTION means 2 * DEFAULT_MAXR2T and the raw number 2 means R2T and Data Out, but is not certain. On the other hand, the next patch will make the max number of outstanding R2Ts per connection configurable. As a preparation to the next patch, add 2 * DEFAULT_MAXR2T explicitly to the macro NUM_PDU_PER_CONNECTION. The next patch will replace DEFAULT_MAXR2T by an new variable. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I8a3be14d53c0abf11d7aade401386601d8fe6c11 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3783 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00
Shuhei Matsumoto	07f2d83dea	lib/iscsi: Change the type of pending_r2t from int to uint32_t Other count variables in iSCSI library have used uint32_t rather than int. Change the type of spdk_iscsi_conn::pending_r2t from int to uint32_t and add assert to check if pending_r2t is not negative. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9bd296c0142b0808ae822952277c9ecc133e5f62 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3775 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00
Shuhei Matsumoto	5aaf754f81	lib/iscsi: Add MaxLargeDataInPerConnection to iSCSI options Add MaxLargeDataInPerConnection to iSCSI global options and make it configurable by JSON RPC. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ibcd16da2eac64241217bedeb89a7929bbdc67871 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3756 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00
Shuhei Matsumoto	5b2c76f062	lib/iscsi: Make the max number of read subtasks for large read I/O configurable For some use case that there is heavy large read I/O, the performance bottleneck due to MAX_LARGE_DATAIN_PER_CONNECTION was reported. The following assumes that all I/Os are large read. Large read primary task whose I/O size is more than SPDK_BDEV_LARGE_BUF_MAX_SIZE (=64KB) is split into multiple read subtasks. spdk_iscsi_globals::MaxQueueDepth limits maximum number of outstanding read primary tasks, and MAX_LARGE_DATAIN_PER_CONNECTION (=64) limits maximum number of outstanding read subtasks. MAX_LARGE_DATAIN_PER_CONNECTION is also used to calculate PDU pool. To remove the performance bottleneck, change the macro constant MAX_LARGE_DATAIN_PER_CONNECTION to a global variable spdk_iscsi_globals::MaxLargeDataInPerConnection. We don't see any negative side effect if we set spdk_iscsi_globals::MaxLargeDataInPerConnection to 64. The use case that reported the performance issue will change the value of spdk_iscsi_globals::MaxLargeDataInPerConnection by its own responsibility. The next patch will add the value of spdk_iscsi_globals::MaxLargeDataInPerConnection to iSCSI options, and make it configurable by JSON RPC. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ifc30cdb8e00d50f4d3755ff399263cf5d0b681b6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3755 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00
Seth Howell	0162da7f76	lib/nvmf: add an in_destruct flag to the ctrlr struct Helps us avoid adding a new I/O qpair while the ctrlr is being destroyed. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I3bf9318b075125b9d432b885fa9f6f2f44d422d7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3686 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-08-17 08:28:49 +00:00
Shuhei Matsumoto	e97fd6c936	lib/iscsi: Add iscsi_target_node_request_logout RPC For the login redirection feature, the current implementation works only if a portal is redirected from an initial portal to a redirect portal. However, the login redirection feature should work even if a portal is redirected from one redirect portal to another redirect portal. A public portal group knows only a redirect portal and does not know the portal group of the redirect portal. Moreover, it is very likely that an initial portal and a redirect portal exist in different SPDK iSCSI target applications. To cover all these concerns, add an new iscsi_target_node_request_logout RPC to request connections whose portal group tag match for the target node. To cover potential use cases, make the second parameter portal group tag optional. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I612672490722fb22fd4eba055998b7408ab84ca5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3780 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-17 08:26:59 +00:00
Shuhei Matsumoto	8cf5581fa9	lib/iscsi: Remove async logout request from iscsi_target_node_redirect() As written in doc/iscsi.md, typically the login redirection feature will be used in scale out iSCSI target system, which runs multiple SPDK iSCSI target applications. In scale out iSCSI target system, the initial portal, the current redirect portal, and the next redirect portal are likely to be in different SPDK iSCSI target applications. In this case, asynchronous logout request should be sent independently from the iSCSI target application which has the current redirect portal. However, we had added asynchronous logout request into the iSCSI target application which has the next redirect portal. This idea works only for the case that login is redirected from the initial portal to a redirect portal. We remove asynchronous logout request from iscsi_target_node_redirect() in this patch, and update the corresponding help documents. The next patch will add an new RPC to send asynchronous logout request to all connections to the specified portal group and the specified target. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ib0ac72e8cdad7e8c64e446b7495e572fac4b5bae Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3779 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-17 08:26:59 +00:00
Ziye Yang	b68d89bcc2	nvmf/rdma: Remove the unused data structure spdk_nvmf_send_wr_list This data structure is not used. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I143fb9256f692d7bd9bb5e14cdc479f64ddcef45 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3746 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-08-13 07:54:36 +00:00
Alexey Marchuk	db09de9866	nvmf/rdma: Update Work Completion error logging 1. Retrieve actual IBV state when we receive WC with bad status 2. Don't log an error if WC status is IBV_WC_WR_FLUSH_ERR. This means that we are performing qpair cleanup and this WC is expected. Change-Id: Id23634092f537861e66ca0f83ab79db9e052507b Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3736 Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-13 07:54:22 +00:00
Jacek Kalwas	47ce1fe307	nvmf: association timer triggered on reset Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I85e886e8912009ec5761b5cd0e5b5cef87b25d6e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3463 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-13 07:53:08 +00:00
Jacek Kalwas	c322453ccc	nvmf: disable keep alive timer during shutdown From the time a shutdown is initiated the controller shall disable Keep Alive timer. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Id499dabce1913b9da2f0b3fd961fdfc8b621afa9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3462 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-13 07:53:08 +00:00
Jacek Kalwas	71cd42e139	nvmf: association timer triggered on shutdown After CC.EN transitions to ‘0’ (due to shutdown or reset), the association between the host and controller shall be preserved for at least 2 minutes. After this time, the association may be removed if the controller has not been re-enabled. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Change-Id: I4734600067fd4b7306b46f1325fdd5031e81c079 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2984 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-13 07:53:08 +00:00
Tomasz Kulasek	549b9f31c6	lib/nvme: implement SPDK_NVME_DATA_NONE data transfer in CUSE Change-Id: Ifb2a53bdbaabd74b7f412923a97d79b44afde861 Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1744 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Karol Latecki <karol.latecki@intel.com>	2020-08-12 10:39:29 +00:00
Maciej Szwed	210f61ec0e	bdev: Return when locking LBA range for fused command failed Add missing return statement on LBA range locking failure. Fixes github issue #1531 Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: I5506f34acd51714b9947b9692d0d5d9793144adc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3737 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-12 07:52:09 +00:00
Seth Howell	fa01f99692	nvmf/rdma: disconnect qpair from ibv_event ctx This call can be made directly now that spdk_nvmf_qpair_disconnect is thread safe. It's actually better that we do it this way, because the qp destruct call is guaranteed to block until the ib events associated with it are acknowledged. this means that by processing the disconnect before we ack the event, we will have valid memory to do the atomic checks. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: If6882b7dc568fe4c35f4a35375769634326e9d76 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3681 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-12 07:51:15 +00:00
Seth Howell	86a6ac995c	nvmf/rdma: always go through spdk_nvmf_qpair_disconnect. We should use this function as the synchronization point for all qpair disconnects. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ic685ac3481765190cc56eeec3ee24dad52e336c9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3675 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-08-12 07:51:15 +00:00
Seth Howell	4bdfe13d4d	lib/nvmf: make spdk_nvmf_qpair_disconnect thread safe. This function should be the synchronization point for all disconnects regardless of whether they begin on the transport, from an RPC, or in response to application termination. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: If3553ab3a9e265b0938c84832cb9f774852d7565 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3674 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-12 07:51:15 +00:00
Monica Kenguva	332e846fe6	lib/nvmf: implement ANA log page SPDK NVMe-oF controller creates a ANA group for each namespace, ANA group ID matches namespace ID, and default ANA state of ANA group is optimized, and the MNAN field is set equal to the NN field. If a ANA log page contains multiple ANA group descriptors, it has one or more descriptors will not be 8 bytes aligned. Hence we create one descriptor and copy it to the ANA log page at a time. Change count will be supported later. Signed-off-by: Monica Kenguva <monica.kenguva@intel.com> Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I56ba6aa78983480caa3dfbf22aefc9aeabfd5405 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2920 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-12 07:50:44 +00:00
Wojciech Malikowski	1a91a68fae	lib/ftl: Return SPDK_POLLER_BUSY flag only when writes were submitted FTL core poller should return SPDK_POLLER_BUSY flag only when some writes operations were processed. Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Change-Id: I50e2b536fbec819887148cc045d76c5c5d78beb2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3619 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-11 09:49:51 +00:00
Seth Howell	d156d2f771	lib/nvmf: don't free ctrlr->qpair_mask early. There are 2 messages passed between when _nvmf_ctrlr_free_from_qpair is executed and when nvmf_ctrlr_destruct is executed. That leaves time when the controller->qpair_mask is not a valid pointer, but it is still in the subsystem controllers list. The purpose of this patch is to close that hole. It is part of a larger series aimed at cleaning up the controller destruct path. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I0c0199c8392ee278f36df56f599beb10e7a46948 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3685 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-11 08:34:15 +00:00
Seth Howell	d1b0d2cbe5	nvmf/rpc: stop_listen rpc now uses the async stop listen function. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ie7352d6f1a9d74557a92c6e39c376856804f021c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3450 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2020-08-11 08:29:14 +00:00
Seth Howell	49ee92a61f	lib/nvmf:add spdk_nvmf_transport_stop_listen_async API. This API differs from spdk_nvmf_tranpsort_stop_listen in that it also disconnects the qpairs associated with that listener. Change-Id: Iadfc6d2debc0ef8f1a8cd5db4f20168aeae8264d Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3279 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-08-11 08:29:14 +00:00
Shuhei Matsumoto	4ea197883c	lib/iscsi: Add iscsi_target_node_set_redirect RPC Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I090f55debe4ecdc47459bcfe0571a937f064313b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3439 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-11 08:27:43 +00:00
Shuhei Matsumoto	420b2353ea	lib/iscsi: Inform initiator that target has temporary moved to a different address If the portal group map of the target has a redirect portal, iscsi_tgt_node_is_moved() fills the buffer by the redirected address and returns true. iscsi_op_login_check_target() calls iscsi_tgt_node_is_redirected() before calling iscsi_tgt_node_access() because login redirection can be checked before any or after all security check. If iscsi_tgt_node_is_redirected() returns true, notify login redirection to the corresponding initiator. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I4573a69c0a32eafcfe48080a033c135e127da321 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3221 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-11 08:27:43 +00:00
Shuhei Matsumoto	2c8309e00e	lib/iscsi: Update redirect portal of public portal group iscsi_tgt_node_redirect() updates redirect portal of the initial portal iin a primary portal group for the target node. Check if the specified portal group is a public portal group and is mapped to the target node first. Then if the passed IP address-port pair is NULL, clear the current redirect setting. Public portal group and private portal group are clearly separated and redirect portal must be chosen from a private portal group. Hence this clear method is intuitive and simple. If the passed IP address-port pair is not NULL, check if they are valid, and are not in the specified portal group. Then update a redirect portal of the portal group map. Finally, send asynchronous logout request to all corresponding initiators. Besides, change allocating pg_map from malloc to calloc to initialize redirect portal. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I79d826663f4c3d5a117add286f133adeb1ce07f5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3222 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-11 08:27:43 +00:00
Shuhei Matsumoto	651f6d6a3a	lib/iscsi: Return portals only in public portal group for SendTargets All redirect portals in private portal groups are temporary and so they should be informed only by temporary login redirection response. Then this patch changes SendTargets operation to return portals only in primary portal groups. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic62ada749886290df2d1490377cc5ca883b3f47a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3492 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-11 08:27:43 +00:00
Shuhei Matsumoto	10d6218444	lib/iscsi: Create portal group as public or private portal group In SPDK iSCSI target, portal group works almost as identifier of portal. To support iSCSI login redirection, we need to have two types of portal groups, public and private portal groups. We need portals of public portal groups to redirect to a portal in a private portal groups at login via temporary login redirection funciton, and we need to make SendTargets return only portals in public portal groups. To do these simply, we mark primary or secondary portal group expicitly at its creation by this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Iccf87a4b9dd1f4a8fbb857a399b8f2dbc7c0b3ab Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3491 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-11 08:27:43 +00:00

1 2 3 4 5 ...

7468 Commits