numam-spdk

Author	SHA1	Message	Date
Alexey Marchuk	8bec9feb76	nvme/rdma: Remove unused spdk_nvme_send_wr_list nvme_rdma_qpair::sends_to_post is not used, remove it and spdk_nvme_send_wr_list structure Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: If9c42736d4e796a947bbfe80f59efd2fd7f77859 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3822 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-21 08:24:43 +00:00
Alexey Marchuk	58f43df1f5	nvmf/rdma: Handle several ibv events in a row Currently rdma acceptor handles only one ibv event per poll Taking into account the default acceptor poll rate (10ms), it can take a long time to handle e.g. LAST_WQE_REACHED events when we close huge amount of qpairs at the same time. This patch allows to handle up to 32 ibv events per acceptor poll. Change-Id: Ic2884dfc5b54c6aec0655aaa547b491a9934a386 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3821 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-21 08:24:43 +00:00
Michal Berger	8e43a261ea	rxe_cfg_small: Remove unused function link_up_rxes() is not used anywhere so remove it. That said, make sure that link_up() is called while adding the rxe device(s). Change-Id: Ic0f9573b911d83d58b42f62d104ef916aa010e4e Signed-off-by: Michal Berger <michalx.berger@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3824 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-21 08:24:29 +00:00
Michal Berger	0b4d8bf59e	test/nvmf: Always remove lingering soft rxe devices In case there are lingering rxe devices present on the system, adding them again under the infiniband subsystem will simply fail the test. To avoid such scenario, make sure that we start the rxe with a clean slate by removing all soft devices first. This is mostly relevant for systems running jobs which init the rxe but don't run tests which call to revert_soft_roce() for cleanup (see BlobFS-autotest as an example). Change-Id: I12997fbaf7343ae3e9bc0b38f5455f6332c4e6c5 Signed-off-by: Michal Berger <michalx.berger@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3823 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-21 08:24:29 +00:00
Seth Howell	ce83fc2aff	lib/nvme: remove qpair from ctrl list in connect fail path This is an oversight that can cause issues with looping through the list if we end up allocating the same qpair twice. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I513ea35398f4b724366c21be144531fbfbdb4347 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3835 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-21 08:24:18 +00:00
Jim Harris	ed7848f2df	blob: handle overlapping open case We only create one spdk_blob object for a given blob, and just increase the ref_count if it is opened multiple times. bs_open_blob would do the lookup for existing opened blobs. But if the blob is opened again, before the previous open operation has completed, we would end up with two spdk_blob objects for the same blob. Solution is to do another lookup when the open operation completes. If we find the blob, free the one we just finished opening and return the existing one instead. Also added unit test that failed on the existing code but passes now with this patch. Signed-off-by: Jim Harris <james.r.harris@intel.com> Reported-by: Mike Cui Change-Id: I00c3a913b413deddf06f0b63f7a669efb2b5658f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3855 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-21 08:24:09 +00:00
yidong0635	361cddfd63	nvme/nvme_pcie: Remove unused codes. pctrlr->cmb.mem_register_addr and pctrlr->cmb.mem_register_size are assigned after spdk_mem_register. if spdk_mem_register is failed , ctrlr_map_cmb hasn't been executed. they are not be used. So remove them. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I3d1996eee8b5260b79c4c3e0a2e1d376da2343b7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3856 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-21 08:24:00 +00:00
Changpeng Liu	01b6bd8a92	nvmf: fix the associate timeout value SPDK poller uses microsecond as the input parameter, so we need to change the correct value when opts.association_timeout is expressed by millisecond. Change-Id: Ia674f0115ea176b998e4c0c70b8ce75b28984701 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3861 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-21 08:23:45 +00:00
Changpeng Liu	5d5a9077a3	nvme: don't assert on custom transport Change-Id: I2d425c127dc070f7bb508f5a61e6304f6042fdf7 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3857 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-21 08:23:19 +00:00
Shuhei Matsumoto	1da94ed7b8	rpc/nvmf: Add ana_reporting parameter to nvmf_create_subsystem RPC Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9adc8373050e68872a4d9e89518c137e61005254 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3852 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-21 08:22:40 +00:00
Shuhei Matsumoto	6f2265734d	lib/nvmf: Optionalize ANA reporting feature After supporting ANA reporting by default, Linux kernel 5.3 reported error when parsing NVMe ANA log. The newer kernel fixed the issue but we should optionalize ANA reporting feature to avoid error for Linux kernel 5.3 or before. Add a bool variable ana_reporting to struct spdk_nvmf_subsystem and disable ANA reporting and initialization of related variables if it is false. We can expose MNAN (Maximum Number of Allowed Namespaces) even if ANA reporting is disabled. But MNAN is not required if ANA reporting is disabled. So do not set MNAN if it is false too. Add a public API spdk_nvmf_subsystem_set_ana_reporting() to set ana_reporting by the nvmf_create_subssytem RPC. The next patch will add ana_reporting to nvmf_create_subsystem RPC. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Icc77773b4c9513daba2f1a9fdaf951d80574f379 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3850 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-21 08:22:40 +00:00
Shuhei Matsumoto	4cc04a1251	lib/nvmf: Add nvmf_subsystem_get_controllers RPC Add an new RPC, nvmf_subsystem_get_controllers to retrieve the list of NVMe-oF controllers of an NVMe-oF subsystem. One of the main use cases will be to get identification information of NVMe-oF controllers to configure their ANA states dynamically. Pause and resume the subsystem to access the controllers safely. One subtle issue remains. The JSON RPC returns success even if resuming the subsystem fails. Write FIXME explicitly to address this. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ibf8d1cf56850a705e343b86022d101b4c7204199 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3848 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-21 08:22:40 +00:00
Wang Shilong	78ba12ecf4	module/blobfs: fix segmentation fault when fuse mount fail if mountpoint doesn't exist, fuse mount will fail and @bfuse will be null, blobfs_fuse_stop() try to access it without check which cause segmentation finally. Patch also improve test messages output to include terminator properly. Signed-off-by: Wang Shilong <wangshilong1991@gmail.com> Change-Id: Idba23e3922778f1985d1f4841a1701cdc272ef17 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3863 Community-CI: Mellanox Build Bot Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-21 08:22:25 +00:00
Alexey Marchuk	09d6e90a9a	nvme_perf: Add warmup Add -a cli parameter to enable warmup stage. Move ns_ctx statistics-related variable to a new structure for easier cleanup. Change-Id: Id536209bc49322f1bc9b5c2296f98f0183b7d4b9 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3784 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-20 09:36:03 +00:00
Karol Latecki	6127b1a33b	test/common: use sudo to remove old temp files Most common scenario is to run SPDK tests using "autorun.sh", which calls autotest.sh with "sudo" and permissions are not an issue then. However we have a number of tests which are run independently (e.g. test/fuzz/autofuzz.sh) and try to remove old temp files before sudo is used to elevate privileges. This causes tests to fail early. Using "sudo" explicitly in autotest_common.sh to remove old files will fix the problem. Signed-off-by: Karol Latecki <karol.latecki@intel.com> Change-Id: I84f75c189bb04d0e3208dfc4f221d07d17db5ee5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3819 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Michal Berger <michalx.berger@intel.com>	2020-08-20 09:35:47 +00:00
WANGHAILIANG	fa729fd3a7	configure: add a tips for installing libubsan In practice, libubsan has been installed but there will still be such alarms sometimes. In fact, this has something to do with the GCC version. Most users may not think of it immediately, so they will be confused and waste time here. So I think we should add a hint here. Change-Id: Ib01cf4b35be5b75a3620aea556f4c097a1ee7a01 Signed-off-by: WANGHAILIANG <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3832 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Liang Yan <liang.z.yan@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-20 09:35:41 +00:00
Michal Berger	98678abe89	vm_setup: Add some missing dependencies btrfs-progs and xfsprofs are required by some of the tests, e.g. the nvmf/target/filesystem.sh. abigail package provides abidiff needed for check_so_deps.sh tests. Change-Id: I5199fafef2ac03ce04ea3709e9ff76fefa1ac717 Signed-off-by: Michal Berger <michalx.berger@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2923 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-20 09:34:23 +00:00
Michael Haeuptle	43ad7febb2	lib/nvmf: Fixes stuck subsystem RPC A subsystem RPC is not transitioned to a paused state when there are ios outstanding (tracked by subsystem poll group). In general AERs, are not tracked as outstanding IOs. However, there are 3 paths in nvmf_ctrlr_async_event_request which do not adjust the outstanding io count. If we get into any of these 3 paths, the subsystem pause can hang forever. The issue was reproduced with hot plug stress testing under load. We can get into the second path (SPDK_NVME_ASYNC_EVENT_TYPE_NOTICE) under these circumstances: - An AER completion is sent to the initiator due to a namespace change (e.g. hot remove/add) - In this case, type is set to SPDK_NVME_ASYNC_EVENT_TYPE_NOTICE - The initiator sends a new AER admin command, hitting the second path where we return without adjusting the outstanding ios. Fixes: 1552 Change-Id: I45f781966cc1e9a601b2305c7985a21154d802e8 Signed-off-by: Michael Haeuptle <michael.haeuptle@hpe.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3854 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: JinYu <jin.yu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-20 09:31:17 +00:00
Michal Berger	59c1e1690e	test/dd: Lower the number of block size tests These tests take significant amount of time on bare-metal systems causing some random timeouts in the CI pool. Reclaim some time by lowering the number of test iterations. Change-Id: Id70e00230e0e49dc0f2f1ccc4d499420fc84d7dd Signed-off-by: Michal Berger <michalx.berger@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3843 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-20 09:27:21 +00:00
Michal Berger	4eed3b0839	test/dd: Make the AIO file smaller Make the footprint of the test smaller by creating smaller AIO file - 2G instead of 8G. Adjust offseting as well to not fall outside of the device\|file. Also, cap in half the size of the source file. Change-Id: I625eebe545477239606cd65ac6949cca4a2b8c80 Signed-off-by: Michal Berger <michalx.berger@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3842 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-20 09:27:21 +00:00
Michal Berger	a52c6b194c	test/dd: Remove plain_copy test This test is unnecessarily redundant since posix tests cover this functionality in detail. Change-Id: Ice7a6616a2106c098ff32595547371ea4c716d45 Signed-off-by: Michal Berger <michalx.berger@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3841 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-20 09:27:21 +00:00
Sochin Jiang	db3d1201a4	lib/blob: fix a data corruption bug There is a fatal bug that could easily cause data corruption when using thin-provisioned blobs. In blob_request_submit_rw_iov(), we first get lba by calling blob_calculate_lba_and_lba_count(), blob_calculate_lba_and_lba_count() calculates different lbas according to the return of bs_io_unit_is_allocated(). Later, we call bs_io_unit_is_allocated() again to judge whether the specific cluster is allocated, the problem is it may have be allocated here while not be allocated when calling blob_calculate_lba_and_lba_count() before. To ensure the correctness of lba, we can do lba recalculation when bs_io_unit_is_allocated() returns true, or make blob_calculate_lba_and_lba_count() return the result of bs_io_unit_is_allocated(), use the second solution in this patch. By configuring more than one cpu core, md thread will run in a separate SPDK thread, this data corruption scenario could be easily reproduced by running fio verify in VMs using thin-provisioned Lvols as block devices. Signed-off-by: Sochin Jiang <jiangxiaoqing.sochin@bytedance.com> Change-Id: I099865ff291ea42d5d49b693cc53f64b60881684 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3318 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-20 09:26:23 +00:00
Ziye Yang	0d3cc15a62	nvme/tcp: Correct the incapsule data usage According to page35 in recent NVMe-oF spec ( NVMe-over-Fabrics-1.1-2019.10.22-Ratified), ioccsz is used to restrict the incapsule size of I/O command, so do not restrict the NVMe-oF OPC command and also the admin command. We accidently trigger an bug in kernel since we do not send the fabrics command with the incapsule and make the kernel coredump, though the kernel has bugs. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I869a2c8ab7b9c2ac1e5cc5b603920662591c2c64 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3837 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-20 09:26:06 +00:00
yupeng	2d30df9b0b	bdev: add bdev_examine_bdev API The bdev_examine_bdev api will examine a bdev explicitly. After disabling the auto_examine feature, a user could call bdev_examine_bdev to examine a specific bdev he/she wants. Signed-off-by: Peng Yu <yupeng0921@gmail.com> Change-Id: Ifbbfb6f667287669ddf6175b8208efee39762933 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3219 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-20 08:56:53 +00:00
WANGHAILIANG	f6727316d7	test/opal: add a trap for opal_revert_and_init() When opal_revert_and_init() is interrupted for some reason, the spdk_tgt still exists, but it should be killed at the same time. Change-Id: I8546d3b0b4d6a0fda1687558a664decb535ef2b4 Signed-off-by: WANGHAILIANG <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3830 Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-19 07:30:04 +00:00
WANGHAILIANG	c4e1d35c13	test/opal: fix a typo for P4510 devices ID Change-Id: Ic30fe63a60d2a151a47444fa84e1c99d9b69a454 Signed-off-by: WANGHAILIANG <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3829 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-08-19 07:30:04 +00:00
Ziye Yang	c67fb06d51	nvme/perf: Fix the hang issue of nvme perf. When using nvme perf program to test against NVME-oF target, the nvme perf program will hang if we kill the NVMe-oF target. For example, if we run the following command: 1 On the target side, start a SPDK NVMe-oF target; 2 On the initiator side, we run: ./build/examples/perf -r 'trtype:rdma adrfam:IPv4 traddr:192.168.7.55 trsvcid:4420' -q 128 -o 4096 -w randwrite -t 100 3 Then we kill the NVMe-oF target on the target, the nvme perf program will hang. For NVMe perf program, I think that we should check it in Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ia864394acdb6e705484dd0db6f015b567eb527a7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3774 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-19 07:29:26 +00:00
Ziye Yang	2031f8f70d	nvme: set the error code if we cannot send keep alive command. If the transport is broken, we should set errno code in spdk_nvme_ctrlr_process_admin_completions instead of keeping silence. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ie73763e1329e12a8c82a0223d360991f86c39be3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3773 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-19 07:29:26 +00:00
Seth Howell	518a1e013a	lib/nvme: make fabrics connect timeout configurable. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: If829d399882ef948d95673c17e5689c91386c21d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3795 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-19 07:29:19 +00:00
Seth Howell	b3bb3a1bbf	lib/nvme: change timeout in wait_for_completions to usec This allows for much more granular control over the timeout. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ib23de21e60eec4207c55320579699edf284f4e16 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3794 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-19 07:29:19 +00:00
Ziye Yang	85ff3fcea6	rdma: Do not use the poller to handle the qpair exiting. Generally, this patch did the following work: Remove the destruct poller. I think that we do not need this, the destruct poller is specially for Softwaare RoCE case. Since SoftRoCE will not have IBV_EVENT_QP_LAST_WQE_REACHED event, we will not wait the last_wqe_reached flag when srq is enabled. So we can avoid using the poller. And the purpose of this patch is to solve the coredump issue. For example, if we run rdma local test such as, e.g., test/nvmf/host/bdevperf.sh --transport=rdma The coredump reason: the qpair is freed twice. Because for RDMA transport, we do not really remove the qpair from the group if the upper layer does it. The first time is called by nvmf_rdma_destroy_drained_qpair in nvmf_rdma_poller_poll, and the second time is called by nvmf_rdma_qpair_reject_connection in in nvme_rdma_close_qpair. Since nvme_rdma_close_qpair will always called, so we need make sure that the qpair will be close after calling this function. Otherwise we will have the double free qpair. So our approach here is add a flag ("to_close")in rqpair structure and make sure the rqpair be freed after the "to_close" is set nvme_rdma_close_qpair Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I6f97debbcd29bbb7c6e3f9725907b4102a1d2892 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3661 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com>	2020-08-19 07:28:36 +00:00
WANGHAILIANG	ea80818901	test/bdevperf: fix an int types for offset which can result in core dumped. Originally, config->offset was defined as int type. When the capacity of SSD is very large, such as 8T(P4510), then bdev->blockcnt2 is 7814037168, config->offset is 3907018584. At this time, it exceeds the maximum int range of 2147483647 and becomes a negative number, resulting in core dumped. Debug info: config->filename is Nvme1n1. make_cli_job_config offset is -387948712. This should be: config->filename is Nvme1n1. make_cli_job_config offset is 3907018584. Change-Id: Ia83d88cc4e56d6c97a6d3fc1a2593b6fc31655b2 Signed-off-by: WANGHAILIANG <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3818 Community-CI: Broadcom CI Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-18 15:41:04 +00:00
Shuhei Matsumoto	05cd697757	lib/iscsi: Add MaxR2TPerConnection to iSCSI options Add MaxR2TPerConnection to iSCSI global options and make it configurable by JSON RPC. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ida95e5c7dac301a22520656709e1aa4d611f31ef Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3777 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00
Shuhei Matsumoto	5af42000c1	lib/iscsi: Make max outstanding R2Ts per connection configurable By the recent refactoring, we have no static size array for outstanding R2Ts per connection. It looks that we do not have any critical reason to prohibit us from making max outstanding R2Ts per connection configurable. There are some use cases to use large write I/O intensively (e.g. 128KB). Let such use cases change the value of max R2Ts per connection by their responsibility to do performance tuning. Maximum outstanding R2Ts per task are defined both for iSCSI target and NVMe-TCP target but maximum outstanding R2Ts per connection is unique for iSCSI target. The next patch will add the corresponding iSCSI option. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I4f6fd3c750a9a0a99bcf23064fe43a3389829aa9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3776 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00
Shuhei Matsumoto	fb229e1eb2	lib/iscsi: Count R2T and Data Out PDUs into PDU pool size It is likely that the raw number 8 in the macro NUM_PDU_PER_CONNECTION means 2 * DEFAULT_MAXR2T and the raw number 2 means R2T and Data Out, but is not certain. On the other hand, the next patch will make the max number of outstanding R2Ts per connection configurable. As a preparation to the next patch, add 2 * DEFAULT_MAXR2T explicitly to the macro NUM_PDU_PER_CONNECTION. The next patch will replace DEFAULT_MAXR2T by an new variable. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I8a3be14d53c0abf11d7aade401386601d8fe6c11 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3783 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00
Shuhei Matsumoto	07f2d83dea	lib/iscsi: Change the type of pending_r2t from int to uint32_t Other count variables in iSCSI library have used uint32_t rather than int. Change the type of spdk_iscsi_conn::pending_r2t from int to uint32_t and add assert to check if pending_r2t is not negative. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9bd296c0142b0808ae822952277c9ecc133e5f62 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3775 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00
Shuhei Matsumoto	5aaf754f81	lib/iscsi: Add MaxLargeDataInPerConnection to iSCSI options Add MaxLargeDataInPerConnection to iSCSI global options and make it configurable by JSON RPC. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ibcd16da2eac64241217bedeb89a7929bbdc67871 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3756 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00
Shuhei Matsumoto	bc814aad67	doc/jsonrpc: Increase column width of parameter table for iscsi_set_options RPC The next patch will add an new parameter max_large_datain_per_connection to the iscsi_set_options RPC. It is longer than the column width of the parameter table. As a preparation, increase the column width of the parameter table for the iscsi_set_options RPC first. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id0f27d608f9c186166cf7a132ae786ba70e398d8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3782 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00
Shuhei Matsumoto	5b2c76f062	lib/iscsi: Make the max number of read subtasks for large read I/O configurable For some use case that there is heavy large read I/O, the performance bottleneck due to MAX_LARGE_DATAIN_PER_CONNECTION was reported. The following assumes that all I/Os are large read. Large read primary task whose I/O size is more than SPDK_BDEV_LARGE_BUF_MAX_SIZE (=64KB) is split into multiple read subtasks. spdk_iscsi_globals::MaxQueueDepth limits maximum number of outstanding read primary tasks, and MAX_LARGE_DATAIN_PER_CONNECTION (=64) limits maximum number of outstanding read subtasks. MAX_LARGE_DATAIN_PER_CONNECTION is also used to calculate PDU pool. To remove the performance bottleneck, change the macro constant MAX_LARGE_DATAIN_PER_CONNECTION to a global variable spdk_iscsi_globals::MaxLargeDataInPerConnection. We don't see any negative side effect if we set spdk_iscsi_globals::MaxLargeDataInPerConnection to 64. The use case that reported the performance issue will change the value of spdk_iscsi_globals::MaxLargeDataInPerConnection by its own responsibility. The next patch will add the value of spdk_iscsi_globals::MaxLargeDataInPerConnection to iSCSI options, and make it configurable by JSON RPC. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ifc30cdb8e00d50f4d3755ff399263cf5d0b681b6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3755 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00
Michal Berger	6733330cf5	test/common: nullglob the tmp storage path pattern This is to avoid including non-matching pattern as an actual item. Change-Id: Ie4fbb27e66efa1f56618959bb7db6f0fccfc2847 Signed-off-by: Michal Berger <michalx.berger@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3290 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-17 11:56:47 +00:00
Darek Stojaczyk	49473bdc81	test/common: consolidate test params for running with external DPDK Get rid of WITH_DPDK_DIR and SPDK_RUN_INSTALLED_DPDK, introduce SPDK_RUN_EXTERNAL_DPDK which can point to a DPDK dir. It's an empty string by default. Change-Id: Iff2b3773a4614db07f4196165087a79472e02b9a Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/867 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-17 11:56:32 +00:00
Tomasz Zawadzki	52aaee43ab	changelog: update titles of releases When browsing https://spdk.io/doc/changelog.html left hand side navigation bar is missing some of the releases. This is due to missing 'title' of particular release that is signified by ":" right after version number. This patch adds missing ":" and fills out missing titles for all releases, so they show up properly on the website. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I3bcb0b2e819d311a033d78101034a7adb2c3395a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3748 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-17 08:29:15 +00:00
Tomasz Zawadzki	ba3c468647	test/cuse: do not disable exit on error Originally the idea was to disable error checking, to match output from Kernel and SPDK NVMe cuse. This includes passing test commands and failures. Any discrepancy would be caught by log output diff at the end. Flaw in this logic is that test command itself might be incorrect. We shouldn't depend on that, nor attempt to cover up some of the failures even if they occur on both interfaces. Most probable cause for this at all, was NVMe emulated in QEMU not really working with all the nvme-cli commands from this test. Since the original creation of this test, CUSE executes on physical devices (to be able to support namespace management). The behavior there is predictable and works with current test commands, thus the test exits on any error with this patch. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I086faf38b2cbbb6225935cc50d4fad14e81f1972 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3032 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Karol Latecki <karol.latecki@intel.com>	2020-08-17 08:28:56 +00:00
Seth Howell	0162da7f76	lib/nvmf: add an in_destruct flag to the ctrlr struct Helps us avoid adding a new I/O qpair while the ctrlr is being destroyed. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I3bf9318b075125b9d432b885fa9f6f2f44d422d7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3686 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-08-17 08:28:49 +00:00
Shuhei Matsumoto	5a92ce53df	doc/iscsi: Fix description about iSCSI login redirection By the previous patches, updating redirect portal and requesting logout are done using different RPCs. Update description to reflect this change. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id1f00bde39446bc2a8de9635135136b8f0194faf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3781 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-17 08:26:59 +00:00
Shuhei Matsumoto	e97fd6c936	lib/iscsi: Add iscsi_target_node_request_logout RPC For the login redirection feature, the current implementation works only if a portal is redirected from an initial portal to a redirect portal. However, the login redirection feature should work even if a portal is redirected from one redirect portal to another redirect portal. A public portal group knows only a redirect portal and does not know the portal group of the redirect portal. Moreover, it is very likely that an initial portal and a redirect portal exist in different SPDK iSCSI target applications. To cover all these concerns, add an new iscsi_target_node_request_logout RPC to request connections whose portal group tag match for the target node. To cover potential use cases, make the second parameter portal group tag optional. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I612672490722fb22fd4eba055998b7408ab84ca5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3780 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-17 08:26:59 +00:00
Shuhei Matsumoto	8cf5581fa9	lib/iscsi: Remove async logout request from iscsi_target_node_redirect() As written in doc/iscsi.md, typically the login redirection feature will be used in scale out iSCSI target system, which runs multiple SPDK iSCSI target applications. In scale out iSCSI target system, the initial portal, the current redirect portal, and the next redirect portal are likely to be in different SPDK iSCSI target applications. In this case, asynchronous logout request should be sent independently from the iSCSI target application which has the current redirect portal. However, we had added asynchronous logout request into the iSCSI target application which has the next redirect portal. This idea works only for the case that login is redirected from the initial portal to a redirect portal. We remove asynchronous logout request from iscsi_target_node_redirect() in this patch, and update the corresponding help documents. The next patch will add an new RPC to send asynchronous logout request to all connections to the specified portal group and the specified target. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ib0ac72e8cdad7e8c64e446b7495e572fac4b5bae Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3779 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-17 08:26:59 +00:00
Monica Kenguva	7a71ae51df	doc: Moved RAID section in alphabetical order Signed-off-by: Monica Kenguva <monica.kenguva@intel.com> Change-Id: I9db49b40a4d8276ae754f89257cbcdeae7a50914 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3683 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-17 08:25:07 +00:00
Monica Kenguva	7a660b30a4	doc: bdev module path and external bdev info update Signed-off-by: Monica Kenguva <monica.kenguva@intel.com> Change-Id: I78ec57d7c0105f5c16261342ddce1294c1a9a2d8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3468 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-17 08:25:07 +00:00
Seth Howell	7fade028eb	test/check_so_deps: remove old suppressions. These suppressions are only valid for the course of one release. Now that we have started with a new release, remove all of the suppressions. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I1a36bc49bb3a16f98de870cc06e56dbfa75d72d6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3722 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-08-17 08:24:46 +00:00

1 2 3 4 5 ...

13098 Commits