numam-spdk

Author	SHA1	Message	Date
Tomasz Zawadzki	c2bd95ee54	vbdev_compress: reduce MAX_NUM_QP This is a workaround for #2338. Ideally the fix should remove this define and use number of cores from the application. With large number of QAT devices following error can be obsered: compdev_isal_create(): ISA-L library version used: 2.30.0 vbdev_compress.c: 358:vbdev_init_compress_drivers: NOTICE: created virtual PMD compress_isal EAL: memzone_reserve_aligned_thread_unsafe(): Number of requested memzone segments exceeds RTE_MAX_MEMZONE RING: Cannot reserve memory isal_comp_pmd_qp_setup(): Failed to create unique name for isal compression device vbdev_compress.c: 268:create_compress_dev: NOTICE: FYI failed to setup a queue pair on compressdev 48 with error 4294967295 so limiting to 84 qpairs Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I689ab6bda991e3864da9f4135f57849e3c0c3986 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11179 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-20 20:08:12 +00:00
Tomasz Zawadzki	ca89b502aa	vbdev_crypto: skip handling QAT_ASYM devices Historically only QAT_SYM devices for crypto were supported. The DPDK submodule explicitly disabled its compilation. For details please see: https://review.spdk.io/gerrit/c/spdk/dpdk/+/9217 Starting with DPDK 21.11 QAT_SYM and QAT_ASYM were merged together, so it is no longer possible to disable it QAT_ASYM as it was before. As vbdev_crypto didn't make use of it, this driver is now skipped in preparation for update to DPDK 21.11. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ib606a4b450cd224d96bc21a64384297b2182967c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11178 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-20 20:08:12 +00:00
GangCao	6b7e9d0af2	Lib/iSCSI: add the LUN Resize support From SAM-4, section 5.13 (Sense Data); “When a command terminates with a CHECK CONDITION status, sense data shall be returned in the same I_T_L_Q nexus transaction (see 3.1.50) as the CHECK CONDITION status. After the sense data is returned, it shall be cleared except when it is associated with a unit attention condition and the UA_INTLCK_CTRL field in the Control mode page (see SPC-4) contains 10b or 11b.” SPDK does not set UA_INTLCK_CTRL to 10b or 11b, so we set the unit attention condition immediately against a single IO or Admin IO after reporting it via a CHECK CONDITION. Once the failed IO received at iSCSI initiator side, it will be retried. In the case of resize operation, if there is no IO from iSCSI initiator side, the unit attention condition will be delayed to report until the first IO is received at the iSCSI target side. Meanwhile, we clear the resizing (newly added) flag on our SCSI LUN structure after first time we report the resize unit attention condition. The kernel initiator won’t actually resize the corresponding block device automatically. It will report a uevent, and then you can set up udev rules to trigger a rescan. SPDK iSCSI initiator will automatically report the LUN size change. Change-Id: Ifc85b8d4d3fbea13e76fb5d1faf1ac6c8f662e6c Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11086 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-01-20 07:56:23 +00:00
Jim Harris	e0415f1720	bdev/nvme: set default bdev_retry_count to 3 Now that we have a much more robust retry framework, set the default bdev_retry_count to 3. Users can still override this default with the bdev_nvme_set_options RPC as before. This ensures that by default, we will retry I/O when possible. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I045bf4969d02be32b951e72a148ce6b6e251dec1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11107 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-19 08:55:46 +00:00
Shuhei Matsumoto	a9fd7f0ba6	bdev/nvme: Add nvme_ctrlr's state string to the bdev_nvme_get_controllers RPC The state of a nvme_ctrlr can be more fine grained than a boolean and such state gives more information to end users for debug or root cause analysis. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I3e2459f449e2dac73f04b155e38b696495f1a335 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10183 Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	80e81273e2	bdev/nvme: Do not use ctrlr for I/O submission if reconnect failed repeatedly If ctrlr_loss_timeout_sec is set to -1, reconnect is tried repeatedly indefinitely, and I/Os continue to be queued. This patch adds another option fast_io_fail_timeout_sec, a flag fast_io_fail_timedout to nvme_ctrlr. If the time fast_io_fail_timeout_sec passed after starting reset, set fast_io_fail_timedout to true not to use the path for I/O submission. fast_io_fail_timeout_sec is initialized to zero as same as ctrlr_loss_timeout_sec and reconnect_delay_sec. The name of the parameter follows the famous DM-multipath, its fast_io_fail_tmo. Change-Id: Ib870cf8e2fd29300c47f1df69617776f4e67bd8c Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10301 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	ae4e54fdc3	bdev/nvme: Retry reconnecting ctrlr after seconds if reset failed Previously reconnect retry was not controlled and was repeated indefinitely. This patch adds two options, ctrlr_loss_timeout_sec and reconnect_delay_sec, to nvme_ctrlr and add reset_start_tsc, reconnect_is_delayed, and reconnect_delay_timer to nvme_ctrlr to control reconnect retry. Both of ctrlr_loss_timeout_sec and reconnect_delay_sec are initialized to zero. This means reconnect is not throttled as we did before this patch. A few more changes are added. Change nvme_io_path_is_failed() to return false if reset is throttled even if nvme_ctrlr is reseting or is to be reconnected. spdk_nvme_ctrlr_reconnect_poll_async() may continue returning -EAGAIN infinitely. To check out such exceptional case, use ctrlr_loss_timeout_sec. Not only ctrlr reset but also non-multipath ctrlr failover is controlled. So we need to include path failover into ctrlr reconnect. When the active path is removed and switched to one of the alternative paths, if ctrlr reconnect is scheduled, connecting to the alternative path is left to the scheduled reconnect. If reset or reconnect ctrlr is failed and the retry is scheduled, switch the active path to one of alternative paths. Restore unit test cases removed in the previous patches. Change-Id: Idec636c4eced39eb47ff4ef6fde72d6fd9fe4f85 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10128 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	f85370b168	bdev/nvme: Use enum to select operations after reset complete This is a clean up as a preparation to the following patches. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ib8bc90e17f52086d4e887463e04f65273bb1079b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11068 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	962c4c3800	bdev/nvme: Fix a degradation that I/O gets queued infinitely We noticed the difference between the SPDK 21.10 and the latest master in a test. The simplified scenario is as follows: 1. Start SPDK NVMe-oF target 2. Run bdevperf for the target with -f parameter to suppress exit on failure. 3. Kill the target after I/O started. With the SPDK 21.10, bdevperf retries failed I/Os and exits after the test time is over. With the latest SPDK master, bdevperf hungs and does not exit even after the test time is over. The cause was as follows: reset ctrlr is repeated very quickly (once per 10ms by default) and hence I/Os were queued infinitely because nvme_io_path_is_failed() returned false if nvme_ctrlr is resetting. We should queue I/O when nvme_ctrlr is resetting only if reset is throttoled and fail-fast for the repeated failures is supported. Hence in this patch, fix the degradation and remove the related unit test cases. Reported-by: Evgeniy Kochetov <evgeniik@nvidia.com> Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I4047d42dc44488a05264c6a841d101a7c371358b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11062 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-17 14:25:15 +00:00
Tan Long	a79af5e7a5	bdev/rbd: Support config_param and config_file simultaneously for rbd_register_cluster config_param and config_file are not conflict to specify rados configurations, support specify both of them is more reasonable. Therefore, After this patch, users can choose the one from the three ways: config_param, config_file + key_file or config_param + config_file + key_file. Signed-off-by: Tan Long <tanl12@chinatelecom.cn> Change-Id: Ide17af72c4965df1e6541f4f50d4fa5309865486 Signed-off-by: Tan Long <tanl12@chinatelecom.cn> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10679 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-17 09:44:56 +00:00
Tan Long	20c8a3b8db	bdev/rbd: Add key_file to the rbd_register_cluster RPC In project practice, config_file and key_file are often used to connect to a rados cluster, config_file includes "mon_host" and other rados configurations like "rbd_cache", and key_file includes the secret key and the access authority to each pool for current user. This patch adds key_file option, user can specify config_file and key_file or only config_param to connect rados cluster. This will make it much more flexible for users with his/her convenience. Signed-off-by: Tan Long <tanl12@chinatelecom.cn> Change-Id: I6b49aad70b578bdeb3ac8ea9ca0fcbd931582025 Signed-off-by: Tan Long <tanl12@chinatelecom.cn> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10485 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-17 09:44:56 +00:00
Changpeng Liu	31d684d759	bdev_malloc: exit early in case of no acceleration task If acceleration tasks are exhausted, then we can exit the submission loop earlier, also print number of IOVs for each R/W request. Change-Id: Ia98ed43b0bb2be229b7c0054f3ade0ad39337b09 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10836 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-14 08:35:32 +00:00
Jim Harris	932ee64b8f	bdev/nvme: add bdev_nvme_stop_discovery RPC This RPC will stop the specified discovery service, including detaching from any controllers that were attached as part of that discovery service. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I9222876457fc45e1acde680a7bd1925917c22308 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10832 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-01-12 08:20:23 +00:00
Jim Harris	f2bf7e9727	bdev/nvme: connect to discovered controllers Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I3b05ab3d22851d433e3d0573e65943c4a30b9aa4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10695 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-01-12 08:20:23 +00:00
Konrad Sztyber	2bb8a83e0a	bdev/malloc: complete requests through poller Requests that are completed immediately (i.e. those not using the accel engine) are now queued and their completion is delayed to the completion poller. It ensures that they're not completed from the context of a submission, which gets rid of an spdk_thread_send_msg() call. It significantly improves performance on some workloads. For instance, 4k zcopy reads (queue depth 128) on an malloc bdev exposed through NVMe/TCP went from 204k IOPS to 485k IOPS. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I196f55fc07d167f1ed117d2430e9c37f9d05f70d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10805 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	0f0c16a76a	bdev/malloc: remove bdev_malloc_(reset\|flush) The only thing these functions were doing was completing the IO, so it could just be inlined. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I5fbd9df763dd68953b1bda9c7752c57ef9ee5dd6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10804 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	0a49fbd241	bdev/malloc: completion poller This poller is registered on each IO channel and can be used to schedule asynchronous completion of a request. This can be especially useful for requests that can be completed immediately. For now, nothing enqueues the requests to be completed through this poller - this will be changed in the following patch. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: If6b26541907bb46402fc0904216bff74dad57b88 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10803 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	fcd5f60144	bdev/malloc: malloc IO channel It'll allow the malloc bdev to store per-thread data. For now, it's only used to keep the pointer to the accel library's IO channel, more fields will be added in subsequent patches. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I604a38877ae8d6075b911f5a484d1793d4bc2ddb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10802 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	d1024c4bc6	bdev/delay: zero-copy support This patch adds support for zero-copy operations in the delay bdev. They use the same delay values as regular IO operations: - (avg\|p99)_read_latency for zcopy_start with populate=true, - (avg\|p99)_write_latency for zcopy_end with commit=true. All other zcopy operations (e.g. zcopy_start with populate=false) are not delayed. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I8b32c1d99f9f2b36b16617122881ea95d02ecc87 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10798 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-12 08:20:11 +00:00
Shuhei Matsumoto	6ac23b3e60	bdev/nvme: Clear I/O path cache if a path whose ns is optimized is restored If a path whose namespace is optimized is restored, the corresponding I/O path cache should be cleared and the path should be chosen as the optimal path. This bug was found by a system test. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ibc3983dbff3418adb090a09df32c2a92a8910d05 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11004 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-10 22:18:46 +00:00
Shuhei Matsumoto	3308bdf1b9	bdev/nvme: Rename functions for a full ctrlr reset sequence Rename a few functions for a full ctrlr reset sequence to clarify what we do and make the following patches easier. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I051e3ab68c3cd77fd6040a2d069d50a700123ae6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10920 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-10 22:18:46 +00:00
Shuhei Matsumoto	521a9bb22c	bdev/nvme: Fix race between failover and add secondary trid We sort secondary trids to avoid using disconnected trids for failover. However the sort had a bug. This bug was found by running test/nvmf/host/multipath.sh in a loop. Verify the fix by adding unit test. Fixes #2300 Signed-off-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com> Change-Id: I22b0ede4d2ef98b786c3e0d1f5337a2d568ba56d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10921 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-01-10 22:18:46 +00:00
Jim Harris	b68f2eeb0b	bdev_nvme: add bdev_nvme_start_discovery RPC This patch adds the framework for a discovery service in the bdev/nvme module. Users can specify an IP/port of a discovery service. The bdev/nvme module will connect to a discovery controller, get the discovery log page, and then register for AERs. It will connect to each subsystem specified in the initial log page. AER completions will trigger fetching the log page again, at which point new subsystems will be connected to, or removed subsystems will be detached. This patch does the following: * Adds the new start_discovery RPC * Connects to the discovery controller * Gets the discovery log page * Registers for AERs * Detach from discovery controllers at shutdown Subsequent patches in this series will: * Connect to subsystems listed in discovery log page * Detach from subsystems that were listed in earlier discovery log pages but subsequently removed * Add a stop_discovery RPC Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I54bfa896a48c5619676f156b5ea9f2d1f886c72f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10694 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-01-10 15:23:39 +00:00
Alexey Marchuk	833a5c9d2b	bdev/nvme: Remove ctrlr_ch from group's list in error case If qpair creation failed, ctrlr_ch remains in group->ctrlr_ch_list but memory for ctrlr_ch is freed. Next attempt to get ctrlr's io channel will modify data in already freed memory and may corrupt another allocation. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I85002f2e6ac86a0ffda6dabfa57e79b59074fb5a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10840 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-12-27 08:43:03 +00:00
Alexey Marchuk	17e9f58f1f	bdev/nvme: Handle failed IO qpair creation It is possible that the application calls get_io_channel during nvme controller reset. In that case IO qpair won't be created and the application will get a NULL pointer. It is possible to repeat get_io_channel later but there is no such indiciation for the application, so it can't distinguish between a real failure and "try again" case during controller reset. This patch ignores IO qpair creation error if controller is resetting. IO qpair will be created when reset completes. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Id39202f5a6878453ff54e35df91d5dc49a5f046a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10828 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-12-27 08:43:03 +00:00
Jim Harris	ef8f297ba4	bdev_nvme: allow bdev_nvme_create() to take a NULL names arg We will want to use bdev_nvme_create() to attach to controllers identified through discovery. In this case, we won't be reporting bdev names back to an RPC caller, so there's no need to allocate an array of names to be filled out since they won't be used. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia386d034df2c2d5a60f9aa18338ba415ec03d763 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10689 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-21 08:15:47 +00:00
Jim Harris	986f74aead	bdev_nvme: split fini ctrlr destruction to separate function We will need to add another step in the fini path for stopping discovery pollers, so this patch prepares for that. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ifecbbac60262f3aae7f7a7ced09b7a600df7c2e8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10590 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-21 08:15:47 +00:00
Konrad Sztyber	54efe6552b	bdev/delay: add missing write_object_end in config_json One of the objects wasn't enclosed with spdk_json_write_object_end(), causing the resulting configuration to be broken. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ib0311e002e43d4ad01c61feb6af54cb4212b477b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10755 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-12-20 18:14:39 +00:00
Shuhei Matsumoto	215518069a	bdev/nvme: nvme_ctrlr_create() gets prchk_flags from nvme_async_probe_ctx Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id3deca8e0aba23299347a6aee6f0f44ee683556e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10555 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	619acff501	bdev/nvme: Delete unused nvme_probe_ctx We set cb_ctx to NULL when calling spdk_nvme_probe_async(). It looks that nvme_probe_ctx has not been used anywhere for a long time. nvme_probe_ctx is not public data structure. Remove nvme_probe_ctx to simplify the code and make the following patches easier. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I7dd5f970a7fde1c9c189fae3c8f28f84d7aed991 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10554 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	bf88d1d4a6	bdev/nvme: Factor out the failover trid operation into a helper function This refactoring will be helpful for the following patches to unify ctrlr reset and failover and failover trid also when reconnecting ctrlr. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I4623a5dd310ac7516c270ccd3b0541c27cc880d8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10443 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	ffabc8ac29	bdev/nvme: Inline bdev_nvme_failover_start() into bdev_nvme_failover() Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I70593de284f5623db9e30d94b03b6576bd6ca29b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10442 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	696ad465d7	bdev/nvme: Remove the failover_in_progress flag from struct nvme_ctrlr The failover_in_progress flag is used to decide the return value of bdev_nvme_failover(). bdev_nvme_delete() calls bdev_nvme_failover() with remove=true to remove nvme_ctrlr->active_path_id. However bdev_nvme_failover() returns zero if nvme_ctrlr->failover_in_progress is true. bdev_nvme_failover() may return zero even if it does not remove nvme_ctrlr->active_path_id. The following will be better. bdev_nvme_failover() returns -EBUSY if nvme_ctrlr->resetting is true, and the caller repeats calling bdev_nvme_failover() until the target trid becomes alternative path or bdev_nvme_failover() returns zero. To do that, the failover_in_progress flag is not necessary any more. Removing the failover_in_progress will also simplify the following patches to unify ctrlr reset and failover. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I57ab944beb1d06ea4def144c81c69705860de35f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10441 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	74f18d6a07	bdev/nvme: Factor out checking if nvme_ctrlr can be unregistered Checking if nvme_ctrlr can be unregistered is not so simple and a few changes will be added. So factoring out the check into a helper function will be valuable. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I310c7e3ad2dae9583df4db575d342c2cb111f3f3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10461 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	7329c1e683	bdev/nvme: Refine and factor out checking if nvme_ctrlr is available or failed When a I/O or admin passthrough failed, if the corresponding nvme_ctrlr is not available, we should failover to another path. When no path was found, if there is at least one nvme_ctrlr which is not failed, we should wait until it is recovered. We should improve error recovery not only for multipath (multipath is "multipath") but also for failover (multipath is omitted or "failover"). To do this easily, clarify the conditions of availability and failure of nvme_ctrlr and realize them by helper functions. Use new helper functions for other cases to improve readability too. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I716731f72811d2ec4dfc91f9eadb191d75739af6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10381 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	7cc66c0ab1	bdev/nvme: Check if ns can be shared when configuring multipath We had not checked the bit 0 of the Namespace Multipath I/O and Namespace Sharing Capabilities (NMIC) field in the Identify Namespace data structure. If the bit 0 of the NMIC is zero, it is likely that namespaces are not identical. We should check if the value of the NMIC first, and do it in this patch. Additionally, it is not usual if the bit 0 of the CMIC and the bit 0 of the NMIC do not match. So in unit tests rename the parameter multi_ctrlr by multipath for ut_attach_ctrlr() and use it for the value of the NMIC. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I6aa7cbcc99be2507dbf18930f7b585a9ea7d0f90 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10380 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	819fd52907	bdev/nvme: Delete already created qpairs if connect qpair failed while resetting ctrlr bdev_nvme_reset() deletes all qpairs, reset a ctrlr, and then create all qpairs. Any qpair may fail to be created, and then the reset request may fail. However, already created qpairs were left. Let's delete the already created qpairs and then fail the reset request. This will make us easier to control reconnect, deley reconnect by a few seconds, or stop reconnect after repeated failures and then delete ctrlr. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I414e2281b4bf0cbd1cf461d8fc64a22f43d26d13 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9896 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	8afa746b4d	bdev/nvme: Use new APIs in a reset ctrlr sequence Replace the spdk_nvme_ctrlr_reset_async() and spdk_nvme_reset_poll_async() calls by the spdk_nvme_ctrlr_disconnect(), spdk_nvme_ctrlr_reconnect_async(), and spdk_nvme_ctrlr_reconnect_poll_async() calls in a reset ctrlr sequence. spdk_nvme_ctrlr_disconnect() can fail if ctrlr is already resetting or removed. But both cases are not possible. reset is controlled and the callback to the hot remove is called when the ctrlr is hot removed. So we assume spdk_nvme_ctrlr_disconnect() always succeed. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I1299e198597b2a2110f80b9a868e2dae015682ee Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10092 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-08 08:31:24 +00:00
Adam Aronov	d39cbc1374	rpc: added num_io_queues parameter to bdev_nvme_attach_controller Fixes issue #2243 Signed-off-by: Adam Aronov <aaronov@infinidat.com> Change-Id: Ia8739102dbff9f775abf8e91fa47ccf81533d2c0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10439 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-12-06 08:35:03 +00:00
Shuhei Matsumoto	1d524bc384	bdev/nvme: Remove unnecessary error check from bdev_nvme_reset_ctrlr() spdk_for_each_channel() always passes status=0 to its completion callback if each channel completes the requested function successfully. bdev_nvme_reset_destroy_qpair() always succeeds. Hence bdev_nvme_reset_ctrlr() does not have to check if the passed status is not zero. The following patches will aggregate multiple flags into a single state for nvme_ctrlr. This change will simplify these. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I1c30c9b20c96886516029e69e90dc23d777a69b4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10077 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-01 09:20:09 +00:00
Shuhei Matsumoto	f9fba507fe	bdev/nvme: Redirect the reset ctrlr operation into nvme_ctrlr->thread In the following patches, we want to retry reconnect if reconnect failed in a reset ctrlr sequence but we want to delay the retry. While we wait the delayed retry, we want to quiesce ctrlr completely. As part of quiesce ctrlr operations, we want to pause adminq poller but we need to do it on the nvme_ctrlr->thread. If a reset ctrlr sequence runs on the nvme_ctrlr->thread, we can avoid redirecting the pending destruct request at completion too. So we redirect the reset ctrlr sequence into the nvme_ctrlr->thread. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I538b962e2a7b5cf00ebbac2a1e888482ddeeee61 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10075 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-01 09:20:09 +00:00
Tan Long	83996cc3b9	bdev/rbd: Fix the decode error in bdev_rbd_register_cluster Incorrect decode function used for the param "config_file" in rpc_bdev_rbd_register_cluster Signed-off-by: Tan Long <tanl12@chinatelecom.cn> Change-Id: I6286c5d0d8396a1b548095975924087ba4ee92d2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10444 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-11-30 09:08:07 +00:00
Josh Soref	1960ef167a	spelling: module Part of #2256 * calculated * changing * deferred * deinitialize * initialization * particular * receive * request * retrieve * satisfied * succeed * thread * unplugged * unregister Change-Id: I13e38f9160cb1a15a87cb5974785a34604124fa3 Signed-off-by: Josh Soref <jsoref@gmail.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10406 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-11-30 09:05:32 +00:00
Shuhei Matsumoto	50b10bc20e	bdev/nvme: bdev_nvme_reset_io() redirect to the orig_thread at completion In the following patches, bdev_nvme_reset() will execute the reset ctrlr operation on the nvme_ctrlr->thread until completion as bdev_nvme_admin_passthru() does. Hence change the callback bdev_nvme_reset_io_continue() to redirect to the orig_thread by using bio. Furthermore, use bio->cpl.cdw0 to store the completion status of the reset processing. bdev_nvme_reset() does not use bio->cpl. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I361cc44494190ba83ad6e360788d78851416c46c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10074 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-23 08:46:36 +00:00
Shuhei Matsumoto	6e0a60aecd	bdev/nvme: reset_controller RPC redirect to the orig_thread at completion In the following patches, bdev_nvme_reset() will execute the reset ctrlr operation on the nvme_ctrlr->thread until completion as bdev_nvme_admin_passthru() does. Hence change the callback rpc_bdev_nvme_reset_controller_cb to redirect to the orig_thread by using a dynamically allocated context. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I8ee61857ac034024d00190875740a675ef1db8b0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10073 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-23 08:46:36 +00:00
Shuhei Matsumoto	b4447abf70	bdev/nvme: Retry failed admin passthru up to retry_count times This patch supports admin passthrough retry when we get any error with DNR=0 but ABORTED_BY_REQUEST up to retry_count times. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I1bf29570791fdbe8651fa70c4c8685bb740fb86b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9944 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-11-23 08:46:36 +00:00
Shuhei Matsumoto	a9a86a14c1	bdev/nvme: Retry admin passthru immediately if it got ctrlr path error This patch supports admin passthrough retry when we get ctrlr path error at completion. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ice0045b84054ec66a9db9ef23e21786d2c082b1d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9943 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-23 08:46:36 +00:00
Shuhei Matsumoto	35a2f4e22e	bdev/nvme: Retry admin passthru a second later if any ctrlr may become available When resetting ctrlr, adminq is disconnected first. If adminq is disconnected, admin passthrough request is rejected with -ENXIO. But resetting ctrlr may succeed. If resetting ctrlr succeeds, adminq is connected again, and admin passthrough request will be submitted successfully. On the other hand, if ctrlr is failed, admin passthrough request is rejected with -ENXIO. But when resetting ctrlr, ctrlr is set to unfailed. Hence bdev_nvme_admin_passthru() skips any ctrlr which is resetting or failed, and calls bdev_nvme_admin_passthru_complete() with -ENXIO if no available ctrlr is found. bdev_nvme_admin_passthru_complete() queues admin passthrough request and retry it one second later if ctrlr is resetting. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic748dc4faf29ebf717ae5c29dcf7c55fe2ea9243 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9942 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-23 08:46:36 +00:00
Alexey Marchuk	5e1e850bdc	bdev/malloc: Add optimal IO boundary Allow to specify optimal IO boundary for malloc bdev, it can be used to test split of IO requests on generic bdev layer Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ic3529dc00cf852ea5cf40d0553d846a698fff6c7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10068 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2021-11-18 08:21:43 +00:00
GangCao	3021eb3ce3	module/bdev: move the NULL check before dereference To fix the Klocwork issue. Change-Id: I9512f1303890b00964a902e28df2395856d3ed32 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10200 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2021-11-17 10:58:30 +00:00

1 2 3 4 5 ...

882 Commits