numam-spdk

Author	SHA1	Message	Date
Seth Howell	58216dd07e	lib/nvme: fix mem leak in req submit. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: If64c06177605a8f57d87ba22b86fe58ddebd6f7a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3921 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-09-02 07:38:38 +00:00
GangCao	4803dc36fc	Blobfs: add an assert for the memory allocation Change-Id: Ie9c75206b61242c71ce6f4f408bbdd44f9633077 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3998 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-09-02 07:38:23 +00:00
Alexey Marchuk	3aaf8e0a0b	nvmf: Stop pg poller when all qpairs are destroyed We stopped the poller to early, so we were not able to reap all completions on ibv CQ, so RDMA qpair was not freed. This patch stops the poller when all references to poll group are released (all qpairs are destroyed) Fixes #1578 Change-Id: I15c1697db13aef9da7567c7312476306c3ee1d62 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3962 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-01 07:51:45 +00:00
Ziye Yang	03aa8995e9	lib/sock: Fix the coredump issue in sock_map_realese When tested on Linux 5.8 kernel and configure spdk with debug mode (--enable-debug), and test SPDK NVMe-oF tcp transport, and we see the coredump in sock_map_release with the following statements: assert(entry->ref > 0); After debug, I can confirm that the placement_id value got from the following function (sock->net_impl->get_placement_id) changes. It means that: When the sock is added into the poll group (spdk_sock_group_add_sock), we get the placement_id (named as Value(begin)); and when the sock is removed from the poll group (spdk_sock_group_remove_sock), we get the plaemednt_id on the same sock (named as Vaule(end)). I found that Value(begin) ! = Value(end). So our solution is for a socket, we will get placement_id once, then we can solve this issue. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ia1d0cf39247b53410260561aca5af38130cc0abb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3983 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-09-01 07:51:27 +00:00
Jim Harris	3fe748f728	blob: move bs_release_cluster We will use it earlier in this file in a future patch. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I554f2073185d466bd0b4e98bdeec721f763c1b44 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3969 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Jim Harris	8ba7450e8c	blob: simplify check for available cluster count Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I56a9ca114b84ded4ac23a9eaacadaa6df8f031cc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3968 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Jim Harris	f5b949d5fd	blob: simplify free cluster counting in inflate path Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I5716290476d4abdf8646a36206f07acb5b2dd4ec Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3967 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Jim Harris	2d87587fe5	blob: claim clusters inline during initialization When claiming clusters as part of blobstore initialization or recovery, just call spdk_bit_array_set directly rather than going through the bs_claim_cluster function. We will be modifying how runtime cluster allocation works so need to separate the two use cases. This code is very small so inlining it has minimal code impact. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iaaa1c817e57b4a2eea62eb4683407364bac1fcc0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3966 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Jim Harris	b7d508f5e5	blob: use spdk_bit_array_load/store_mask functions These functions were added during FTL development and are more efficient than the roll-your-own implementations blobstore had previously. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie09e5c305e6e171af0258e805f2aac3b88822b5e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3965 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Shuhei Matsumoto	70a2063998	lib/iscsi: Remove a DEBUG_LOG which shows error when target is not found This DEBUG_LOG causes misunderstanding to us. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I7f538a796fa025fe79210b26225650bd11065511 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3938 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-31 15:22:44 +00:00
Vitaliy Mysak	d267d0e874	lib/log: add RPC to toggle timestamps Allow toggling log timestamps on and off by adding new RPC call. Change-Id: I34c84bf89fae352ade266fbf7fd20594ff67bced Signed-off-by: Vitaliy Mysak <vitaliy.mysak@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2024 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-31 07:46:36 +00:00
yidong0635	b2c86b35db	vhost/vhost_scsi: Fix coredump issue for non-scsi device. Remove assert and add exit codes instead. That in non-debug mode, these could lead coredump. We don't want the vhost target be crashed after recieved invalid commands. fixes issue: #1575 Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: Ifef6d8f9c32150213bc2c80787e92d428d4c49c3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3951 Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: JinYu <jin.yu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-28 07:19:27 +00:00
Vitaliy Mysak	ea2000cf56	event: disable cpumask check cpumask can be changed by spdk_thread_set_cpumask() during the time that event takes before it arrives on _schedule_thread() function, which would make the function assert(false), even though that is ~ok~. Currently, that can happen right after thread is created or between two successive calls to spdk_thread_set_cpumask(). But most importantly, it will constantly happen if we introduce rescheduler. This patch just disables the check for now. Change-Id: Ie6dfe22d6eff2c908c367d1311436cc6769a6960 Signed-off-by: Vitaliy Mysak <vitaliy.mysak@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3905 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-27 08:49:18 +00:00
Shuhei Matsumoto	767df1289a	lib/iscsi: Increment StatSN when sending asynchronous logout request Without this increment, the following warning messages were collected when processing the corresponding logout request. iscsi.c:4426:iscsi_pdu_hdr_handle: DEBUG: opcode 6 * iscsi.c:4398:iscsi_update_cmdsn: DEBUG: StatSN(5) advanced iscsi.c:2519:iscsi_pdu_hdr_op_logout: DEBUG: reason=0, ITT=74000030, cid=0 iscsi.c:2531:iscsi_pdu_hdr_op_logout: DEBUG: CmdSN=92, ExpStatSN=5, StatSN=4, ExpCmdSN=92, MaxCmdSN=155 * iscsi.c:2546:iscsi_pdu_hdr_op_logout: DEBUG: StatSN(5/4) might have dropped iscsi.c:4506:iscsi_pdu_payload_handle: DEBUG: opcode 6 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Iee2677317dd8974abdb69141eed7fb5e538933ba Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3940 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2020-08-27 08:37:00 +00:00
Shuhei Matsumoto	438fa38e4a	lib/iscsi: Check if data comes really after processing logout request When the PDU receive handler processes the header of the logout request PDU, conn->is_logged_out is set to true. However, if conn->is_logged_out is true, conn->pdu_recv_state is set to ERROR before the PDU receive handler completes processing the logout request PDU. Then if conn->pdu_recv_state is ERROR, conn->state is set to EXITING after returning from the PDU receive handler. Response PDUs are sent asynchronously now and may not be sent even after returning from the PDU receive handler. On the other hand, outside the PDU receive handler, the current connection is closed if conn->state is EXITING. Hence logout response PDU may not be sent to the initiator. For the case that the initiator logs out and then reconnects when receiving asynchronous logout request, missing logout response is critical because initiator waits until receiving logout request and gets timeout. This patch moves the check if PDU comes after logout to the place just after getting a PDU header. At the new location, data segment of the PDU is not received yet. But logout request PDU does not have data segment and initiator will not send additional PDU after sending logout request PDU, and by this patch, iSCSI target will continue to stop receiving any new PDU after processing logout request. Furthermore, even if there is any remaining data in the kernel buffer, the kernel will discard or flush it when closing the socket. Fixes issue #1571 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9554f4d54f3db80bf86abd6bffe81bac8c234531 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3928 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2020-08-27 08:37:00 +00:00
Shuhei Matsumoto	6206e46883	lib/nvme: Set ANA transition time value ANA transition time shall be non-zero if controller supports ANA reporting. Linux NVMe host sets this value to 10, and we don't have any reason to change from that. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I61396695dacf47fad40e3cea3311e555729d9e3e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3909 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-27 08:36:51 +00:00
Jacek Kalwas	477d9500a3	env: fix pci driver registration Registration macro now generates function based on driver's name. It allows to have multiple registration within single source file. Similar pattern is used e.g. by SPDK_NVMF_TRANSPORT_REGISTER. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Ied0887e8dae7fe9ca1517313be5eff8f218b7e98 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3895 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-27 08:36:43 +00:00
Seth Howell	3f190fe170	nvmf/subsystem: find intermediate state in helper function. This will be used in another place later. This patch is part of a series aimed at improving recovery when we are fail to change the subsystem state. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I24bfbeb3d006584003164540d6ede540dbcafa86 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3392 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-08-27 08:36:09 +00:00
Jim Harris	150339cc59	cpuset: fix parse_mask The loop here was counting the bytes in the cpus array, but the lcores are represented by bits. While here, add a unit test that exposes this bug and demonstrates it is now fixed with this patch. Fixes #1570. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I3a1fc48a8085254f41587e3b3d5d732154b90134 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3931 Community-CI: Mellanox Build Bot Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-27 08:34:07 +00:00
Seth Howell	309200661f	lib/nvme: pass up rc from failed connect in reset. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I8041728de7bb7bdb35fb1570f3c2fb20feef8664 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3877 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-26 09:47:24 +00:00
Seth Howell	316f92d118	lib/nvme: pass up fabric connect rc to app. This will allow applications to understand why they were unable to connect. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ic04c7e72098c6ec1823de7d6a07d90150ef5ac20 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3836 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-26 09:47:24 +00:00
yupeng	2b16cfff9e	bdev: free g_bdev_examine_allowlist before the app exit Add a bdev_examine_allowlist_free function, which releases the members in g_bdev_examine_allowlist. Invoke it in bdev_mgr_unregister_cb. Signed-off-by: Peng Yu <yupeng0921@gmail.com> Change-Id: I47faf6959066da6679716b2f2abfab8ac8b8dd79 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3880 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-25 08:02:43 +00:00
Jim Harris	178291a516	nvme: read all uevents Currently when the uevent processing code finds a non-uio/vfio uevent, it just stops its loops and returns. This means that if there are a lot of non-uio/vfio uevents, the netlink socket buffer can build up until its full because only one non-uio/vfio event gets drained per spdk_nvme_probe() call (which may be very infrequently). So modify parse_event so that it does not indicate error when a non-uio/vfio event is found. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ic8a40f71ee89d597ce46129eac889fe5b7ef5171 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3876 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-24 07:38:33 +00:00
Jim Harris	79f508b047	iscsi: set last_nopin when connection is established This ensures we don't send a nopin immediately after a connection is established, in case the nopin poller fires before the connection reaches full feature phase. Fixes #1441. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ieba9476bec0e9b7f85e60b9113ae8364eda5bda3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3902 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2020-08-24 07:38:24 +00:00
Xiaodong Liu	d84ae74d29	lib/event: fix a wrong deletion Change-Id: I1b88174837c8e03d9e505b6a6270cca1bffa9806 Signed-off-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3878 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com>	2020-08-24 07:38:10 +00:00
Tomasz Zawadzki	846ca67594	lib/blob: remove requirement for md_page alignment The md_page alignment is not really required for md_page buffers. Allocating 4k aligned buffers all the time, causes memory to be heavily fragmented. Due to DPDK keeping track of the allocation in the same DMA region as the allocation themselves. Removing this alignment requirement will help DPDK when searching for the right part of memory in the heap. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reported-by: Mike Cui Change-Id: If2f4ca2be38d432d5740f6145b5e0ff46237806b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3853 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-24 07:38:00 +00:00
Ziye Yang	d4d2e317b5	nvme/tcp: Make the return value consistent. We should make nvme_tcp_ctrlr_connect_qpair always return negative value if this function fails. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I457e704e39d7a3acd298fd48e89e8ea51e2ed4ad Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3809 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-24 07:37:13 +00:00
Changpeng Liu	5adf099eed	nvme: don't send Identify NS ID Descriptor List to inactive NS The specification says it will return INVALID FIELD if the NS is in inactive state. Fix issue #1551. Change-Id: I1b32f023ed665d410f4705e439068699e2b2f8de Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3860 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-21 08:45:31 +00:00
Alexey Marchuk	fdf2490a32	nvmf/rdma: Don't destroy qpair if rdma_accept fails Failed qpair will be destroyed on generic nvmf layer during handling of error code returned from spdk_nvmf_poll_group_add. The current approach leads to heap-use-after-free. Change-Id: I99331150fa36a3c3c18176589afb973dee449b3a Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3538 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-21 08:26:15 +00:00
paul luse	c3fd3e95bb	lib/accel: change task alloc from global mempool->per chan list The one large global mempool was a waste of memory for apps that don't use the accel framework as its always allocated a pool sized to handle a heavy load with multiple threads. Instead move to a per channel list of just 1024 tasks greatly decreasing the memory footprint but still able to scale as more threads are added. Also renamed all accel_req to acccel_taak and simply task to accel_task as this was being touched anyways and not consistent. fixes issue #1510 Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I0e93ca6270323e2df4b739711c5d9b667a52e1eb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3740 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-21 08:25:04 +00:00
Alexey Marchuk	8bec9feb76	nvme/rdma: Remove unused spdk_nvme_send_wr_list nvme_rdma_qpair::sends_to_post is not used, remove it and spdk_nvme_send_wr_list structure Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: If9c42736d4e796a947bbfe80f59efd2fd7f77859 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3822 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-21 08:24:43 +00:00
Alexey Marchuk	58f43df1f5	nvmf/rdma: Handle several ibv events in a row Currently rdma acceptor handles only one ibv event per poll Taking into account the default acceptor poll rate (10ms), it can take a long time to handle e.g. LAST_WQE_REACHED events when we close huge amount of qpairs at the same time. This patch allows to handle up to 32 ibv events per acceptor poll. Change-Id: Ic2884dfc5b54c6aec0655aaa547b491a9934a386 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3821 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-21 08:24:43 +00:00
Seth Howell	ce83fc2aff	lib/nvme: remove qpair from ctrl list in connect fail path This is an oversight that can cause issues with looping through the list if we end up allocating the same qpair twice. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I513ea35398f4b724366c21be144531fbfbdb4347 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3835 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-21 08:24:18 +00:00
Jim Harris	ed7848f2df	blob: handle overlapping open case We only create one spdk_blob object for a given blob, and just increase the ref_count if it is opened multiple times. bs_open_blob would do the lookup for existing opened blobs. But if the blob is opened again, before the previous open operation has completed, we would end up with two spdk_blob objects for the same blob. Solution is to do another lookup when the open operation completes. If we find the blob, free the one we just finished opening and return the existing one instead. Also added unit test that failed on the existing code but passes now with this patch. Signed-off-by: Jim Harris <james.r.harris@intel.com> Reported-by: Mike Cui Change-Id: I00c3a913b413deddf06f0b63f7a669efb2b5658f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3855 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-21 08:24:09 +00:00
yidong0635	361cddfd63	nvme/nvme_pcie: Remove unused codes. pctrlr->cmb.mem_register_addr and pctrlr->cmb.mem_register_size are assigned after spdk_mem_register. if spdk_mem_register is failed , ctrlr_map_cmb hasn't been executed. they are not be used. So remove them. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I3d1996eee8b5260b79c4c3e0a2e1d376da2343b7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3856 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-21 08:24:00 +00:00
Changpeng Liu	01b6bd8a92	nvmf: fix the associate timeout value SPDK poller uses microsecond as the input parameter, so we need to change the correct value when opts.association_timeout is expressed by millisecond. Change-Id: Ia674f0115ea176b998e4c0c70b8ce75b28984701 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3861 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-21 08:23:45 +00:00
Changpeng Liu	5d5a9077a3	nvme: don't assert on custom transport Change-Id: I2d425c127dc070f7bb508f5a61e6304f6042fdf7 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3857 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-21 08:23:19 +00:00
Shuhei Matsumoto	1da94ed7b8	rpc/nvmf: Add ana_reporting parameter to nvmf_create_subsystem RPC Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9adc8373050e68872a4d9e89518c137e61005254 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3852 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-21 08:22:40 +00:00
Shuhei Matsumoto	6f2265734d	lib/nvmf: Optionalize ANA reporting feature After supporting ANA reporting by default, Linux kernel 5.3 reported error when parsing NVMe ANA log. The newer kernel fixed the issue but we should optionalize ANA reporting feature to avoid error for Linux kernel 5.3 or before. Add a bool variable ana_reporting to struct spdk_nvmf_subsystem and disable ANA reporting and initialization of related variables if it is false. We can expose MNAN (Maximum Number of Allowed Namespaces) even if ANA reporting is disabled. But MNAN is not required if ANA reporting is disabled. So do not set MNAN if it is false too. Add a public API spdk_nvmf_subsystem_set_ana_reporting() to set ana_reporting by the nvmf_create_subssytem RPC. The next patch will add ana_reporting to nvmf_create_subsystem RPC. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Icc77773b4c9513daba2f1a9fdaf951d80574f379 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3850 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-21 08:22:40 +00:00
Shuhei Matsumoto	4cc04a1251	lib/nvmf: Add nvmf_subsystem_get_controllers RPC Add an new RPC, nvmf_subsystem_get_controllers to retrieve the list of NVMe-oF controllers of an NVMe-oF subsystem. One of the main use cases will be to get identification information of NVMe-oF controllers to configure their ANA states dynamically. Pause and resume the subsystem to access the controllers safely. One subtle issue remains. The JSON RPC returns success even if resuming the subsystem fails. Write FIXME explicitly to address this. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ibf8d1cf56850a705e343b86022d101b4c7204199 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3848 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-21 08:22:40 +00:00
Michael Haeuptle	43ad7febb2	lib/nvmf: Fixes stuck subsystem RPC A subsystem RPC is not transitioned to a paused state when there are ios outstanding (tracked by subsystem poll group). In general AERs, are not tracked as outstanding IOs. However, there are 3 paths in nvmf_ctrlr_async_event_request which do not adjust the outstanding io count. If we get into any of these 3 paths, the subsystem pause can hang forever. The issue was reproduced with hot plug stress testing under load. We can get into the second path (SPDK_NVME_ASYNC_EVENT_TYPE_NOTICE) under these circumstances: - An AER completion is sent to the initiator due to a namespace change (e.g. hot remove/add) - In this case, type is set to SPDK_NVME_ASYNC_EVENT_TYPE_NOTICE - The initiator sends a new AER admin command, hitting the second path where we return without adjusting the outstanding ios. Fixes: 1552 Change-Id: I45f781966cc1e9a601b2305c7985a21154d802e8 Signed-off-by: Michael Haeuptle <michael.haeuptle@hpe.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3854 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: JinYu <jin.yu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-20 09:31:17 +00:00
Sochin Jiang	db3d1201a4	lib/blob: fix a data corruption bug There is a fatal bug that could easily cause data corruption when using thin-provisioned blobs. In blob_request_submit_rw_iov(), we first get lba by calling blob_calculate_lba_and_lba_count(), blob_calculate_lba_and_lba_count() calculates different lbas according to the return of bs_io_unit_is_allocated(). Later, we call bs_io_unit_is_allocated() again to judge whether the specific cluster is allocated, the problem is it may have be allocated here while not be allocated when calling blob_calculate_lba_and_lba_count() before. To ensure the correctness of lba, we can do lba recalculation when bs_io_unit_is_allocated() returns true, or make blob_calculate_lba_and_lba_count() return the result of bs_io_unit_is_allocated(), use the second solution in this patch. By configuring more than one cpu core, md thread will run in a separate SPDK thread, this data corruption scenario could be easily reproduced by running fio verify in VMs using thin-provisioned Lvols as block devices. Signed-off-by: Sochin Jiang <jiangxiaoqing.sochin@bytedance.com> Change-Id: I099865ff291ea42d5d49b693cc53f64b60881684 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3318 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-20 09:26:23 +00:00
Ziye Yang	0d3cc15a62	nvme/tcp: Correct the incapsule data usage According to page35 in recent NVMe-oF spec ( NVMe-over-Fabrics-1.1-2019.10.22-Ratified), ioccsz is used to restrict the incapsule size of I/O command, so do not restrict the NVMe-oF OPC command and also the admin command. We accidently trigger an bug in kernel since we do not send the fabrics command with the incapsule and make the kernel coredump, though the kernel has bugs. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I869a2c8ab7b9c2ac1e5cc5b603920662591c2c64 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3837 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-20 09:26:06 +00:00
yupeng	2d30df9b0b	bdev: add bdev_examine_bdev API The bdev_examine_bdev api will examine a bdev explicitly. After disabling the auto_examine feature, a user could call bdev_examine_bdev to examine a specific bdev he/she wants. Signed-off-by: Peng Yu <yupeng0921@gmail.com> Change-Id: Ifbbfb6f667287669ddf6175b8208efee39762933 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3219 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-20 08:56:53 +00:00
Ziye Yang	2031f8f70d	nvme: set the error code if we cannot send keep alive command. If the transport is broken, we should set errno code in spdk_nvme_ctrlr_process_admin_completions instead of keeping silence. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ie73763e1329e12a8c82a0223d360991f86c39be3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3773 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-19 07:29:26 +00:00
Seth Howell	518a1e013a	lib/nvme: make fabrics connect timeout configurable. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: If829d399882ef948d95673c17e5689c91386c21d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3795 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-19 07:29:19 +00:00
Seth Howell	b3bb3a1bbf	lib/nvme: change timeout in wait_for_completions to usec This allows for much more granular control over the timeout. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ib23de21e60eec4207c55320579699edf284f4e16 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3794 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-19 07:29:19 +00:00
Ziye Yang	85ff3fcea6	rdma: Do not use the poller to handle the qpair exiting. Generally, this patch did the following work: Remove the destruct poller. I think that we do not need this, the destruct poller is specially for Softwaare RoCE case. Since SoftRoCE will not have IBV_EVENT_QP_LAST_WQE_REACHED event, we will not wait the last_wqe_reached flag when srq is enabled. So we can avoid using the poller. And the purpose of this patch is to solve the coredump issue. For example, if we run rdma local test such as, e.g., test/nvmf/host/bdevperf.sh --transport=rdma The coredump reason: the qpair is freed twice. Because for RDMA transport, we do not really remove the qpair from the group if the upper layer does it. The first time is called by nvmf_rdma_destroy_drained_qpair in nvmf_rdma_poller_poll, and the second time is called by nvmf_rdma_qpair_reject_connection in in nvme_rdma_close_qpair. Since nvme_rdma_close_qpair will always called, so we need make sure that the qpair will be close after calling this function. Otherwise we will have the double free qpair. So our approach here is add a flag ("to_close")in rqpair structure and make sure the rqpair be freed after the "to_close" is set nvme_rdma_close_qpair Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I6f97debbcd29bbb7c6e3f9725907b4102a1d2892 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3661 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com>	2020-08-19 07:28:36 +00:00
Shuhei Matsumoto	05cd697757	lib/iscsi: Add MaxR2TPerConnection to iSCSI options Add MaxR2TPerConnection to iSCSI global options and make it configurable by JSON RPC. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ida95e5c7dac301a22520656709e1aa4d611f31ef Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3777 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00
Shuhei Matsumoto	5af42000c1	lib/iscsi: Make max outstanding R2Ts per connection configurable By the recent refactoring, we have no static size array for outstanding R2Ts per connection. It looks that we do not have any critical reason to prohibit us from making max outstanding R2Ts per connection configurable. There are some use cases to use large write I/O intensively (e.g. 128KB). Let such use cases change the value of max R2Ts per connection by their responsibility to do performance tuning. Maximum outstanding R2Ts per task are defined both for iSCSI target and NVMe-TCP target but maximum outstanding R2Ts per connection is unique for iSCSI target. The next patch will add the corresponding iSCSI option. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I4f6fd3c750a9a0a99bcf23064fe43a3389829aa9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3776 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-18 07:27:45 +00:00

1 2 3 4 5 ...

7495 Commits