numam-spdk

Author	SHA1	Message	Date
Tomasz Zawadzki	3dadb79e37	lib/blob: add EXTENT_RLE descriptor description Since further patches will be adding new descriptors that are related to cluster layout throughout the blobstore, add description for existing descriptor too. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I722eb633445685789d5185ed59dfc910f76b109f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481724 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-01-27 18:06:43 +00:00
Tomasz Zawadzki	c33840b7e6	lib/blob: add option to enable extent pages This is an additional option that can be passed when creating a blob. When opts->enable_extent_pages is set to false (current default), only EXTENT_RLE should be persisted on sync. During blob load, when EXTENT_RLE is present in md, blob->extent_rle_found is set to true. When opts->enable_extent_pages is set to true, only EXTENT_TABLE and EXTENT_PAGES should be persisted on sync. During blob load, when EXTENT_TABLE is present in md, blob->extent_table_found is set to true. It is possible to find neither EXTENT_* descriptor when loading a blob. This means that blob length is 0 and EXTENT_RLE was supposed to be used. Yet none were persisted due to lack of clusters. In such case blob->use_extent_table is set to true after finishing blob load. When parsing metadata ends, if extent_table_found is set - then support for extent_table is enabled. All other cases disable it. At this time path for Extent Pages is not implemented, so it should not be used. Later in the series, it will become the default path for serialization. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I2146da6130a0645e686ab02a3b5d2d86a7d35a1f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479853 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 18:06:43 +00:00
Ben Walker	f84c916c41	nvmf/tcp: Correctly kick the recv state machine when a request is freed When a command arrives and no requests are available, the socket recv state machine sits in the RECV_STATE_AWAIT_REQ state until another network event occurs. If this I/O was the last one sent, this leaves the target hung. To fix this, when a request is completed, kick the state machine to make forward progress. In practice, this can only occur once the pdu send acknowledgements are asynchronous relative to arriving commands. That only begins happening with the use of MSG_ZEROCOPY. When MSG_ZEROCOPY is turned on, it's possible receive the next PDU in a chain for a command prior to seeing the acknowledgement that the response that triggered that PDU actually sent. Change-Id: I556f31ad56970d36aa3538cfde375d35f3d4e551 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/480002 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 17:42:24 +00:00
Ben Walker	48a547fd82	nvmf/tcp: Wait for R2T send ack before processing H2C Previously, the R2T was sent and if an H2C arrived prior to seeing the R2T ack, it was processed anyway. Serialize this process. In practice, if the H2C arrives with a correctly functioning initiator, that means the R2T already made it to the initiator. But because the PDU hasn't been released yet, immediately processing the PDU requires an extra PDU associated with the request. Basically, making this change halves the worst-case number of PDUs required per connection. In the current sock layer implementations, it's not actually possible for the R2T send ack to occur after that H2C arrives. But with the upcoming addition of MSG_ZEROCOPY and other sock implementations, it's best to fix this now. Change-Id: Ifefaf48fcf2ff1dcc75e1686bbb9229b7ae3c219 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479906 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-27 17:42:24 +00:00
Ben Walker	033ef363a9	nvmf/tcp: Inline spdk_nvmf_tcp_pdu_set_buf_from_req This function was only called from one spot. Change-Id: I856f564d3ef6c6157be7a32a2cd812c702516a8d Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482003 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-27 17:42:24 +00:00
Ben Walker	fdfb7908b5	nvmf/tcp: Rename next_expected_r2t_offset to h2c_offset This seems like a more descriptive name Change-Id: Ia616865b3fb36d8f9ccc5fb2ca6185bdd8543cf8 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482002 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2020-01-27 17:42:24 +00:00
Ben Walker	a2adca79d9	nvmf/tcp: Set up math to always use 1 R2T per nvme command With our target design, there's no advantage to sending multiple R2T PDUs per nvme command. This patch starts by setting up the math so that at most 1 R2T PDU is required per request. This can be guaranteed because the maximum data transfer size (MDTS) is pre-negotiated in NVMe-oF to a reasonable size at start up. It then proceeds to simplify all of the logic around mapping requests to PDUs. It turns out that the mapping is now always 1:1. There are two additional cases where there is no request object at all but a PDU is still needed - the connection response and termination request. Put an extra PDU on the queue object for that purpose. This is a major simplification. Change-Id: I8d41f9bf95e70c354ece8fb786793624bec757ea Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479905 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2020-01-27 17:42:24 +00:00
Ben Walker	399529aaa1	nvmf/tcp: Set max h2c size equal to max I/O size We can always accept up to the maximum I/O size in an H2C, so eliminate the #define. Change-Id: I349dab5f9b6ec482a7c580b1396e03c8d30a250b Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482278 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 17:42:24 +00:00
Ben Walker	4dba507224	nvmf/tcp: Simplify qpair resource initialization The resources allocated to a queue pair do not need to be directly correlated to the queue size requested by the initiator in NVMe-oF, as long as enough resources are present. The RDMA transport, for instance, does complex pooling of the resources behind the scenes when using a shared receive queue. Simplify the resource allocation for a TCP qpair to just always allocate the max allowed queue size right away. This is a configurable parameter, so system administrators can adjust for their needs. The initiator may then request a queue size less than or equal to that, which will only be enforced by queue depth counting and not impact the actual number of resources allocated on the target. This change relies on the MaxC2HSize being equal to the Maximum Data Transfer Size (MDTS) reported. That is the default configuration, but MDTS is configurable. Changing the MDTS with this patch to a value larger than 128k will cause the target to break. This is addressed in the next patch in this series. Change-Id: Ibd4723785c6a4d8d444f9b7bbfa89f98de2320f5 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479733 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2020-01-27 17:42:24 +00:00
Ben Walker	444cf90c72	nvmf/tcp: Change qpair's state_cntr array to uint32_t These values do not need to be negative. Change-Id: Id9f798cf1c9da354448f9c6fbb90e599f877bb32 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482277 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 17:42:24 +00:00
Ben Walker	5a7b33ec67	nvmf/tcp: In _pdu_write_done, free pdu before calling user callback By releasing the just-completed PDU prior to calling the callback, for flows that immediately submit another PDU inside the callback, the just-released PDU can be immediately reused. This reduces the number of PDUs required in the pool to continue forward progress to half of the previous value, while also making it more CPU cache friendly. Change-Id: I8031b8f9f57ac05f261d96433d9899fe5e31d318 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479904 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Or Gerlitz <gerlitz.or@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-27 17:42:24 +00:00
Jim Harris	dc3717296e	bdev: handle unlock v. lock race When we unlock a range, we remove the range from the locked bdev list before doing the for_each_channel iteration to remove the range from each channel. But at the same time, right after removing from the locked list, a new lock on that range could start. In that case, we also do a for_each_channel to add the range to each channel, and that will race with the for_each_channel remove. When the lock start wins, it finds the range already in the channel, but doesn't set the owner_range which results in a seg fault when the for_each_channel completes. The fix is actually rather simple. We just add the locked_ctx to the comparison when checking if the range is already in the channel. If the locked_ctx matches, then we know it was added as part of initializing a new channel. If it doesn't, then we create a new range object pointing to the new locked_ctx. The first one will get removed when the remove for_each_channel catches up. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I94f8b20376dd437f404add35744d42fc148303ff Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482620 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-01-27 17:39:52 +00:00
Jim Harris	da11a46466	bdev: start lock process on original channel If a locking operation has to wait because of an existing lock, we queue the lock context. When the existing lock finishes unlocking, we restart the queued lock context. But we have to make sure we restart the lock context on the same thread it was originally submitted, since it has a channel associated with it. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I555515f3adfc3c13a86584c601ed541d605980b7 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482463 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 17:39:52 +00:00
Maciej Szwed	a83644fe2b	bdev: Lock LBA range for fused command execution Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: I577f961484b2ebf350f4f795eda1a018c5f0fd7a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481710 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-27 17:39:52 +00:00
Tomasz Kulasek	9a80e954f7	lib/nvmf: report support for fused compare and write Change-Id: Ib073719a59972240a68b1a4ad4951820c7ea5323 Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/476136 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-27 17:39:52 +00:00
Maciej Szwed	ff8a425182	nvmf: Return ACWU and NACWU values in indentify structures For ACWU we always set value 1 because bdev holds information specific for namespace only. This value actually does not matter because we also set NACWU which makes ACWU irrelevant. We set ACWU because NVMe specs requires ACWU != 0 if fused commands are supported. Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: Ida4357026d3b32677fc824b3cd878e7ad8ef2680 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/477915 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-27 17:39:52 +00:00
Maciej Szwed	c13733915b	bdev: Add spdk_bdev_get_acwu function This function is required for NVMf implementation for compare and write fused command. Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: If41611f5c0b8e4ed8eec66f09858c724f1800d59 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/477914 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-27 17:39:52 +00:00
Maciej Szwed	71beb568d6	nvmf: Add call support for compare and write cmd in spdk_nvmf_ctrlr_process_io_cmd Add call for spdk_nvmf_bdev_ctrlr_compare_and_write_cmd function in spdk_nvmf_ctrlr_process_io_cmd function when fused command is discovered. This patch also removes redundant defines for fused flags. Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: I61971a56577ab32b52e1fde1e572f718a9a2d9aa Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/476621 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 17:39:52 +00:00
Maciej Szwed	87be077d0b	nvmf: Add spdk_nvmf_ctrlr_process_io_fused_cmd Move fused cmd related code from spdk_nvmf_ctrlr_process_io_cmd to separate function. Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: Ic662a968b054f05db7f6e1cf4fa9aa13f6fb7c40 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481942 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-27 17:39:52 +00:00
Maciej Szwed	941d9e7aa8	nvmf: Add support for compare op command This patch introduces new spdk_nvmf_bdev_ctrlr_compare_cmd function which implements support for compare operation. Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: Iadf402a6441a78ea0e6468f1066c6b0e10e63b9b Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/477782 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-27 17:39:52 +00:00
Maciej Szwed	05e7f56c3a	nvmf: Add spdk_nvmf_bdev_ctrlr_compare_and_write_cmd function This patch introduces new function that is a part of upcoming support for fused commands. Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: I019c587bee7fd0f745ec17c141baf4cb7bf86645 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/476611 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-27 17:39:52 +00:00
Tomasz Kulasek	67c9c1c5d8	lib/nvmf: add fused operations Change-Id: If3162a5683d1c57011f9a66cbcfe47ba161734bf Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/476138 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 17:39:52 +00:00
Maciej Szwed	adf90938b1	bdev: Add spdk_bdev_io_get_nvme_fused_status function Added new function for getting NVMe specific return code for fused commands. Also changed one of the return codes in fused commands so that we could distinguish error cases. Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: I86417ea4f5b8f3e6496162be3d6c6128076e35d4 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481666 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-27 17:39:52 +00:00
zhangjf	2a00a12892	vhost_blk: need init task when resubmit the blk request Change-Id: I10fca86be6a2834fe3238d8881a4645ac810a201 Signed-off-by: zhangjf <zjfhappy@126.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482346 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-24 08:07:13 +00:00
Seth Howell	85fcc49fd4	nvmf/rdma: fix call to spdk_nvmf_rdma_listen This change fixes a merge incompatibility between commits 50cb6a04acf3f77863cc7fe7753dabd79beaab57 and 708ed4fb6e8c41d6033ce26b349171aa77703061. Change-Id: I5bc71a3c214667f01de66857cf61b9eb25f6cf6b Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482586 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-23 16:38:46 +00:00
Seth Howell	ca693eaba8	lib/nvme: fix cm event handling during rdma qpair shutdown. In the event that we have more than one event outstanding for a qpair at the time of destruction, we need to ack all of the events, Luckily the synchronization is already there in the form of the ctrlr lock. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ib297598f2e28d9b9bd83e904f950795a61fa883a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479171 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-01-23 15:14:55 +00:00
Seth Howell	50cb6a04ac	lib/nvmf: handle RDMA_CM_EVENT_ADDR_CHANGE This allows features like transparent failover on target RNICs. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Iab494ad3e9e4efea4db9cbb30bc18ea5b584f345 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478879 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: <jacek.kalwas@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-01-23 15:14:55 +00:00
paul luse	82a4c84fec	lib/bdev: Add support for new per bdev_io auxiliary buffer. A new API was added `spdk_bdev_io_get_aux_buf` allowing the caller to request an auxiliary buffer for its own private use. The API is used in the same manner that `spdk_bdev_io_get_buf` is used and the length of the buffer is always the same as the bdev_io primary buffer. 'spdk_bdev_io_put_aux_buf' is called to free the auxiliary buffer. The initial use case is crypto, in the next patch in series it is used. No UT were added as the logic isn't that complicated and it is fully tested with each run of crypto. Fixed a comment typo also (not mine for once). Signed-off-by: paul luse <paul.e.luse@intel.com> Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ib1939fcbc8e5db36fd909ef26771a725a551e8e6 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478383 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-01-23 02:36:51 +00:00
Ziye Yang	74ce72edca	lib/iscsi: Using async writev for ISCSI_OP_LOGOUT_RSP PDU Change-Id: I9d6d547645930c5075dca7d1e8c566634cda8e73 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482028 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-23 02:29:58 +00:00
Ziye Yang	16d5a6155a	lib/iscsi: Using async writev for ISCSI_OP_LOGIN_RSP PDU Change-Id: Ia69c996c731dfd89702bbb28468d8798c391034d Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481922 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-23 02:29:58 +00:00
Ziye Yang	67067ea4de	lib/iscsi: Add a helper function iscsi_conn_params_update Purpose: To reduce the duplicated code. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I6f6e79af602281ed50fa0fde7651238065c9bd31 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482291 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-23 02:29:58 +00:00
Ziye Yang	b03612bf4d	lib/iscsi: Using async writev for ISCSI_OP_TEXT_RSP PDU To avoid partial write issue of this PDU. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Id9b22da844c75ae53c6881850d192b40ac4098ac Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481948 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-23 02:29:58 +00:00
Ziye Yang	e199f1a5b4	lib/iscsi: adjust the location of spdk_iscsi_param_free in two functions. Purpose: To prepare for the further patch submission. Since we do not need to keep this variable too late. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ibaa100925e1ea317253d4fe7e560917e063fcf6b Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482290 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-23 02:29:58 +00:00
Ziye Yang	d8d1168c06	lib/iscsi: Add real callback for DATAIN PDU complete Since only after DATAIN pdu sending out, we can have free slot to handle queued data in tasks. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I49a52597e8660453ea90c5960d020eb53f81265d Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482048 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-23 02:29:58 +00:00
Ziye Yang	098d32273a	lib/iscsi: Add two parameters in spdk_iscsi_conn_write_pdu This is prepared for the further call back usage. Change-Id: Iccf304c87e67debfb4e7c330acc9cc233cc3ec48 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481917 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-23 02:29:58 +00:00
Ziye Yang	d648dde682	lib/iscsi: Use asychronized writev for sending data on sockets This patch eliminates the flushing logic and simplies the writev logic. And this patch can also improve the performance. We support async write for PDUs other than login response, logout response, and text response in this patch. We will support async write also for them later in this patch series. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I243f598f297d594da0bb18466bc47dab918ed3ee Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481686 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-23 02:29:58 +00:00
Ziye Yang	377a016f69	lib/iscsi: add the conn in spdk_iscsi_pdu Purpose: Prepare for the async writev usage for spdk iSCSI target application. Change-Id: Iff0e932159b0ad80be32aed3fc543b67d8fb8f51 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481644 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-23 02:29:58 +00:00
Changpeng Liu	5e8a3a77b6	vhost: make SPDK internal vhost library can work compatible with live recovery We will not enable the live recovery feature for SPDK internal vhost library, so we unmask the protocol flag for internal vhost library. For the purpose to make it can be compiled with latest DPDK version, some mandatory APIs are required, so add them here. Change-Id: I34fab7ed90c86a0fb612852a47f6cadeb8a072f3 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482069 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-22 13:54:19 +00:00
Jan Kryl	2167c68d18	lib/nvmf: nvmf target stops to listen when subsystem is destroyed There is a spdk_nvmf_tgt_listen() which opens a port for specified transport (trid) which opens possibility to accept new connections from initiators. However there is no counterpart of this function (i.e. spdk_nvmf_tgt_stop_listen()), which would stop listening. Instead the current code relies on spdk_nvmf_subsystem_destroy() to stop the listener, which seems to be wrong. Fixes #1129 Change-Id: I6e73d8c234dc451f0fee8394132eae34cd4f4756 Signed-off-by: Jan Kryl <jan.kryl@mayadata.io> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479873 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-22 13:53:34 +00:00
Or Gerlitz	8e8a5f7c28	nvme/tcp: Use writev_async for sending data on sockets Amortize the writev syscall cost by using the writev_async socket API. This allows the socket layer to batch writes into one system call and also apply further optimizations such as posix's MSG_ZEROCOPY when they are available. As part of doing so we remove the error return in the socket layer writev_async implementation for sockets that don't have a poll group. Doing so eliminates the send queue processing. Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Change-Id: I5432ae322afaff7b96c22269fc06b75f9ae60b81 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/475420 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-22 13:53:09 +00:00
Or Gerlitz	e61b0904a8	sock/posix: Add flush Initiator drivers (e.g nvme/tcp) don't use poll groups but rather directly poll the qpair. In this case we want to allow the polling function (e.g _qpair_process_completions()) to flush async writes pending on the socket. Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Change-Id: Ibd8c73691213d58e287b7110d0f5a381a89a64d0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/475419 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-22 13:53:09 +00:00
Tomasz Zawadzki	1fdee03c3c	lib/blob: split loading next md_chain to separate function Replaying md through _spdk_bs_load_replay_md_cpl() starts with md page 0 in search of first valid md page starting a chain for particular blob. When it is found, next pages read are from the current pages `next` page - next in chain. After whole chain is read, it goes back to first page in chain and starts search for next valid chain from there. This patch adds separation between reading particular chain, and moving to the next one. Moving on to the next one happens in _spdk_bs_load_replay_md_chain_cpl(). Further in the series, extent pages will be added in the metadata. Those are not within any particular blobs chain of metadata, but spread out over the md region. It is not enough to read all md and read extent pages. In case of power failure, only extent pages known to be valid are the ones which are pointed to by some valid md chain. In futher patches, a step will be added after reading particular valid md chain to go read extent pages pointed by it. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I6e7cd64af66ce5db0abd2ad5962d604ac2b30994 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481900 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-01-22 13:52:49 +00:00
Tomasz Zawadzki	bb25821c7e	lib/blob: move finishing unload to _spdk_bs_unload_finish() Moved finishing of unloading to separate function, which is now called on every failure and success when unloading the blobstore. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I34539b78c5cc63a6fe5891014cba89b9eb62d4df Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482009 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-01-22 13:52:49 +00:00
Tomasz Zawadzki	f7bd1e1eb9	lib/blob: check bserrno on each step of bs_load Before this change it was possible to fail at writing out some of used md pages. bserrno output of those was not verified. This patch adds it at every step. With that two function don't need (and never needed) to pass the bserrno: _spdk_bs_load_write_used_md() spdk_bs_load_complete() Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I1a61763f03665ba1b00e5949ef0cf37eefaaf08f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482008 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-01-22 13:52:49 +00:00
Tomasz Zawadzki	cf5df9b41d	lib/blob: remove seq argument from _spdk_bs_load_ctx_fail() This is simplification of load path. seq is save in ctx already, no need to pass it to the function. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ief0ddc1826c461adbad71ba1a3897c510ec2a971 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482007 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-22 13:52:49 +00:00
Seth Howell	9436ab59ba	nvme/rdma: inline buffers for all host to ctrlr ops Not inlining all host to controller operations breaks the target within the context of fused commands. This issue was discovered when enabling the compare-and-write fused command. Only the write command buffer was being inlined which caused the write to jump the compare in the transport specific state machine on the target side before our fused command checks in the generic code. Change-Id: I9e52ae6160e01ffd36d20429ffc8459491c729ef Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482001 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-22 13:41:47 +00:00
Changpeng Liu	2a8281fdfd	thread: free message event after executing the callback successfully We should check the thread's state at the end of message callback, or we may leak the message memory in case the thread was set to exit state. Change-Id: Ifb67c3b5c39440c411eca1d045c11e8aa6c514cc Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482206 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-22 01:25:03 +00:00
Jacek Kalwas	de4bf95443	lib/nvmf: put explicit transport name on dump Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Ie37c1d713f0e1b0767c4b40c1055b86d9de220af Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481954 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-21 23:16:57 +00:00
Jacek Kalwas	0651753ce8	lib/nvmf: introduce function to get transport name Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Ide89666bfd856d42ca5cb535e8a29716f787ea3f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481953 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-21 23:16:57 +00:00
Jacek Kalwas	7a35a678b8	lib/nvme: extend trtype to str with CUSTOM Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I519bb6bf0e930e0cd977ef4b5133bbdd7ca8af86 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481952 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-21 23:16:57 +00:00

1 2 3 4 5 ...

6607 Commits