numam-spdk

Author	SHA1	Message	Date
Darek Stojaczyk	74243e36b9	vhost: reorder spdk_vhost_session_send_event Put it next to other functions in this call chain. Change-Id: Ieafd91c6cfefec134594aec8671eb4efdac15dfe Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459164 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-16 10:57:46 +00:00
Darek Stojaczyk	5e63804146	vhost: remove spdk_ prefix from some static function spdk_ prefix should be only used on public API functions. Change-Id: I663b107bd6b1c92c2c6263f2ec7c763d9812e7fe Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459163 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-16 10:57:46 +00:00
Darek Stojaczyk	4de67bbf6d	vhost: inline spdk_vhost_event_async_send_foreach_continue Despite its name, this function is defined as static and is only used in one place, so inline it. Change-Id: I4e217b3baae9b735761f5497f06b681a118860e9 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459162 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-16 10:57:46 +00:00
Darek Stojaczyk	98af6aba4d	vhost: remove vsession->ev_ctx It's no longer used. Change-Id: Iffa385e18ba7a979d7a384f420f546207774dea3 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459161 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-16 10:57:46 +00:00
Darek Stojaczyk	f43a485299	env_dpdk/pci: make spdk_pci_device_detach() synchronous again By making dpdk device detach asynchronous we have actually broken some cases where devices are re-attached immediately after and fail since they were not detached yet, so now we're making device detach synchronous again. For that we'll simply wait inside spdk_pci_device_detach() for the background dpdk thread to perform all necessary actions before we return. We'll also print an error msg if DPDK failed the detach (probably because of some internal error). Change-Id: I7657ac1b169169eae3325de2d28c2cc311e7d901 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460286 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: <jacek.kalwas@intel.com>	2019-07-16 10:56:28 +00:00
Darek Stojaczyk	79b5618168	env_dpdk/pci: don't defer device detach while on the dpdk thread By making dpdk device detach asynchronous we have actually broken some cases where devices are re-attached immediately after and fail since they were not detached yet. We'll need to make detach synchronous again, and for that we'll wait for the background dpdk thread to perform all necessary actions before we return from spdk_pci_device_detach(). However, device detach could be triggered from the very same dpdk background thread as well. Waiting there would cause a deadlock, so now we'll schedule asynchronous device detach to the dpdk thread only if we're not on that thread already. This patch itself serves also as an optimization. Change-Id: I86b7ac1b669169eee3325de2d28c2cc313e7d901 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460285 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-16 10:56:28 +00:00
Darek Stojaczyk	cf35beccf4	env_dpdk/memory: include rte_memory.h Latest DPDK moved some definitions around and we don't compile with it right now. Adding the missing include fixes it. Change-Id: I9b0a915632996acfedbcf3d0f03feed986889a2d Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460905 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-16 10:54:01 +00:00
Darek Stojaczyk	4617707d07	reduce: switch to spdk_malloc() spdk_dma_malloc() is about to be deprecated. Change-Id: I140e10b2fd07efb48e664cfa00e1d60f604abd21 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449797 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-16 10:51:47 +00:00
Jacek Kalwas	114a067738	nvmf/rdma: pd null check In case of pd allocation by nvmf hooks there is a lack of null check as oposed to pd allocation by ibv_alloc_pd. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Iead6e0332bdee3da4adb6e657af298215c4e2196 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461576 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-16 01:29:03 +00:00
Seth Howell	dc53a9de36	lib/ftl: remove local phase variable Since the local is only used in the SPDK_DEBUGLOG call, it was causing the build to fail when the configure options --disable-debug and --enable-werror were supplied together. This can be seen in the most recent nightly builds. Change-Id: I32112cf832a705292783da4e841badaeed17dbb6 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461746 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-16 01:21:55 +00:00
Darek Stojaczyk	69642141bb	blobstore: fix unused variable warning on non-debug builds gcc complains: blobstore.c: In function ‘_spdk_blob_load_cpl’: blobstore.c:978:12: warning: unused variable ‘max_md_lba’ [-Wunused-variable] Change-Id: If2875d2d83edce6d1b544d6a4f51e78fa760d752 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461750 Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-16 01:20:30 +00:00
James Bergsten	5acf617c6e	nvme: add functions to pretty-print commands and completions This change attempts to address the Trello request to decode I/O errors in NVMe hello_world example. See https://trello.com/c/MzJJw7hM/2-decode-io-errors-in-nvme-helloworld-example As part of this change, spdk_nvme_cpl_get_status_string was declared in nvme.h, and spdk_nvme_qpair_print_command and spdk_nvme_qpair_print_completion were renamed and added to nvme.h, allowing all three to used "externally." To test the failing paths, two compile time defines were added to force a write or read error (bad LBA) respectively. As the example does a read after write, if the write fails, the example fails. Signed-off-by: James Bergsten <jamesx.bergsten@intel.com> Change-Id: Ib94b4a02495eb40966e3f49517a5bdf64485538a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457076 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-15 07:47:03 +00:00
Richael Zhuang	d4cbbf1751	nvme: use atomic builtins for g_signal_lock The __sync builtin based implementation generates full memory barriers on some non-x86 platforms. Replace it with C11 atomic builtins can make: ·arm and ppc from full barrier to half barrier ·x86 code same as before Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Change-Id: Ib6624ef8e45af497b9eced6ecfa7710bcc88a733 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461590 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-15 06:01:37 +00:00
Tomasz Zawadzki	672d42b284	lib/blob: fix check against lba during blob load md_start and md_len are values in pages rather than lba. Those should not be compared against lba of currently loaded md page. This patch changes assert to verify if the lba of current page does not exceed max lba where md is expected to be. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Id445eb9871f82f7fe367bfc396f1b495591511c1 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460976 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-15 04:22:23 +00:00
Tomasz Zawadzki	6ced601526	lib/blob: only validate blobid of first page during bs_load Blob id only is matched to the very first page of md for that particular blob. During loading blobstore, we shouldn't verify further pages in chain against the blobid. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ifc7863ddcb403aedc264c14e6b4c3915bd30dc41 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460607 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2019-07-15 04:22:23 +00:00
Evgeniy Kochetov	9d5037275d	nvmf: Add BDEV IO pending statistics This patch adds statistics for BDEV IO pending state in NVMf subsytem which may help to detect lack of resources and configure pool size correctly. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I6c60c27efe3efed194b2d2c46a707af7c2808fe9 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/445290 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-12 12:46:29 +00:00
Evgeniy Kochetov	da999b69b8	nvmf: Add queue pair counts statistics This patch adds number of admin and IO queue pairs per poll group in NVMf statistics. It can be useful to troubleshoot load sharing issues. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I2a9c0fc99cf5d0729eb130d30540ae52b5207fc9 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/445288 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-12 12:46:29 +00:00
Evgeniy Kochetov	fca6ff8f75	rpc: Add nvmf_get_stats RPC method This patch adds nvmf_get_stats RPC method and basic infrastructure to report NVMf global and per poll group statistics in JSON format. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I13b83e28b75a02bc1dcb7b95cbce52ae10ff0f7b Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452298 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-12 12:46:29 +00:00
Konrad Sztyber	d354d0a342	lib/ftl: scrub non-volatile cache after recovery If the data from non-volatile cache was recovered, but the state of the cache isn't clean (i.e. no range overlap, two different phases at max), scrub it, so that subsequent recovery can be performed successfully. Change-Id: Ic8b5cbb6e02444bc99d4700bfe3dfbb33f06ef24 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459622 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	45372c5768	lib/ftl: separate non-volatile scrub function The cache needs to be scrubbed during the initial device creation as well as after power loss recovery. This patch extracts the scrubbing code into a separate function. Change-Id: I2cb32e6993a3531470f29f466d990f0d96e45def Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459621 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	8d1bb260ea	lib/ftl: separate non-volatile header write function The header is being written from multiple places, so having a discrete function serializing and writing it at the appropriate place in the cache makes sense. Change-Id: I7a1e6ebd05e8a4974d141f04202803f507b978e4 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459620 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	78154d9558	lib/ftl: allow flushing active bands This patch adds a function that marks all active open bands to be flushed and once all of them are closed it notifies the caller. This needs to be done before the data from non-volatile cache can be scrubbed to make sure its stored on bands the device can be restored from. Change-Id: I9658554ffce90c45dabe31f294879dc17ec670b9 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459619 Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	78097953f7	lib/ftl: notify init/fini callbacks on proper threads This patch makes sure we're on the thread that requested creation / deletion of the device when calling the notification callback. Change-Id: Ia11a8054692874f6b57d4ebe3e3cb290c58e83b6 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459618 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	0f7080e779	lib/ftl: helper function to check for nv_cache Added ftl_dev_has_nv_cache to check if the FTL is configured to use non-volatile cache or not. It makes these checks a bit more readable. Change-Id: I0140df184d89a675e40bd5056718cd64301c553e Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459617 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	17da389ecc	lib/ftl: delay writing band's metadata Wait until all user writes are completed before writing band's metadata. Otherwise in case of power loss, user data might not get written while the metadata does, which would result in data loss. Change-Id: I419862960c072e38265b91d0d0498ff0c6f9f29e Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459615 Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	4d7c81625c	lib/ftl: non-volatile cache data recovery Use the data placed on the non-volatile cache to perform recovery in case the device wasn't shut down cleanly. The write phase ranges are read and their data is copied onto the OC device. The code added in this patch will correctly copy the data from overlapping ranges, however it won't do anything about these overlapping areas, so subsequent power loss happening quickly after recovery might result in data loss. Change-Id: Ib4c66092cee858496ec66f789fcfb1e7e32f5c20 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458105 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	81e3797452	lib/ftl: distinct non-volatile cache recovery phase Change-Id: I6936905d4a031508a85729e61ac72a352a490e14 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458104 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	6db41a006e	lib/ftl: non-volatile cache recovery scan Scan the cache to find ranges of blocks written with the same phase. This prepares the structures needed to perform data recovery from the non-volatile cache. Change-Id: I0c901d010d6ca76feabca13116d831c1d9931833 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458103 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	773b7003bc	lib/ftl: add comments to ftl_restore's fields The structures in this module had no comments, so it was a bit hard to understand what they're used for. Change-Id: I439c8a792f02b929006c60933e6b272751b1a675 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458102 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	0f0af48009	lib/ftl: keep reloc traffic out of non-volatile cache Moving data from one band to the other doesn't need to be stored on the non-volatile cache. Not only does it add unnecessary traffic to the cache (wearing it out and reducing its throughput), but it requires us to synchronize it with user writes to the same LBAs. To avoid all that, this patch adds the FTL_IO_BYPASS_CACHE flag to all writes coming from the reloc module. However, to be sure that the moved data is stored on disk and can be restored in case of power loss, we need to make sure that each free band have all of its data moved to a closed band before it can be erased. It's done by keeping track of the number of outstanding IOs moving data from particular band (num_reloc_blocks), as well as the number of open bands that contains data from this band (num_reloc_bands). Only when both of these are at zero and the band has zero valid blocks it can be erased. Change-Id: I7c106011ffc9685eb8e5ff497919237a305e4478 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458101 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	4d113ee5d3	lib/ftl: allow writes bypassing non-volatile cache Some of the writes doesn't need to go through the non-volatile cache (e.g. relocations, data recovery from the cache). This patch adds IO flag to indicate that the write shouldn't be stored on the non-volatile cache. Change-Id: I3d485fe14cf25b3074832f26491ba0cb12ff0e58 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458100 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	9a42d7fc30	lib/ftl: initialize LBA when allocating internal IOs Initialize children IOs with the appropriate LBA of its parent when allocating internal IOs. Change-Id: I191ad741b9d88d7f18cae05982e0a06a8f371f78 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458099 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	4fd4e3db5f	lib/ftl: track non-volatile cache's write sequence This patch adds tracking of the phase of the writes to the non-volatile cache. The phase is changed each time the whole buffer is filled. Along with every block's LBA, current phase is stored in its metadata. This allows for replaying the sequence of writes when recovering the data from the cache after (unclean) shutdown. Since there are only three possible phases to be stored on the device at a time, phase is defined as a 2-bit counter cycling through 1 -> 2 -> 3 -> 1, with 0 marking blocks that were never written. Change-Id: Id47880367934027fd102c32f183110acc9d4c62a Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458098 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	c69529d452	lib/ftl: block nv_cache until header is written After filling whole non-volatile cache, block all further writes until the header with metadata is written. This means that metadata stored on the device will always be up-to-date with the most recent write sequence. Change-Id: I15b724b52814289622374ce77e5c3b23173a75c6 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458097 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	2c96745563	lib/ftl: check non-volatile cache's DIF type Check the type of DIF used by the bdev specified as the non-volatile write cache. If it's anything other than SPDK_DIF_DISABLE, fail the initialization, as we don't support any other type yet. Change-Id: Ie8bc1729558e055989d7925bc55f6307ee738f0e Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458096 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	77ddc70e1c	lib/ftl: restore non-volatile cache's metadata When restoring the device, read the first block of the non-volatile cache containing its metadata header and verify that it's indeed a device that was used as write cache. Change-Id: Idf113a9e8eb73160a2d9e6e882c9e026d3fafb3e Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458095 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-12 12:39:38 +00:00
Konrad Sztyber	1243c9306d	lib/ftl: prepare non-volatile cache area When creating FTL device using non-volatile cache, zero out the non-volatile cache and store metadata (device's UUID, size of the cache) in the first block. Change-Id: Id8f212aef756e86e8a215582ab7c32a635e18938 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458094 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com>	2019-07-12 12:39:38 +00:00
Vitaliy Mysak	6b654ab900	bdev: prevent early spdk_bdev_init_complete() In case some module has `async_init = true` and some other module that comes after it fails to initialize, then callback from asynchronously initialized module may call `spdk_bdev_init_complete()` first, then failed module will call `spdk_bdev_init_complete()` later. This currently results in NULL dereference because first call to `spdk_bdev_init_complete()` sets `g_init_cb_fn = NULL`. This change prevents first call to `spdk_bdev_init_complete()` by saying that failed module is not finished with initialization. This patch fixes #847 Change-Id: Ib6b231d5ea27896ad88d7f11b8732921077b3d4d Signed-off-by: Vitaliy Mysak <vitaliy.mysak@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461230 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-12 04:14:58 +00:00
paul luse	0b3fb2403e	lib/reduce: fix bug with adding up req->decomp_iovcnt In the memcpy elimination patches, the same bug exists in 3 places. When building req->decomp_iov using the host buffers, req->decomp_iovcnt was being incremented in the loop and also being used as part of the index messing everything up. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I485ac32502801c1e11b8392b2df7eba06b4f5a9b Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461053 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-12 04:05:52 +00:00
paul luse	b9bc6254a8	lib/reduce: fix critical issue with reduce optimization The first optimization to eliminate memcpy was too aggressive and did so for the read-modify-write operation as well. This didn't affect the fio tests used that the time but bdevio catches it right away. When over writing a chunk with data, we first need to read the old data before applying the new. This patch uses the scratch buffer for old data as sending it to the user buffer results in it not being written at the end of the read-modify-write. There is at least one more bug fix coming after this also found with bdevio but passed with fio Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I8fe074056434bb4757c68077e2df446861edfd94 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461032 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-12 04:05:52 +00:00
Shuhei Matsumoto	b860c8dbce	bdev/gpt: call get buffer function before forwarding read I/O to the base bdev iSCSI target does not allocate data buffer on read, and delegate allocation to the bdev. When the bdev is a split vbdev, the split vbdev does not allocate data buffer and delegate allocation to the backend bdev. In this case, iSCSI target expects the buffer is allocated until notifying completion to the split vbdev. However, the split vbdev notifies completion to the backend bdev when calling the callback of iSCSI target. The backend bdev frees the buffer immediately, but iSCSI target still uses the buffer. If the buffer is reused by another I/O, data corruption will occur. For this issue, vbdev_gpt_submti_request() calls spdk_bdev_io_get_buf() when the I/O is read, and its callback vbdev_gpt_get_buf_cb calls _vbdev_gpt_submit_request() then. This will ensure the buffer is allocated before forwarding I/O to the backed bdev. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ifb2eac500276ab5012123b7d6f7eb033d87ad17c Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461350 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-12 02:25:57 +00:00
Shuhei Matsumoto	25532d08f8	bdev/split: call get buffer function before forwarding read I/O to the base bdev iSCSI target does not allocate data buffer on read, and delegate allocation to the bdev. When the bdev is a split vbdev, the split vbdev does not allocate data buffer and delegate allocation to the backend bdev. In this case, iSCSI target expects the buffer is allocated until notifying completion to the split vbdev. However, the split vbdev notifies completion to the backend bdev when calling the callback of iSCSI target. The backend bdev frees the buffer immediately, but iSCSI target still uses the buffer. If the buffer is reused by another I/O, data corruption will occur. For this issue, vbdev_split_submti_request() calls spdk_bdev_io_get_buf() when the I/O is read, and its callback vbdev_split_get_buf_cb calls _vbdev_split_submit_request() then. This will ensure the buffer is allocated before forwarding I/O to the backed bdev. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Icfd0663b548479ac0bf6b5b49420f144142e3300 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461348 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-12 02:25:57 +00:00
Ben Walker	88da8a91f9	nvmf: spdk_nvmf_subsystem_remove_ns is no longer asynchronous Now that the resume path can correctly handle the case where a namespace was removed and a new one added with the same nsid, this no longer needs to be asynchronous. Change-Id: I693045e66a7d4e75255b526d8f5ca5ef8695533e Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459606 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-11 11:19:53 +00:00
Shuhei Matsumoto	316d5c7c79	bdev/part: Remap DIF reference tag for read/write I/O When using stacked virtual bdev (e.g. split virtual bdev), block address space will be remapped during I/O processing and so reference tag have to be remapped accordingly. This patch adds an new helper function spdk_bdev_part_remap_dif and call it before submitting write I/O or after completing read I/O. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Idfc6081893861d412c19a9edfb348a7faa7e8c5b Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461106 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-11 11:14:22 +00:00
Shuhei Matsumoto	5a31186745	bdev/part: Consolidate getting remapped offset in spdk_bdev_part_submit_request All IO types but reset have used the remapped offset to submit I/O to the base bdev. Previously each IO type had got the remapped offset by itself. Consolidating it into a place will improve readability and will be helpful for the next patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I29465e92d8fb62e45cfc97c52fedaa661b2f0602 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461105 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-11 11:14:22 +00:00
Shuhei Matsumoto	7e70c3d18f	dif: Add spdk_dix_remap_ref_tag to remap ref. tag for separate metadata payload When using stacked virtual bdev (e.g. split virtual bdev), block address space will be remapped during I/O processing and so reference tag will have to be remapped accordingly. This patch adds an API, spdk_dif_remap_ref_tag to satisfy the case. UT code is added together in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I55cc45c475d4e86e736f5712baf02fcabfde3c82 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461104 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-11 11:14:22 +00:00
Shuhei Matsumoto	f4a62a3993	dif: Add spdk_dif_remap_ref_tag to remap ref. tag for extended LBA payload When using stacked virtual bdev (e.g. split virtual bdev), block address space will be remapped during I/O processing and so reference tag will have to be remapped accordingly. The use case is explained in detail as follows: - Format a single NVMe SSD with DIF enabled. - Create a NVMe bdev on the NVMe SSD with DIF enabled. - Create four split vbdevs on the NVMe bdev. - Add the split vbdevs to a NVMe-oF target. - Application is aware of block address space of the split vbdevs. - Application submits read/write I/O to the NVMe-oF target. Case 1: - Configure NVMe-oF target to DIF pass-through. Case 2: - Configure NVMe-oF target to DIF insert/strip For the case 1, - Application inserts DIF for write I/O and verifies DIF for read I/O. - The split vbdevs remaps reference tags of DIF both for read and write I/O because application expects reference tags are based on the block address space of split vbdevs. - The NVMe bdev processs read/write I/Os without remapping reference tags because reference tags are already based on the block address space of the NVMe bdev. For the case 2, - NVMe-oF target inserts DIF for write I/O, and verifies and strips DIF or read I/O. - The split vbdevs remaps reference tags of DIF both for read and write I/O because NVMe-oF target expects reference tags are based on the block address space of split vbdevs. - The NVMe bdev processs read/write I/Os without remapping reference tags because reference tags are already based on the block address space of the NVMe bdev. This patch adds two APIs, spdk_dif_ctx_set_remapped_init_ref_tag and spdk_dif_remap_ref_tag to satisfy the use case. UT code is added together in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ib3101129225b334d2f578eab75197790b1818770 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461103 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-11 11:14:22 +00:00
Maciej Szwed	1b4c99a2ba	bdev: Introduce new bdev mutex for accessing bdevs list In future patch in new spdk_bdev_open_ext function we will call spdk_bdev_get_by_name function and after that call and before calling old spdk_bdev_open routine bdev can be removed. We need to add mutex which will prevent that. Any future code should use this mutex when accessing the bdevs list to get a bdev and perform some operation on it. Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: I785a1791346aebdd394fc51ad0e7fbfbabf317c9 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458457 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-11 10:22:06 +00:00
Tomasz Zawadzki	69a8877e82	lib/blob: do not allow xattr to exceed maximum descriptor length Length of xattr descriptor is equal to length of xattr struct, xattr name and the len of stored value. There is no limit to how much can be stored in memory for xattr. On disk xattr size is limited to single page and within that to max descriptors that can fit in it. This size is known at compile time. Before this patch it was possible to add xattr exceeding what was possible to be written to disk. This caused issues when serializing the metadata during spdk_blob_sync_md() or spdk_blob_close(). Making those fail without specific info to the user and not actually writting such descriptor. Since maximum length of xattr descriptor is known at compile time, this patch compares against this value when setting the xattr. It will immediately report back to user with error, and will not store xattr in memory (thus not serialize it). This patch should not affect any backward compatibility for blobs. Too large xattrs weren't written to disk before, API for blobstore stays the same - only reporting ENOMEM when it should. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I6f4af4d079e47f084e20d7a4969d9a78ec1f8610 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460450 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-11 10:05:41 +00:00
Shuhei Matsumoto	7ee58b90e1	nvmf/tcp: Set DIF context to PDU when processing in-capsule, C2H, or H2C data Set DIF context of the corresponding request to PDU when - processing in-capsule data of the command, - processing data of C2H PDU, or - processing data of H2C PDU. Change-Id: I3a668a55be21dbe2ee6ecf26476290670bd7b4a8 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458929 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-11 05:30:28 +00:00
Shuhei Matsumoto	e3e023cfd3	nvmf/tcp: Increase in-capsule buffer size to fill DIF fields When NVMe/TCP initiator transfers in-capsule data, NVMe/TCP has to process it as in-capsule data. If DIF insert/strip is enabled, in-capsule data size will be increased by NVMe/TCP target to insert metadata. However size of in-capsule data buffer had not been increased, and buffer overflow occurred when NVMe/TCP initiator transfers in-capsule data to NVMe/TCP target with DIF insert/strip being enabled. This patch increases size of in-capsule data buffer size to store metadata. 16 byte metadata per 512 byte data block is the current maximum ratio of metadata per block. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I88b127efd7a945bde167a95df19a0b9175cb8cd0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461333 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-11 05:30:28 +00:00
Shuhei Matsumoto	9d4ee5f344	nvmf/tcp: Fix wrong data offset in nvmf_tcp_pdu_payload_insert_dif We updated readv_offset before generating DIF to avoid adding the temporary variable _rc in the previous patch, but that caused write error when inserting DIF. Fix the bug in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id0788280a83cbea2554c851db77751432fc00cba Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461116 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-11 05:30:28 +00:00
Shuhei Matsumoto	2c9b0af271	nvmf/tcp: Get DIF context when handling capsule command header When handling the capsule command header, call spdk_nvmf_request_get_dif_ctx by passing the NVMf request and the reference to the DIF context, and set the flag dif_insert_or_strip of the NVMf/TCP request to true. spdk_nvmf_request_get_dif_ctx returns false immediately when the corresponding NVMf controller disables DIF insert/strip. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I16f6b322f2692d5f9653d011a490e7929ec37365 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458928 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-11 05:30:28 +00:00
Shuhei Matsumoto	1c7f92f075	nvmf: Hide DIF setting of the backend bdev if DIF insert/strip is enabled When the NVMf controller's flag dif_insert_or_strip is enabled, DIF is inserted for write I/O and stripped for read I/O, and the corresponding NVMe-oF initiator should not be aware of the DIF setting of the backend bdev. Hence this patch hides the DIF setting of the backend bdev when the flag dif_insert_or_strip is enabled. Change-Id: I3c14880c2e94cba7f76b1bca78afb36bfe884e26 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456731 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-11 05:30:28 +00:00
Shuhei Matsumoto	4ff3665ce9	nvmf: Check DIF insert/strip setting of NVMf controller when getting DIF context The first idea was that the caller of spdk_nvmf_request_get_dif_ctx() should check if the current transport enables DIF insert/strip before calling spdk_nvmf_request_get_dif_ctx(). But NVMf controller knows if DIF/insert/strip is enabled now by the previous patch. Hence spdk_nvmf_request_get_dif_ctx() checks if the NVMf controller enables DIF insert/strip at its head. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I78253d356b694800c3a9a9608514df58e0c631a6 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461314 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-11 05:30:28 +00:00
Shuhei Matsumoto	91da9aaafe	nvmf: Add a flag dif_insert_or_strip to struct spdk_nvmf_ctrlr Add a flag dif_insert_or_strip to struct spdk_nvmf_ctrlr that indicates whether DIF insert/strip is done. Copy the DIF insert/strip setting of the corresponding transport options to the flag at NVMf controller creation. The purpose of this patch is to make DIF insert/strip not per-transport option but per-controller option because we may want to be able to control DIF insert/strip per controller at some point. Besides this patch will clean the implementation. Besides align indent around the change. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I57f65960b430e55f4021ed514aacd85581ff9993 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461313 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-11 05:30:28 +00:00
Karol Latecki	a4b0a2b6fd	bdev/crypto: add more descriptive rpc error messages Improve error messages where possible Change-Id: I2c75cea66dbd635d89e7f27aef59a38c5533b349 Signed-off-by: Karol Latecki <karol.latecki@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460966 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Vitaliy Mysak <vitaliy.mysak@intel.com>	2019-07-10 08:29:21 +00:00
Karol Latecki	d9580c759e	aio/rpc: Add more descriptive error messages for aio bdevs Improve error messages where possible Change-Id: I104998a666789c4e724d153c2cd14ee05c71b699 Signed-off-by: Karol Latecki <karol.latecki@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460157 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Vitaliy Mysak <vitaliy.mysak@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2019-07-10 08:29:21 +00:00
Karol Latecki	c4a1c90a4c	aio/rpc: make filename an obligatory argument Filename should not be an optional argument. Making it obligatory removes the need for further checks as it should then be checked in json decode. Change-Id: Ia779c2623db8d5cdde3983507e3b2b3cfb7e971f Signed-off-by: Karol Latecki <karol.latecki@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460958 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Vitaliy Mysak <vitaliy.mysak@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2019-07-10 08:29:21 +00:00
Karol Latecki	37c04b7be8	lib/bdev: do not allow bdev name to be an empty string It looks like currently we only check bdev names for NULL, but not for "empty" string. For example this rpc command: sudo scripts/rpc.py construct_aio_bdev aio_disk "" 512 Will result in construction of AIO bdev with empty name: sudo scripts/rpc.py get_bdevs [...] "name": "", "aliases": [], [...] Change-Id: I41204096c8cf210a4dc40a8225d1c9dad353f533 Signed-off-by: Karol Latecki <karol.latecki@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460150 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Vitaliy Mysak <vitaliy.mysak@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com>	2019-07-10 08:29:21 +00:00
Karol Latecki	f155fedcdb	null/rpc: Add more descriptive error messages for null bdevs Improve error messages where possible. Change-Id: I9d1e4dee106712ecd7a40cfd1eeaf74ccf6d0d1d Signed-off-by: Karol Latecki <karol.latecki@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460121 Reviewed-by: Vitaliy Mysak <vitaliy.mysak@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-10 08:29:21 +00:00
Karol Latecki	5e28673bc5	bdev/rpc: Add descriptive error messages for malloc bdevs Provide more error codes and/or messages then just a generic "32602 invalid parameters" error. Change-Id: I1777f454faef336b10af24dda50a2d5b5e73727f Signed-off-by: Karol Latecki <karol.latecki@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459948 Reviewed-by: Vitaliy Mysak <vitaliy.mysak@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-10 08:29:21 +00:00
Vitaliy Mysak	772db556af	lib/util: fix spdk_strerror() empty string return If __USE_GNU is set, spdk_strerror() returns empty string instead of "Unknown error %d". if unknown error code provided. The reason is that on unknown errors, `strerror_r()` will return provided buffer, (in our case, `buf` is returned) then `snprintf()` will write to `buf` having `buf` as input argument because `new_buffer` == `buf`, which results in an empty string. This patch fixes the above issue by first checking if `buf` == `new_buffer`. Change-Id: I838ebf47d115b58cee3145991243bc9ebaeb651d Signed-off-by: Vitaliy Mysak <vitaliy.mysak@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460825 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2019-07-10 08:29:21 +00:00
Evgeniy Kochetov	7535cdbd62	rpc: Add thread_get_stats RPC method SPDK threads collect busy and idle time statistics. This commit adds thread_get_stats RPC method to retrieve these values. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I8ed8041c6164eb0c0a9336f4e50b5f26a3f20190 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/445285 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2019-07-10 04:28:14 +00:00
Ziye Yang	750a4213ef	nvmf: add spdk_nvmf_get_optimal_poll_group This patch is used to do the following work: 1 It is optimized for NVMe/TCP transport. If the qpair's socket has same NAPI_ID, then the qpair will be handled by the same polling group. 2. We add a new connection scheduling strategy, named as ConnectionScheduler in the configuration file. It will be used to input different scheduler according to the customers' input. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ifc9246eece0da69bdd39fd63bfdefff18be64132 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454550 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-10 02:30:41 +00:00
Ziye Yang	960460f0d1	nvmf: add spdk_nvmf_transport_get_optimal_poll_group Add the optimal poll group get function. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ia9e57c6924a6563d79269cf535814883e83698cd Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454549 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-10 02:30:41 +00:00
Ben Walker	09ef0593d4	nvmf: Leverage bdev uuid to correctly detected remove+add ns while paused Change-Id: Idbf00956394f7ee7ff7e27f2627785cd7146b01f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459605 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2019-07-10 01:59:05 +00:00
Ben Walker	85e9760161	nvmf: Capture ns_info onto stack in poll_group_update_subsystem By capturing this pointer onto the stack, we inform the compiler that we don't expect it to change. That allows the compiler to generate more efficient code. Change-Id: I0f3ff9373662198e915269c4498e4902a2cdb808 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459754 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2019-07-10 01:59:05 +00:00
Ben Walker	ab3abc15aa	nvmf: Capture channel variable to stack when updating poll groups This signals to the compiler and analysis programs that this won't change during iteration, so it may produce better code. Change-Id: I478c0c9445d4ddf8a69ab1b3deaf628b82a0eaea Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459753 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2019-07-10 01:59:05 +00:00
Ben Walker	75b4f332f4	bdev: All bdevs now have a UUID. For devices that don't have a UUID, the UUID is generated at registration time. That means that some devices will not have the same UUID from run to run, but this seems no worse than having no UUID at all. Change-Id: Icf6b8517ffcffabafa2b73176dc03d896d0017fe Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459604 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-10 01:59:05 +00:00
Changpeng Liu	5317a9f795	rpc/nvmf: add RPC support to add the persistent configuration file for one NS Change-Id: Ic4963d3e55cffceca35d18ba8d406658e51a189a Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455913 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-10 01:40:26 +00:00
Changpeng Liu	7b74274fbf	nvmf: add parameter check when loading reservation information from a JSON file Change-Id: Id217212fd82e57a4cfb32f62f11798c72187879e Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460794 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-10 01:40:26 +00:00
Shuhei Matsumoto	390cffb64e	rpc: Add dif_insert_or_strip parameter to nvmf_create_transport RPC Add an new optional parameter dif_insert_or_strip to nvmf_create_transport RPC. .INI config file will be deprecated and dif_insert_or_strip is not supported in .INI config file. Change-Id: Ibf38b599cff75eeb0056dd2125d6ec10d444f339 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458927 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-10 00:43:02 +00:00
Shuhei Matsumoto	aa322721cb	nvmf: Add dif_insert_or_strip to transport options This is a place holder and subsequent patches will use the option dif_insert_or_strip and provide JSON RPCs to configure it. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I7e3fbb1d49c47647a9a0a1a2149152801591b283 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456452 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-10 00:43:02 +00:00
Shuhei Matsumoto	ddb680ebab	nvmf: Add helper function to get DIF context from NVMf request Add a helper function to get DIF context when the passed NVMf request is for I/O queue, NVMe read, write, or compare command, and its NSID is valid. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I796c20607c7b64a8be85da5131c5ea95ffd9f8e4 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458713 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2019-07-10 00:43:02 +00:00
Shuhei Matsumoto	9b04e29173	nvmf: Add helper function to get DIF context from bdev and NVMe cmd Add a helper function to get necessary DIF information and set them into the passed DIF context and return. This function will be called only when the specific requirement is satisfied and the caller will be added in the next patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic435886ca936a211f34278b813f547ffa43b9000 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458712 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-10 00:43:02 +00:00
Jeffry Molanus	9bba21c969	app.c: --huge-dir has not effect `struct option` is set incorrectly for long_opt --huge-dir causing the value to be ignored. Change-Id: I5bb84f391e1ac551b2a91c43fe8da658ae54f115 Signed-off-by: Jeffry Molanus <jeffry.molanus@gmail.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460581 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-09 05:33:03 +00:00
Wojciech Malikowski	fe73e3072c	lib/env: Added parent field to spdk_pci_device VMD introduce parent/child relationship between pci devices. Parent filed allow to associate NVMe disk with VMD device. Change-Id: Ie363dbe83fefbe05e3347888dc6bd361a235da4a Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459637 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-09 04:04:16 +00:00
Shuhei Matsumoto	7bfbc388d7	nvmf/tcp: Pass extended LBA based length as I/O length to NVMf controller When DIF is inserted or stripped, - in the TCP transport layer, we can use LBA based length throughout, but - in the NVMf controller layer and BDEV layer, extended LBA based length must be used, and NVMf controller gets the length from tcp_req->req.length. Hence by adding and using two variables, elba_length and orig_length to struct spdk_nvmf_tcp_req, set the extended LBA length to tcp_req->req.length before calling spdk_nvmf_request_exec(), and then restore the original LBA based length to tcp_req->req.length after calling spdk_nvmf_tcp_req_complete(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9309b8923c6386644c4fd8ef3ee83a19f5d21ce5 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458926 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-09 03:39:25 +00:00
Shuhei Matsumoto	51b643648c	nvmf/tcp: Increase buffer to insert/strip DIF in spdk_nvmf_tcp_req_parse_sgl If tcp_req->dif_insert_or_strip, increase the length from LBA based to extended LBA based by using its own DIF context. Change-Id: Ie9f5cf757328dda795b43a7b6c70a72259865115 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458925 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-09 03:39:25 +00:00
Shuhei Matsumoto	536bd70eb4	nvmf/tcp: Use cached length variable in spdk_nvmf_tcp_req_parse_sgl The next patch will extend the length from LBA based to extended LBA based and use it as buffer length to insert or strip DIF. So cache sgl.unkeyed.length at the top of spdk_nvmf_tcp_req_parse_sgl and use it throughout. Besides, one unrelated change-the-line to improve the readability is included. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I2a1dc9379bb5671ec80b5b478504c9879a4f0fff Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458924 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-09 03:39:25 +00:00
Shuhei Matsumoto	975239c29d	nvmf/tcp: Insert DIF to the newly read data to create extended LBA payload Generate and insert DIF to each data block when reading more than a single byte. This update is very similar with the use case of spdk_dif_generate_stream in iSCSI target. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I063919a32153ac0daf6d6eb1836c0d5995b65d33 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459092 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-09 03:39:25 +00:00
yidong0635	ff0a7dfc42	nvme: Handle CQ polling failures by marking the controller as failed. nvme_transport_qpair_process_completions calls nvme_rdma_qpair_process_completions There are some cases return -1 due to failure of "CQ errors". Handle CQ polling failures by marking the controller as failed. That a completion with an error will be treated as controller failed. Requests will be aborted after retry counter exceeded. Otherwise, code will keep on reporting errors without recovery. This is to fix issue #850. Change-Id: I0b324232310e107bf7fd5722aca54d402a19b14d Signed-off-by: yidong0635 <dongx.yi@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460569 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-09 01:43:02 +00:00
yidong0635	16fdf46600	bdev: Fix warning about scanbuild error on fedora30. In file included from bdev_ut.c:43: /root/yidong/spdk/lib/bdev/bdev.c:4373:9: warning: Access to field 'bdev' results in a dereference of a null pointer (loaded from variable 'desc') return desc->bdev; ^~~~~~~~~~ This is related to issue #822. Change-Id: I8cd2bafadeff9846169bc9ca67b3c4110e9c0da8 Signed-off-by: yidong0635 <dongx.yi@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459529 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-09 00:38:22 +00:00
Andrey Kuzmin	fa6bfa80af	Nvme: check spdk_nvme_qpair_process_completions return value. nvme_tcp_qpair_process_completions returns -1 on socket I/O error. Unless the caller checks this return value (which spdk_nvme_wait_for_completion_robust_lock currently doesn't), on connection loss or any other fatal connection error spdk_nvme_wait_for_completion will never exit the completion check loop. Change-Id: I92bb349beb071db312e6c31b84db2a7b51ec486c Signed-off-by: Andrey Kuzmin <akuzmin@jetstreamsoft.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460657 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-09 00:27:54 +00:00
paul luse	06f6c90626	bdev/crypto: add IO queueing for out of mem condition via bdev layer Also made on the prints a DEBUG message instead and noticed the really name that was being registered by this component so updated it to make it look like the rest of SPDK. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I747a846cb365e7db49be50db941e83fb1b265ea0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460244 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-08 09:24:29 +00:00
Changpeng Liu	1edc5f0040	nvmf: restore the loaded reservation information to NS Load reservation information based on ptpl configuration file, and restore the information to NS data structure. Change-Id: I5f46d49a6d1e6e49aab93ca7cd654469a3a08659 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455912 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-08 08:21:03 +00:00
Shuhei Matsumoto	8448adaefa	nvmf/tcp: Verify DIF before sending C2H data in spdk_nvmf_tcp_send_c2h_data If DIF mode is local and C2H data is extended LBA payload, DIF should be verified just before sending the payload. Add a helper function nvmf_tcp_pdu_verify_dif and call it in spdk_nvmf_tcp_send_c2h_data after completing nvme_tcp_pdu_set_data_buf. When nvmf_tcp_pdu_verify_dif returns error, treat the error as fatal transport error because the error is caused by the target itself. Handle the fatal NVMe/TCP transport error by terminating the connection as described in the NVMe specification. On the other hand, data digest error is treated as a non-fatal transport error because the error is caused outside the target. This is reasonable. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9680af2556c08f5888aeaf0a772097e4744182be Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458921 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-08 03:33:07 +00:00
Shuhei Matsumoto	457afd77b1	bdev/split: Fix orphan'ed config when removing the base bdev first When we create a base bdev and then create a split vbdev on top of the base bdev, if we delete the base bdev first, we have no way to remove the configuration of the split vbdev. Hence even if we create a base bdev again, we cannot create any split vbdev on top of the base bdev again. The meaning of flag, `removed` of `struct spdk_vbdev_split_config` is not clear and there will be no issue even if the flag `removed`. Hence remove the flag `removed` in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I053c95e647721004cecfe4fd8b0f1ff5bb9bf38a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460580 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-08 03:30:39 +00:00
Darek Stojaczyk	fcbbcf4905	bdev: cleanup child iov rewind code When we run out of bdev_io's child iovs and we had to round down I/O size to nearest block size boundary, we used to decrease the existing child_iovcnt and set a new "child_iov_run_out" flag to terminate the uppermost splitting loop. We can get rid of that new flag by just not decreasing child_iovcnt when rewinding the last few iovs - it will make the uppermost loop naturally terminate using the existing checks. Change-Id: Ie40c7ce135e7fb8fe284afdf7beeebd10af85cb7 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459911 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-05 12:11:46 +00:00
Hailiang Wang	5926236661	bdev/raid: fix a warning of freed memory Compilation Warning on fedora30. In file included from bdev_raid_ut.c:38: spdk/lib/bdev/raid/bdev_raid.c:325:11: warning: Use of memory after it is freed. raid_ch->base_channel[pd_idx], ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~ This is related to issue #822. Change-Id: I6432772fb38ca02bc4f0a02a36ed3fe61b8607c7 Signed-off-by: Hailiang Wang <hailiangx.e.wang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460069 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: yidong0635 <dongx.yi@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-05 12:10:07 +00:00
paul luse	63d9d2e2b0	lib/reduce: eliminate RMW on writes with chunk_size length Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I6545a91e2ae4805f7bd1d92baa6dcbce0f1f8fba Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459864 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-05 11:56:03 +00:00
paul luse	d23c36a169	lib/reduce: eliminate two more memcpy operations For callers of _reduce_vol_compress_chunk() Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I9ac1da8f9bcfd902fe58e4c5ffc20ce16e9bafcd Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459863 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-05 11:56:03 +00:00
paul luse	89a9a50497	lib/reduce: eliminate memcpy in read decompression path This is the first in a series of patches to eliminate memcpy ops in the comp/decomp paths. Currently the lib uses 2 scratch buffers and copies all data in and out of them to the user buffers following a comp/decomp. This patch replaces the memcpy in one of the paths by constructing an iovec array that points to a combination of the scratch buffer and user buffer so that user data decompresses directly into the user buffer and any data in the chunk that isn't needed by the user will be sent to the scratch buffer. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Ib1956875729a82d218527bc81795f750d1df2b89 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459662 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-05 11:56:03 +00:00
Shuhei Matsumoto	8b539eb553	nvme: Set appropriate value to max_xfer_size and max_sge SPDK NVMe-oF initiator driver could not transfer IO whose size is more than 128KiB even if NVMe-oF target allows IO whose size is more than 128KiB both for RDMA and TCP transport. Some use cases need to transfer IO larger than 128KiB. For RDMA transport, max_mr_size by ibv_query_device of RDMA devices indicates the maximum size of a single memory region and is independent from the actual I/O size, and is very likely to be larger than 2 MiB which is the granularity we currently register memory regions. Actually some RDMA NICs return UINT64_MAX for max_mr_size by ibv_query_device. Hence use UINT32_MAX and let the generic layer use the controller data to moderate this value. On the other hand, for TCP transport, there is no limit for maximum IO size and hence use UINT32_MAX. Besides, for RDMA transport, max_sges should be the minimum of max_sge got by querying RDMA devices and NVME_RDMA_MAX_SGL_DESCRIPTORS. Hence do this change together in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Idc813afd3e525bf5f370c0fcd2623f9c146a5528 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459218 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-05 06:35:41 +00:00
Shuhei Matsumoto	cf3c54bc03	nvme: Ensure max_sges not to exceed what controller supports in generic layer Previously comparing the transport supported value and the target value was done in RDMA transport layer. However this comparison should be done in the generic layer like the maximum IO transfer size. Hence change the comparison to do in the generic layer in this patch. Besides, for MSDBD, the value 0 indicates no limit but we had handled this as maximum number of SGS entries was 0 by mistake. This patch fixes the bug together. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I54365cf114169b10180ec2c659f9c7302672674c Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459574 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-05 06:35:41 +00:00
Chunyang Hui	993ab4908c	RocksDB: Remove static and assert for SpdkInitializeThread RocksDB spdk-v5.13.4 and spdk-v5.18.4 still need to call SpdkInitializeThread in its env init. Static will trigger make error. Thus removed. For removing assert, we already have enough check to make sure the allocate won't happen twice. The assert here is redundant. Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Change-Id: I058c580349398b83fed8a8408b089e065b5d2988 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460465 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-05 04:19:11 +00:00
Ziye Yang	57efada508	nvmf/tcp: reorg the structure of struct spdk_nvmf_tcp_req I used pahole to see whether the alignment of the structure is reasonable. After reorgnization, we can saved 16 bytes and 1 cacheline according to the information by pahole. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I1347e7c582fe2b00707e2841690b87d53cc61e33 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460572 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-05 04:18:41 +00:00
Darek Stojaczyk	89021c6c6c	nvme/rpc: switch to spdk_malloc(). spdk_dma_malloc() is about to be deprecated. Change-Id: I6fd1106c2278c2ef8899c822e920252f62266547 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459550 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-05 03:48:51 +00:00
Chunyang Hui	fbd2f3fd2e	opal: add support for getting locking range info Change-Id: I8e3e39673c260f823a9703e86006b5334dedc987 Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457576 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-05 02:23:28 +00:00
Chunyang Hui	505dbf59ff	Opal: Add locking range support Change-Id: I4974d4134aed3b63e204b79c9292ce940e32d40c Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455175 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-05 02:23:28 +00:00
Chunyang Hui	755b4390f9	Opal: Add activate locking SP method Change-Id: I4189bdefdb5a6651bb73bd32e61c16e899b2ae5a Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454211 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-05 02:23:28 +00:00
Shuhei Matsumoto	bcfb2b2b9c	bdev/passthru: Pass-through metadata and DIF setting of base bdev Allow I/O requests using metadata and DIF if base bdev supports them. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ie1b4b301a3d72d3fbd6e459ee2ab7d1a85425162 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460394 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-04 09:23:07 +00:00
Shuhei Matsumoto	3ff1ff004e	nvme/tcp: Minor cleanups for SGL operations Using naming rules consistent with other related libraries is helpful to ensure the quality as verified by this patch series. This patch changes a few parts to use iov and iovcnt for SGL operations. Besides, name of an array points to the head of the array and is constant. So copying name of array to an another pointer is not necessary and can be removed. Change-Id: I2324f28126b3088098c1c767cf6c060f22c175c3 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455629 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-04 08:58:40 +00:00
Shuhei Matsumoto	127cfac020	nvmf/tcp: Use nvme_tcp_pdu_set_data_buf for incapsule data Previously we had used nvme_tcp_pdu_set_data() for incapsule data. This patch changes handling incapsule data to use nvme_tcp_pdu_set_data_buf() as same as H2C and C2H. This unification is necessary to support DIF insert and strip in NVMe/TCP target later. Change-Id: I02cae8db94e51cf79a354dd64ad45f0e491ec08e Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455920 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-04 08:58:40 +00:00
Shuhei Matsumoto	3184884f9d	nvmf/tcp: Properly handle multiple iovecs in processing H2C and C2H NVMe/TCP target had assumed the size of each iovec was io_unit_size. Using nvme_tcp_pdu_set_data_buf() instead removes the assumption and supports any alignment transparently. Hence this patch moves nvme_tcp_pdu_set_data_buf() to include/spdk_internal/nvme_tcp.h and replaces the current code to use it. Besides, this patch simplifies spdk_nvmf_tcp_calc_c2h_data_pdu_num() because sum of iov_len of iovecs is equal to the variable length now. We cannot separate code movement (lib/nvme/nvme_tcp.c to include/ spdk_internal/nvme_tcp.h) and code replacement (lib/nvmf/tcp.c) because moved functions are static and compiler give warning if they are not referenced in lib/nvmf/tcp.c. The next patch will add UT code. Change-Id: Iaece5639c6d9a41bd35ee4eb2b75220682dcecd1 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455625 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-04 08:58:40 +00:00
Ziye Yang	b09bd95ad3	sock: update spdk_sock_group_add_sock And also add spdk_sock_group_get_ctx function Change-Id: I2a2a58b0588ff7d99d3538ea0a633a3b8c7a234b Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454538 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com>	2019-07-04 08:21:05 +00:00
Ziye Yang	8bb174f87d	sock: add function spdk_sock_get_optimal_sock_group Also add the mapping table and the operations between placement_id and sock_group Change-Id: I31868e241fdd20252c2d79792ff1239e6d23afb8 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454537 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-04 08:21:05 +00:00
Changpeng Liu	bdb90726ee	scsi: fix error break when checking SCSI reservation We should return for the registrant case when the reservation holder exists. Change-Id: Ie3cf31554eafdad03294aef2eeb6eaef1536b8c3 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460305 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Liang Yan <liang.z.yan@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-04 08:15:44 +00:00
Shuhei Matsumoto	666a0b5cb4	iscsi: Assign not pointer but instance of spdk_cpuset in struct spdk_iscsi_portal_grp This will reduce pontential malloc failures. Change-Id: I9b1965e0be95af4c0496dfbae80c86b25c460c94 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459718 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-04 00:30:22 +00:00
Shuhei Matsumoto	752fa1ca27	thread: Assign not pointer but instance of spdk_cpuset in struct spdk_thread This will reduce potential malloc failures. Change-Id: Ie67554fec877e33bbd1044fc61eb4d79df306168 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459717 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-04 00:30:22 +00:00
Shuhei Matsumoto	6de3d418df	cpuset: Expose internal of struct spdk_cpuset in header file This will make other structures to allocate struct spdk_cpuset statically and will reduce potential malloc failures. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I067ec2c79824b04796a8b6f717e610727a861461 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459716 Reviewed-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-04 00:30:22 +00:00
Shuhei Matsumoto	12d6dce2aa	nvmf: Use not malloc'ed but fixed size string for host NQN Maximum size of NQN is already defined to be SPDK_NVMF_NQN_MAX_LEN, and hence use fixed size string whose size is SPDK_NVMF_NQN_MAX_LEN + 1 for spdk_nvmf_vhost::nqn. This change will reduce the potential malloc failure. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I2b9c7cc21200b3e88b5485ebfdcd5040bc6e3589 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459742 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-04 00:30:22 +00:00
yidong0635	21740a7cac	ftl_reloc: Fix scanbuild warning about moves. /spdk/lib/ftl/ftl_reloc.c:507:8: warning: Assigned value is garbage or undefined move = moves[i]; ^ ~~~~~~~~ lib/ftl/ftl_reloc.c:508:11: warning: Access to field 'state' results in a dereference of a null pointer (loaded from variable 'move') switch (move->state) { ^~~~~~~~~~~ Change-Id: I9cc1c2b52a93957bb4c56b1ed463c23289b5a43d Signed-off-by: yidong0635 <dongx.yi@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460120 Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-04 00:25:17 +00:00
Wojciech Malikowski	e0bf5e3e4f	lib/ftl: Enable ANM events handling Added ANM events processing by relocation module. Change-Id: I6d20b2dd66309fd7cf0fddb44b6027848b29446b Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455253 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-03 04:28:13 +00:00
Wojciech Malikowski	fdf3c5a30f	lib/ftl: Temporarily disable relocation on open bands Handling ANMs on open band leads to many corner cases in FTL and on the other hand such event should be very rare. Disable it until we will have stable test results from current implementation with extended dirty shutdown tests. Change-Id: Id438c7274ed2be1712bf581d6aabfc27bcbd53dc Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459434 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-03 04:28:13 +00:00
Changpeng Liu	77c1f90e98	iscsi: change ERRLOG to DEBUGLOG for read socket error Since spdk_iscsi_conn_read_data() can print error log, so we don't need to print again in the caller, existing code will print error log for LOGOUT and DISCOVERY cases. Fix issue #845. Change-Id: I547d3d667b6412ab6a59c9b401d0f28c5026307d Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460110 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-02 22:56:01 +00:00
Shuhei Matsumoto	3d1995c35b	thread: Use not malloc'ed but fixed size string for IO device name 256 bytes will be enough but not too large for the name of SPDK IO device. Use fixed size string for the name of SPDK IO device and reduce the potential malloc failure. If the length of passed name is longer then 256, it will be cut off without error. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I618b82a1d07769df7c775280fbf364cbcfdde403 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459721 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2019-07-02 07:04:58 +00:00
Shuhei Matsumoto	09013306c3	thread: Use not malloc'ed but fixed size string for thread name 256 bytes will be enough but not too large for the name of SPDK thread. Use fixed size string for the name of SPDK thread and reduce the potential malloc failure. If the length of passed name is longer then 256, it will be cut off without error. Change-Id: I13a24997a73a8365c8bf5e093f2bd78861ba6660 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459720 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-02 07:04:58 +00:00
Ziye Yang	404d27263f	sock: add get_placement_id function. Placement_id is related with getsockopt with the optname= SO_INCOMING_NAPI_ID. For some testing platform, it is not supported with this macro, so use ifdef to avoid send this to the kernel. Change-Id: I9e49e6e15810af0cd5085b92469c15a53ac09ada Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454468 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-02 06:47:36 +00:00
Changpeng Liu	af6ed1e94a	nvmf: update the reservation information for ACQUIRE/RLEASE commands Change-Id: Ibfebffa4d683da08ae8f9350cce144fafe6a5538 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455910 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-02 00:06:59 +00:00
Changpeng Liu	196d4f704a	nvmf: enable ptpl feature with reservation register command Add file based reservation information definition, the data structure can be used to store all the reservation information to a json based configuration file, and enable this feature with REGISTER command. Change-Id: Ic93cfc5934a4ad96f11b96ec77bacb877edf6c10 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455909 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-02 00:06:59 +00:00
Mateusz Kozlowski	d679b0ec6a	lib/ftl: Remove num_pad_bands counter from restore Base off restore completion on list population rather than another counter. Signed-off-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Change-Id: I8f9d8f13aea42e1c350640efd84ff6c247eded0a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457606 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-01 14:49:14 +00:00
Shuhei Matsumoto	a57daa6976	env: Add an API to lookup the memory pool created by the primary process Add spdk_mempool_lookup to lookup the memory pool created by the primary process. This will be utilized in SPDK multi process application future. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I90505b6566dfc93ef5957ef4c73b1a6438c30742 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459739 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-01 14:47:30 +00:00
Mateusz Kozlowski	60c8845fd0	bdev/ftl: construct_ftl_bdev respects default ftl config Changed initialization of the ftl lib when using an rpc call to allow for usage of any default configuration parameters (currently only allow_open_bands is exposed). Signed-off-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Change-Id: I73457dfcacc6b1adeffd13ecc6e98001749e00cf Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459741 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-01 14:46:26 +00:00
Pawel Kaminski	d270cd36ad	jsonrpc: Reorder spdk_jsonrpc_server_write_cb We'll use it in spdk_jsonrpc_parse_request() soon. Change-Id: I78ad2a931787b095e65053bea4dce663a92bb3b0 Signed-off-by: Pawel Wodkowski <pawelx.wodkowski@intel.com> Signed-off-by: Pawel Kaminski <pawelx.kaminski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459657 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-01 13:09:52 +00:00
Hailiang Wang	3a65c8729b	lib/nvme: fix a warning of spdk_pci_addr->domain Compilation Warning on fedora30. In file included from nvme_ut.c:42: /home/vagrant/spdk_repo/spdk/test/common/lib/test_env.c:517:17: warning: The left operand of '>' is a garbage value if (a1->domain > a2->domain) { ~~~~~~~~~~ ^ This is related to issue #822. Change-Id: I2b61e821130b89af04db3c475e81d2e91a380a90 Signed-off-by: Hailiang Wang <hailiangx.e.wang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459923 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-01 13:07:48 +00:00
Darek Stojaczyk	dad4c43a88	vhost: add a single dpdk semaphore The semaphore was a part of struct spdk_vhost_session_fn_ctx so far, but since there's only one pthread waiting on that semaphore and hence only one event using it, we could just use a single global sem_t. Same thing with response code for those callbacks - there's only one needed. Going a step further, the function complete_session_event() was removed - it would only operate on global variables now, and its signature wouldn't make much sense after this refactor, so it's been inlined. This serves as cleanup. Change-Id: I63ef41d7e1564fff5e785de101d887bc1014aad9 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459160 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-01 12:50:57 +00:00
Darek Stojaczyk	5fb7330151	vhost: introduce g_vhost_init_thread Enforce spdk_vhost_fini() to be called on the same thread which called spdk_vhost_init(). We'll also use the newly added g_vhost_init_thread for other purposes later on. Change-Id: I99aebeda2d8ddaf42554aa422c32ed935634595f Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459159 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-01 12:50:57 +00:00
Darek Stojaczyk	ccdc0b615f	vhost: operate on poll groups instead of lcores With all the pieces in place we can finally remove the legacy cross thread messages from vhost. We replace spdk_vhost_allocate_reactor() with spdk_vhost_get_poll_group(). The returned poll_group has to be passed to spdk_vhost_session_send_event(), where it will be assigned to the session. After the session it started, that poll group will be used for all the internal vhost cross-thread messaging. Change-Id: I17f13d3cc6e2b64e4b614c3ceb1eddb31056669b Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452207 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-01 12:50:57 +00:00
Shuhei Matsumoto	f62d5ccbe6	nvme/tcp: Properly handle multiple iovecs in nvme_tcp_pdu_set_data_buf nvme_tcp_pdu_set_data_buf() has been used to process C2H and H2C for NVMe/TCP initiator. In this case, NVMe/TCP cuts out the part of the input data buffer and transfers the part, and repeats these cut and transfers until the whole data buffer is transferred. NVMe/TCP uses two SGLs, and use one to parse from the offset datao to datao + datal and another to append from the offset 0 to datal. However, the current nvme_tcp_pdu_set_data_buf() had used data_length as not data length of this transfer but total length of the whole transfers by mistake. Recently DIF library updated to properly handle very similar cases, and so this patch takes DIF library as a reference and corrects the implementation. The next patch will add UT code to verify the bug will be fixed. The code size is pretty large and so UT code is separated. Change-Id: Ibeed4de182b8b8740566e874e2757280dc21f9e8 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455623 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-01 08:28:20 +00:00
Shuhei Matsumoto	a7b6d2ef00	nvme/tcp: Change parameters of nvme_tcp_pdu_set_data_buf to use in target This patch is the first patch of the patch series. The purpose of this patch series is to correct the bug of nvme_tcp_pdu_set_data_buf() when the multiple iovecs array is passed, to share nvme_tcp_pdu_set_data_buf() between NVMe/TCP initiator and target, and utilize nvme_tcp_pdu_set_data_buf() not only for C2H and H2C but also in-capsule data in NVMe/TCP target. This patch is necessary to satisfy the second requirement, to share nvme_tcp_pdu_set_data_buf() between NVMe/TCP initiator and target because struct nvme_tcp_req and struct spdk_nvmf_tcp_req are different. Four variables, iov, iovcnt, data_offset, and data_len are common, and hence this patch changes the parameters of nvme_tcp_pdu_set_data_buf() to accept them. The bug is fixed in the next patch and tested in after the next patch. Change-Id: Ifabd9a2227b25f4820738656e804d05dc3f874a5 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455622 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-01 08:28:20 +00:00
Shuhei Matsumoto	f341e69a50	iscsi: Use not malloc'ed but fixed size string for portal porta number Using malloc'ed string for string in iSCSI target has caused scan-build error. Define maximum port number of portal to be 32 and use fixed size string whose size is 33 for spdk_iscsi_portal_grp::port. This change will reduce the potential malloc failure. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ie1fcdbd45ce000a9c1c53761195697555b8d030a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459709 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-01 05:03:07 +00:00
Shuhei Matsumoto	8f64724e86	iscsi: Use not malloc'ed but fixed size string for portal IP address Using malloc'ed string for string in iSCSI target has caused scan-build error. Define maximum IP address of portal to be 256 and use fixed size string whose size is 257 for spdk_iscsi_portal_grp::host. This change will reduce the potential malloc failure. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Iceeae94e250ea426f72ff72355a213606308da51 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459708 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-01 05:03:07 +00:00
Shuhei Matsumoto	d1961b5e41	iscsi: Use not malloc'ed but fixed size string for target name and alias Using malloc'ed string for string in iSCSI target has caused scan-build error. Maximum size of target name is already defined to be MAX_TARGET_NAME, and hence use fixed size string whose size is MAX_TARGET_NAME + 1 for spdk_iscsi_tgt_node::name. Change psdk_iscsi_tgt_node::alias together. This change will reduce the potential malloc failure. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Iac4cd6e9d60173ddeb68ca21ce712126c13bc3c4 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459707 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-01 05:03:07 +00:00
Shuhei Matsumoto	975e48ae8a	iscsi: Use not malloc'ed but fixed size string for initiator address Using malloc'ed string for string in iSCSI target has caused scan-build error. Maximum size of initiator address is already defined to be MAX_INITIATOR_ADDR, and hence use fixed size string whose size is MAX_INITIATOR_ADDR + 1 for spdk_iscsi_initiator_mask::mask. This change will reduce the potential malloc failure. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic39e08986c9377800ce58a1cb5b8401c6b71cf96 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459706 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-01 05:03:07 +00:00
Shuhei Matsumoto	4a3ad8371c	iscsi: Use not malloc'ed but fixed size for initiator name Using malloc'ed string for string in iSCSI target has caused scan-build error. Maximum size of initiator name is already defined to be MAX_INITIATOR_NAME, and hence use fixed size string whose size is MAX_INITIATOR_NAME + 1 for spdk_iscsi_initiator_name::name. This will also reduce the potential malloc failure. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic6bc172125fc6c9c0896499704d2a9b522106da0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459705 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-01 05:03:07 +00:00
Shuhei Matsumoto	1eba9812f2	iscsi: Simplify include relationships to avoid cyclic inclusion Including tgt_node.h in iscsi.h will prevent us from including iscsi.h in tgt_node.h. Subsequent patches will require tgt_node.h to refer the macro constants in iscsi.h. Hence - remove inclusion of tgt_node.h from iscsi.h, - add inclusion of spdk/scsi.h to iscsi.h, and - remove inclusion of spdk/scsi.h from tgt_node.h Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I5ac808a83754c157e4140bcd2a83c4d210e30d91 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459704 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-01 05:03:07 +00:00
Changpeng Liu	efd7b514d4	bdev: rewind child offset to last block size aligned iov Here is the an example to describe existing issue: There is a Write request with 64KiB data length, and this IO is cross the IO boundary. We assume that the parent IO will have 2 children requests, one is 33KiB length, the other one is 31KiB. Here is the view of parent iovs, the first 33KiB length data has 33 iovs: iov.[0].iov_length = 1024; . . iov.[31].iov_length = 256; iov.[32].iov_length = 768; . . iov.[64].iov_length = 1024; In function _spdk_bdev_io_split(), then you can see that for the 33KiB length child request, exiting code will run out of child child_iov space and return error due to last one data buffer is not block size aligned. Here we can rewind the existing offset to last block size aligned buffer to avoid the error case, for backend which need aligned data buffer such as AIO backend, the request will go through spdk_bdev_io_get_buf() again to do the data copy, otherwise for those backend devices such as NVMe with hardware SGL support, 256 data segment is fine for them. Change-Id: I96ebdf29829d86f9b38fab28a7406eedc9fa44ef Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453604 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-01 04:20:21 +00:00
Wojciech Malikowski	091bc429d7	lib/event/subsystems: Added VMD dependency to bdev subsystem Bdev initialization need to be done after VMD. Change-Id: Ia680ccbdb8fc6db1d3c09cf9d917105e183a3845 Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459768 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-01 03:54:45 +00:00
Wojciech Malikowski	a044e19470	lib/rocksdb: Optional VMD enumeration VMD section with Enable flag set to true need to be defined in config file to enumerate devices behind VMD. Change-Id: I0b35d93b224025050ae0c081af720ed816c9f0fa Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459765 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-01 03:53:06 +00:00
Wojciech Malikowski	e2fb1b80e1	lib/ftl: Check if any additional relocation was added Return immediatly from ftl_reloc_add() if no new blocks was added to relocation. Change-Id: If80dfa725e0bb9f3b8987740012858a671c5ad90 Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457626 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2019-07-01 03:29:00 +00:00
Wojciech Malikowski	a74b79dc74	lib/ftl: Drop relocation for empty bands immediately Added check if band that is added to reloc have any valid blocks. Return immediately if there is no valid blocks. Change-Id: I2bce088e0ad71479c6899fff96845397d12e2e92 Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457625 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-01 03:29:00 +00:00
Wojciech Malikowski	c149f9597f	lib/ftl: Prevent from adding active reloc to pending queue In case ANM event occurs on band being relocating (band is on active reloc queue) we shoudn't add such band to pending queue. Change-Id: I92a8bee11309097e19afaea549460f1d4387e3e5 Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458617 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-01 03:29:00 +00:00
Wojciech Malikowski	76cff6da81	lib/ftl: Remove band from active/pending queue In case high priority band was added for relocation it should be removed from active/pending queue if it was already on one of them. Change-Id: Id0591b1d3a4174dd05eb1c32227e4d3b3a9cbcd0 Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458057 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2019-07-01 03:29:00 +00:00
Wojciech Malikowski	2cc6bd2a26	lib/ftl: Skip block with ongoing write during relocation In case ANM event occurs on open band there can be situation that reloc will try to read block on which there is ongoing write. This is happening because lba valid map is updated before write submission to allow sent consistent metadata to disk before all user writes are completed. Added write offset to the each chunk and add check to reloc if particular ppa is written on that chunk. Change-Id: Ic95a06e69381d2152a86984b65a0975afaff955d Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458056 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-01 03:29:00 +00:00
Wojciech Malikowski	bf4973087f	lib/ftl: Allow for relocating open band In case ANM event occurs on open band reloc need to be able to process such event. If band is not in closed sate do not alloc lba map for it and do not set it to free state after relocation. Change-Id: I2f4a5770fef08271d222936ca19f3cc98e5e5be1 Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457612 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2019-07-01 03:29:00 +00:00
Wojciech Malikowski	f8a9112292	lib/ftl: Mark all lba map segments as cached for open bands Open bands need to have lba map segments set to cached state to prevent read lba map from disk during relocation events. Change-Id: Ib4f1ed19131fad174c1d2f70e4c02e83701e2a0a Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457853 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-01 03:29:00 +00:00
Wojciech Malikowski	bfd67f9405	lib/ftl: Initialize band tail metadata physical address Band tail PPA should be initialized when new FTL instance is created. Change-Id: Ie2fb72aa3f29eece0b6f8912998b33af3ba6b355 Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457777 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-01 03:29:00 +00:00
Wojciech Malikowski	4be37a57f4	lib/ftl: Consume ANM event on core thread Send ANM event to core thread for further processing. This will remove a need of locking in relocate module when ANM event occur. Change-Id: I0efb1f1b8c96c107cda5fe78e8ee5672cde39f11 Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457611 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-01 03:29:00 +00:00
Ziye Yang	cdc0170c1b	nvmf/tcp: Add a maximal PDU loop number In our previous code, we will handle all the PDU until there is no incoming data from the network if we can continue the loop. However this is not quite fair when we handling multiple connections in a polling group. And this change is setting a maximal NVME/TCP PDU we can handle for each conneciton, it can improve the performance. After some tuing, 32 should be a good loop number. Our iSCSI target uses 16. The following shows some performance data: Configuration: 1 Command used in the initiator side: ./examples/nvme/perf/perf -r 'trtype:TCP adrfam:IPv4 traddr:192.168.4.11 trsvcid:4420' -q 128 -o 4096 -w randrw -M 50 -t 10 2 target side, export 4 malloc bdev in a same subsystem Result: Before patch: Starting thread on core 0 ======================================================== Latency(us) Device Information : IOPS MiB/s Average min max TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 51554.20 201.38 2483.07 462.31 4158.45 TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 51533.00 201.30 2484.12 508.06 4464.07 TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 51630.20 201.68 2479.30 481.19 4120.83 TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 51700.70 201.96 2475.85 442.61 4018.67 ======================================================== Total : 206418.10 806.32 2480.58 442.61 4464.07 After patch: Starting thread on core 0 ======================================================== Latency(us) Device Information : IOPS MiB/s Average min max TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 57445.30 224.40 2228.46 450.03 4231.23 TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 57529.50 224.72 2225.17 676.07 4251.76 TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 57524.80 224.71 2225.29 627.08 4193.28 TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 57476.50 224.52 2227.17 663.14 4205.12 ======================================================== Total : 229976.10 898.34 2226.52 450.03 4251.76 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I86b7af1b669169eee2225de2d28c2cc313e7d905 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459572 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-28 12:28:54 +00:00
Changpeng Liu	6c9b6abf5e	blobfs: make internal asynchronous APIs as public APIs SPDK blobfs has asynchronous APIs defined in blobfs_internal.h file, as users may want to use them, so we remove them to the public .h file. Change-Id: I1835d97060101f6315a73cb8638b15ff7e13ba54 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457547 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-28 09:50:50 +00:00
Changpeng Liu	1966f1eef3	blobfs: add writev/readv asynchronous APIs support Change-Id: Id1172f546852fcf25c6d13cb63f9d875b02e768c Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453493 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-28 09:50:50 +00:00
Takeshi Yoshimura	c74ea9fa8e	rocksdb: Fix null deref at blobfs calls I tried experimental binding of SPDK with Mongo-rocks. However, the binding sometimes invoke blobfs APIs without thread initializations. In that case, null dereferences occur. In other words, we need to carefully use blobfs not to invoke any threads that are not registered to blobfs. This patch simply adds a sanity check at every use of blobfs APIs. By doing this, we do not need to care about which threads can use blobfs APIs. Change-Id: I5b37b0267306a7c76d20e81c1773a6a33be7828c Signed-off-by: Takeshi Yoshimura <t.yoshimura8869@gmail.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/418966 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-28 08:37:54 +00:00
Anil Veerabhadrappa	9d1e666798	conf: parse "C2HSuccess" parameter for TCP transport only "C2HSuccess" is only valid for TCP transport. So this parameter should be looked up only for TCP transport. Without the change, spdk_nvmf_parse_transport() would bailout early for RDMA and other transports without every creating them. Signed-off-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Change-Id: I34bdff2f4ab930516743cd5dbf022d75e60fd85c Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459571 Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-28 07:03:45 +00:00
Shuhei Matsumoto	0c26ea5a2b	dif: Factor out converting size from LBA based to extended LBA based In DIF library there are many functions that converts offset or length from LBA based to extended LBA based. Factor out them by adding a helper function _to_size_with_md(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id5576edacc8a07095726f659c4b53ac3aa83727d Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459530 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-06-28 04:13:02 +00:00
Shuhei Matsumoto	1d1c60e53d	dif: Add helper function to convert buffer range to extended LBA based This will be used to get extended LBA based range or length in NVMe/TCP target later. Change-Id: Id0f08bdaeea634dbc05b34a0f7914be21aef9aae Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458706 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-28 04:13:02 +00:00
Shuhei Matsumoto	767c046e77	dif: Add spdk_dif_update_crc32c_stream to update CRC32C by stream fashion Add spdk_dif_update_crc32c_stream to update CRC32C by stream fashion. spdk_dif_update_crc32c_stream utilizes the updated _dif_update_crc32c_split. A minor bug was found in UT for spdk_dif_update_crc32c and is fixed together in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I92358e845e8e2e17c6f288aa718b947e71e6e1fb Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458919 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-06-28 04:13:02 +00:00
Shuhei Matsumoto	4c2e52b935	dif: Process partial data block properly in _dif_update_crc32c_split For NVMe/TCP target, data segments which correspond to H2C or C2H PDU will have any alignment, and _dif_update_crc32c_split will have to process partial data block, particularly the following types: - start and end are both within a data block. - start is within a data block, and end is at the end of a block On the other hand, _dif_update_crc32c_split had assumed that passed block is always a complete block. This patch exposes offset_in_block, data_len, and guard as parameters of _dif_update_crc32c_split() and make _dif_verify_split() process the above two types of data block properly. The next patch will utilize the updated _dif_update_crc32c_split to add spdk_dif_update_crc32c_stream(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Iee29377ad49d4f209673fffb4de4a23a54f31766 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458918 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-06-28 04:13:02 +00:00
Tomasz Zawadzki	699a5f35e5	net/vpp: switch to session.api Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I94ce2735c3d15dd7ee5e4ad33280e9996740e244 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/417056 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-27 08:23:08 +00:00
Darek Stojaczyk	f9a6588f57	nvme: switch to spdk_malloc(). spdk_dma_malloc() is about to be deprecated. Change-Id: I6c308ee546c28c479ceb903bc1749bf5209dc6fe Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448172 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: <uma.willpower@gmail.com>	2019-06-27 04:34:50 +00:00
Shuhei Matsumoto	a019feec62	dif: Factor out setup operation of spdk_dif_generate/verify_stream spdk_dif_generate_stream() and spdk_dif_verify_stream() are very similar. Factoring out the common part into a function will improve the maintainability and do in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I16ecd0860c75037d9182298d7513749dfe8e9b56 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458376 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-27 00:36:25 +00:00
Shuhei Matsumoto	6db126c24c	dif: Add spdk_dif_verify_stream to verify DIF by stream fashion Add spdk_dif_verify_stream to verify DIF by stream fashion. spdk_dif_verify_stream utilizes the updated _dif_verify_split. spdk_dif_verify_stream is very similar with spdk_dif_generate_stream(). UT code demonstrates how it is realized. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I1c5d197cf4c0bbc82c8e7f4fa45ddc0b94051058 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458330 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-27 00:36:25 +00:00
Shuhei Matsumoto	80f2ca0d90	dif: Process partial data block properly in _dif_verify_split For NVMe/TCP target, data segments which correspond to H2C or C2H PDU will have any alignment, and _dif_verify_split will have to process partial data block, particularly the following types: - start and end are both within a data block. - start is within a data block, and end is at the end of a block On the other hand, _dif_verify_split had assumed that passed block is always a complete block. According to the refactoring done in the last patch, this patch exposes offset_in_block, data_len, and guard as parameters of _dif_verify_split() and make _dif_verify_split() process the above two types of data block properly. The next patch will utilize the updated _dif_verify_split to add spdk_dif_verify_stream(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic371d3ccefbd5fe8147a948a624013be2702128e Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458329 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-27 00:36:25 +00:00
Shuhei Matsumoto	d69dc28b00	dif: Separate _dif_verify_split into three parts For NVMe/TCP target, data segments which correspond to H2C or C2H PDU will have any alignment, and _dif_verify_split will have to process partial data block, particularly the following types: - start and end are both within a data block. - start is within a data block, and end is at the end of a block On the other hand, _dif_verify_split had assumed that passed block is always a complete block. To process the above types, separating guard computation, DIF copy and skipping metadata field, and DIF verification into three parts will be helpful and is done in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic4f1765e01507efa812dfaf7a8018666c6346f8e Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458328 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-27 00:36:25 +00:00
Konrad Sztyber	42287a6954	lib/ftl: error logging around getting chunk info Some of the errors were silent, making it hard to pinpoint the exact failing call. This patch adds SPDK_ERRLOGs for each error path. Change-Id: I71be6c97cab916ac52314e5f4e4d63358877bd96 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458426 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-26 14:21:52 +00:00
Konrad Sztyber	18b1de97d8	lib/ftl: store metadata on non-volatile cache Send LBA along with the data block when mirroring writes to the non-volatile cache. The metadata buffer is retrieved from the metadata pool, so the maximum number of concurrent requests is limited to nv_cache.max_request_cnt, while the number of blocks in a single request is limited by nv_cache.max_requets_size. Change-Id: If260302d16039183fb0fe073ef7419947532cfab Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458093 Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-26 09:36:57 +00:00
Konrad Sztyber	11ff1f4a2b	lib/ftl: non-volatile cache metadata pool Initialize the memory pool for storing metadata (LBAs) when writing data to the non-volatile cache. The mempool's object count and size can be configured via nv_cache.max_request_cnt / nv_cache.max_request_size respectively. Change-Id: I376df9a75be13d4b29ba475f350edf402c868d48 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458092 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com>	2019-06-26 09:36:57 +00:00
Mateusz Kozlowski	8c2cff02b6	lib/ftl: Fix ppa pack function Address translation wasn't correct for >32 bit length packed address. This commit fixes the issue and adds a corresponding unit test. This patch fixes issue #774: https://github.com/spdk/spdk/issues/774 Signed-off-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Change-Id: Idce67c47f2a9888f9e2ae2eadaf71ccc34e5c260 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457114 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-26 08:38:32 +00:00
Wojciech Malikowski	f056bc6524	lib/env_dpdk: Allow iterating over all detected PCI devices Added spdk_pci_get_first_device() and spdk_pci_get_next_device() to iterate over all devices on g_pci_devices list. Change-Id: I65079fb3e274195707dee64bc1fb8b4b72d07352 Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/450924 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-26 08:24:02 +00:00
Darek Stojaczyk	b9e8dc71f7	env_dpdk/pci: cleanup locks Put the locks inside cleanup_pci_devices(). This serves as cleanup. Change-Id: I040b28006e5584d1f33af26b63cafedbafe04fdb Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458934 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: John Kariuki <John.K.Kariuki@intel.com>	2019-06-26 08:24:02 +00:00
Darek Stojaczyk	fe511d03d2	env_dpdk/pci: reduce g_pci_mutex scope The global pci tailq is no longer modified on the dpdk thread, so on the spdk thread we can access it safely without any lock. The code is slightly more readable then. This shows that cleanup_pci_devices() is always wrapped with lock/unlock. We'll put the locks inside this function in the next patch. Change-Id: Ia4d386b78a87078761df0a3b953bfc4ff44102f8 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458933 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-26 08:24:02 +00:00
Darek Stojaczyk	b941b2983a	env_dpdk/pci: don't hotplug devices directly on the dpdk intr thread To safely access the global pci device list on an spdk thread, we'll need not to modify this list on any other thread. When device gets hotplugged on a dpdk thread, it will be now inserted into a new global tailq that can be accessed only under g_pci_mutex. Then any subsequently called public pci function will add it to the regular device tailq. Change-Id: I9cb9d6b24fd731641fd764d0da71bedab38824c9 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458932 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-06-26 08:24:02 +00:00
Darek Stojaczyk	cf0abd0e83	env_dpdk/pci: don't hotremove devices directly on the dpdk intr thread To safely access the global pci device list on an spdk thread, we'll need not to modify this list on any other thread. When device gets hotremoved on a dpdk thread, it will now set a new per-device `removed` flag. Then any subsequently called public pci function will remove it from the list. Change-Id: I0f16237617e0bea75b322ab402407780616424c3 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458931 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-06-26 08:24:02 +00:00
Darek Stojaczyk	49c12890aa	env_dpdk/pci: remove thread safety from PCI APIs For VMD driver we'll need to introduce some way of iterating over all spdk pci device objects and we would like to achieve that with simple spdk_pci_get_first_dev()/get_next_dev() APIs. To make it thread safe though, we would have to expose some public pci mutex to be locked around the iteration and we don't want to do that, so we'll make PCI APIs usable from only a single thread - this will prevent any pci devices from being removed inbetween subsequent get_first/get_next calls. We currently have the following players accessing pci device state: 1) public APIs, obviously (on any thread right now) 2) VFIO hotremove callback (dpdk interrupt thread) 3) rte_eal_alarm for detaching rte_pci_devices (dpdk interrupt thread) 4) DPDK hotplug IPC (dpdk interrupt thread) There is g_pci_mutex providing the thread safety, but even today it doesn't protect #3 and #4, making the entire pci layer prone to data corruption. To make #3 and #4 safe, we would have to lock inside device init/fini callbacks (spdk_pci_device_init/fini), but those are called directly inside the public device attach/detach functions which already lock. So now, with the decision to drop thread safety from public pci APIs, we narrow down the locks inside public functions and introduce locks inside those lower-level init/fini callbacks. Change-Id: I5dcbc9cdcbab65ee76cd3c42890f596069ec9a8a Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458930 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-26 08:24:02 +00:00
Ziye Yang	016d933793	lib/virtio: change the definition of cookie Converting to the struct virtio_req is useless. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I141268314d28cf87bdef529808c8e18bd1b41c9d Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459360 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-26 08:03:37 +00:00
Maciej Szwed	e8356fd233	blobstore: Cleanup after power failure while creating snapshot Currently we are missing cleanup routine for case when power failure interrupts creating snapshot. This patch add such routine. For the case where we find blob with a parent snapshot ID matching newly created snapshot we can finish whole process during recovery by processing forward with setting snpashot as read only, removing xattr and syncing. We should remove snapshot only if there is no blob with parent pointing at snapshot. Fixes github issue #760 Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: I2f0e298164e07a2b4dfa5367e8878facef640702 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455216 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2019-06-26 08:00:14 +00:00
paul luse	bfda995be2	bdev/compress: add RPC to specify PMD By default QAT will be selected if available however a new RPC can be used to either auto-select (default) or specify either ISAL or QAT. Change-Id: I37cf7640bbd8cef455583e1eccb8adb59cc419d8 Signed-off-by: paul luse <paul.e.luse@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456693 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-26 07:57:09 +00:00
Mateusz Kozlowski	ef6878072c	lib/ftl: Band seq is restored from head md only Fixed issue when restoring from a dirty shutdown - sometimes end md wasn't erased after a band was prepared from writing when a shutdown happened. This resulted in inconsistency between the new head md and old tail md, which was technically valid. Band sequence numbers would then be reused, causing a failure on any subsequent restore. Signed-off-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Change-Id: Ic3e968be02bb814d6c85f0a3279403fe99337b86 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459287 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-26 07:09:00 +00:00
Mateusz Kozlowski	38bb4bcdee	lib/ftl: Fix I/O alignment in dirty shutdown restore Changed to use 4k alignment in dirty shutdown I/Os. Otherwise the scatter gather lists used in QEMU for underlying file/block device would use an extra entry (e.g. 17 for 16 sector writes), and eventually some I/Os would write to offset 0 in underlying file, corrupting head metadata. Signed-off-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Change-Id: If8c88ce708529b094a09c8ee952912cc22cd53b9 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458090 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-26 07:09:00 +00:00
Mateusz Kozlowski	67d027ece9	lib/ftl: Fix lba_map cleanup during restore Band's lba_map needs to be set to NULL before restore completes, as it's not allocated on a per band basis and instead uses a pool from restore struct itself. Without the fix initializing a band for writing would hit an assert during proper allocation in ftl_band_alloc_lba_map. Signed-off-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com> Change-Id: Icff4f54cbe722cb6030b9dfd55726b9b0d6c1e27 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458422 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-26 07:09:00 +00:00
Shuhei Matsumoto	8c69654d5a	dif: Process unaligned data segment properly in DIF insert This patch makes spdk_dif_set_md_interleave_iovs() and spdk_dif_generate_stream() process unaligned start of data segment properly by using ctx->data_offset. Separating this patch into two may be required but this patch is small and aggregating into a patch is good to test. UT code demonstrates how it is realized. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Idb5250aba4e12a34102e5ce067d725c685681177 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458142 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-26 06:56:25 +00:00
Shuhei Matsumoto	2819718176	dif: Add an API to update data offset of DIF context To process unaligned data segment properly when a whole data buffer is splitted into multiple data segments and each data segment has any alignment, we have to update only data offset of DIF context according to the progress. Hence this patch adds an new API spdk_dif_ctx_set_data_offset(). The API will be used in the next patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I346ab583518b80792ea40d34cf0c8536ecc3d904 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458141 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-26 06:56:25 +00:00
yidong0635	17c006a7c3	lib/jsonrpc: Fix memory leaks about connection request. There're outstanding requests in spdk_jsonrpc_parse_request which caused by connection close. There are methods to call spdk_jsonrpc_server_conn_close, including spdk_jsonrpc_server_conn_remove and spdk_jsonrpc_server_shutdown, Some rpc methods call these functions to terminate connections ,that leads to memory leaks. Try to free outstanding requests after deciding to terminate a connection. And do this follwing with close(conn->sockfd). Fix issue #784, and can resolve other similar memory leaks about this. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: Icd287bd0c5670ee8ec32750b999f82b0fa89cf84 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458438 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-26 06:26:50 +00:00
Or Gerlitz	6629202cbd	nvmf/tcp: Use the success optimization by default By now (5.1 is released), the Linux kernel initiator supports the success optimization and further, the version that doesn't support it (5.0) was EOL-ed. As such, lets open it up @ spdk by default. Doing so provides a notable performance improvement: running perf with iodepth of 64, randread, two threads and block size of 512 bytes for 60s ("-q 64 -w randread -o 512 -c 0x5000 -t 60") over the VMA socket acceleration library and null backing store, we got 730K IOPS with the success optimization vs 550K without it. IOPS MiB/s Average min max 549274.10 268.20 232.99 93.23 3256354.96 728117.57 355.53 175.76 85.93 14632.16 To allow for interop with older kernel initiators, we added a config knob under which the success optimization can be enabled or disabled. Change-Id: Ia4c79f607f82c3563523ae3e07a67eac95b56dbb Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457644 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-26 06:24:03 +00:00
Changpeng Liu	cf5c4a8a2e	nvmf: add ptpl activated flag to Namespace If users set the persist through power loss configuation file, that means the Namespace has the capability to support ptpl feature, here we added a ptpl_activated flag to indicate that the users enable the feature or not. Users can use Set features or Reservation Register commands to change the value. Change-Id: Iae3fd44085c5be5bf9574e49efa567e8212dee20 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455906 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-26 01:54:10 +00:00
Chunyang Hui	863f17d609	reduce: check pmem buf before unmap Fixed issue #831 Change-Id: Id589290f3aa729572fa81daf735cecdc8e2adb84 Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458563 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-06-25 13:39:09 +00:00
Hailiang Wang	73a171a07c	rdma: assert ibv_send_wr is not NULL Vhost testing crashed from Nightly testing, because a member access within null pointer of type 'struct ibv_send_wr'. Change-Id: If8f34f23864883ea73516d2d1fe3b30137c04316 Signed-off-by: Hailiang Wang <hailiangx.e.wang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458913 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-25 13:37:15 +00:00
Darek Stojaczyk	29093c7f01	event/app: don't call start_fn twice when json config is used When JSON config was used, app layer was calling the app start callback twice - once from internally-sent "start_subsystem_init" RPC, and once from the app layer itself. In case of JSON configs, the callback from within the RPC was actually called prematurely, as the real RPC server was still starting in the background at that point. We still need to start the app from that RPC in case of `--wait-for-listen` option, but for JSON configs it doesn't make sense. Just ignore it now and rely on json config load completion callback to start the app. Fixes #816 Change-Id: Ib54d624f3167137216c910b2d947bbd1dc5023b1 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458351 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-24 15:54:49 +00:00
Darek Stojaczyk	a7d9fc4a4d	event: fix segv on json config read failure If reading the JSON config file has failed, we entered spdk_app_json_config_load_done(-ERRNO) and tried to close a client connection that was never initiated, which resulted in NULL dereference. To fix it, just check if client_conn != NULL before attempting to close it. Change-Id: I7340567c45e795f77110c2914e94ba83fa8d1bff Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458350 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-24 15:54:49 +00:00
Wojciech Malikowski	8233a5a8e1	event/subsystems/vmd: Added VMD subsystem Added new VMD subsystem to enumerate devices behind VMD when event framewrok is used. To enable VMD, user need to provide Enable flag via config file. Change-Id: I89bfe22b127c00d358dac7336ffb44b0c0f426ea Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458443 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-24 09:59:00 +00:00
Shuhei Matsumoto	1750a0859b	dif: Process unaligned end of data buffer in spdk_dif_generate_stream() NVMe/TCP target may split a whole data payload into multiple H2C or C2H PDUs with any alignment. Hence to insert or strip DIF correctly to the split H2C or C2H PDUs, we have to bring the interim guard value of the last partial data block of the current H2C or C2H PDU to the first partial data block of the next H2C or C2H PDU. So we add last_guard to struct spdk_dif_ctx and use it in spdk_dif_generate_stream(). API spdk_dif_generate_stream() is not changed and UT code should pass without any change. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I12636c5ac7f619483402538faff4339a16c0e6b0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457545 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-06-24 07:02:07 +00:00
Shuhei Matsumoto	f6a91b3b40	dif: Process partial data block properly in _dif_generate_split() For NVMe/TCP target, data segments which correspond to H2C or C2H PDU will have any alignment, and _dif_generate_split will have to process partial data block, particularly the following types: - start and end are both within a data block. - start is within a data block, and end is at the end of a block According to the refactoring done in the last patch, this patch exposes offset_in_block, data_len, and guard as parameters of _dif_generate_split() and make _dif_generate_split() process the above two types of data block properly. The next patch will utilize the updated _dif_generate_split in spdk_dif_generate_stream(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I4211e65ead7fc256a40748412c670e46f83b1731 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457544 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-06-24 07:02:07 +00:00
Shuhei Matsumoto	27707953ec	dif: Separate _dif_generate_split into three parts For NVMe/TCP target, data segments which correspond to H2C or C2H PDU will have any alignment, and _dif_generate_split will have to process partial data data block, particularly the following types: - start and end are both within a data block. - start is within a data block, and end is at the end of a block On the other hand, _dif_generate_split had assumed that passed block is always a complete block. To process the above types, separating guard computation, DIF generation, and DIF copy into three parts will be helpful and is done in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I0171d9021837b9a4b425370293cef45dbe7500e8 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458225 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-06-24 07:02:07 +00:00
Shuhei Matsumoto	f1344911ea	dif: Minor cleanup of spdk_dif_set_md_interleave_iovs Four variables head_unalign, tail_unalign, num_blocks, and offset_blocks became unuseful by the last patch. Hence reduce them to buf_len and buf_offset in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I04fc1e442be6569a96533cdfe36b27fcc78e98d4 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457876 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-24 07:02:07 +00:00
Shuhei Matsumoto	8b24fc4a31	dif: Process unaligned end of data segment in spdk_dif_set_md_interleave_iovs() For NVMe/TCP target, data segments which correspond to H2C or C2H PDU will have any alignment. spdk_dif_set_md_interleave_iovs() have allowed reading data to have any alignment but had required data segment to be a multiple of block size. In other words, spdk_dif_set_md_interleave_iovs() had required that both ctx->data_offset and (data_offset + data_len) must be a multiple of the data block size. This patch refines the algorithm to remove the latter requirement. The update implies that spdk_dif_set_md_interleave_iovs support any data buffer whose size is less than a single data block. The update doesn't change parameters of spdk_dif_set_md_interleave_iovs and existing UT should be passed. This patch adds additional UT code to test these updates. Change-Id: I88c7d2a80a8d92b54863b6ad1c3a9d2761a6195d Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457542 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-24 07:02:07 +00:00
Shuhei Matsumoto	2bb4fb4c8c	dif: Factor out SGL append opeartion when splitting data buffer The subsequent patches will refine spdk_dif_set_md_interleave_iovs to change the data_len parameter to be the remaining length, and to remove alignment constraint completely. This patch is a preparation to subsequent patches. This patch doesn't change any behavior. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I0e9be7e66d313f3ec2bd8c55cce8bb18e4fff892 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457721 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-24 07:02:07 +00:00
Changpeng Liu	5d718951a6	bdev: split requests first if the request has data buffer There is one existing example usage case to describe the issue: Users(e.g. Vhost-blk target with Windows Guest) call spdk_bdev_readv_blocks() to submit a 128KiB length data READ request, and the data buffer provides by vhost isn't aligned, but the backend block device requires aligned data buffer, so existing function call trace: spdk_bdev_readv_blocks()--> spdk_bdev_io_submit()--> spdk_bdev_io_get_buf() spdk_bdev_io_get_buf() will allocate buffer from large data buffer pool for 128KiB length, of course, it will return error with existing logic. So here, no matter what the data length is, we can go through the split process first for both READ and WRITE. However, there is one scenario that for iSCSI READ request, the iSCSI layer will not allocate data buffer for the request, so for this case if the IO boundary is required we should keep the logic as before. Change-Id: I67661f5fa4c3c7c561b45c86146759aa3477adbf Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453133 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-24 04:51:46 +00:00
JinYu	77290bfe6b	nvme: fix the endless loop of aborting trackers The completion cb of outstanding_tr may submit new requeset to the outstanding_tr list of the qpair, it's an endless loop. We only abort the remaining outstanding trackers. Fix #819 Change-Id: I342f52f4d1836f8ef620ef9e3add0b1986727282 Signed-off-by: JinYu <jin.yu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457755 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-21 08:34:41 +00:00
paul luse	68fbb33b81	bdev/compress: add 2 recommended flags to the comp operation Per docs. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I2b520ba9cce2e8914e5003095cdb0be61b417cb2 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458836 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-21 08:20:01 +00:00

... 2 3 4 5 6 ...

5796 Commits