numam-spdk

Author	SHA1	Message	Date
Ziye Yang	13eb8f2fb3	idxd: Replace the read_8 function pointer with another one Just remove this function pointer and add a new one,i.e., dump_sw_error. Because this function pointer is only used to read a sw err info. We can hide it in the detailed idxd implemenation. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I42fe2220dae85df307b5af64e37acfd7f748915b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8707 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2021-07-13 09:06:15 +00:00
Michal Berger	ca0339d8e5	nvme_cuse: Return ENOTTY in case unsupported ioctl is sent to a device Latest nvme-cli (>= 1.13) fails to issue commands towards SPDK's cuse ctrl device, e.g.: $ nvme get-feature /dev/spdk/nvme0 -f 1 -s 1 -l 100 nvme_cuse.c: 654:cuse_ctrlr_ioctl: ERROR: Unsupported IOCTL 0x4E40. get-namespace-id: Invalid argument The reason is because nvme-cli now also sends NVME_IOCTL_ID to the target device to determine if it's indeed a controller or a ns. In case kernel returns ENOTTY then nvme-cli considers the device to be a controller. Since cuse_ctrlr_ioctl() returns EINVAL in such a case the nvme-cli fails. To avoid this simply replace EINVAL with ENOTTY for the ioctls that may be not supported by ctrl or ns device. nvme-cli commit in question: `fa2b91da74` Signed-off-by: Michal Berger <michalx.berger@intel.com> Change-Id: I29003864bc2a5c1a8906d6d01beba3d6f4e31b0e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8531 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-07-13 09:00:05 +00:00
Krishna Kanth Reddy	78ecd30d8e	Fix Rocksdb db_bench build's Linker issue. Linker throws undefined references to spdk_app_start, spdk_app_stop, spdk_app_start_shutdown, spdk_app_fini, spdk_event_allocate, spdk_app_opts_init and spdk_event_call. Signed-off-by: Krishna Kanth Reddy <krish.reddy@samsung.com> Change-Id: I05da1b9d94ac40127b4f0e80d8a8e406f279d3bb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8677 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2021-07-13 08:59:24 +00:00
Ben Walker	e1d06d9954	net: Remove library Now that we've deprecated the RPCs for a release, we can remove the whole library. Change-Id: I0f1a357fcfb3404efac39aa021928841c2f22ff1 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4305 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-07-13 08:57:58 +00:00
Shuhei Matsumoto	efbd101b8b	nvme: Rename cmic.multi_host by cmic.multi_ctrlr of spdk_nvme_ctrlr_data Bit 1 in the CMIC of the Identify Controller Data Structure specifies if the NVM subsystem may have multiple controllers or not. However, multi_host indicated a particular use case such that the NVM subsystem is used by multiple hosts. multi_ctrlr will be more appropriate. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I0246096a5cc44721aeff3ff6f96473a2abe11964 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8719 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-07-13 08:57:33 +00:00
Tomasz Zawadzki	57a2f03eb6	lib/app: only print cpumask for thread within app core mask For cases where cpumask for a thread was not set, all bits were turned on for whole length of cpuset structure. This resulted in JSON RPC reponses with way too long cpumask for what is useful. Now the response is limited to the applications core mask, as that makes sense so long as number of cores cannot change. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ib5cf271d3b219ba679f1abe498516796693a87dd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8288 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2021-07-12 21:58:56 +00:00
Tomasz Zawadzki	fe2f80961c	scheduler_dynamic: start core selection from first core The round-robin logic is no longer necessary to spread the threads around the cores. Starting from core other than first is even counter-productive to bunching up threads. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I5fcee2bacc2d0b4af26336caf381ed954814d731 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8085 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-07-12 21:58:56 +00:00
Tomasz Zawadzki	a5999f637a	scheduler_dynamic: prioritize lowest lcore id for active threads Before this patch _find_optimal_core() returned 1) any core that could fit the thread 2) if current core was over the limit, the least busy core 3) current core if no better candidate was found Combined with _get_next_target_core() round-robining the first core to consider, resulted in threads being unnecessarily spread over the cores. This patch only places threads on lower lcore id, or when current core is over limit then any core that can fit it. Next patch will remove round-robin logic to always start with lowest lcore id. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I54e373d3ca02a5633607d22978305baa1142f8bd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8112 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2021-07-12 21:58:56 +00:00
Tomasz Zawadzki	d2ed0f45e7	scheduler_dynamic: scale up core load when moving thread Before this patch the idle time of a core was increased by the amount of busy time of thread that was moved out. No assumption was made as to how the remaining threads, would behave during next scheduling period. This approach is fine, as over multiple scheduling periods we'd arrive at a point where threads could do no more work or all cores would be busy. Yet this requires multiple scheduling periods to sort out the threads. Later in the series core_load will be used to determine, when to start moving threads out of the core. So changing this assumption will allow for faster responses to thread load, at cost of sometimes spreading threads too much briefly. With this patch, we are assuming that threads remaining on the core will do proportionally the same amount of work during next scheduling period. See an example illustrating the change: Before moving Thread1 Thread1 Busy 80 Idle 20 Load 80% Thread2 Busy 60 Idle 40 Load 60% Core Busy 140 Idle 60 Load 70% After moving Thread1 out (original code) Core Busy 140-80=60 Idle 60+80=140 Load 30% After moving Thread1 out (this patch) Core Busy 140-80=60 Idle 60-20=40 Load 60% Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I1f347983449b2fde476dab360c4df689965ca3ea Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8279 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2021-07-12 21:58:56 +00:00
Tomasz Zawadzki	11c9b3960b	scheduler_dynamic: move thread to least busy core In cases when all cores are already doing too much work to fit a thread, active threads should still be balanced over all cores. When current core is overloaded, place the thread on another that is less busy. The core limit is set to 95% to catch only ones that are fully busy. Decreasing that value would make spreading out the threads move aggressive. Changed thread load in one of the unit tests to reflect the 95% limit. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I3b3bc5f7fbd22725441fa811d61446950000cc46 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8113 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-07-12 21:58:56 +00:00
Shuhei Matsumoto	cf8405fc24	bdev: Hold mutex while removing name from name tree We had not held mutex while removing bdev name or alias from bdev name tree for most cases. Fix these in this patch. spdk_bdev_unregister() already holds g_bdev_mgr.mutex when removing name, and so we do not need to change it. spdk_bdev_close() had not held g_bdev_mgr.mutex. What we want to lock is only when removing name from name tree, that is, calling bdev_name_del() in bdev_unregister_unsafe(). However, we need to keep hierarchical lock ordering. Hence get and free g_bdev_mgr.mutex outside of bdev->internal.mutex. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I4e2c8604e27c8603725efa9bc0bee2013eccb2ac Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8527 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-07-12 15:30:39 +00:00
Shuhei Matsumoto	d06f1c498f	bdev: Hold mutex when adding bdev name to global bdev name tree We had not held mutex when adding bdev name to global bdev name tree in bdev_name_add(). Fix these in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I33813638f11da85263ec0c8849e566d247a45d43 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8524 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-07-12 15:30:39 +00:00
Shuhei Matsumoto	20ba4a0dbe	bdev: bdev_name_add() checks if the name exists in the global name tree If the specified name already exists in the global bdev name tree, RB_INSERT() returns a pointer to it. Hence we do not have to call bdev_get_by_name() when using bdev_name_add(). Hence update bdev_name_add() to return -EEXIST if RB_INSERT() returns a non-NULL pointer, and then remove the bdev_get_by_name() calls. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I2d4554ef7e5286270417def64b638b803eecfca2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8573 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-07-12 15:30:39 +00:00
Ziye Yang	36b5a69bb0	trace: fix the snprintf warning issue. The complier complains: /usr/include/bits/stdio2.h:71:10: note: ‘__builtin___snprintf_chk’ output between 4 and 19 bytes into a destination of size 7 71 \| return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1, So we change the array size from 7 to 20, so it is enough to put 19 bytes in. Fixes #issue 2014 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I97dfbf9707d0e275382324fa7352b7a212b2aeb5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8694 Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2021-07-12 14:06:43 +00:00
Ziye Yang	cd1261ae00	trace: fix compiler complain on two variables In the nightly test, the compiler complains: trace.c: In function ‘_spdk_trace_record’: 00:07:12.523 trace.c:144:53: error: ‘argval’ may be used uninitialized in this function [-Werror=maybe-uninitialized] 00:07:12.523 memcpy(&buffer->data[offset], (uint8_t *)argval + argoff, 00:07:12.523 ^ 00:07:12.523 trace.c:145:36: error: ‘arglen’ may be used uninitialized in this function [-Werror=maybe-uninitialized] 00:07:12.523 spdk_min(curlen, arglen - argoff)); And this patch is provided to fix such issue. Fixes #issue 2034 Change-Id: I4c78d63bdc6a7d166990ae1d18a6abf183efdee1 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8709 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Karol Latecki <karol.latecki@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Chengqiang Meng <chengqiangx.meng@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2021-07-09 19:19:24 +00:00
Jacek Kalwas	03ac99d13f	nvmf: set NGUID for given namespace based on bdev UUID If NGUID is not specified with nvmf_subsystem_add_ns json-rpc request then it is possible to expose the same NGUID as bdev nvme module attached. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Ie0ed7189e55a5abd6bc0904fc356d26f62b50549 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8628 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-07-09 07:02:11 +00:00
Jacek Kalwas	a410fb4438	nvme: introduce function to get nguid Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Ida07eca2e3cbc390d8ee481f63b20f5715a53631 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8626 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-07-09 07:02:11 +00:00
Konrad Sztyber	6cc3169677	lib/trace: chain entries to extend their buffer size This patch adds the ability to chain multiple trace entries together to extend the size of the argument buffer. This means that a tracepoint is no longer limited to the size of a single entry, so it can have any number of arguments, and their size is also not constrained to a single entry. Some limitations are still there: a tracepoint can have up to 5 arguments and strings are limited to 255 bytes. These constraints stem from the definitions of tracepoint structures, which could be easily modified to extend the limits if needed. To record a tracepoint requiring larger buffer, aside from reserving `spdk_trace_entry` structure, a series of `spdk_trace_entry_buffer` structures are allocated too. Each of them acts as a buffer for the arguments. To allow trace tools to treat the buffer structures similarly to regular entries, they also have the `tpoint_id` and `tsc` fields. The id is always assigned to `SPDK_TRACE_MAX_TPOINT_ID` to make sure that a buffer is never mistaken for an entry, while the value of `tsc` is always shared with the initial entry. This also provides a way for the trace tools to verify if an entry is part of a chained buffer. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I51ceea6b6e57df95d4b8bd797f04edbc4936c180 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8405 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-07-07 09:43:37 +00:00
Konrad Sztyber	0cf270910a	lib/trace: add argument variable in _spdk_trace_record It makes the code more readable. Additionally, to avoid partial updates to an entry, the check for the number of arguments was moved before it's filled in. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I9ba01b1bcdc29267571badaebd4a9b34ffd7f728 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8404 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2021-07-07 09:43:37 +00:00
Konrad Sztyber	c681d76fb4	lib/trace: extract getting next entry to a helper function It allows us to get rid of the `next_circual_entry` variable and will make it easier to retrieve multiple trace entries, which will be needed in subsequent patches. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I4666c9da518c2ac0b376e10aa73d1c58cff91f13 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8403 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com>	2021-07-07 09:43:37 +00:00
Jim Harris	4246e79c04	nvme: change nvme_transport_ctrlr_delete_io_qpair to void Returning an error from this function is not useful - there is nothing the caller can do with that information. So change the return value to void. Also add ERRLOG and assert if a transport actually returns a non-zero status, to force the transport implementer (which must be an out-of-tree transport) to make changes as necessary. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I402afec045265db178af821d25b99a6dbe066eab Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8659 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-07-07 07:27:40 +00:00
Jim Harris	c081a84cd2	nvme: always return success from delete_io_qpair It is not uncommon for delete_io_qpair to fail, for example when a controller is hot removed. So even if SQ or CQ deletion fails, continue with freeing resources and report success back up the stack. There is really nothing the application can do to account for this failing anyways. Upcoming patches will add additional checks to ensure failing delete_io_qpair status never gets propagated to the caller. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iac007c1eba30f7a8c4936b3ffb6c837f28ee12ae Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8658 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-07-07 07:27:40 +00:00
Shuhei Matsumoto	ce60606dbc	iscsi: Fix data digest degradation by restoring the original code Due to the recent changes for non block size multiples write I/O, the data digest feature was degraded. If Linux iSCSI host enables data digest and tries to detect LU from SPDK iSCSI target, data mismatch error is detected and the connection is disconnected unexpectedly. The cause was that pdu->data_valid_bytes was not set for non-write response PDUs which have a data segment. iscsi_pdu_calc_data_digest() has been used only for non-write response PDUs. Hence we did not need to change iscsi_pdu_calc_data_digest(). Restore the original implementation of iscsi_pdu_calc_data_digest(). Additionally, to avoid future degradation, rename the related functions to iscsi_pdu_calc_partial_data_digest() and iscsi_pdu_calc_partial_data_digest_done(), and add comments for clarification. This fix was verified by the reporter. Fixes #2029. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I6babcd1b56e79d3fa3cd26b2dfaad87a52788e63 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8635 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2021-07-07 07:26:23 +00:00
Vasuki Manikarnike	a82e8478ea	lib/nvme: Do not retry aborts if ctrlr is failed. Fixes #2022 If queued aborts are present when trying to fail a ctrlr using spdk_nvme_ctrlr_fail(), then the abort command completion will attempt to retry one of the queued aborts. This eventually leads to a segfault that can be avoided by not retrying any queued aborts. Change-Id: I897dcb8809e16af8bdd39d4381ab531e1cc29822 Signed-off-by: Vasuki Manikarnike <vasuki.manikarnike@hpe.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8585 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-07-06 19:44:59 +00:00
Changpeng Liu	e6464f32fa	nvmf: abort AERs when doing controller reset and shutdown The vfio-user target emulated NVMe device is treated as PCIe NVMe SSD in the Guest VM, so when doing controller reset or shutdown, we should abort the AERs which in the NVMf library. Users may switch kernel NVMe driver to SPDK NVMe driver in the VM, without this fix, we will got "AERL exceeded" response very frequently, because the AERs submitted by previous driver will never be aborted in runtime. Change-Id: I0222ed509629ccb0e98217414dd9043857105686 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8558 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-07-06 19:36:04 +00:00
Changpeng Liu	4fa3d99131	nvmf: don't start the association timer poller for vfio-user When users remove kernel NVMe driver in the VM, after 120 seconds, SPDK NVMf target will disconnect ADMIN queue pair due to association timer timeout, and for vfio-user transport, the ADMIN queue pair connection is associated with the socket connection, so when probing the NVMe controller again, because there is no active ADMIN connection for fabric register R/W commands, it will cause segment fault. Here we set the association timeout value to 0 for vfio-user transport, so that the ADMIN connection will not be disconnected when shutdown the controller, the ADMIN queue pair will be disconnected when the socket connection breaks. Change-Id: I3613169229bae384405889653e50f581d30d7c07 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8557 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-07-06 19:36:04 +00:00
Changpeng Liu	d5102d37b3	nvmf/vfio-user: process NVMe response cdw0 correctly The NVMf library will set cdw0 based on specific command, so we use it directly in vfio-user, otherwise, some NVMe commands such as AER can't work. Fix issue #2016. Change-Id: Ie1a80a92c0856b61822ee51ce5d8faaaf1d463de Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8556 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2021-07-06 19:36:04 +00:00
Changpeng Liu	e34ad3e2c5	nvmf/vfio-user: add two debug logs Also fix one incorrect print log. Change-Id: I3254baf4bbff4acfc0ef43f628d025931e8589ea Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8555 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2021-07-06 19:36:04 +00:00
Changpeng Liu	2ccb76c30a	nvmf/vfio-user: remove unnecessary macros These macros are only valid for Fabric transports. Change-Id: Ia456eebdcdab28e81226c1b3a7211fcb41b5e481 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8554 Community-CI: Mellanox Build Bot Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-07-06 19:36:04 +00:00
Changpeng Liu	c138dfd3c0	nvmf/vfio-user: don't allocate internal data buffers for vfio-user target Change-Id: I75f1f1a493a480aadbc233b4583616886559565c Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8474 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: John Levon <levon@movementarian.org>	2021-07-06 19:36:04 +00:00
Shuhei Matsumoto	563f69ebe8	bdev: spdk_bdev_get_by_name() hold mutex itself while traversing bdev name tree spdk_bdev_register() and spdk_bdev_add_alias() had not held mutex when adding bdev name or alias to global bdev name tree. This bug caused unexpected error when traversing global bdev name tree. The next patch will fix the bug. This patch is a preparation for the fix. spdk_bdev_get_by_name() had not held mutex while traversing bdev name tree. The major callers to spdk_bdev_get_by_name() had held mutex when calling it. However, this was not clear. Factor out the internal of spdk_bdev_get_by_name() into a helper function bdev_get_by_name() and then change spdk_bdev_get_by_name() to lock and unlock when calling bdev_get_by_name(). Then replace spdk_bdev_get_by_name() call in spdk_bdev_alias_add() and bdev_register() by bdev_get_by_name() call. spdk_bdev_get_by_name() call in spdk_bdev_examine() is not changed. This is called only from JSON RPC and not related with the bug. So we want to fix only unlocked access to global bdev name tree. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I25f07694e569eec10dba6c3c8543f6ce77412fe8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8523 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2021-07-05 14:46:30 +00:00
Shuhei Matsumoto	680388d45d	bdev: Move spdk_bdev_get_by_name() up in a file Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ia081edc6d04f2293296d61ec2f229f9823149bbf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8522 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2021-07-05 14:46:30 +00:00
Jim Harris	ac3a42b15c	nvmf: retry connect commands internally when subsys not ready It is better to not fail connect commands when a subsystem is not ready. The host will not be expecting that and will typically treat it as a catastrophic failure (i.e. it won't retry the connect). So instead when this situation occurs, start a poller for the connect request. We will continue to retry processing it until the subsystem is ready to handle it. Fixes issue #1985. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Id8835df8f0edf1e889fdd7e754e261c2a880cbb6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8571 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-07-05 14:45:34 +00:00
Jim Harris	65ef1f32a6	nvmf: check for null admin_qpair when updating subsystem pg It is possible for a controller to get added to the subsystem before its admin_qpair has been assigned. We need to account for that when traversing the subsystem's ctrlr list when determining ns and ana_changes that need to be reported for the ctrlr. Found while doing stress testing with connects and subsystem ns add/remove. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie54dc6ac202faeaeace054e6599f2dea2f30211e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8570 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-07-05 14:45:34 +00:00
Jim Harris	e8e2b469ec	nvme: use spdk_strerror to report CQ transport errors Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I910c5a63e1f35fa76dfb7c296361fb1af7209e6b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8569 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-07-05 14:45:34 +00:00
Alexey Marchuk	05c7a2cc0f	nvme/fabrics: Fix trid trstring populate After correct trstring initialization, it is overwritten with trstring value of the current probe ctx. That leads to a problem when initiator connects to a sbusystem with listeners of different transport types (e.g. TCP and RDMA). If probe_ctx has TCP type, than discovery probe initialized probe trid with trtype=RDMA and trstring=TCP. As results, SPDK creates TCP controller with trtype=RDMA and we hit assert in nvme_tcp_qpair function. Change-Id: I9355450c40c58fa55b016220703f6f7ae36b2571 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8464 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2021-07-05 14:45:11 +00:00
John Levon	2c34af8bff	nvmf: fix nvmf_tgt_accept() return code Pollers are supposed to return SPDK_POLLER_{BUSY,IDLE}. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I92bd184aaba9e3efb730b68a6024ebc9757ffd8b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8559 Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-07-05 14:37:07 +00:00
Weifeng Su	d651f8a238	nvme/nvme_cuse: Fix race condition in cuse session If we continuous setup and teardown cuse session, It will teardown uninitialized cuse session and cause segment fault, New function cuse_session_create will do the session create operation and under g_cuse_mtx to avoid this issue. Signed-off-by: Weifeng Su <suweifeng1@huawei.com> Change-Id: I2b32e81c0990ede00eea6d4ed3a7e44d534d4df3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8231 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-07-05 14:36:24 +00:00
Shuhei Matsumoto	3959e397d4	nvme: Add new detach to a detach context while it is being polled This update will allow us to use spdk_nvme_detach_async() and spdk_nvme_detach_poll_async() easier to aggregate multiple detachments. Previously, we could do: spdk_nvme_detach_async() spdk_nvme_detach_async() spdk_nvme_detach_async() and then started doing spdk_nvme_detach_poll_async(). Hence aggregating multiple detachments is already supported. After this patch, the following sequence is possible: spdk_nvme_detach_async() = 0 spdk_nvme_detach_async() = 0 spdk_nvme_detach_async() = 0 spdk_nvme_detach_poll_async() = -EAGAIN spdk_nvme_detach_async() = 0 spdk_nvme_detach_async() = 0 spdk_nvme_detach_poll_async() = -EAGAIN spdk_nvme_detach_poll_async() = -EAGAIN spdk_nvme_detach_poll_async() = -EAGAIN spdk_nvme_detach_poll_async() = 0 The actual changes is to remove the variable polling_started from struct spdk_nvme_detach_ctx because it is not necessary anymore. Clarify this change via updating the header file and CHANGELOG. Verify this change by unit test. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Iebdf6c27c5304a2097b7084c315ccc99634ffa1e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8468 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-06-30 22:54:19 +00:00
Shuhei Matsumoto	4fe4040a14	nvme: Add spdk_nvme_detach_poll() to simplify a common use case Add a new function spdk_nvme_detach_poll() to simplify a common use case to continue polling until all detachments complete. Then use the function for the common use case throughout. Besides, usage by simple_copy application was not correct, and fix it in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic14711cd8478bf221c0fe375301e77b395b37f26 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8509 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-06-30 22:54:19 +00:00
John Levon	56e327e795	vfio-user: fix nvmf_vfio_user_poll_group_add() comment The function comment was referring to a non-existent caller; instead, expand with a little more detail on the path taken for new QPs. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I42478194f3cfc18a6ff6c434964630ac42866f1d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8534 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2021-06-30 22:53:38 +00:00
Jim Harris	10c7d133be	nvmf: print debug response value after prop size check When the property is 8 bytes but the host only requested 4, we need to mask and only return the bytes requested by the host. Wait to do the DEBUGLOG until after that has happened. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I8f476a47e9fd07bf652fd64f3b1c17d650374167 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8506 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2021-06-30 19:23:40 +00:00
Michal Berger	7232c450f9	configure: Build against installed DPDK instance Interpret bare --with-dpdk opt as user's request to find installed (provided by the distro) DPDK's libs\|include files and use them during the build. Signed-off-by: Michal Berger <michalx.berger@intel.com> Change-Id: I9da99671b95af0121194b3a6d53636b0ded71f1b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8348 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Karol Latecki <karol.latecki@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: <tomasz.rochumski@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-06-29 18:17:43 +00:00
Changpeng Liu	15beaa20bf	nvme: print NVMe command and response when enable nvme log flag Fix issue #2010. Change-Id: I9ffc77ddfececce1e6bdac49939d616d9e7bb3c0 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8493 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-06-29 15:13:24 +00:00
paul luse	3bbfbb5b0f	lib/idxd: update some func params for consistency Was using "dst" in some cases and "crc_dst" in others for crc32c related calls. Update them to always use crc_dst Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Icf200f1734c64c29881f23b02b8d12bad81b3ca0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8186 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2021-06-29 00:46:25 +00:00
paul luse	10808e45d4	idxd: refactor flow control for idxd engine Recent work identified race conditions having to do with the dynamic flow control mechanism for the idxd engine. In order to both address the issue and simplify the code a new scheme is now in place. Essentially every DSA device will be allowed to accomodate 8 channels and each channel will get a fixed 1/8 the number of work queue entries regardless of how many channels there are. Assignment of channels to devices is round robin and if/when no more channels can be accommodated the get channel request will fail. The performance tests also revealed another issue that was masked before, it's a one-line so is in this patch for convenience. In the idxd poller we limit the number of completions allowed during one run to avoid the poller thread from starving other threads since as operations complete on this thread they are immediately replaced up to the limit for the channel. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I913e809a934b562feb495815a9b9c605d622285c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8171 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-06-29 00:46:25 +00:00
Zhiqiang Liu	e4746ad40f	idxd: fix memleak problem in spdk_idxd_configure_chan() In spdk_idxd_configure_chan(), if memory allocation fails in TAILQ_FOREACH() {} code range, we will goto err_user_comp and err_user_desc tag, in which we donot free chan->completions and confused batch->user_completions with chan->completions. Memleak problem and double free problem may occurs. Signed-off-by: Zhiqiang Liu <liuzhiqiang26@huawei.com> Change-Id: I0e588a35184d97cab0ea6b6c013ca8b3342f940a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8432 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2021-06-28 16:30:06 +00:00
Shuhei Matsumoto	b503ef4fa0	nvmf: Fix heap-use-after-free when poll_group_remove() is called after ctrlr is freed When a qpair is destroyed and the qpair is the last, _nvmf_ctrlr_free_from_qpair() (in lib/nvmf/nvmf.c) sends two messages, one is for _nvmf_ctrlr_destruct() and another is for _nvmf_transport_qpair_fini(). We do not know which of two completes earlier. _nvmf_ctrlr_destruct() frees the qpair->ctrlr in the end. On the other hand, _nvmf_ctrlr_free_from_qpair() calls spdk_nvmf_poll_group_remove() in the end, and spdk_nvmf_poll_group_remove() accesses the qpair->ctrlr to free queued requests to the qpair. Before one recent change, spdk_nvmf_poll_group_remove() had been called before _nvmf_ctrlr_free_from_qpair() was called. Hence extrace the operation to free queued requests from spdk_nvmf_poll_group_remove() and inline it into _nvmf_qpair_destroy(). Fixes one showstopper error to investigate the issue reported in #1819. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I29c43ff7b289fc77a5de9c33e0266301c412e208 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8438 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2021-06-28 16:25:24 +00:00
Tomasz Zawadzki	127fc0d0c3	scheduler_dynamic: consider any core for the thread Previously core load was only considered for main lcore. Other cores were used based on cpumask only. Once an active thread was placed on core it remained there until idle. If _get_next_target_core() looped around, the core might receive another active thread. This patch makes the core load matter for placement of any thread. As of this patch if no core can fit a thread it will remain there. Later in the series least busy core will be used to balance threads when every core is already busy. Modified the functional test that depended on always selecting consecutive core, even if 'current' one fit the bill. Later in the series the round robin logic for core selection is removed all together. Fixed typo in test while here. Note: _can_core_fit_thread() intentionally does not check core->interrupt_mode and uses tsc. That flag is only updated at the end of balancing right now. Meanwhile tsc is updated one first thread moved to the core, so it is no longer considered in interrupt mode. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I95f58c94e3f5ae8a468723d1dd6e53b0e417dcc3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8069 Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2021-06-28 16:18:19 +00:00
Tomasz Zawadzki	2d79bf58fb	scheduler_dynamic: balance idle threads in separate pass Idle threads are always moved to main core, there are no other considations. Doing it as separate first pass, allows to have the core stats be up to date for second pass for active threads. Core load stats will be used later in the series to determine optimal target core for an active thread. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I6a9bc11b86e954e461f7badebf3a6e4d1718f63c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8067 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-06-28 16:18:19 +00:00

1 2 3 4 5 ...

8466 Commits