numam-spdk

Author	SHA1	Message	Date
Jim Harris	3fe748f728	blob: move bs_release_cluster We will use it earlier in this file in a future patch. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I554f2073185d466bd0b4e98bdeec721f763c1b44 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3969 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Jim Harris	8ba7450e8c	blob: simplify check for available cluster count Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I56a9ca114b84ded4ac23a9eaacadaa6df8f031cc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3968 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Jim Harris	f5b949d5fd	blob: simplify free cluster counting in inflate path Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I5716290476d4abdf8646a36206f07acb5b2dd4ec Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3967 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Jim Harris	2d87587fe5	blob: claim clusters inline during initialization When claiming clusters as part of blobstore initialization or recovery, just call spdk_bit_array_set directly rather than going through the bs_claim_cluster function. We will be modifying how runtime cluster allocation works so need to separate the two use cases. This code is very small so inlining it has minimal code impact. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iaaa1c817e57b4a2eea62eb4683407364bac1fcc0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3966 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Jim Harris	b7d508f5e5	blob: use spdk_bit_array_load/store_mask functions These functions were added during FTL development and are more efficient than the roll-your-own implementations blobstore had previously. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie09e5c305e6e171af0258e805f2aac3b88822b5e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3965 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Tomasz Zawadzki	846ca67594	lib/blob: remove requirement for md_page alignment The md_page alignment is not really required for md_page buffers. Allocating 4k aligned buffers all the time, causes memory to be heavily fragmented. Due to DPDK keeping track of the allocation in the same DMA region as the allocation themselves. Removing this alignment requirement will help DPDK when searching for the right part of memory in the heap. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reported-by: Mike Cui Change-Id: If2f4ca2be38d432d5740f6145b5e0ff46237806b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3853 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-24 07:38:00 +00:00
Jim Harris	ed7848f2df	blob: handle overlapping open case We only create one spdk_blob object for a given blob, and just increase the ref_count if it is opened multiple times. bs_open_blob would do the lookup for existing opened blobs. But if the blob is opened again, before the previous open operation has completed, we would end up with two spdk_blob objects for the same blob. Solution is to do another lookup when the open operation completes. If we find the blob, free the one we just finished opening and return the existing one instead. Also added unit test that failed on the existing code but passes now with this patch. Signed-off-by: Jim Harris <james.r.harris@intel.com> Reported-by: Mike Cui Change-Id: I00c3a913b413deddf06f0b63f7a669efb2b5658f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3855 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-21 08:24:09 +00:00
Sochin Jiang	db3d1201a4	lib/blob: fix a data corruption bug There is a fatal bug that could easily cause data corruption when using thin-provisioned blobs. In blob_request_submit_rw_iov(), we first get lba by calling blob_calculate_lba_and_lba_count(), blob_calculate_lba_and_lba_count() calculates different lbas according to the return of bs_io_unit_is_allocated(). Later, we call bs_io_unit_is_allocated() again to judge whether the specific cluster is allocated, the problem is it may have be allocated here while not be allocated when calling blob_calculate_lba_and_lba_count() before. To ensure the correctness of lba, we can do lba recalculation when bs_io_unit_is_allocated() returns true, or make blob_calculate_lba_and_lba_count() return the result of bs_io_unit_is_allocated(), use the second solution in this patch. By configuring more than one cpu core, md thread will run in a separate SPDK thread, this data corruption scenario could be easily reproduced by running fio verify in VMs using thin-provisioned Lvols as block devices. Signed-off-by: Sochin Jiang <jiangxiaoqing.sochin@bytedance.com> Change-Id: I099865ff291ea42d5d49b693cc53f64b60881684 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3318 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-20 09:26:23 +00:00
Ben Walker	30ee8137cf	blob: Add a bitmask for quickly checking which blobs are open This can speed up the check for whether a blob is already open significantly. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: If32b0b1f168fcdb58e61df6281d7b7520725a195 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2781 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-07-07 07:30:58 +00:00
Sochin Jiang	7137bde120	lib/blob: fix a potential wrong release of md page blob_insert_cluster_msg() will release the already claimed md page(intended for extent page) if the corresponding extent page is found to be allocated. But later blob_insert_extent() may fails, and this cause blob_insert_cluster_cpl() to release the same md page again, this could be wrong if this specific md page is clamied by others, thus cause data corruption. So, put it to zero after released in blob_insert_cluster_msg(). Signed-off-by: Sochin Jiang <jiangxiaoqing.sochin@bytedance.com> Change-Id: I46eba79b24b1950318002dcb27cb51b01ca566ec Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3152 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-07-03 07:30:30 +00:00
Yan Wang	abeae98424	lib/blob: check return value after malloc() and strdup() Signed-off-by: Yan Wang <wangyan122@huawei.com> Change-Id: Ieaaecfd12ddf7826c2e4ce6dc5f47411e7e077da Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2907 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-06-29 09:20:12 +00:00
Seth Howell	964463e420	lib/blob: remove spdk_ prefix from bs_batch_read_bs_dev This function was removed from the public API last release when the map file was introduced, but I didn't clean up the name at that time. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I3101723b504531ce2c51dba2feb063511dd32684 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2443 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-01 09:21:14 +00:00
Seth Howell	b5d68d5934	lib/blob: remove _spdk prefix from all functions. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Idb33816e5b66266987845172c27c87667ac0a596 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2437 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-05-27 07:35:02 +00:00
Sochin Jiang	e230d49ae2	blob: fix a wrong memset parameter A wrong parameter is passed to memset when loading blob->active.clusters, this leads to an unpredictable wrong lba value using thin provision LVOLs while submitting IO requests, thus causes EIO error using QEMU vms. Signed-off-by: Sochin Jiang <jiangxiaoqing.sochin@bytedance.com> Change-Id: Iecea80cfa58f7a025603430d666fd9cd4d3fea8b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2431 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-05-21 09:20:30 +00:00
Seth Howell	3456377b45	lib: accel, bdev, blob, env_dpdk remove spdk_ prefix. Hitting only the static functions from the above libraries with the spdk_ prefix. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ic6df38dfbeb53f0b1c30d350921f7216acba3170 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2362 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-05-21 09:19:00 +00:00
Seth Howell	d18e63206a	mk/lib: add a check that major and minor version is set for libs. Also, while we are here, consolidate setting SO_SUFFIX to one spot. Previously, it was possible for a library to slip through without an SO version. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I4db5fa5839502d266c6259892e5719b05134518c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2361 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-05-21 09:19:00 +00:00
Tomasz Zawadzki	887531f6a6	lib/blob: provide better log on blob_load failures Added blobid and metadata page number to the log. Previously only number within particular blobs md chain was displayed. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I8e881c5824c9d2eadca9f3ac8ee2ac9ffc0e5cae Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2058 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-05-13 07:46:43 +00:00
Tomasz Zawadzki	7f007b44a9	lib/blob: clear blobid and md_page on blob_create failure Blobid and md_page is claimed as first step of blob creation. If blob creation failed, both should returned to be used by other blobs. This caused multiple reports of: "Metadata page 1 crc mismatch" when loading blobstore due to md_pages not actually containing the written out md pages. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I495452c578d879f749281cebf8975eb2c1c7f79a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2057 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-05-07 08:30:19 +00:00
Tomasz Zawadzki	b3348624e7	blob: add pages_per_cluster_shift Operation of locating right lba from cluster map is done on I/O path. Instead of division and multiplication, perform bit shift operation. Bit shift is only used when pages per cluster is power of 2. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ic3ed7ec0a82867a8a4bc6391785b9d40c800aacb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1724 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-24 15:45:21 +00:00
Tomasz Zawadzki	a6587c7e82	blob: inline _spdk_blob_calculate_lba_and_lba_count() This is called on hot path for I/O, inline it. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Iec40033eac19f2c66c2984623acb5e157a5ffe05 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1723 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-22 09:18:41 +00:00
Seth Howell	ba870c2e99	lib/blob: remove some unused internal APIs spdk_bs_sequence_unmap and spdk_bs_batch_to_sequence were unused. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I4edc45948941ba700bea353c20ddb4e3f2fa15a4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1701 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-04-15 22:10:08 +00:00
Seth Howell	ad7fdd12b1	lib/blob: remove spdk_ from non-public APIs We have an unofficial naming convention that the spdk_ namespace is reserved for public API functions only. This patch is attempting to bring the blob library into compliance with that naming convention. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ie298e41d1b741dae01744826c208378ee60f9d0a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1700 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI	2020-04-15 22:10:08 +00:00
Seth Howell	75365cfa60	lib/blob: add a map file. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I72da725f82dfa3eeb2c0ec6614ecc9ffb0b852a8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1682 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Community-CI: Broadcom CI	2020-04-15 22:10:08 +00:00
Tomasz Zawadzki	c7325e3fad	lib/blob: assert that md_page is not extent page Contidion previous to this should already verify that md page is not an extent page. All extent pages are not part of the chain (sequence_num == 0), and their location (ctx->cur_page) cannot be the root of md chain (page->id). Yet during development it could appen, so adding assert to verify further that the md page is not extent page. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I6d5dc2ae965f8f9a388cd1c8e186145f8ca91db4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1667 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-07 08:37:41 +00:00
Tomasz Zawadzki	955c58fa68	lib/blob: change replay of extent pages to batching Before this patch reading out the extent pages during blobstore replay was serialized. Only issuing reads for next extent page when previous operation finished. This was done by continously calling _spdk_bs_load_replay_extent_page_cpl() and decreasing ctx->num_extent_pages. This patch changes spdk_bs_sequence_* to spdk_bs_batch_*. All the reads are submitted at once, and only when all of them finish we proceed to next valid md chain. Goal of this change is improving efficiency and readability. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I807cdb98166e04706fedb494363f5776e3151827 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1540 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-07 08:37:41 +00:00
Tomasz Zawadzki	22dc6254d6	lib/blob: add extent_pages to load ctx This variable currently holds single extent page. Further patch will utilize it to use multiple. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: If86ffd57cecf5d3bfd0812a767c784d7bf503fb6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1538 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-07 08:37:41 +00:00
Tomasz Zawadzki	f03586b28a	lib/blob: rename extent_pages to extent_page_num in load ctx This field does not hold actual pages, but just md page numbers which hold the extent pages. Rename as prepartation to adding new one that will hold actual extent pages. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I1fb85a58c92a93b968e1fad22e421252399e9281 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1537 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-07 08:37:41 +00:00
Tomasz Zawadzki	e0fb369c26	lib/blob: add more error paths on blob_persist This patch adds more ways to back off when parts of blob persist fails. Otherwise the process would proceed as if nothing happened. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I7cff73e1dc3066d0c822d1e3dac4bd35e27cd54a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1263 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-04-02 07:03:35 +00:00
Tomasz Zawadzki	597c91abdf	lib/blob: always add back the snapshot if delete fails After opening the blob for deletion, in _spdk_bs_delete_open_cpl(), the blob is removed from list of blobs in blobstore. This is to prevent future _spdk_blob_lookup()s from referencing blob while it is deleted. In usual blob deletion path, next step is proceeding with deletion of the blob by reducing its size to 0 and syncing the blob. Changes from this point forward are persisted. Meanwhile in special case of deleting snapshot which has single clone on it, before above occurs additional steps are performed. Each of the blobs are opened and their attributes changed. Failures on those steps are fully recoverable on any errors, and in such case blob should be added back to the bs list of blobs. Original code had condition on how many references there were to blob being deleted, which is incorrect. Any error on that path should clean up after itself (revert attributes and close blobs) and re-add the blob. This change is tested with blob_delete_snapshot_power_failure() UT, by adding error path in persist - which triggers error in aforementioned blob delete code path. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I926e7cbf3cb86170c69f31231399535859f290dd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/985 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-04-02 07:03:35 +00:00
Tomasz Zawadzki	9dd7c9f18b	lib/blob: Set thin_provision back when sync fails on snapshot creation When creating snapshot, 'original' blob will end up being a thin provisioned clone. Before that first thin_provisioned 'newblob' is created during this process. If the first md sync for 'newblob' fails, it means that only valid references to clusters are still only present in 'original' blob. The 'newblob' can be safely cleaned up. Unfortunetly 'newblob' inherited some of 'original' blob properties before sync. Cluster maps were already swaped in current cleanup code. But during blob close of 'newblob' - persist blob code expects clusters to be 0 only for thin_provisioned blobs. If original blob was thick, then it triggers an assert within persist code. This patch makes sure to set thin_provision to 'newblob', to align with its creation. Added asserts to verify that clusters maps are 0's, which should be the case as I/O to origblob is frozen. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I5420617792aefe8a3ef4e5989b2056504cdd1850 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1394 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-02 07:03:35 +00:00
Tomasz Zawadzki	c65b591b7d	lib/blob: move sequential extent page write to the caller This will help with making the _spdk_blob_persist_write_extent_pages() batch all writes of extent pages. No functional change occurs with this patch, this is just refactor for future change. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I8c93b1d6473db660f7ad5e04c8ec9f3331b2055c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/986 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-03-24 07:39:47 +00:00
Tomasz Zawadzki	28a44891f1	lib/blob: add error path on persisting dirty bs super->clean value signifies if blobstore was unloaded cleanly. If it was not, then during bs_load the _spdk_bs_recover() procedure if called. Meanwhile bs->clean is always set to 1 after load, causing very first blob_persist to also re-write super block with the super->clean set to 0. To signify that md has changed and possibly trigger the recovery if clean bs unload does not occur. When the re-write of super block succeeds the bs->clean is set to 0, because further re-writes of super block are not needed on next blob persist. This patch resolves issue when: 1) reading super block fails - execution should backoff, to prevent writing an empty buffer as super block ! 2) writing super->clean = 0 to the super block fails - execution again should fail, and bs->clean should not be set to 0. It will cause next persist to attempt re-write again. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ia07cc5c6c107310059b50886edb7283c176b9169 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1164 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-03-24 07:39:47 +00:00
Tomasz Zawadzki	0f5157377f	lib/blob: merge EP of a clone when deleting a snapshot In general it is not possible to delete snapshot when there are clones on top of it. There is special case when there is just a single clone on top that snapshot. In such case the clone is 'merged' with snapshot. Unallocated clusters in clone, are filled with the ones in snapshot (if allocated there). Similar behavior should have occurred for extent pages. This patch adds the implementation for moving EP from snapshot to clone along with UT. The UT exposes the issue by allowing delete_blob to proceed beyond just unrecoverable snapshot blob. Fixes #1291 Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ib2824c5737021f8e8d9b533a4cd245c12e6fe9fa Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1163 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-03-23 09:37:27 +00:00
Seth Howell	193927830d	make: rev SO versions individually for libraries. This will allow us to keep track of compatibility issues on a per-library basis. Change-Id: Ib0c796adb1efe1570212a503ed660bef6f142b6e Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1067 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-03-18 08:02:30 +00:00
Tomasz Zawadzki	2731f52607	lib/blob: add md page parameter to _spdk_bs_load_replay_md_parse_page() Further part of the series will hold array of md pages in the ctx. Callers of _spdk_bs_load_replay_md_parse_page() will make select which page to parse. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I3fb70660672ba74bdb338eb1233409103903b215 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/983 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-03-06 10:28:43 +00:00
Tomasz Zawadzki	2a51c755db	lib/blob: use common pointer to md page in load replay path This is refactoring change for future patches. struct spdk_bs_load_ctx will contain array of pages instead of single one. Having to change just single line for selection of page will make it easier to read next patches in series. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: If3dc1e7da7e61c7b4866307d859e55131a32d38b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/982 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-03-06 10:28:43 +00:00
Tomasz Zawadzki	030be573f3	lib/blob: queue up blob persists when one already is ongoing It is possible for multiple blob persists to affect one another. Either by blob->state changes or blob mutable data. Safe way to prevent that is to queue up the persists. Next persist will be executed only after previous one completes. Fixes #1170 Fixes #960 Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Iaf95d9238510100b629050bc0d5c2c96c982a60c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/776 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-02-21 09:35:27 +00:00
Tomasz Zawadzki	dd80edb2b4	lib/blob: move starting persist to separate function _spdk_blob_persist_check_dirty() function will be called in subsequent patch at the end of persist in _spdk_blob_persist_complete() to proceed with any queued up persists. Please see following patch for this. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ieeb334e23cde329743647f728e70dd60333c224a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/872 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-02-21 09:35:27 +00:00
Tomasz Zawadzki	29bd502046	lib/blob: add invalid flag for extent table With recent changes to extent on-disk metadata format, new format (Extent Pages) is not backwards compatible. Meanwhile old format (Extent RLE) is backwards compatible with older SPDK applications. Summing up: Blobstore created pre SPDK 20.01 can only use Extent RLE. Blobstore created starting with SPDK 20.01 can use both, Extent Pages and Extent RLE specified by use_extent_table opts. When use_extent_table is set to true, invalid flag for it is set. SPDK application pre 20.01, will not load such blob. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: If14ebd03f19eb581d71dcb46191e099336655189 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/483220 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-31 09:28:56 +00:00
Tomasz Zawadzki	bba0f040cf	lib/blob: fix uninitialized variable for non-debug builds This was observed after running nighly tests on previous patch. As part of it, autopackage.sh compiles SPDK without debug flag set. Exposing the uninitialized var here. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Iedb1641f3c0d4a21f293c81cd4fcf35c6d1c7ae5 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482893 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-01-28 09:15:23 +00:00
Tomasz Zawadzki	12d1404125	lib/blob: set default use_extent_table to true Extent table and extent page descriptors are now set to be default way clusters are serialized on disk. With this patch UT are ran with and without extent table. Changed two asserts in test, since amount is dependent on which type of serialization is used. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ica58fce6a4effd014d7dd40ee26edd0fa3196d0f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481901 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-01-28 09:15:23 +00:00
Tomasz Zawadzki	5d5053373f	lib/blob: fix inserting extent pages ctx->extent_page signifies if page was allocated for insertion. 1) It is possible for a thread to claim extent page on its own thread, and put it in ctx->extent_page. If conflicting thread allocates another ctx->extent_page, then it should be freed. This does not mean failure to insert cluster. As different threads could have been trying to allocate different clusters, so condition on line 6716 does not cover it. If so then it shouldn't be an issue to release the claimed ctx->extent_page and proceed with updating the extent page which originally won the race. NOTE: if clusters were conflicting, then extent_page is freed in _spdk_blob_insert_cluster_cpl(). 2) At this point of _spdk_blob_insert_cluster_msg() we already verified that there already is extent page allocated at "*extent_page". In such case ctx->extent_page will be 0, and should not be used. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Id5b57c88248890eee60d2e7dbecbd984c98b561b Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482867 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-01-28 09:15:23 +00:00
Tomasz Zawadzki	1dd8c57ab8	lib/blob: fix clearing cluser array on EP load sz is set to number of clusters that should be have been in particular unallocated EP (remaining_clusters_in_et up to SPDK_EXTENTS_PER_EP). The cluster array should be set to 0 only in region between original size (cluster_array_size) and new total size (active.num_clusters). It was incorrectly using sz in the memset. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ic43e89c17d53e9529e3ed0349aeb4fb7dc6593f2 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482858 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-01-28 09:15:23 +00:00
Tomasz Zawadzki	8b6df6fbab	lib/blob: count clusters starting from 0 when parsing EP Previously part of function assumed that cluster count 1)means number of clusters in EP and another 2) that it is following the active.num_clusters (akin to extent_rle). This was incosistent and showed when using multiple extent pages to serialize metadata. This patch changes it to only go with 1), so it is clear that it means number clusters within particular EP. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I155104cabc127ed47df04434032fb01e08948e13 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482848 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-01-28 09:15:23 +00:00
Tomasz Zawadzki	dd0a320cf9	lib/blob: Always persist at least single extent table desc Following changes are done in this patch: 1) _spdk_blob_serialize_extent_table now persists at least single extent page. When num_extent_pages == 0. 2) Minimum valid size of ET descriptor is even without extent_pages. This is a case when there are no EP, but we still want to persist num_clusters in ET. 3) Taking above points, redone the loop for serializing extent pages. 4) Make sure to mark blob dirty if any new extent pages were allocted. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I81dc6cf2de2722bb49927ed42f4b9f31292f78c5 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482847 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-01-28 09:15:23 +00:00
Tomasz Zawadzki	c177a3c841	lib/blob: remove assumption of empty cluster array with multiple EP Changed assert checking if cluster array is allocated when loading extent pages. This is true only for the first extent page being loaded, of course after that the cluster array can be already allocated. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I0f96294ede5a12ffd6bca73cbeadba8d94a35bac Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482857 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-01-28 09:15:23 +00:00
Tomasz Zawadzki	42109157f4	lib/blob: add starting cluster index to extent page Size of a blob (thus size of clusters array in mutable data) is known from extent table descriptor. Extent pages were read sequentially in order they were placed in extent table. This meant that cluster array could have been filled up from beginning to end. Yet reading extent pages in any other order, would result in incorrect placement of clusters. This patch adds first cluster index that is contained within each extent page. This will allow to read/write multiple extent pages in parallel, since we will know where in clusters array to put the cluster idxs. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ib6b9332111cd93f990d057dc60624152907dd87f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482701 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-28 09:15:23 +00:00
Tomasz Zawadzki	78257ab613	lib/blob: rename num_clusters_in_et to remaining_clusters_in_et This is more adequate name, since this value if first read from Extent Table descriptor. Then decreased when iterating over entries in extent table and extent pages are read. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ib188c524b8488b38d4de063a9970dcfdf49c9acd Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482600 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 18:06:43 +00:00
Tomasz Zawadzki	b5380c370d	lib/blob: replay the clusters from Extent Pages When replaying md chain for a blob, extent table descriptor can be read. When it is present, all allocated pages it points to are now being put into extent_pages array in ctx. If multiple extent table descriptors are in single md chain, the array is expanded accordingly. After replaying single md chain is done, replay extent pages starting from last one. Replaying extent pages, is similar to extent_rle in that each allocated cluster is claimed and number of free clusters in blobstore decreased. When all extent pages are read, return to _spdk_bs_load_replay_md_cpl() for continuing replaying next valid md chain. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I4573226aff7d7b1bcdfd188518235c8d4b68a4c3 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481621 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 18:06:43 +00:00
Tomasz Zawadzki	5dc88c6ccb	lib/blob: _spdk_bs_load_replay_md_parse_page() now takes only load ctx _spdk_bs_load_replay_md_parse_page() is only used in replay path during blobstore load. Next patch will expand the load ctx with array of extent pages to be read. It is filled out when reading in-chain metadata of extent table descriptors. Passing the load ctx here will make it simpler to fill out the array when processing extent table. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: If96e6670560c8c4a3610f33ece14c354d7d5da39 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482412 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-01-27 18:06:43 +00:00

1 2 3 4 5 ...

320 Commits