numam-spdk

Author	SHA1	Message	Date
Tomasz Zawadzki	42ad9d0379	lib/blob: resize open_blobids on bs load open_blobids holds bit array of currently open blobs, this is a way for quicker determination than iterating over all blobs. See patch introducing it: (`30ee8137`)blob: Add a bitmask for quickly checking which blobs are open That patch added resizes of this bit array to bs init and bs recovery path (not shut down cleanly). But that patch skipped over bs load from a clean shutdown. This resulted in blob open having multiple blob pointers that target the same blob id. Fixes #1937 Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I3c42a63d168d1f5b013b449f010c5b207936045b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7998 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Community-CI: Mellanox Build Bot	2021-05-25 07:12:07 +00:00
Tomasz Zawadzki	ceaa0c7fa9	lib/blob: complete multiple persists When blob persist starts, there can already be multiple of such requests pending. It is possible to complete a set of persists at once, if blob state after their execution would be the same. This is the case when persists are already pending when a particular persist request is started. This patch implements such mechanism by introducing persists_to_complete queue, containing entries that were previously queued up before starting the current persist request. If there are any entries in this queue, further requests are put into pending_persists. When first request from persists_to_complete is persisted, completions are issued for all requests on that queue at once. If at that point there are any new entries on pending_persists, all of them are put into persists_to_complete. Persist process is started again with the first request from that queue. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I10063e55d6f821b1863de016d3148da6a719a422 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7643 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-05-24 10:08:00 +00:00
Tomasz Zawadzki	a8fe601853	lib/blob: delay executing blob persist completions This helps in next patch in series where multiple completions will be executing. UT is adjusted since one additional poll is required. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Id72377ddef91e40cdbc2bdea6f33c23309b0ca3d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7642 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-05-11 12:03:57 +00:00
Tomasz Zawadzki	958364b4a0	lib/blob: only verify cluster allocation when I/O is not frozen During snapshot creation the original blob becomes a thin provisioned blob that will only the diff of data after snapshot creation. Despite the comment in the UT the number of polls before issuing blob write was hitting blob BEFORE it swapped with new one. Issuing I/O during this period shall check for io freeze before checking cluster allocation. Otherwise bs_io_unit_is_allocated() hits assert for thin provisioned blob. This is because cluster map of blob is empty, but properties have not been updated yet. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I742e1a50b14d456ae1e6de13b5111caec3e8322c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7641 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-05-11 12:03:57 +00:00
Tomasz Zawadzki	50935184c8	lib/blob: force execution of queued persists When performing snapshot creation the I/O is frozen during the process. The blob persists for extent page allocation is delayed until snapshot creation is finished. This results in multiple blob persists executing one after the other, with only intent of writing out updated extent table pointing to new extent pages. Since blob->state is marked DIRTY before issuing each persist, but a single persist completion marks state CLEAR. Blob serialize correctly expects each persist to contain dirtied metadata, in order to avoid unnecessary md writes. Since all other instances of marking blob DIRTY is explicit, assert in blob serialize is left as is. Instead when running the queued up blob persists, the blob state is marked DIRTY. Side effect is that it will write out same md in some cases. Fixes #1909 Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I39f37299f3f0ebfccbdd4063781b5ecce286e993 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7640 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2021-04-29 07:10:13 +00:00
Tomasz Zawadzki	ae10cf633c	lib/blob: do not write extent pages during md sync for thin blobs Before this patch blob persist wrote out all allocated extent pages. Intended design was to write out extent pages in two cases: 1) Thin provisioned blobs to write out extent pages when necessary during cluster allocation. 2) Thick provisioned blobs to write extent pages during blob persist when the blob was resized This patch implements 1) by inserting extent before issuing blob persist in cluster allocation path. See blob_persist_extent_page_cpl() and blob_insert_new_ep_cb(). Blob persist might have to rewrite the last extent page after blob resize. See blob_persist_start(). Meanwhile 2) was incorrecly implemented since it always re-wrote all extent pages starting from 0. This was addressed by limiting number of extent pages written, only to ones that were resized. Some considerations were needed: a) blob resize happen on cluster granularity, it might be needed to re-write last extent page if resize was not large enough to change number of extent pages b) first extent page to write should be based on the num_extent_pages from active or clean, depending on resize direction See blob_persist_start(). Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ibba9e3de3aadb64c1844a462eb0246e4ef65d37f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7202 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2021-04-23 07:52:25 +00:00
Tomasz Zawadzki	a512214517	lib/blob: update extent pages during snapshot deletion When both clone and snapshot had already extent pages corresponding to the same region in cluster map, the clone extent page was replaced with one from snapshot. This was incorrect and would result in loss of clusters from clones extent page. It did not occur in practice because all extent pages were rewritten anyway during md sync. Cluster map was correct so updated extent pages were too. Cluster map correctness is verified in UT _blob_inflate_rw(true), at the very end when checking data consistency of inflated blob. This patch writes out the updated extent page explicitly. So it would be possible to skip wirting out extent pages during md sync later in the series. Note 1) At this point in series the extent page is written here, and in blob persists. The later will be removed later in series. Note 2) Errors during updating extent pages are not accounted for, but neither does syncing them in blob persist. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I7deac3c64299f33f8df49e860af1a16295c074e6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7438 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2021-04-23 07:52:25 +00:00
Tomasz Zawadzki	bc9da1c66c	lib/blob: rename blob_insert_extent to blob_write_extent_page The blob_insert_extent() name was confusing, since the function was actually responsible for writting out the extent page to disk. Changed to a more fitting name. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ia312b0ef152100f30d5a1bfe123e55135c8afa6e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7561 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2021-04-23 07:52:25 +00:00
Tomasz Zawadzki	9c3895b6bc	lib/blob: refactor extent page updates during snapshot deletion This patch does not change functionality. It separates three stages of updating clone during snapshot deletion: - updating cluster map - updating extent pages - removing backing device from clone Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I44869f3be596d9d0f06db4acedfdd7e1500516ff Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7437 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-04-23 07:52:25 +00:00
Ziye Yang	9472eecbdb	blob: Make the ABI compatibility of spdk_blob_open_opts structure. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I8627347f44f0a9cb4b50f41ac5effae18b427c6d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5701 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-12-29 07:55:22 +00:00
Ziye Yang	b47f0f20b7	blob: Make the ABI compatibility of spdk_blob_opts structure. Change-Id: I1b1806864783e944d8f55c9393228a1954051236 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5687 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-12-29 07:55:22 +00:00
Ziye Yang	3de9887d6d	blob: Make the ABI compatibility for spdk_bs_opts The purpose of this patch is to make spdk_bs_opts for compatiblity issue. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I26d2a6bc644feede64d48890c7903f224b1fc306 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5681 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-12-29 07:55:22 +00:00
Sochin Jiang	3790a412b8	lib/blob: try best to merge the contiguous clusters blob_persist_clear_clusters() issues 'unmap' or 'write_zeroes' IOs in one batch, this can consumes lots of 'bdev_io's. we need to try best to merge the contiguous clusters especially in the following case: cluster 0: LBA 128 (count = 128) cluster 1: LBA 0 (meaning thin provisioned, no cluster allocated) cluster 2: LBA 256 (count = 128) Besides, in the long run, we may need to do this batch by sequence. Signed-off-by: Sochin Jiang <jiangxiaoqing.sochin@bytedance.com> Change-Id: I436c8b46a1ee33ec127d0f43fd85b2c443689b59 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4082 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-12-28 13:39:56 +00:00
Jim Harris	894175dece	blob: add more detail to bs_load_use_pages_cpl assert We have an intermittent CI failure that hits this assert. Adding some more detail to help analyze and root cause. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I2174dda58d0ae4d98359a914e5975f8be3e45a89 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5550 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-12-14 14:29:09 +00:00
guozhihong	750cfa17ab	fix a bug of clear extents during blob persist md Signed-off-by: guozhihong <guozhihong@bytedance.com> Change-Id: Ief3fbc1754dedf424419dc98bebac8d73196377e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5106 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-12-09 08:26:43 +00:00
Nick Connolly	b8c99a3aad	lib: Use PRId64 for portability POSIX defines PRId64/PRIu64/PRIx64 for printing 64-bit values in a portable way. Replace a few references to %ld to remove the assumption about the size of a long. Similarly, use %z with size_t arguments. Where the value being printed is an unsigned 64-bit value, use PRIu64 instead of %ld. Explicitly test for not __linux__ where that is the intent, rather than testing for __FreeBSD__. Cast pointer to uintptr_t before aligning it, rather than using a specific integer size which may not be large enough to store a pointer. Signed-off-by: Nick Connolly <nick.connolly@mayadata.io> Change-Id: Icfe219e1bbb2d06b3ef05710fac5b7091d340251 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5142 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-11-20 11:01:37 +00:00
Tomasz Zawadzki	7f139e549b	lib/blob: extent_page shall never occupy md page 0 Search for md_page to be used as extent page started from 0, which is completely valid md_page. This page can be free when for example blob with id 0 was deleted and some other requested a new page for extent. There are already existing blobs that have extents pointing to 0, which means unallocated. Unfortunetly it means 0 can never mean md page 0. If that already occured for someone, this extent page was already lost during blob/bs reload and nothing can be done. With this in mind following assumptions are made for extent pages: - 0 means unallocated extent page - UINT32_MAX means we ran out of md pages, and should not be persisted - [NEW] extent page can never occupy md page 0 That last one is new addition in this patch. bs_allocate_cluster will now always try to find md page from 1 or higher. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ia17ce5bbca2fab4fb4487e4e263f3a0aa120bf17 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4314 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-10-16 08:15:16 +00:00
Tomasz Zawadzki	63e257ed34	lib/blob: update pages array during blob load When loading a blob and parsing its metadata, the array of pages was not updated. Serialization was unaffected, since the current pages array is unused there. Behavior was working correctly for first page, but did not for any blob with more than one page. Unfortunetly blob_persist_zero_pages() never zeroed out the pages, neither blob_persist_zero_pages_cpl() released the md pages. Resulting in md pages being claimed even after blobs deletion. This patch now fills out the active pages array with the appropriate page numbers from metadata. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I6ff1f4fe95684119d283c2471fdbbea464da8151 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4504 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-10-16 08:15:16 +00:00
Tomasz Zawadzki	4e8e97c886	log: remove internal log.h header There is nothing left here, so remove it. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ib947d42bc577dbebb4650b1be885e05a80f8f8cf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4541 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Community-CI: Broadcom CI	2020-10-15 08:23:39 +00:00
Tomasz Zawadzki	2172c432cf	log: simplify SPDK_LOG_REGISTER_COMPONENT This patch removes the string from register component. Removed are all instances in libs or hardcoded in apps. Starting with this patch literal passed to register, serves as name for the flag. All instances of SPDK_LOG_* were replaced with just * in lowercase. No actual name change for flags occur in this patch. Affected are SPDK_LOG_REGISTER_COMPONENT() and SPDK_*LOG() macros. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I002b232fde57ecf9c6777726b181fc0341f1bb17 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4495 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Mellanox Build Bot Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI	2020-10-14 08:00:35 +00:00
Sochin Jiang	5363eb3c4c	lib/blob: fix an uint32_t overflow in blobstore init uint32_t supports at most 2TB at most, we need to handle the larger blobstores, fix this overflow problem. Signed-off-by: Sochin Jiang <jiangxiaoqing.sochin@bytedance.com> Change-Id: I27950eb759e9cb9ad48fa4aa8dd1976b4e852832 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4075 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-09-16 07:57:42 +00:00
Jim Harris	bd16f57472	blob: switch to bit_pool for tracking used_clusters We still need to be able to explicitly set specific bits in the cluster array during initialization and loading (especially recovery), so we use a bit_array during load, and then convert it to a bit_pool just before calling the user's cmopletion callback. This gives a roughly 300% improvement over baseline on a benchmark which does continuous resize operations. The benefit is primarily from saving the lowest free bit rather than having to always start at bit 0. We may be able to further improve this by saving extents in the bit pool as well, although after this patch, the benchmark shows other hot spots different from the bit search. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Idb1d75d8348bc50560b1f42d49dbe4d79d024619 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3975 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-15 07:12:44 +00:00
Jim Harris	04eb8e05d6	blob: allocate ctx in bs_alloc The ctx allocation was duplicated after both bs_alloc calling sites, so this reduces the code a bit. This change also enables some future changes involving the used_clusters bit array. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I4ea98f079dbe385654e9cb9c0c58a1926a990c9e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3973 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-15 07:12:44 +00:00
Jim Harris	39caf843a1	blob: consolidate init, load and dump ctx objects This will allow for some additional simplifications in future patches. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie421ad35f8c0efbb775fbe6bf85799af515264ef Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3972 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-15 07:12:44 +00:00
Jim Harris	0a19f43e05	blob: move spdk_bs_load_ctx earlier in the file This prepares for some future patches. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: If63c83f76e839b796c58200ddb0ca2137fbc4288 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3971 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-15 07:12:44 +00:00
Jim Harris	835fbef712	blob: simplify bs_allocate_cluster Have it both find and set the lowest available cluster bit index. This will temporarily hurt the performance for cluster allocation, since it will always search starting at bit index 0. But upcoming changes in this patch set will fix that again by using a new spdk_bit_pool object that will do allocations much more efficiently than the current implementation here. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iad199c9166b82cb9a31597a080f5a28823849e60 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3970 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-15 07:12:44 +00:00
Jim Harris	ded02d0929	blob: batch used_cluster_mutex operations Currently if we create a blob of 128 clusters, we lock/unlock the used_cluster_mutex 128 times - once for each cluster. Same when those clusters are released when the blob is deleted. Batching these lock/unlock operations is very easy and gives a noticeable efficiency improvement. My local benchmark (1GiB ramdisk, 4KB cluster size, 128 clusters/blob) creates enough blobs to fill the blobstore and then continuously deletes and recreates them. Performance increases 20% on that benchmark with this patch. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ic503accf1ca1ab1af7254b4067771d956f52014d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4069 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-15 07:12:44 +00:00
Tomasz Zawadzki	bc0394250f	lib/blob: explicitly pass ctx where possible Some of the functions were only referenced directly. There is no need to use void* or pass any bserrno, in some cases. Let's be explicit. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ib26dda7068965838f38dad856ea1e456fd87a655 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4061 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-09 07:19:57 +00:00
Tomasz Zawadzki	c8c73af940	lib/blob: add clearing extents to persist path This looks like a major omission on persist path. Especially visible for cases where blobstore was not reloaded between blob creations/deletion. Added writing out zeroes to md_pages that contained truncated extents (resized down). After zeroes are writen out, md_pages for those extents are released. In case of blob deletion, extents are resized down to 0 so all extent pages are released. Fixes #1590 Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I9a2a1190e3f1f3b5d1bb806191c1fe4d27df7780 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4051 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-09-09 07:19:57 +00:00
Jim Harris	3fe748f728	blob: move bs_release_cluster We will use it earlier in this file in a future patch. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I554f2073185d466bd0b4e98bdeec721f763c1b44 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3969 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Jim Harris	8ba7450e8c	blob: simplify check for available cluster count Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I56a9ca114b84ded4ac23a9eaacadaa6df8f031cc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3968 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Jim Harris	f5b949d5fd	blob: simplify free cluster counting in inflate path Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I5716290476d4abdf8646a36206f07acb5b2dd4ec Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3967 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Jim Harris	2d87587fe5	blob: claim clusters inline during initialization When claiming clusters as part of blobstore initialization or recovery, just call spdk_bit_array_set directly rather than going through the bs_claim_cluster function. We will be modifying how runtime cluster allocation works so need to separate the two use cases. This code is very small so inlining it has minimal code impact. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iaaa1c817e57b4a2eea62eb4683407364bac1fcc0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3966 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Jim Harris	b7d508f5e5	blob: use spdk_bit_array_load/store_mask functions These functions were added during FTL development and are more efficient than the roll-your-own implementations blobstore had previously. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie09e5c305e6e171af0258e805f2aac3b88822b5e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3965 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-01 07:50:33 +00:00
Tomasz Zawadzki	846ca67594	lib/blob: remove requirement for md_page alignment The md_page alignment is not really required for md_page buffers. Allocating 4k aligned buffers all the time, causes memory to be heavily fragmented. Due to DPDK keeping track of the allocation in the same DMA region as the allocation themselves. Removing this alignment requirement will help DPDK when searching for the right part of memory in the heap. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reported-by: Mike Cui Change-Id: If2f4ca2be38d432d5740f6145b5e0ff46237806b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3853 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-24 07:38:00 +00:00
Jim Harris	ed7848f2df	blob: handle overlapping open case We only create one spdk_blob object for a given blob, and just increase the ref_count if it is opened multiple times. bs_open_blob would do the lookup for existing opened blobs. But if the blob is opened again, before the previous open operation has completed, we would end up with two spdk_blob objects for the same blob. Solution is to do another lookup when the open operation completes. If we find the blob, free the one we just finished opening and return the existing one instead. Also added unit test that failed on the existing code but passes now with this patch. Signed-off-by: Jim Harris <james.r.harris@intel.com> Reported-by: Mike Cui Change-Id: I00c3a913b413deddf06f0b63f7a669efb2b5658f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3855 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-21 08:24:09 +00:00
Sochin Jiang	db3d1201a4	lib/blob: fix a data corruption bug There is a fatal bug that could easily cause data corruption when using thin-provisioned blobs. In blob_request_submit_rw_iov(), we first get lba by calling blob_calculate_lba_and_lba_count(), blob_calculate_lba_and_lba_count() calculates different lbas according to the return of bs_io_unit_is_allocated(). Later, we call bs_io_unit_is_allocated() again to judge whether the specific cluster is allocated, the problem is it may have be allocated here while not be allocated when calling blob_calculate_lba_and_lba_count() before. To ensure the correctness of lba, we can do lba recalculation when bs_io_unit_is_allocated() returns true, or make blob_calculate_lba_and_lba_count() return the result of bs_io_unit_is_allocated(), use the second solution in this patch. By configuring more than one cpu core, md thread will run in a separate SPDK thread, this data corruption scenario could be easily reproduced by running fio verify in VMs using thin-provisioned Lvols as block devices. Signed-off-by: Sochin Jiang <jiangxiaoqing.sochin@bytedance.com> Change-Id: I099865ff291ea42d5d49b693cc53f64b60881684 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3318 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-08-20 09:26:23 +00:00
Ben Walker	30ee8137cf	blob: Add a bitmask for quickly checking which blobs are open This can speed up the check for whether a blob is already open significantly. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: If32b0b1f168fcdb58e61df6281d7b7520725a195 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2781 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-07-07 07:30:58 +00:00
Sochin Jiang	7137bde120	lib/blob: fix a potential wrong release of md page blob_insert_cluster_msg() will release the already claimed md page(intended for extent page) if the corresponding extent page is found to be allocated. But later blob_insert_extent() may fails, and this cause blob_insert_cluster_cpl() to release the same md page again, this could be wrong if this specific md page is clamied by others, thus cause data corruption. So, put it to zero after released in blob_insert_cluster_msg(). Signed-off-by: Sochin Jiang <jiangxiaoqing.sochin@bytedance.com> Change-Id: I46eba79b24b1950318002dcb27cb51b01ca566ec Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3152 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-07-03 07:30:30 +00:00
Yan Wang	abeae98424	lib/blob: check return value after malloc() and strdup() Signed-off-by: Yan Wang <wangyan122@huawei.com> Change-Id: Ieaaecfd12ddf7826c2e4ce6dc5f47411e7e077da Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2907 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-06-29 09:20:12 +00:00
Seth Howell	964463e420	lib/blob: remove spdk_ prefix from bs_batch_read_bs_dev This function was removed from the public API last release when the map file was introduced, but I didn't clean up the name at that time. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I3101723b504531ce2c51dba2feb063511dd32684 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2443 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-01 09:21:14 +00:00
Seth Howell	b5d68d5934	lib/blob: remove _spdk prefix from all functions. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Idb33816e5b66266987845172c27c87667ac0a596 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2437 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-05-27 07:35:02 +00:00
Sochin Jiang	e230d49ae2	blob: fix a wrong memset parameter A wrong parameter is passed to memset when loading blob->active.clusters, this leads to an unpredictable wrong lba value using thin provision LVOLs while submitting IO requests, thus causes EIO error using QEMU vms. Signed-off-by: Sochin Jiang <jiangxiaoqing.sochin@bytedance.com> Change-Id: Iecea80cfa58f7a025603430d666fd9cd4d3fea8b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2431 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-05-21 09:20:30 +00:00
Seth Howell	3456377b45	lib: accel, bdev, blob, env_dpdk remove spdk_ prefix. Hitting only the static functions from the above libraries with the spdk_ prefix. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ic6df38dfbeb53f0b1c30d350921f7216acba3170 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2362 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-05-21 09:19:00 +00:00
Tomasz Zawadzki	887531f6a6	lib/blob: provide better log on blob_load failures Added blobid and metadata page number to the log. Previously only number within particular blobs md chain was displayed. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I8e881c5824c9d2eadca9f3ac8ee2ac9ffc0e5cae Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2058 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-05-13 07:46:43 +00:00
Tomasz Zawadzki	7f007b44a9	lib/blob: clear blobid and md_page on blob_create failure Blobid and md_page is claimed as first step of blob creation. If blob creation failed, both should returned to be used by other blobs. This caused multiple reports of: "Metadata page 1 crc mismatch" when loading blobstore due to md_pages not actually containing the written out md pages. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I495452c578d879f749281cebf8975eb2c1c7f79a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2057 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-05-07 08:30:19 +00:00
Tomasz Zawadzki	b3348624e7	blob: add pages_per_cluster_shift Operation of locating right lba from cluster map is done on I/O path. Instead of division and multiplication, perform bit shift operation. Bit shift is only used when pages per cluster is power of 2. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ic3ed7ec0a82867a8a4bc6391785b9d40c800aacb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1724 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-24 15:45:21 +00:00
Tomasz Zawadzki	a6587c7e82	blob: inline _spdk_blob_calculate_lba_and_lba_count() This is called on hot path for I/O, inline it. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Iec40033eac19f2c66c2984623acb5e157a5ffe05 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1723 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-22 09:18:41 +00:00
Seth Howell	ad7fdd12b1	lib/blob: remove spdk_ from non-public APIs We have an unofficial naming convention that the spdk_ namespace is reserved for public API functions only. This patch is attempting to bring the blob library into compliance with that naming convention. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ie298e41d1b741dae01744826c208378ee60f9d0a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1700 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI	2020-04-15 22:10:08 +00:00
Tomasz Zawadzki	c7325e3fad	lib/blob: assert that md_page is not extent page Contidion previous to this should already verify that md page is not an extent page. All extent pages are not part of the chain (sequence_num == 0), and their location (ctx->cur_page) cannot be the root of md chain (page->id). Yet during development it could appen, so adding assert to verify further that the md page is not extent page. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I6d5dc2ae965f8f9a388cd1c8e186145f8ca91db4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1667 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-07 08:37:41 +00:00

1 2 3 4 5 ...

308 Commits