numam-spdk

Author	SHA1	Message	Date
Jim Harris	bd16f57472	blob: switch to bit_pool for tracking used_clusters We still need to be able to explicitly set specific bits in the cluster array during initialization and loading (especially recovery), so we use a bit_array during load, and then convert it to a bit_pool just before calling the user's cmopletion callback. This gives a roughly 300% improvement over baseline on a benchmark which does continuous resize operations. The benefit is primarily from saving the lowest free bit rather than having to always start at bit 0. We may be able to further improve this by saving extents in the bit pool as well, although after this patch, the benchmark shows other hot spots different from the bit search. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Idb1d75d8348bc50560b1f42d49dbe4d79d024619 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3975 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-15 07:12:44 +00:00
Ben Walker	30ee8137cf	blob: Add a bitmask for quickly checking which blobs are open This can speed up the check for whether a blob is already open significantly. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: If32b0b1f168fcdb58e61df6281d7b7520725a195 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2781 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-07-07 07:30:58 +00:00
Seth Howell	b5d68d5934	lib/blob: remove _spdk prefix from all functions. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Idb33816e5b66266987845172c27c87667ac0a596 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2437 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-05-27 07:35:02 +00:00
Tomasz Zawadzki	b3348624e7	blob: add pages_per_cluster_shift Operation of locating right lba from cluster map is done on I/O path. Instead of division and multiplication, perform bit shift operation. Bit shift is only used when pages per cluster is power of 2. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ic3ed7ec0a82867a8a4bc6391785b9d40c800aacb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1724 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-24 15:45:21 +00:00
Seth Howell	ad7fdd12b1	lib/blob: remove spdk_ from non-public APIs We have an unofficial naming convention that the spdk_ namespace is reserved for public API functions only. This patch is attempting to bring the blob library into compliance with that naming convention. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ie298e41d1b741dae01744826c208378ee60f9d0a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1700 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI	2020-04-15 22:10:08 +00:00
Tomasz Zawadzki	030be573f3	lib/blob: queue up blob persists when one already is ongoing It is possible for multiple blob persists to affect one another. Either by blob->state changes or blob mutable data. Safe way to prevent that is to queue up the persists. Next persist will be executed only after previous one completes. Fixes #1170 Fixes #960 Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Iaf95d9238510100b629050bc0d5c2c96c982a60c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/776 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-02-21 09:35:27 +00:00
Tomasz Zawadzki	29bd502046	lib/blob: add invalid flag for extent table With recent changes to extent on-disk metadata format, new format (Extent Pages) is not backwards compatible. Meanwhile old format (Extent RLE) is backwards compatible with older SPDK applications. Summing up: Blobstore created pre SPDK 20.01 can only use Extent RLE. Blobstore created starting with SPDK 20.01 can use both, Extent Pages and Extent RLE specified by use_extent_table opts. When use_extent_table is set to true, invalid flag for it is set. SPDK application pre 20.01, will not load such blob. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: If14ebd03f19eb581d71dcb46191e099336655189 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/483220 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-31 09:28:56 +00:00
Tomasz Zawadzki	42109157f4	lib/blob: add starting cluster index to extent page Size of a blob (thus size of clusters array in mutable data) is known from extent table descriptor. Extent pages were read sequentially in order they were placed in extent table. This meant that cluster array could have been filled up from beginning to end. Yet reading extent pages in any other order, would result in incorrect placement of clusters. This patch adds first cluster index that is contained within each extent page. This will allow to read/write multiple extent pages in parallel, since we will know where in clusters array to put the cluster idxs. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ib6b9332111cd93f990d057dc60624152907dd87f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482701 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-28 09:15:23 +00:00
Tomasz Zawadzki	78257ab613	lib/blob: rename num_clusters_in_et to remaining_clusters_in_et This is more adequate name, since this value if first read from Extent Table descriptor. Then decreased when iterating over entries in extent table and extent pages are read. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ib188c524b8488b38d4de063a9970dcfdf49c9acd Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482600 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 18:06:43 +00:00
Tomasz Zawadzki	2bccb7c9b4	lib/blob: use use_extent_table instead of NULL from extent_page Right now output from _spdk_bs_cluster_to_extent_page() is used to determine whether the exten_table is used at all. If NULL pointer was returned this meant that extent table was not allocated, even if the code might suggest just checking if we overran the array. To make it more obvious, the _spdk_bs_cluster_to_extent_page() now only asserts the extent_table_id. blob->use_extent_table is now always used to determine the serialization path. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I9d2630645213539bae5cd1d72e5f9b878f53c2bc Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482599 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 18:06:43 +00:00
Tomasz Zawadzki	e1ce55158a	lib/blob: require SPDK_EXTENTS_PER_EP to be power of 2 Force number of Extents to fit into Extent Page to be power of 2, in order to simplify calculations on cluster allocations. At this time SPDK_BS_PAGE_SIZE is 4k, which would results in SPDK_EXTENTS_PER_EP to be 512. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I7e09d92b00dfe5c12d7dd10ac0fc5a9a10d526ac Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/472041 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-01-27 18:06:43 +00:00
Tomasz Zawadzki	f4e58993f7	lib/blob: add EXTENT descriptor to blobs Similar to EXTENT_RLE, this descriptor holds LBA of clusters. Difference is that EXTENT is kept in separate md pages, and only single EXTENT will be updated on cluster allocation. This patch adds the EXTENT processing, which is not used until following patch. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ifbac23db7ca3e7c8c91cee01018f20071f0d5160 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/470014 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-01-27 18:06:43 +00:00
Tomasz Zawadzki	1b23560fcd	lib/blob: add _spdk_bs_cluster_to_extent_page() for easy conversion Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I3e49c398d9bdf9f4eacba65061cc7fe4b300fb56 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479963 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-27 18:06:43 +00:00
Tomasz Zawadzki	59f7f3f736	lib/blob: change extent pages array size on blob resize With this patch extent pages array will change it size accordingly to size of the blob. Similar to clusters, only resizing up is done on blob resize. Shrinking is done on persisting the blob. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Id7f7c81efbd96af414fce9fc4045cbb476cc93a6 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479962 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-01-27 18:06:43 +00:00
Tomasz Zawadzki	f60b4a7e28	lib/blob: add EXTENT_TABLE descriptor to blobs Added new descriptor SPDK_MD_DESCRIPTOR_TYPE_EXTENT_TABLE. Extent Table will hold md page offsets for new Extent Page descriptor. Entries in Extent Table are run-length encoded 0's as unallocated Extent Page descriptors. Additionally total number of clusters is persisted in each Extent Table descriptor. This is because there is no guarantee that last Extent Page of a blob will be allocated. Even if number of Extents per Extent Page is always the same, Extent Page can hold less Extents than that. This patch does not add more metadata on disk right now. Only added descriptor parsing/serialization and applicable fields to store it in run time. Following patches are going to implement TODO's added in this patch. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Iac5d8f00ddfc655c507bc26d69d7adf8495074e9 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466920 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 18:06:43 +00:00
Tomasz Zawadzki	3dadb79e37	lib/blob: add EXTENT_RLE descriptor description Since further patches will be adding new descriptors that are related to cluster layout throughout the blobstore, add description for existing descriptor too. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I722eb633445685789d5185ed59dfc910f76b109f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/481724 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-01-27 18:06:43 +00:00
Tomasz Zawadzki	c33840b7e6	lib/blob: add option to enable extent pages This is an additional option that can be passed when creating a blob. When opts->enable_extent_pages is set to false (current default), only EXTENT_RLE should be persisted on sync. During blob load, when EXTENT_RLE is present in md, blob->extent_rle_found is set to true. When opts->enable_extent_pages is set to true, only EXTENT_TABLE and EXTENT_PAGES should be persisted on sync. During blob load, when EXTENT_TABLE is present in md, blob->extent_table_found is set to true. It is possible to find neither EXTENT_* descriptor when loading a blob. This means that blob length is 0 and EXTENT_RLE was supposed to be used. Yet none were persisted due to lack of clusters. In such case blob->use_extent_table is set to true after finishing blob load. When parsing metadata ends, if extent_table_found is set - then support for extent_table is enabled. All other cases disable it. At this time path for Extent Pages is not implemented, so it should not be used. Later in the series, it will become the default path for serialization. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I2146da6130a0645e686ab02a3b5d2d86a7d35a1f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479853 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 18:06:43 +00:00
paul luse	ea69d6d6cc	lib/blob: store clear_method in per blob metadata Accept a clear method option on blob create by adding clear_method to the opts structure passed in to _spdk_bs_create_blob(). Store these 2 bits in md_ro_flags so that earlier versions without an understanding of these bits can not alter metadata. The new metadata values will be used later in the series. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I5440645ca20b426778d13b2e544b65dc2b3b83c7 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/472204 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-20 09:57:16 +00:00
Tomasz Zawadzki	4b8db27b2a	lib/blob: add _spdk_bs_md_page_to_lba() function internal to blobstore The _spdk_bs_page_to_lba() [without 'md'] is only for translating the pages on the blobstore to lba they are at. Those pages start at the begining of the device and cover all of it. Thus simple math is enough to translate those. It is used to calculate lba_count for set of pages as well. Meanwhile there are 'md_pages' which are the same pages as for the above, but their count start at bs->md_start. Which is right after super_block and couple pages for bit masks. This patch creates new _spdk_bs_md_page_to_lba() that is more explicit in what page number is passed. Hopefully avoiding confusion when reading which page number refers to which 'type' of page. Exception to the that is _spdk_bs_dump_read_md_page(), where blobstore is not actually loaded (md_start from super block is not copied to bs structure). Additionaly providing assert to catch errors on debug builds. Making the check in _spdk_blob_load_cpl() for max_md_lba obsolete. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I66bbca55b5ca3d6794c462d50177e6037ddbefa6 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479017 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-14 17:13:15 +00:00
Tomasz Zawadzki	3e372f35c3	lib/blob: rename extents to extents_rle In future patches new type of extents will be added, for compatibility the current extent type will be still handled in the code. To signify the difference between those two types, current type is renamed to SPDK_MD_DESCRIPTOR_TYPE_EXTENT_RLE. Along with any variables throughout the code, to make it clear which ones are used. There are no functional changes in this patch. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I7186ccc452d200036188abf1dcea9660dcedee72 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/468230 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-10-07 15:07:12 +00:00
Tomasz Zawadzki	69a8877e82	lib/blob: do not allow xattr to exceed maximum descriptor length Length of xattr descriptor is equal to length of xattr struct, xattr name and the len of stored value. There is no limit to how much can be stored in memory for xattr. On disk xattr size is limited to single page and within that to max descriptors that can fit in it. This size is known at compile time. Before this patch it was possible to add xattr exceeding what was possible to be written to disk. This caused issues when serializing the metadata during spdk_blob_sync_md() or spdk_blob_close(). Making those fail without specific info to the user and not actually writting such descriptor. Since maximum length of xattr descriptor is known at compile time, this patch compares against this value when setting the xattr. It will immediately report back to user with error, and will not store xattr in memory (thus not serialize it). This patch should not affect any backward compatibility for blobs. Too large xattrs weren't written to disk before, API for blobstore stays the same - only reporting ENOMEM when it should. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I6f4af4d079e47f084e20d7a4969d9a78ec1f8610 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460450 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-11 10:05:41 +00:00
Maciej Szwed	92cafd1586	blobstore: Remove blob on blobstore load when required In some cases user may want to flag blob for removal then do some operations (before removing it) and while it happens there might be power failure. In such cases we should remove this blob on next blobstore load. Example of such usage is delete snapshot functionality that will be introduced in upcoming patch. Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: I85f396b73762d2665ba8aec62528bb224acace74 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453835 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-24 23:09:56 +00:00
Maciej Szwed	8256cecf39	blobstore: rename resize_in_progress to locked_operation_in_progress This is a part of future changes to block blob operations that may cause race conditions between each other. Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: Ia728d1fc207375ddcb3b70b5081ddcffa9f99027 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449789 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-08 21:39:08 +00:00
Maciej Szwed	adb39585ef	lvol: add option to change default data erase method Some users require to do write zeroes operation when erasing data on lvol. Currently the default method is unmap. This patch adds flag to spdk_rpc_construct_lvol_bdev call that changes default erase method. This is also a base implementation for possible future function for erasing data on lvol bdev. Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: I8964f170b13c2268fe3c18104f7956c32be96040 Reviewed-on: https://review.gerrithub.io/c/441527 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2019-01-23 22:25:37 +00:00
Piotr Pelplinski	6609b776e4	blobstore: allow I/O operations to use io unit size smaller than page size. Signed-off-by: Piotr Pelplinski <piotr.pelplinski@intel.com> Change-Id: I994b5d46faffd34430cb39e66225929c4cba90ba Reviewed-on: https://review.gerrithub.io/414935 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-10-04 21:35:24 +00:00
Chen Wang	6fa48bbf62	lib: fix typos in the lib directory Change-Id: Idcb60b79d2902bb316facc6f60e0a81e5cf847ed Signed-off-by: Chen Wang <chenx.wang@intel.com> Reviewed-on: https://review.gerrithub.io/423372 Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2018-08-24 17:15:12 +00:00
Ziye Yang	ee9db7dac0	blobstore: adjust order in spdk_xattr It will save the space of spdk_xattr when put uint16_t after uint32_t Change-Id: Ie0712d8c3b16d90fc354847509fd87e1ffd93916 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/419453 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2018-07-19 01:45:19 +00:00
Piotr Pelplinski	2c91e91907	blobstore: Save the original size of the disk. Save the original size of the disk to metadata when it is first created. On load verify that the disk did not change size. Signed-off-by: Piotr Pelplinski <piotr.pelplinski@intel.com> Change-Id: I535940ee188425ee3b394effd99653cc073d541e Reviewed-on: https://review.gerrithub.io/410896 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-06-28 17:58:31 +00:00
Jim Harris	f300130872	blob: always use uint64_t to represent page_idx 4KiB page size * UINT32_MAX = 16TiB - so we must use a uint64_t for any blobstores on backing devices of 16TiB or greater. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ief13cf06d413477dc8ab4f9fe0ff4c0631566c00 Reviewed-on: https://review.gerrithub.io/416448 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com>	2018-06-21 22:46:30 +00:00
Daniel Verkamp	89426e9bb5	blob: change lba to uint64_t in serialize_extent Make sure we don't truncate the LBA when using it to serialize the cluster array into an extent list. We also need to add an explicit cast in _spdk_bs_cluster_to_lba to ensure the conversion doesn't get truncated. While here, do the same cast for _spdk_bs_cluster_to_page. Change-Id: If4e65ed86550e39dfa39826930dfafac158d519c Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com> Signed-off-by: Jim Harris <james.r.harris@intel.com> Reviewed-on: https://review.gerrithub.io/416231 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-06-21 22:46:30 +00:00
Piotr Pelplinski	69fa57cdf0	blobstore: freeze I/O during resize Signed-off-by: Piotr Pelplinski <piotr.pelplinski@intel.com> Change-Id: I23c34d4dcb542aa9ab3fa8cb734cf9cc0e0fc5da Reviewed-on: https://review.gerrithub.io/409144 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-06-08 19:32:25 +00:00
Piotr Pelplinski	8c45ed3822	blobstore: freeze I/O during snapshoting. Signed-off-by: Piotr Pelplinski <piotr.pelplinski@intel.com> Change-Id: I6182eb3a77d23db7088703492d71349e3a4b6460 Reviewed-on: https://review.gerrithub.io/399366 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-06-06 22:26:04 +00:00
Piotr Pelplinski	bc8f2cd90f	blobstore: Change behaviour of dirty bit The patch disables writing dirty bit during blobstore loading. Instead, dirty bit is written prior to the first metadata update. Signed-off-by: Piotr Pelplinski <piotr.pelplinski@intel.com> Change-Id: I7be81009a99f09048bf23749c8f6ef5e9f7b3751 Reviewed-on: https://review.gerrithub.io/410884 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com>	2018-05-30 00:37:54 +00:00
Tomasz Kulasek	d7e065be93	blobstore: clone-snapshot blobstore relations This commit provides an API to obtain an information about snapshot and clone relations. The main objective is: 1) Determinate if we can delete snapshot (if have some created clones), 2) Provide an information about parent/children nodes to the upper layer (e.g. lvol) Realization: 1) Structure parent-children is stored in the blob store object and updated on: a) blob store load, b) blob create/delete, 2) Full information about parent-children is provided via new API: spdk_blob_get_parent() and spdk_blob_get_children(), Note: While we don't store an information about these relations in the blob store, we need to open all blobs on blob store load to create it. It should be considered that it have an impact on the blobstore loading performance. Change-Id: Ie0237fa5b93af01aa73d1f68ac1694e653fb75e5 Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Reviewed-on: https://review.gerrithub.io/405025 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com>	2018-04-20 15:22:53 -04:00
Tomasz Kulasek	0d1c3aefc3	blobstore: clone-snapshot classification This patch introduces API to get some blobs capabilites: bool spdk_blob_is_read_only(struct spdk_blob blob); bool spdk_blob_is_thin_provisioned(struct spdk_blob blob); to be used in upper level in the unified way. Change-Id: I4411bb3f4dd0c64826ae16a66141b2911cbaab79 Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Reviewed-on: https://review.gerrithub.io/405022 Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2018-04-17 17:05:53 -04:00
Piotr Pelplinski	777627e024	blobstore: add snapshot functionality This patch adds new feature of blobstore. New call creates a read-only snapshot of specified blob with provided options. NOTE: This patch doesn't cover recovery operation if snapshotting fails. This operation will be implemented and added later. Signed-off-by: Piotr Pelplinski <piotr.pelplinski@intel.com> Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Change-Id: I470ca13525638fa6df485d508b3adf71b6b69c0b Reviewed-on: https://review.gerrithub.io/393935 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-04-06 16:30:24 -04:00
Jim Harris	b24fdae1a8	Revert "blob: queue sync requests if one already in progress" BlobFS shutdown path needs to be investigated more with these changes. This reverts commit `a137b9afd0`. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I8b04b24e178945d62db20668b9e500f278ae955b Reviewed-on: https://review.gerrithub.io/403600 Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Tested-by: SPDK Automated Test System <sys_sgsw@intel.com>	2018-03-12 20:37:49 -04:00
Jim Harris	a137b9afd0	blob: queue sync requests if one already in progress For any given blob, if an spdk_blob_sync_md() operation is already in progress, queue additional spdk_blob_sync_md() operations until the previous one completes. This ensures proper ordering of writing metadata to disk. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I2051e8cb5b8d1a033ec1238cb4811232110aa0f4 Reviewed-on: https://review.gerrithub.io/401257 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-03-12 16:24:40 -04:00
Piotr Pelplinski	c26c4e9fb4	blobstore: Add a blob_bs_dev that provides back_bs_dev for clones Unit tests implemented in following patches. This is rebased patch from https://review.gerrithub.io/#/c/396648 merged as commit `c1174e6895` and reverted in `0847f27b54`. Change-Id: I3d152bf7847c83bf75149edd61564c1f393927d8 Signed-off-by: Piotr Pelplinski <piotr.pelplinski@intel.com> Reviewed-on: https://review.gerrithub.io/402529 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com>	2018-03-08 11:34:15 -05:00
Daniel Verkamp	8a6ba58cb4	scripts/check_format: check for spaces before tabs Automatically detect more whitespace errors. All existing cases are fixed; only whitespace change (verify with diff -w) except for one comment style fixup in include/spdk/nvme.h. Change-Id: If750e54b9c8e3421ea6feda5f20184a31431631e Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-on: https://review.gerrithub.io/402360 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2018-03-05 11:09:13 -05:00
Daniel Verkamp	0847f27b54	Revert "blobstore: Add a blob_bs_dev that provides back_bs_dev for clones" This change wasn't correctly rebased and needs to be updated to compile against the current blobstore. This reverts commit `c1174e6895`. Change-Id: I529608bee7323cb626d8c36dff15adc9ba24ad26 Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-on: https://review.gerrithub.io/402352 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-03-02 15:05:45 -05:00
Piotr Pelplinski	c1174e6895	blobstore: Add a blob_bs_dev that provides back_bs_dev for clones Unit tests implemented in following patches. Signed-off-by: Piotr Pelplinski <piotr.pelplinski@intel.com> Change-Id: Ib18c9060f527bd22bfdbed74e96871a6e0551ead Reviewed-on: https://review.gerrithub.io/396648 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com>	2018-03-02 14:09:09 -05:00
Jim Harris	7d4705a257	blob: remove SPDK_BLOB_STATE_SYNCING All metadata operations are now done on the metadata thread, so we no longer have to worry about one thread updating in-memory metadata structures while another thread is transferring the in-memory structures to on-disk structures. This does not protect against multiple sync operations outstanding at once - that will be coming in an upcoming path. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ibf33edf4d41d867c96a38df017737e9ceb87fa58 Reviewed-on: https://review.gerrithub.io/401056 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-02-27 10:58:29 -05:00
Jim Harris	c8efd8a8b2	blob: revert spdk_blob_data changes There was some thinking that we would need to allocate I/O channels on a per-blob basis to handle dynamic resizing during I/O. Making spdk_blob an opaque handle, with the existing spdk_blob structure renamed to spdk_blob_data was a first step towards making that happen. But more recent work on blobstore has simplified the resizing approach, so this spdk_blob_data is no longer needed. So revert it. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I22e07008faceb70649ee560176ebe5e014d5f1a3 Reviewed-on: https://review.gerrithub.io/400881 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com>	2018-02-23 15:54:12 -05:00
Piotr Pelplinski	7ba8c006c5	blobstore: allow xattrs to be set internally only for blobstore Patch adds internal version of xattr functions to allow operations on internal xattrs, which are not visible to upper layers. When there is at least one internal xattr set, also SPDK_BLOB_INTERNAL_XATTR flag is set in invalid_flags to prevent loading this blob in previous spdk versions. Signed-off-by: Piotr Pelplinski <piotr.pelplinski@intel.com> Change-Id: Iec918ec858f069f7cd9f36d5e8f0495ffa4a42d8 Reviewed-on: https://review.gerrithub.io/395122 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com>	2018-02-12 19:12:14 -05:00
Piotr Pelplinski	69c9bb0153	blobstore: move xattr serialization to separate function Signed-off-by: Piotr Pelplinski <piotr.pelplinski@intel.com> Change-Id: I277f7288427788e7a107b143331753fd5b23f16f Reviewed-on: https://review.gerrithub.io/396571 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-02-02 16:58:06 -05:00
Maciej Szwed	9103821d3e	blob: make _spdk_bs_allocate_cluster thread safe Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: I3c7d2096f549a88b4a9884c0026d15d3bcd8dc67 Reviewed-on: https://review.gerrithub.io/396387 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-01-30 17:29:53 -05:00
Maciej Szwed	4132ac52da	blob: support for thin provisioned reads and writes Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: Ibc9609ad36188006e9454e5c799bccd8a92d7991 Reviewed-on: https://review.gerrithub.io/391422 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-01-30 15:46:18 -05:00
Ben Walker	8970f8682a	blob: Add a bs_dev that always returns 0 This will be useful for backing thin provisioned blobs in the future. Change-Id: I78cf8cda39e8dff42da69b79ed460797d7494af1 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/397043 Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-01-30 15:46:18 -05:00
Jim Harris	dfb102b79a	blob: add md_thread to struct spdk_blob_store For now, use this to add some assert() calls to ensure per-blob metadata operations are only called from the thread that initialized/loaded the blobstore. Upcoming patches will utilize this for metadata updates required due to cluster allocations on thin provisioned blobs. In that case, the cluster allocations may not always be done on the metadata thread - but we want the metadata thread to actually do the metadata sync operation to guard against races from allocations on multiple threads in parallel. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ifa0adfe8b7e61ba770449d1e076126ecb9d7a556 Reviewed-on: https://review.gerrithub.io/396712 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Tested-by: SPDK Automated Test System <sys_sgsw@intel.com>	2018-01-29 12:33:05 -05:00

1 2

73 Commits