btrfs-progs

mirror of https://github.com/kdave/btrfs-progs synced 2025-02-17 02:06:51 +00:00

Author	SHA1	Message	Date
Josef Bacik	888a2b6a0c	btrfs-progs: remove root argument from free_extent and inc_extent_ref Neither of these actually need the root argument, we provide all the information for the ref through the arguments we pass through. Remove the root argument from both of them. These needed to be done in the same patch because of the __btrfs_mod_ref helper which will pick one or the other function for processing reference updates. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:30 +02:00
Josef Bacik	30347fde73	btrfs-progs: add btrfs_root_id helper This exists in the kernel and is used throughout ctree.c, sync this helper to make sync'ing ctree.c easier. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	eea6308658	btrfs-progs: sync and stub-out tree-mod-log.h In order to sync ctree.c we're going to have to have definitions for the tree-mod-log stuff. However we don't need any of the code, we don't do live backref lookups in btrfs-progs, so simply sync the header file and stub all the helpers out. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	4e790a5b5c	btrfs-progs: make __btrfs_cow_block static This isn't used anywhere other than ctree.c, make it static. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	656938b665	btrfs-progs: rename clear_extent_buffer_dirty to btrfs_clear_buffer_dirty This is a mirror of the change I've done in the kernel, but in progs it's even more simply because clean_tree_block was just a wrapper around clear_extent_buffer_dirty. Change this to btrfs_clear_buffer_dirty, and then update all the callers to use this helper instead of clean_tree_block and clear_extent_buffer_dirty. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	d65c4884cd	btrfs-progs: cleanup parameters of btrfs_set_disk_extent_flags In the kernel we just pass in the extent_buffer and get the fields we need from that to update the extent ref flags. Update this helper to match the kernel to make syncing ctree.c easier. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	25432a6c2a	btrfs-progs: rename btrfs_set_block_flags to btrfs_set_disk_extent_flags This helper in the kernel is named btrfs_set_disk_extent_flags, which is a more accurate description than btrfs_set_block_flags. Rename to the in kernel name to make syncing ctree.c easier. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	216f442e5e	btrfs-progs: add some missing extent buffer helpers The following are some extent buffer helpers we have in the kernel but not in btrfs-progs. Sync these in to make syncing ctree.c easier. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	5780714b58	btrfs-progs: add btrfs_locking_nest to btrfs_alloc_tree_block This is how btrfs_alloc_tree_block is defined in the kernel, so when we go to sync this code in it'll be easier if we're already setup to accept this argument. Since we're in progs we don't care about nesting so just use BTRFS_NORMAL_NESTING everywhere, as we sync in the kernel code it'll get updated to whatever is appropriate. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	311d990fff	btrfs-progs: sync locking.h and stub out all the helpers We want locking.h to have all the definitions that get used throughout the codebase, however we don't want to actually use any of the actual locking. This sync's the bulk of locking.h, and then stubs out all of the definitions. We need a locking.c for the root lock helpers that return the extent buffer, but everything else can simply be inlined out. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	006baaecdd	btrfs-progs: rename btrfs_alloc_free_block to btrfs_alloc_tree_block This is in keeping with what the function actually does, and is named this way in the kernel. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	8e427ada49	btrfs-progs: copy btrfs_root::state from kernel We changed from members in the root for all the different flags to a bit based flag system. In order to make syncing the kernel code into btrfs-progs easier go ahead and sync in the bits we use and update all the users of the old ->track_dirty and ->ref_cows to use the state bits. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	c6b160c4e4	btrfs-progs: constify the extent buffer helpers These helpers are all take const struct extent_buffer in the kernel, do the same in btrfs-progs in order to enable us to more easily sync ctree.c. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	4a9a8f2a8a	btrfs-progs: sync extent-io-tree.[ch] and misc.h from the kernel This is a bit larger than the previous syncs, because we use extent_io_tree's everywhere. There's a lot of stuff added to kerncompat.h, and then I went through and cleaned up all the API changes, which were - extent_io_tree_init takes an fs_info and an owner now. - extent_io_tree_cleanup is now extent_io_tree_release. - set_extent_dirty takes a gfpmask. - clear_extent_dirty takes a cached_state. - find_first_extent_bit takes a cached_state. The diffstat looks insane for this, but keep in mind extent-io-tree.c and extent-io-tree.h are ~2000 loc just by themselves. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	bf743c4cf8	btrfs-progs: sync async-thread.[ch] from the kernel We won't actually use the async code in progs, however we call the helpers and such all over the normal code, so sync this into btrfs-progs to make syncing other parts of the kernel easier. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	f8efe9f724	btrfs-progs: sync file-item.h into progs This patch syncs file-item.h into btrfs-progs. This carries with it an API change for btrfs_del_csums, which takes a root argument in the kernel, so all callsites have been updated accordingly. I didn't sync file-item.c because it carries with it a bunch of bio related helpers which are difficult to adapt to the kernel. Additionally there's a few helpers in the local copy of file-item.c that aren't in the kernel that are required for different tools. This requires more cleanups in both the kernel and progs in order to sync file-item.c, so for now just do file-item.h in order to pull things out of ctree.h. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	c979ffd787	btrfs-progs: sync accessors.[ch] from the kernel This syncs accessors.[ch] from the kernel. For the most part accessors.h will remain the same, there's just some helpers that need to be adjusted for eb->data instead of eb->pages. Additionally accessors.c needed to be completely updated to deal with this as well. This is a set of files where we will likely only sync the header going forward, and leave the C file in place as it needs to be specific to btrfs-progs. This forced a few "unrelated" changes - Using btrfs_dir_item_ftype() instead of btrfs_dir_item_type(). This is due to the encryption changes, and was simpler to just do in this patch. - Adjusting some of the print tree code to use the actual helpers and not the btrfs-progs ones. A local definition of static_assert is used to avoid compilation failures on older gcc (< 9) where the 2nd parameter is mandatory. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:28 +02:00
David Sterba	164bc10dfc	btrfs-progs: add musl compatibility for printf format %pV Glibc provides an interface to extend the printf formats but this is not standardized and does not work on musl. The code brought from kernel uses %pV for varargs and also has own implementation of printk. As a workaround for musl expand the pV value to a string and then simply print it. The details are hidden behind macros: - DECLARE_PV(vaf) - PV_ASSIGN(vaf, format, args) - PV_FMT in printf string - PV_VAL in arguments Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:28 +02:00
Josef Bacik	228aa34f10	btrfs-progs: sync messages.[ch] from the kernel These are the printk helpers from the kernel. There were a few modifications, the hi-lights are - We do not have fs_info::fs_state, so that needed to be removed. - We do not have discard.h sync'ed yet, so that dependency was dropped. - Anything related to struct super_block was commented out. - The transaction abort had to be modified to fit with the current btrfs-progs code. - Added a btrfs_no_printk() helper to common/messages.* so that the print statements still worked. - The 32bit limit checkers are not needed so are behind __KERNEL__ Additionally there were kerncompat.h changes that needed to be made to handle the dependencies properly. Those are easier to spot. Any function that needed to be modified has a MODIFIED tag in the comment section with a list of things that were changed. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:28 +02:00
Josef Bacik	89697b69f7	btrfs-progs: sync on-disk definitions from the kernel header This pulls in the kernel's uapi/btrfs_tree.h, which now has all of the on-disk definitions. Include this into ctree.h, and then yank out all the duplicate code from ctree.h. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:28 +02:00
Josef Bacik	a754fe29d9	btrfs-progs: sync uapi/btrfs.h into btrfs-progs We want to keep this file locally as we want to be uptodate with upstream, so we can build btrfs-progs regardless of which kernel is currently installed. Sync this with the upstream version and put it in kernel-shared/uapi to maintain some semblance of where this file comes from. There are some changes that need to be synced back to kernel. A local definition of static_assert is used to avoid compilation problems on gcc (< 9) due to mandatory 2nd parameter. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:28 +02:00
Josef Bacik	b0a4eab561	btrfs-progs: remove parent_key arg from btrfs_check_* helpers Now that this is unused by these helpers and only used by the repair related code we can remove this argument from the main helpers. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:28 +02:00
Josef Bacik	90e687d90c	btrfs-progs: add a btrfs check helper for checking blocks btrfs check wants to be able to record corrupted extents if it finds any bad blocks. This has been done directly inside of the btrfs_check_leaf/btrfs_check_node helpers, but these are going to be sync'ed from the kernel in the future. Add another helper and move the corrupt block handling into this helper and keep it inside of the check code. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:28 +02:00
Josef Bacik	d36571ed9a	btrfs-progs: remove fs_info argument from btrfs_check_* helpers This can be pulled out of the extent buffer that is passed in, drop the fs_info argument from the function. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:28 +02:00
Josef Bacik	bee10e7fc9	btrfs-progs: rename the qgroup structs to match the kernel Now that the libbtrfs stuff has it's own local copy of ctree.h and ioctl.h, let's rename these qgroup struct members to match the kernel names, this way it'll make it easier to sync the kernel code into btrfs-progs. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:28 +02:00
psykose	c9abbf6264	btrfs-progs: stop using legacy 64 interfaces The 64 interfaces, such as fstat64, off64_t, etc, are legacy interfaces created at a time when 64-bit file support was still new. They are generally exposed when defining a macro named _LARGEFILE64_SOURCE, as e.g. the glibc docs[0] say. The modern way to utilise largefile support, is to continue to use the regular interfaces (off_t, fstat, ..), and define _FILE_OFFSET_BITS=64. We already use the autoconf macro AC_SYS_LARGEFILE[1] which arranges this and sets this macro for us. Therefore, we can utilise the non-64 names without fear of breaking on 32-bit systems. This fixes the build against musl libc, ever since musl dropped the 64 compat from interfaces by default[2] just for _GNU_SOURCE, unless _LARGEFILE64_SOURCE is defined. However, there are plans for a future removal of the whole 64 header API, and that workaround (adding another define) might cease to exist. So, rename all 64 API use to the regular non-suffixed names. For consistency, rename the internal functions that were 64 named (lstat64_path, ..) too. This should have no regressions on any platform. [0]: https://www.gnu.org/software/libc/manual/html_node/Feature-Test-Macros.html#index-_005fLARGEFILE64_005fSOURCE [1]: https://www.gnu.org/software/autoconf/manual/autoconf-2.67/html_node/System-Services.html [2]: `25e6fee27f` Pull-request: #615 Signed-off-by: psykose <alice@ayaya.dev> Signed-off-by: David Sterba <dsterba@suse.com>	2023-04-25 16:59:42 +02:00
Qu Wenruo	68a04bc710	btrfs-progs: tune: add new option to convert back to extent tree With previous btrfstune support to convert to block-group-tree, it has implemented most of the infrastructure for bi-directional conversion. This patch will implement the remaining conversion support to go back to extent tree. The modification includes: - New convert_to_extent_tree() function in btrfstune.c It's almost the same as convert_to_bg_tree(), but with small changes: * No need to set extra features like NO_HOLES/FST. * Need to delete the block group tree when everything finished. - Update btrfs_delete_and_free_root() to handle non-global roots Currently the function can only accepts global roots (extent/csum/free space trees) If we pass a non-global root into the function, we will screw up global_roots_tree and crash. Since we're going to use btrfs_delete_and_free_root() to free block group tree which is not a global tree, this is needed. - New handling for half converted fs in get_last_converted_bg() There are two cases need to be handled: * The bg tree is already empty We need to grab the first bg in extent tree. Or at conversion function we will fail at grabbing the first bg. * The bg tree is not empty Then we need to grab the last bg in extent tree. - Extra root switching in involved functions. This involves: * read_converting_block_groups() * insert_block_group_item() * update_block_group_item() We just need to update our target root according to the current compat_ro and super flags. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-04-19 01:10:24 +02:00
Qu Wenruo	6c7a6dba95	btrfs-progs: use alloc_dummy_extent_buffer() for temporary super block [FALSE ALERT] There is a false alert when compiling btrfs-progs using gcc 12.2.1: $ make D=1 kernel-shared/print-tree.c: In function 'print_sys_chunk_array': kernel-shared/print-tree.c:1797:9: warning: 'buf' may be used uninitialized [-Wmaybe-uninitialized] 1797 \| write_extent_buffer(buf, sb, 0, sizeof(sb)); \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ In file included from ./kernel-shared/ctree.h:27, from kernel-shared/print-tree.c:24: ./kernel-shared/extent_io.h:148:6: note: by argument 1 of type 'const struct extent_buffer ' to 'write_extent_buffer' declared here 148 \| void write_extent_buffer(const struct extent_buffer eb, const void src, \| ^~~~~~~~~~~~~~~~~~~ [CAUSE] This is a false alert, the uninitialized part of buf will not be utilized at all during write_extent_buffer(). [FIX] Instead of allocating such ad-hoc buffer, go a more formal way by calling alloc_dummy_extent_buffer(), which would properly set all the members. Reviewed-by: Anand Jain <anand.jain@oracle.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-03-21 03:10:42 +01:00
David Sterba	9be33f558c	btrfs-progs: tune: update checksum conversion The checksum conversion is still experimental and still does not convert all filesystems correctly. Do not use on valuable data. Previous implementation copied the UUID conversion which was not a good base for the checksum conversion so it left out basically all trees except extent and checksum. This update adds the base for the required safety features: - let the old csum tree intact until the full conversion is done (ie. all data are still verifiable) - add on-disk status tracking item, this should keep the from/to checksum conversion, last generation to catch potential updates of the underlying filesystem if conversion is interrupted and the filesystem mounted - convert most of the fundamental trees, the subvolumes, tree log and relocation trees are not converted - trees are converted in-place to avoid potentially running out of space but this might be better done by transaction protection with a temporary tree Known issues: - not all trees are converted - not all checksums are correctly inserted into the new tree and reading the files leads to EIO Issue: #438 Signed-off-by: David Sterba <dsterba@suse.com>	2023-02-28 20:11:22 +01:00
Qu Wenruo	f914949b1a	btrfs-progs: fix set but not used variables [WARNING] Clang 15.0.7 warns about several unused variables: kernel-shared/zoned.c:829:6: warning: variable 'num_sequential' set but not used [-Wunused-but-set-variable] u32 num_sequential = 0, num_conventional = 0; ^ cmds/scrub.c:1174:6: warning: variable 'n_skip' set but not used [-Wunused-but-set-variable] int n_skip = 0; ^ mkfs/main.c:493:6: warning: variable 'total_block_count' set but not used [-Wunused-but-set-variable] u64 total_block_count = 0; ^ image/main.c:2246:6: warning: variable 'bytenr' set but not used [-Wunused-but-set-variable] u64 bytenr = 0; ^ [CAUSE] Most of them are just straightforward set but not used variables. The only exception is total_block_count, which has commented out code relying on it. [FIX] Just remove those variables, and for @total_block_count, also remove the comments. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-02-18 17:44:03 +01:00
Qu Wenruo	3a1d4aa089	btrfs-progs: fix fallthrough cases with proper attributes [FALSE ALERT] Unlike gcc, clang doesn't really understand the comments, thus it's reportings tons of fall through related errors: cmds/reflink.c:124:3: warning: unannotated fall-through between switch labels [-Wimplicit-fallthrough] case 'r': ^ cmds/reflink.c:124:3: note: insert '__attribute__((fallthrough));' to silence this warning case 'r': ^ __attribute__((fallthrough)); cmds/reflink.c:124:3: note: insert 'break;' to avoid fall-through case 'r': ^ break; [CAUSE] Although gcc is fine with /* fallthrough */ comments, clang is not. [FIX] So just introduce a fallthrough macro to handle the situation properly, and use that macro instead. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-02-18 17:44:02 +01:00
Qu Wenruo	66dad3a8f5	btrfs-progs: fix a false alert on an uninitialized variable when BUG_ON() is involved [FALSE ALERT] Clang 15.0.7 gives the following false alert in get_dev_extent_len(): kernel-shared/extent-tree.c:3328:2: warning: variable 'div' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] default: ^~~~~~~ kernel-shared/extent-tree.c:3332:24: note: uninitialized use occurs here return map->ce.size / div; ^~~ kernel-shared/extent-tree.c:3311:9: note: initialize the variable 'div' to silence this warning int div; ^ = 0 And one in btrfs_stripe_length() too: kernel-shared/volumes.c:2781:2: warning: variable 'stripe_len' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] default: ^~~~~~~ kernel-shared/volumes.c:2785:9: note: uninitialized use occurs here return stripe_len; ^~~~~~~~~~ kernel-shared/volumes.c:2754:16: note: initialize the variable 'stripe_len' to silence this warning u64 stripe_len; ^ = 0 [CAUSE] Clang doesn't really understand what BUG_ON() means, thus in that default case, we won't get uninitialized value but crash directly. [FIX] Silent the errors by assigning the default value properly using the value of SINGLE profile. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-02-18 17:44:02 +01:00
Qu Wenruo	e2806fd624	btrfs-progs: remove an unnecessary branch to silent the clang warning [FALSE ALERT] With clang 15.0.7, there is a false alert on uninitialized value in ctree.c: kernel-shared/ctree.c:3418:13: warning: variable 'offset' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] } else if (ret < 0) { ^~~~~~~ kernel-shared/ctree.c:3428:41: note: uninitialized use occurs here write_extent_buffer(eb, &subvol_id_le, offset, sizeof(subvol_id_le)); ^~~~~~ kernel-shared/ctree.c:3418:9: note: remove the 'if' if its condition is always true } else if (ret < 0) { ^~~~~~~~~~~~~ kernel-shared/ctree.c:3380:22: note: initialize the variable 'offset' to silence this warning unsigned long offset; ^ = 0 kernel-shared/ctree.c:3418:13: warning: variable 'eb' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] } else if (ret < 0) { ^~~~~~~ kernel-shared/ctree.c:3429:26: note: uninitialized use occurs here btrfs_mark_buffer_dirty(eb); ^~ kernel-shared/ctree.c:3418:9: note: remove the 'if' if its condition is always true } else if (ret < 0) { ^~~~~~~~~~~~~ kernel-shared/ctree.c:3378:26: note: initialize the variable 'eb' to silence this warning struct extent_buffer eb; ^ = NULL [CAUSE] The original code is handling the return value from btrfs_insert_empty_item() like this: ret = btrfs_insert_empty_item(); if (ret >= 0) { / Do something for it. / } else if (ret == -EEXIST) { / Do something else. / } else if (ret < 0) { / Error handling. */ } But the problem is, the last one check is always true if we can reach there. Thus clang is providing the hint to remove the if () check. [FIX] Normally we prefer to do error handling first, so move the error handling first so we don't need the if () else if () chain. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-02-18 17:44:02 +01:00
David Sterba	73efc566d5	btrfs-progs: kerncompat: hide definition of __init As kerncompat.h is included from all libbtrfs headers we must be careful about generic names like __init, in this case it breaks build of snapper. Signed-off-by: David Sterba <dsterba@suse.com>	2023-01-03 17:24:36 +01:00
David Sterba	e78fe2d92e	Revert "btrfs-progs: rename qgroup items to match the kernel naming scheme" This reverts commit `03451430de`. (It's not 1:1, there are some additional trivial fixups in cmds/qgroup.c) This breaks a lot of 3rd party tools that depend on it as Neal reports: * btrfs-assistant * buildah * cri-o * podman * skopeo * containerd * moby/docker * snapper * source-to-image Link: https://lore.kernel.org/linux-btrfs/CAEg-Je8L7jieKdoWoZBuBZ6RdXwvwrx04AB0fOZF1fr5Pb-o1g@mail.gmail.com/ Reported-by: Neal Gompa <ngompa@fedoraproject.org> Signed-off-by: David Sterba <dsterba@suse.com>	2023-01-03 13:10:54 +01:00
Josef Bacik	e380421ff2	btrfs-progs: make write_extent_buffer take a const eb This is what we do in the kernel, and while we're syncing individual files we're going to have state where some callers are using a const, but progs isn't. So adjust write_extent_buffer to take a const eb in order to make this less painful. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-30 19:14:29 +01:00
Josef Bacik	405b5cadd3	btrfs-progs: replace btrfs_leaf_data with btrfs_item_nr_offset We're using btrfs_item_nr_offset(leaf, 0) to get the start of the leaf data in the kernel, we don't have btrfs_leaf_data. Replace all occurrences of btrfs_leaf_data() with btrfs_item_nr_offset(leaf, 0) in order to make syncing accessors.[ch] easier. ctree.c will be synced later, so this is simply an intermediate step. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-30 19:14:29 +01:00
Josef Bacik	788a71c16a	btrfs-progs: sync compression.h from the kernel This patch copies in compression.h from the kernel. This is relatively straightforward, we just have to drop the compression types definition from ctree.h, and update the image to use BTRFS_NR_COMPRESS_TYPES instead of BTRFS_COMPRESS_LAST, and add a few things to kerncompat.h to make everything build smoothly. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-30 19:14:29 +01:00
Josef Bacik	fac1fae3ef	btrfs-progs: rename extent buffer flags to EXTENT_BUFFER_* We have been overloading the extent_state flags for use on the extent buffers as well. When we sync extent-io-tree.[ch] this will become impossible, so rename these flags to EXTENT_BUFFER_* and use those definitions instead of the extent_state definitions. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-28 18:57:44 +01:00
Josef Bacik	83cc5a5489	btrfs-progs: delete state_private code We used to store random private things into extent_states, but we haven't done this for a while and there are no users of this code, simply delete it. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-28 18:57:44 +01:00
Josef Bacik	20d88c17e7	btrfs-progs: move extent cache code directly into btrfs_fs_info We have some extra features in the btrfs-progs copy of the extent_io_tree that don't exist in the kernel. In order to make syncing easier simply move this functionality into btrfs_fs_info, that way we can sync in the new extent_io_tree code and not have to worry about breaking anything. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-28 18:57:44 +01:00
Josef Bacik	412eea9e97	btrfs-progs: do not pass io_tree into verify_parent_transid We do not use the io_tree, don't bother passing it into verify_parent_transid. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-28 18:57:44 +01:00
Josef Bacik	ccee633f3a	btrfs-progs: move dirty eb tracking to it's own io_tree btrfs-progs has a cache tree embedded in the extent_io_tree in order to track extent buffers. We use the extent_io_tree part to track dirty, and the cache tree to keep the extent buffers in. When we sync extent-io-tree.[ch] we'll lose this ability, so separate out the dirty tracking into its own extent_io_tree. Subsequent patches will adjust the extent buffer lookup so it doesn't use the custom extent_io_tree thing. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-28 18:57:43 +01:00
Josef Bacik	af30cf2e3e	btrfs-progs: make the find extent buffer helpers take fs_info This is a cleanup patch to make syncing the btrfs kernel code into btrfs-progs easier. In btrfs-progs we have an extra cache in the extent_io_tree that's exclusively used for the extent buffer tracking. In order to untangle this dependency start passing around the fs_info to search for extent_buffers, and then have the helpers use the appropriate structure to find the extent buffer. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-28 18:57:43 +01:00
Josef Bacik	2b9fba7974	btrfs-progs: rename btrfs_item_end to btrfs_item_data_end This matches what we did in the kernel, btrfs_item_data_end is more inline with what the helper does, which is give us the offset of the end of the data portion of the item, not the offset of the end of the item itself. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-28 18:57:43 +01:00
Josef Bacik	53f8b8fd01	btrfs-progs: make btrfs_qgroup_level helper match the kernel We return __u16 in the kernel, as this is actually the size of btrfs_qgroup_level. Adjust the existing helpers and update all the callers to deal with the new size appropriately. This will make syncing the kernel code cleaner. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-28 18:57:43 +01:00
Josef Bacik	03451430de	btrfs-progs: rename qgroup items to match the kernel naming scheme We're going to sync the kernel source into btrfs-progs, and in the kernel we have all these qgroup fields named with short names instead of the full name, so rename referenced -> rfer compressed -> cmpr exclusive -> excl to match the kernel and update all the users, this will make the sync cleaner. ioctl.h is a public header but there are no users of the btrfs_qgroup_limit structure. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-28 18:57:43 +01:00
Qu Wenruo	2aa4085bf7	btrfs-progs: properly handle degraded raid56 reads [BUG] For a degraded RAID5, btrfs check will fail to even read the chunk root: # mkfs.btrfs -f -m raid5 -d raid5 $dev1 $dev2 $dev3 # wipefs -fa $dev1 # btrfs check $dev2 Opening filesystem to check... warning, device 1 is missing bad tree block 22036480, bytenr mismatch, want=22036480, have=0 ERROR: cannot read chunk root ERROR: cannot open file system [CAUSE] Although read_tree_block() function from btrfs-progs is properly iterating the mirrors (mirror 1 is reading from the disk directly, mirror 2 will be rebuild from parity), the raid56 recovery path is not handling the read error correctly. The existing code will try to read the full stripe, but any read failure (including missing device) will immediately cause an error: for (i = 0; i < num_stripes; i++) { ret = btrfs_pread(multi->stripes[i].dev->fd, pointers[i], BTRFS_STRIPE_LEN, multi->stripes[i].physical, fs_info->zoned); if (ret < BTRFS_STRIPE_LEN) { ret = -EIO; goto out; } } [FIX] To make failed_a/failed_b calculation much easier, and properly handle too many missing devices, here this patch will introduce a new bitmap based solution. The new @failed_stripe_bitmap will represent all the failed stripes. So the initial read will mark all the missing devices in the @failed_stripe_bitmap, and later operations will all operate on that bitmap. Only before we call raid56_recov(), we convert the bitmap to the old failed_a/failed_b interface and continue. Now btrfs check can handle above case properly: # btrfs check $dev2 Opening filesystem to check... warning, device 1 is missing Checking filesystem on /dev/test/scratch2 UUID: 8b2e1cb4-f35b-4856-9b11-262d39d8458b [1/7] checking root items [2/7] checking extents [3/7] checking free space tree [4/7] checking fs roots [5/7] checking only csums items (without verifying data) [6/7] checking root refs [7/7] checking quota groups skipped (not enabled on this FS) found 147456 bytes used, no error found total csum bytes: 0 total tree bytes: 147456 total fs tree bytes: 32768 total extent tree bytes: 16384 btree space waste bytes: 139871 file data blocks allocated: 0 referenced 0 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-24 17:29:12 +01:00
David Sterba	1414ecbb6f	btrfs-progs: replace strerror(errno) with %m in printf formats We're using the %m format where possible. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-26 09:46:22 +02:00
David Sterba	44efde7fa0	btrfs-progs: unify naming of qgroup subvolid helpers Kernel function name is btrfs_qgroup_subvolid so rename it in progs. The libbtrfs can't API be changed without versioning so at least add the new helper. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-26 09:36:44 +02:00
Qu Wenruo	720819745c	btrfs-progs: print-tree: follow the supported flags when printing flags Currently we put EXTENT_TREE_V2 incompat flag entry under EXPERIMENTAL features, thus at compile time, incompat_flags_array[] is determined at compile time. But the truth is, we have @supported_flags for __print_readable_flag(), which is already defined based on EXPERIMENTAL flag. Thus for __print_readable_flag(), we can always include the entry for EXTENT_TREE_V2, and only print the flag if it's in the @supported_flags By this, we can remove one EXPERIMENTAL ifdef. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-18 16:03:43 +02:00
Anand Jain	c802b02988	btrfs-progs: dump-super: add extent-tree-v2 The extent-tree-v2 is still experimental but it should be printed among the other incompat flags if enabled by the build. Signed-off-by: Anand Jain <anand.jain@oracle.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:12 +02:00
Qu Wenruo	2cdc8dddbf	btrfs-progs: mkfs: offset inode numbers of the source filesystem [BUG] When running mkfs tests on a newly rebooted minimal system, it can cause mkfs/009 to fail. The reproduce steps requires /tmp to has minimal files in the first place. # mkdir /tmp/rootdir # xfs_io -f -c "pwrite 0 16k" /tmp/rootdir # mkfs.btrfs --rootdir /tmp/rootdir -f $dev # btrfs check $dev Opening filesystem to check... Checking filesystem on /dev/test/scratch1 UUID: 6821b3db-f056-4c18-b797-32679dcd4272 [1/7] checking root items [2/7] checking extents data backref 13631488 root 5 owner 170 offset 0 num_refs 0 not found in extent tree incorrect local backref count on 13631488 root 5 owner 170 offset 0 found 1 wanted 0 back 0x55ff6cd72260 backref 13631488 root 5 not referenced back 0x55ff6cd4c1f0 incorrect global backref count on 13631488 found 2 wanted 1 backpointer mismatch on [13631488 16384] ERROR: errors found in extent allocation tree or chunk allocation [CAUSE] The extent tree has the following weird item: item 0 key (13631488 EXTENT_ITEM 16384) itemoff 16250 itemsize 33 refs 1 gen 0 flags DATA tree block backref root FS_TREE This is an extent item for data, thus it should not have an inline tree backref. Then checking the fs tree: item 0 key (170 INODE_ITEM 0) itemoff 16123 itemsize 160 generation 7 transid 0 size 16384 nbytes 16384 block group 0 mode 100600 links 1 uid 1000 gid 1000 rdev 0 sequence 0 flags 0x0(none) atime 1664866393.0 (2022-10-04 14:53:13) ctime 1664863510.0 (2022-10-04 14:05:10) mtime 1664863455.0 (2022-10-04 14:04:15) otime 0.0 (1970-01-01 08:00:00) There is an inode item before the root dir inode. And that inode number 170 is causing the problem. In traverse_directory(), we use the inode number reported from stat() directly as btrfs inode number, and pass it to btrfs_record_file_extent(), which finally calls btrfs_inc_extent_ref(), with above 170 passed as @owner parameter. But inside btrfs_inc_extent_ref() we use that @owner value to determine if it's a data backref. Since we got a smaller than BTRFS_FIRST_FREE_OBJECTID, btrfs treats it as tree block, and cause the above problem. [FIX] As a quick fix, always add BTRFS_FIRST_FREE_OBJECTID to all inode number directly grabbed from stat(). And add an ASSERT() in __btrfs_record_file_extent() to catch unexpected objectid. This is not a perfect solution, as the resulted fs will has a huge gap in its inodes: item 0 key (256 INODE_ITEM 0) itemoff 16123 itemsize 160 item 4 key (426 INODE_ITEM 0) itemoff 15883 itemsize 160 For a proper fix, we should allocate new btrfs inode numbers in a sequential order, but that would be another series of patches. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:10 +02:00
Qu Wenruo	dad9db45bb	btrfs-progs: properly initialize extent generation in __btrfs_record_file_extent() [BUG] When using mkfs.btrfs --rootdir option, the data extents generated will have 0 as their generation in extent tree: # mkdir /tmp/rootdir # xfs_io -f -c "pwrite 0 16k" /tmp/rootdir/foobar # mkfs.btrfs -f --rootdir /tmp/rootdir $dev # btrfs ins dump-tree -t extent $dev btrfs-progs v5.19.1 extent tree key (EXTENT_TREE ROOT_ITEM 0) leaf 30474240 items 13 free space 15536 generation 7 owner EXTENT_TREE leaf 30474240 flags 0x1(WRITTEN) backref revision 1 fs uuid c1f05988-49f9-4dd4-8489-b90d60f522ee chunk uuid 40f81603-fe75-4f58-aa9e-e74e28df8523 item 0 key (13631488 EXTENT_ITEM 16384) itemoff 16230 itemsize 53 refs 1 gen 0 flags DATA <<< Generation is 0 ... [CAUSE] In __btrfs_record_file_extent() we just set the extent generation to 0. [FIX] Use trans->transid to properly fill extent generation. Now after mkfs, the first data extent backref looks like this: item 0 key (13631488 EXTENT_ITEM 16384) itemoff 16230 itemsize 53 refs 1 gen 7 flags DATA ... Reviewed-by: Anand Jain <anand.jain@oracle.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:10 +02:00
David Sterba	ccb2d4aa45	btrfs-progs: device-utils: rename btrfs_device_size There's a group of helpers to read device size, the btrfs_device_size should be one of them. Rename it and so minor cleanup. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:10 +02:00
David Sterba	a827bb2db8	btrfs-progs: use template for transaction commit error messages Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:10 +02:00
David Sterba	8fcafae04a	btrfs-progs: use template for transaction start error messages Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:10 +02:00
David Sterba	c2be0e2ce0	btrfs-progs: use template for out of memory error messages Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:09 +02:00
David Sterba	2267708bfe	btrfs-progs: move repair.c from common/ to check/ Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:09 +02:00
Qu Wenruo	08bb354a1c	btrfs-progs: properly handle write error when writing back tree blocks [BUG] If we emulate a write error during commit transaction, by setting the block device read-only, then we can easily have the following crash using "btrfs check --clear-space-cache v2": Opening filesystem to check... Checking filesystem on /dev/test/scratch1 UUID: 5945915b-37f1-4bfa-9f64-684b318b8f73 Clear free space cache v2 Error writing to device 1 kernel-shared/transaction.c:156: __commit_transaction: BUG_ON `ret` triggered, value 1 ./btrfs(+0x570c9)[0x562ec894f0c9] ./btrfs(+0x57167)[0x562ec894f167] ./btrfs(__commit_transaction+0x13b)[0x562ec894f7f2] ./btrfs(btrfs_commit_transaction+0x214)[0x562ec894fa64] ./btrfs(btrfs_clear_free_space_tree+0x177)[0x562ec8941ae6] ./btrfs(+0xc8958)[0x562ec89c0958] ./btrfs(+0xc9d53)[0x562ec89c1d53] ./btrfs(+0x17ec7)[0x562ec890fec7] ./btrfs(main+0x12f)[0x562ec8910908] /usr/lib/libc.so.6(+0x232d0)[0x7ff917ee82d0] /usr/lib/libc.so.6(__libc_start_main+0x8a)[0x7ff917ee838a] ./btrfs(_start+0x25)[0x562ec890fdc5] Aborted (core dumped) [CAUSE] The call trace has shown it's a BUG_ON(), and it's from __commit_transaction(), which is writing tree blocks back. [FIX] The fix is pretty simple, just return error. In fact we even have an error value check in btrfs_commit_transaction() just after __commit_transaction() call (although not catching the return value from it). And since we're here, also call btrfs_abort_transaction() to prevent newer transactions from being started. Now we won't have a full crash: Opening filesystem to check... Checking filesystem on /dev/test/scratch1 UUID: 5945915b-37f1-4bfa-9f64-684b318b8f73 Clear free space cache v2 Error writing to device 1 ERROR: failed to write bytenr 30425088 length 16384: Operation not permitted ERROR: failed to write tree block 30425088: Operation not permitted ERROR: failed to clear free space cache v2: -1 extent buffer leak: start 30720000 len 16384 Reported-by: Christoph Anton Mitterer <calestyo@scientia.org> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:08 +02:00
Qu Wenruo	75800c2fee	btrfs-progs: remove duplicated leaked extent buffer report [BUG] When transaction is aborted halfway, we can have extent buffer leaked, and in that case, the same leaked extent buffer can be reported for multiple times: ERROR: failed to clear free space cache v2: -1 extent buffer leak: start 30441472 len 16384 WARNING: dirty eb leak (aborted trans): start 30441472 len 16384 extent buffer leak: start 30720000 len 16384 extent buffer leak: start 30425088 len 16384 extent buffer leak: start 30425088 len 16384 << Duplicated WARNING: dirty eb leak (aborted trans): start 30425088 len 16384 Note that 30425088 line is reported twice (not accounting the "dirty eb leak" line). [CAUSE] When we detected a leaked eb, we call free_extent_buffer_nocache(), but free_extent_buffer_nocache() can only remove the eb when its reduced refs is 0. If the eb has refs 2, it will need two free_extent_buffer_nocache() calls to remove it from the cache. [FIX] Just reset the eb->refs to 1 so that free_extent_buffer_nocache() can remove it from cache for sure. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:08 +02:00
Qu Wenruo	811ae819e3	btrfs-progs: remove unused function extent_io_tree_init_cache_max() The function was introduced by commit `a5ce5d2198` ("btrfs-progs: extent-cache: actually cache extent buffers") but never got utilized. Thus we can just remove it. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:08 +02:00
Boris Burkov	980ba4e842	btrfs-progs: receive: add support for fs-verity Process an enable_verity cmd by running the enable verity ioctl on the file. Since enabling verity denies write access to the file, it is important that we don't have any open write file descriptors. This also revs the send stream format to version 3 with no format changes besides the new commands and attributes. This version is not finalized and commands may change, also this needs to be synchronized with any kernel changes. Note: the build is conditional on the header linux/fsverity.h Signed-off-by: Boris Burkov <boris@bur.io> Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:08 +02:00
David Sterba	feef6aaaf6	btrfs-progs: kernel-lib: remove radix-tree The radix-tree is not used in userspace code. In kernel it's for tracking unpersisted and in-memory structures and has been replaced by the xarray. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:07 +02:00
Qu Wenruo	d8f3355734	btrfs-progs: unexport csum_tree_block() The function csum_tree_block() is not really utilized by anyone, all current callers just use csum_tree_block_size(). Furthermore there is a stale definition in common/utils.h which is using the old "struct btrfs_root" as the first argument, while we have already migrated to "struct btrfs_fs_info". So just unexport csum_tree_block() and remove the stale definition. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:06:11 +02:00
David Sterba	d5e15ba825	btrfs-progs: fix may be unused warning in load_free_space_extents Some compilers warn about potentially unused variable, however the value validity is guarded by have_prev so this can't happen and it's probably insufficient analysis on the compiler side. Let's initialize the prev_key to zeros that would also work as the condition. In file included from /usr/include/stdio.h:894, from ./kerncompat.h:27, from ./kernel-lib/list.h:23, from ./kernel-shared/ctree.h:24, from kernel-shared/free-space-tree.c:19: In function ‘fprintf’, inlined from ‘load_free_space_extents’ at kernel-shared/free-space-tree.c:1446:5, inlined from ‘load_free_space_tree’ at kernel-shared/free-space-tree.c:1577:9: /usr/include/bits/stdio2.h:105:10: warning: ‘prev_key.objectid’ may be used uninitialized [-Wmaybe-uninitialized] 105 \| return __fprintf_chk (__stream, __USE_FORTIFY_LEVEL - 1, __fmt, \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 106 \| __va_arg_pack ()); \| ~~~~~~~~~~~~~~~~~ kernel-shared/free-space-tree.c: In function ‘load_free_space_tree’: kernel-shared/free-space-tree.c:1398:31: note: ‘prev_key.objectid’ was declared here 1398 \| struct btrfs_key key, prev_key; Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:06:11 +02:00
Qu Wenruo	2f2f6bfe17	btrfs-progs: btrfstune: add the ability to convert to block group tree feature The new '-b' option will be responsible for converting to block group tree compat ro feature. The workflow looks like this for new convert: - Setting CHANGING_BG_TREE flag And initialize fs_info->last_converted_bg_bytenr value to (u64)-1. Any bg with bytenr >= last_converted_bg_bytenr will have its bg item update go to the new root (bg tree). - Iterate each block group by their bytenr in descending order This involves: * Delete the old bg item from the old tree (extent tree) * Update last_converted_bg_bytenr to the bytenr of the bg * Add the new bg item into the new tree (bg tree) * If we have converted a bunch of bgs, commit current transaction - Clear CHANGING_BG_TREE flag And set the new BLOCK_GROUP_TREE compat ro flag and commit. And since we're doing the convert in multiple transactions, we also need to resume from last interrupted convert. In that case, we just grab the last unconverted bg, and start from it. And to co-operate with the new kernel requirement for both no-holes and free-space-tree features, the convert tool will check for free-space-tree feature. If not enabled, will error out with an error message to how to continue (by mounting with "-o space_cache=v2"). For missing no-holes feature, we just need to set the flag during convert. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-09-12 18:25:32 +02:00
Qu Wenruo	1430b41427	btrfs-progs: separate block group tree from extent tree v2 Block group tree feature is completely a standalone feature, and it has been over 5 years before the initial introduction to solve the long mount time. I don't really want to waste another 5 years waiting for a feature which may or may not work, but definitely not properly reviewed for its preparation patches. So this patch will separate the block group tree feature into a standalone compat RO feature. There is a catch, in mkfs create_block_group_tree(), current tree-checker only accepts block group item with valid chunk_objectid, but the existing code from extent-tree-v2 didn't properly initialize it. This patch will also fix above mentioned problem so kernel can mount it correctly. Now mkfs/fsck should be able to handle the fs with block group tree. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-09-12 18:25:32 +02:00
Qu Wenruo	c5a21a7814	btrfs-progs: don't save block group root into super block The extent tree v2 (thankfully not yet fully materialized) needs a new root for storing all block group items. My initial proposal years ago just added a new tree rootid, and load it from tree root, just like what we did for quota/free space tree/uuid/extent roots. But the extent tree v2 patches introduced a completely new (and to me, wasteful) way to store block group tree root into super block. Currently there are only 3 trees stored in super blocks, and they all have their valid reasons: - Chunk root Needed for bootstrap. - Tree root Really the entrance of all trees. - Log root This is special as log root has to be updated out of existing transaction mechanism. There is not even any reason to put block group root into super blocks, the block group tree is updated at the same timing as old extent tree, no need for extra bootstrap/out-of-transaction update. So just move block group root from super block into tree root. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-09-12 15:31:27 +02:00
Qu Wenruo	e47c34821f	btrfs-progs: rescue: allow fix-device-size to shrink device item If we found that the underlying block device size is smaller than total_bytes in dev item, kernel will reject the mount, and there is no progs tool to fix it. Under most case it's just a small mismatch, and there is no dev extent in the shrunk range. In that case, we can let "btrfs rescue fix-device-size" to reset the total_bytes in dev items to fix. We add some extra checks, like to make sure there is no dev extent in the shrunk device range, to make sure we won't lose data during the device item shrink. And also update the test case to verify the repaired fs can pass the check. Issue: #504 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-09-12 15:31:21 +02:00
Qu Wenruo	75fea7496c	btrfs-progs: use write_data_to_disk() to handle RAID56 in write_and_map_eb() Function write_data_to_disk() can handle RAID56 writes without any problem. So just call write_data_to_disk() inside write_and_map_eb() instead of manually doing the RAID56 write. Tested-by: Shin'ichiro Kawasaki <shinichiro.kawasaki@wdc.com> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-08-16 15:18:12 +02:00
Qu Wenruo	2060120201	btrfs-progs: fix a BUG_ON() condition for write_data_to_disk() The BUG_ON() condition in write_data_to_disk() is no longer correct. Now write_raid56_with_parity() will return the bytes written of last stripe. Thus a success writeback can trigger the BUG_ON(ret). Fix the condition to (ret < 0). Tested-by: Shin'ichiro Kawasaki <shinichiro.kawasaki@wdc.com> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-08-16 15:18:12 +02:00
Qu Wenruo	fc6925bfd3	btrfs-progs: avoid repeated data write for metadata [BUG] Shinichiro reported that "mkfs.btrfs -m DUP" is doing repeated write into the device. For non-zoned device this is not a big deal, but for zoned device this is critical, as zoned device doesn't support overwrite at all. [CAUSE] The problem is related to write_and_map_eb() call, since commit `2a93728391` ("btrfs-progs: use write_data_to_disk() to replace write_extent_to_disk()"), we call write_data_to_disk() for metadata write back. But the problem is, write_data_to_disk() will call btrfs_map_block() with rw = WRITE. By that btrfs_map_block() will always return all stripes, while in write_data_to_disk() we also iterate through each mirror of the range. This results above repeated writeback. [FIX] Fix this problem by completely remove @mirror argument from write_data_to_disk(). With extra comments to explicitly show that function will write to all mirrors. Reported-by: Shinichiro Kawasaki <shinichiro.kawasaki@wdc.com> Fixes: `2a93728391` ("btrfs-progs: use write_data_to_disk() to replace write_extent_to_disk()") Tested-by: Shin'ichiro Kawasaki <shinichiro.kawasaki@wdc.com> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-08-16 15:18:12 +02:00
Boris Burkov	ba7b281049	btrfs-progs: add VERITY ro compat flag This compat flag is missing, but is being checked by mount, and could well be present legitimately. Reviewed-by: Sweet Tea Dorminy <sweettea-kernel@dorminy.me> Reviewed-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: Boris Burkov <boris@bur.io> Signed-off-by: David Sterba <dsterba@suse.com>	2022-08-16 15:18:11 +02:00
Su Yue	d0a99313e5	btrfs-progs: save item data end in u64 to avoid overflow in btrfs_check_leaf() Similar to kernel check_leaf(), calling btrfs_item_end_nr() may get a reasonable value even an item has invalid offset/size due to u32 overflow. Fix it by use u64 variable to store item data end in btrfs_check_leaf() to avoid u32 overflow. Bugzilla: https://bugzilla.kernel.org/show_bug.cgi?id=215299 Reported-by: Wenqing Liu <wenqingliu0120@gmail.com> Signed-off-by: Su Yue <l@damenly.su> Signed-off-by: David Sterba <dsterba@suse.com>	2022-08-16 15:18:11 +02:00
Qu Wenruo	963188943f	btrfs-progs: make btrfs_super_block::log_root_transid deprecated This is the same on-disk format update synchronized from the kernel code. Unlike kernel, there are two callers reading this member: - btrfs inspect dump-super It's just printing the value, add a notice about deprecation. - btrfs-find-root In that case, since we always got 0, the root search for log root should never find a perfect match. Use btrfs_super_geneartion() + 1 to provide a better result. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-08-16 15:18:11 +02:00
David Sterba	8356c423e6	btrfs-progs: receive: implement FILEATTR command The initial proposal for file attributes was built on simply doing SETFLAGS but this builds on an old and non-extensible interface that has no direct mapping for all inode flags. There's a unified interface fileattr that covers file attributes and xflags, it should be possible to add new bits. On the protocol level the value is copied as-is in the original inode but this does not provide enough information how to apply the bits on the receiving side. Eg. IMMUTABLE flag prevents any changes to the file and has to be handled manually. The receiving side does not apply the bits yet, only parses it from the stream. Signed-off-by: David Sterba <dsterba@suse.com>	2022-08-16 15:18:11 +02:00
Omar Sandoval	0ee5b22345	btrfs-progs: send: stream v2 ioctl flags First, add a --proto option to allow specifying the desired send protocol version. It defaults to one, the original version. In a couple of releases once people are aware that protocol revisions are happening, we can change it to default to zero, which means the latest version supported by the kernel. This is based on Dave Sterba's patch. Also add a --compressed-data flag to instruct the kernel to use encoded_write commands for compressed extents. This requires an explicit opt in separate from the protocol version because: 1. The user may not want compression on the receiving side, or may want a different compression algorithm/level on the receiving side. 2. It has a soft requirement for kernel support on the receiving side (btrfs-progs can fall back to decompressing and writing if the kernel doesn't support BTRFS_IOC_ENCODED_WRITE, but the user may not be prepared to pay that CPU cost). Going forward, since it's easier to update progs than the kernel, I think we'll want to make new send features that require kernel support opt-in, whereas anything that only requires a progs update can happen automatically. Signed-off-by: Boris Burkov <boris@bur.io> Signed-off-by: Omar Sandoval <osandov@fb.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-06-07 13:59:33 +02:00
Omar Sandoval	1c05b10008	btrfs-progs: receive: add send stream v2 commands and attributes Update our copy of send.h from the kernel. This adds the new commands and attributes for v2 as well as explicit enum numbering. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Boris Burkov <boris@bur.io> Signed-off-by: Omar Sandoval <osandov@fb.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-06-07 13:59:32 +02:00
Boris Burkov	a82996e1b6	btrfs-progs: receive: dynamically allocate sctx->read_buf In send stream v2, write commands can now be an arbitrary size. For that reason, we can no longer allocate a fixed array in sctx for read_cmd. Instead, read_cmd dynamically allocates sctx->read_buf. To avoid needless reallocations, we reuse read_buf between read_cmd calls by also keeping track of the size of the allocated buffer in sctx->read_buf_sz. We do the first allocation of the old default size at the start of processing the stream, and we only reallocate if we encounter a command that needs a larger buffer. Signed-off-by: Boris Burkov <boris@bur.io> Signed-off-by: David Sterba <dsterba@suse.com>	2022-06-07 13:59:31 +02:00
David Sterba	0f65bf66be	btrfs-progs: libbtrfs: drop ifdef BTRFS_FLAT_INCLUDES where not necessary Headers that are only exported and not used for build do not need the BTRFS_FLAT_INCLUDES switch (between local and installed headers). Now that there are local copies of the shared headers drop the respective part from local headers. Signed-off-by: David Sterba <dsterba@suse.com>	2022-06-06 15:48:52 +02:00
Johannes Thumshirn	a7ae6d5948	btrfs-progs: zoned: add upper and lower zone size boundaries As we're not supporting arbitrarily big or small zone sizes in the kernel, reject devices that don't fit in progs as well. Signed-off-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-06-06 15:47:50 +02:00
Qu Wenruo	38f90e906e	btrfs-progs: properly initialize block group thresholds [BUG] When creating btrfs with new v2 cache (the default behavior), mkfs.btrfs always create the free space tree using bitmap. It's fine for small fs, but will be a disaster if the device is large and the data profile is something like RAID0: $ mkfs.btrfs -f -m raid1 -d raid0 /dev/test/scratch[1234] btrfs-progs v5.17 [...] Block group profiles: Data: RAID0 4.00GiB Metadata: RAID1 256.00MiB System: RAID1 8.00MiB [..] $ btrfs ins dump-tree -t free-space /dev/test/scratch1 btrfs-progs v5.17 free space tree key (FREE_SPACE_TREE ROOT_ITEM 0) node 30441472 level 1 items 10 free space 483 generation 6 owner FREE_SPACE_TREE node 30441472 flags 0x1(WRITTEN) backref revision 1 fs uuid deddccae-afd0-4160-9a12-48fe7b526fb1 chunk uuid 68f6cf98-afe3-4f47-9797-37fd9c610219 key (1048576 FREE_SPACE_INFO 4194304) block 30457856 gen 6 key (475004928 FREE_SPACE_BITMAP 8388608) block 30703616 gen 5 key (953155584 FREE_SPACE_BITMAP 8388608) block 30720000 gen 5 key (1431306240 FREE_SPACE_BITMAP 8388608) block 30736384 gen 5 key (1909456896 FREE_SPACE_BITMAP 8388608) block 30752768 gen 5 key (2387607552 FREE_SPACE_BITMAP 8388608) block 30769152 gen 5 key (2865758208 FREE_SPACE_BITMAP 8388608) block 30785536 gen 5 key (3343908864 FREE_SPACE_BITMAP 8388608) block 30801920 gen 5 key (3822059520 FREE_SPACE_BITMAP 8388608) block 30818304 gen 5 key (4300210176 FREE_SPACE_BITMAP 8388608) block 30834688 gen 5 [...] ^^^ So many bitmaps that an empty fs will have two levels for free space tree already [CAUSE] Member btrfs_block_group::bitmap_high_thresh is never properly set to any value other than 0, thus in function update_free_space_extent_count(), the following check is always true: if (!(flags & BTRFS_FREE_SPACE_USING_BITMAPS) && extent_count > block_group->bitmap_high_thresh) { ret = convert_free_space_to_bitmaps(trans, block_group, path); Thus we always got converted to bitmaps. [FIX] Cross-port the function set_free_space_tree_thresholds() from kernel, and call that function in btrfs_make_block_group() and read_one_block_group() so that every block group has bitmap_high_thresh properly set. Now even for that 4GiB large data chunk, we still only have one free extent: btrfs-progs v5.17 free space tree key (FREE_SPACE_TREE ROOT_ITEM 0) leaf 30572544 items 15 free space 15860 generation 6 owner FREE_SPACE_TREE leaf 30572544 flags 0x1(WRITTEN) backref revision 1 fs uuid b24e52ea-6580-4a88-aa70-cb173090bfe3 chunk uuid d85f3905-fc61-4084-b335-2b6b97814b8e [...] item 13 key (298844160 FREE_SPACE_INFO 4294967296) itemoff 16235 itemsize 8 free space info extent count 1 flags 0 item 14 key (298844160 FREE_SPACE_EXTENT 4294967296) itemoff 16235 itemsize 0 free space extent Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-05-20 15:54:20 +02:00
Qu Wenruo	9bded24a46	btrfs-progs: do not use btrfs_commit_transaction() just to update super blocks There are several call sites utilizing btrfs_commit_transaction() just to update members in super blocks, without any metadata update. This can be problematic for some simple call sites, like zero_log_tree() or check_and_repair_super_num_devs(). If we have big problems preventing the fs to be mounted in the first place, and need to clear the log or super block size, but by some other problems in extent tree, we're unable to allocate new blocks. Then we fall into a deadlock that, we need to mount (even ro,rescue=all) to collect extra info, but btrfs-progs can not do any super block updates. Fix the problem by allowing the following super blocks only operations to be done without using btrfs_commit_transaction(): - btrfs_fix_super_size() - check_and_repair_super_num_devs() - zero_log_tree(). There are some exceptions in btrfstune.c, related to the csum type conversion and seed flags. In those btrfstune cases, we in fact wants to proper error report in btrfs_commit_transaction(), as those operations are not mount critical, and any early error can be helpful to expose any problems in the fs. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-05-20 15:54:16 +02:00
David Sterba	f1178950d3	btrfs-progs: btrfstune: fix build-time detection of experimental features Qu noticed that the full checksums are still printed even if the experimental build is not enabled. This is caused by wrong use of #ifdef (as the macro is always defined), this must be "#if". Fixes: `1bb6fb896d` ("btrfs-progs: btrfstune: experimental, new option to switch csums") Reported-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-05-10 15:42:13 +02:00
Qu Wenruo	50a5dfde6d	btrfs-progs: print-tree: print the checksum of header without tailing zeros For the default CRC32C checksum, print-tree now prints tons of unnecessary padding zeros: btrfs-progs v5.17 chunk tree leaf 22036480 items 7 free space 15430 generation 6 owner CHUNK_TREE leaf 22036480 flags 0x1(WRITTEN) backref revision 1 checksum stored 0ac1b9fa00000000000000000000000000000000000000000000000000000000 checksum calced 0ac1b9fa00000000000000000000000000000000000000000000000000000000 fs uuid 3d95b7e3-3ab6-4927-af56-c58aa634342e This is caused by commit `1bb6fb896d` ("btrfs-progs: btrfstune: experimental, new option to switch csums"), and it looks like most distros just enable EXPERIMENTAL features by default. (Which is a good thing to provide much better coverage). So here we just limit the csum print to the utilized csum size. Now the output looks like: btrfs-progs v5.17 chunk tree leaf 22036480 items 4 free space 15781 generation 6 owner CHUNK_TREE leaf 22036480 flags 0x1(WRITTEN) backref revision 1 checksum stored 676b812f checksum calced 676b812f fs uuid d11f8799-b6dc-415d-b1ed-cebe6da5f0b7 Fixes: `1bb6fb896d` ("btrfs-progs: btrfstune: experimental, new option to switch csums") Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-05-10 13:44:37 +02:00
Qu Wenruo	851ef59b2c	btrfs-progs: remove the unused btrfs_fs_info::seeding member This member is not used by anyone, just remove it. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-29 22:13:22 +02:00
Qu Wenruo	4e9e978783	btrfs-progs: allow read_data_from_disk() to rebuild RAID56 using P/Q This new ability is added by: - Allow btrfs_map_block() to return the chunk type This makes later work much easier - Only reset stripe offset inside btrfs_map_block() when needed Currently if @raid_map is not NULL, btrfs_map_block() will consider this call is for WRITE and will reset stripe offset. This is no longer the case, as for RAID56 read with mirror_num 1/0, we will still call btrfs_map_block() with non-NULL raid_map. Add a small check to make sure we won't reset stripe offset for mirror 1/0 read. - Add new helper read_raid56() to handle rebuild We will read the full stripe (including all data and P/Q stripes) do the rebuild, then only copy the refered part to the caller. There is a catch for RAID6, we have no way to exhaust all combination, so the current repair will assume the mirror = 0 data is corrupted, then try to find a missing device. But if no missing device can be found, it will assume P is corrupted. This is just a guess, and can to totally wrong, but we have no better idea. Now btrfs-progs have full read ability for RAID56. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-25 19:08:30 +02:00
Qu Wenruo	a99bece1cd	btrfs-progs: remove extent_buffer::fd and extent_buffer::dev_bytes Those two members are a shortcut for non-RAID56 profiles. But we should not use such shortcut, and move all our logical address read/write to the unified read_data_from_disk()/write_data_to_disk(). With previous refactors, now we're safe to remove them. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-25 19:08:30 +02:00
Qu Wenruo	3ff9d35257	btrfs-progs: use read_data_from_disk() to replace read_extent_from_disk() and replace read_extent_data() The function read_extent_from_disk() is only a wrapper to read tree block. And read_extent_data() is just a while loop to eliminate short read caused by stripe boundary. In fact, a lot of call sites of read_extent_data() are either reading metadata (thus no possible short read) or doing extra loop by themselves. This patch will replace those two functions with read_data_from_disk(), making it the only entrance for data/metadata read. And update read_data_from_disk() to return the read bytes, so caller can do a simple while loop. For the few callers of read_extent_data(), open-code a small while loop for them. This will allow later RAID56 read repair using P/Q much easier. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-25 19:08:30 +02:00
Qu Wenruo	2a93728391	btrfs-progs: use write_data_to_disk() to replace write_extent_to_disk() Function write_extent_to_disk() is just writing the content of a tree block to disk. It can not handle RAID56, and its work is the same as write_data_to_disk(). Thus we can replace write_extent_to_disk() with write_data_to_disk() easily. There is only one special call site in write_raid56_with_parity(), which can easily be replace with btrfs_pwrite() directly. This reduce the write entrance, and make later eb::fd removal easier. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-25 19:08:29 +02:00
Qu Wenruo	01c25d73f1	btrfs-progs: extract metadata restore read code into its own helper For metadata restore, our logical address is mapped to a single device with logical address 1:1 mapped to device physical address. Move this part of code into a helper, this will make later extent buffer read path refactoer much easier. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-25 19:07:09 +02:00
Qu Wenruo	7a0c4b5dc1	btrfs-progs: remove the unnecessary BTRFS_SUPER_INFO_OFFSET path for tree block read We used to use read_whole_eb() to read super block, but it's no longer the case (so long that I can not even find out which commit did the conversion). Thus there is no need for read_whole_eb() to handle super block read anymore. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-25 19:07:08 +02:00
Qu Wenruo	f9659c7235	btrfs-progs: fix an error path which can lead to empty device list [BUG] With the incoming delayed chunk item insertion feature, there is a super weird failure at mkfs/022: ====== RUN CHECK ./mkfs.btrfs -f --rootdir tmp.KnKpP5 -d dup -b 350M tests/test.img ... Checksum: crc32c Number of devices: 0 Devices: ID SIZE PATH Note the "Number of devices: 0" line, this means our fs_info->fs_devices->devices list is empty. And since our rw device list is empty, we won't finish the mkfs with proper superblock magic, and cause later btrfs check to fail. [CAUSE] Although the failure is only triggered by the incoming delayed chunk item insertion feature, the bug itself is here for a while. In btrfs_alloc_chunk(), we move rw devices to our @private_devs list first, then in create_chunk(), we move it back to our rw devices list. This dance is pretty dangerous, especially if btrfs_alloc_dev_extent() failed inside create_chunk(), and current profile have multiple stripes (including DUP), we will exit create_chunk() directly, without moving the remaining devices in @private_devs list back to @dev_list. Furthermore, btrfs_alloc_chunk() is expected to return -ENOSPC, as we call btrfs_alloc_chunk() to pre-allocate chunks, and ignore the -ENOSPC error if it's just a pre-allocation failure. This existing error path can lead to the empty rw list seen above. [FIX] After create_chunk(), unconditionally move all devices in @private_devs back to rw device list. And add extra check to make sure our rw device list is never empty after a chunk allocation attempt. Reviewed-by: Josef Bacik <josef@toxicpanda.com> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-25 18:33:29 +02:00
Qu Wenruo	4a940ab2c0	btrfs-progs: fix a memory leak when starting a transaction on fs with error Function btrfs_start_transaction() will allocate the memory unconditionally, but if the fs has an aborted transaction we don't free the allocated memory but return error directly. Fix it by only allocate the new memory after all the checks. Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-25 18:32:17 +02:00
Naohiro Aota	fd4bab06a4	btrfs-progs: zoned: fix and simplify dev_extent_hole_check_zoned() The previous patch revealed a bug in dev_extent_hole_check_zoned(). If the given hole is OK to use as is, it should have just returned the hole. But on the contrary, it shifts the hole start position by one zone. That results in refusing any hole. We don't use btrfs_ensure_empty_zones() in the btrfs-progs version of dev_extent_hole_check_zoned() unlike the kernel side, because btrfs_find_allocatable_zones() itself is doing the necessary checks. So, we can just "return changed" if the "pos" is unchanged. That also makes the loop and "changed" variable unnecessary. So, fix and simplify the code in one shot. Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-08 23:17:35 +02:00
Naohiro Aota	38670212dd	btrfs-progs: fix ordering of hole_size setting and dev_extent_hole_check() The hole_size is used by dev_extent_hole_check() to check the hole is OK as a device extent. However, commit `b031fe84fd` ("btrfs-progs: zoned: implement zoned chunk allocator") mis-ported the kernel code and placed dev_extent_hole_check() before setting hole_check. That made the dev_extent_hole_check() call here essentially pass through as we have hole_size == 0 on mkfs time. As a result, mkfs.btrfs creates data BG at 64 MB where the regular superblock exists, when zone size is 16 MB. Fix the ordering of hole_size setting and calling dev_extent_hole_check(). Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-08 23:17:35 +02:00
Naohiro Aota	32c43d0c68	btrfs-progs: zoned: export sb_zone_number() and related constants Move sb_zone_number() and related constants from zoned.c to the corresponding header for later use. Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-08 23:17:35 +02:00
Sweet Tea Dorminy	c494724858	btrfs-progs: dump-tree: add print support for verity items 'btrfs inspect-internals dump-tree' doesn't currently know about the two types of verity items and prints them as 'UNKNOWN.36' or 'UNKNOWN.37'. So add them to the known item types. Suggested-by: Boris Burkov <boris@bur.io> Reviewed-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: Sweet Tea Dorminy <sweettea-kernel@dorminy.me> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-24 00:49:19 +01:00
Josef Bacik	02fb308bdc	btrfs-progs: make btrfs_create_tree take a key for the root key We're going to start create global roots from mkfs, and we need to have a offset set for the root key. Make the btrfs_create_tree() take a key for the root_key instead of just the objectid so we can setup these new style roots properly. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:07:22 +01:00
Josef Bacik	5fb27deaf1	btrfs-progs: make btrfs_clear_free_space_tree extent tree v2 aware With extent tree v2 we'll have multiple free space trees, and we can't just unset the feature flags for the free space tree. Fix this to loop through all of the free space trees and clear them out properly. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:07:21 +01:00
Josef Bacik	c4164edeb5	btrfs-progs: add a btrfs_delete_and_free_root helper The free space tree code already does this, but we need it for cleaning up per block group roots. Abstract this code out into a helper so that we can use it in multiple places in the future. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:07:19 +01:00
Josef Bacik	e33738306c	btrfs-progs: handle the per-block group global root id We will now be using block_group->chunk_objectid to point at the global root id for this particular block group. For now we'll assign this based on mod'ing the offset of the block group against the number of global root id's and handle the block_group_item updating appropriately. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:07:17 +01:00
Josef Bacik	27eaa3b514	btrfs-progs: set the number of global roots in the super block In order to make sure the file system is consistent we need to record the number of global roots we should have in the super block. We could infer this from the number of global roots we find, however this could lead to interesting fuzzing problems, so add a source of truth to the super block in order to make it easier to verify the file system is consistent. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:07:14 +01:00
Josef Bacik	c7a8363276	btrfs-progs: handle no bg item in extent tree for free space tree We have an ASSERT(ret == 0) when populating the free space tree as we should at least find the block group item with extent tree v1. However with v2 we no longer have the block group item in the extent tree, so fix the population logic to handle an empty block group (which occurs during mkfs) and only assert if ret != 0 and we don't have extent tree v2 turned on. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:07:00 +01:00
Josef Bacik	cab7570ccc	btrfs-progs: add print support for the block group tree Add the appropriate support to the print tree and dump tree code to spit out the block group tree. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:06:54 +01:00
Josef Bacik	9ee6cc78a8	btrfs-progs: add support for loading the block group root This adds the ability to load the block group root, as well as make sure the various backup super block and super block updates are made appropriately. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:06:51 +01:00
Josef Bacik	4f184cc911	btrfs-progs: make all of the item/key_ptr offset helpers take an eb When we change the size of the btrfs_header we're going to need to change how these helpers calculate where to find the start of items or block ptrs. To prepare for that make these helpers take the extent_buffer as an argument so we can do the appropriate math based on the version type. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:14 +01:00
Josef Bacik	aba83381a5	btrfs-progs: rework the btrfs_node accessors to match the item accessors We are duplicating the offsetof(btrfs_node, key_ptr) logic everywhere, instead use the helper to do this work for us, and make all the node accessors use the helper. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:14 +01:00
Josef Bacik	53654db311	btrfs-progs: replace btrfs_item_nr_offset(0) btrfs_item_nr_offset(0) is simply offsetof(struct btrfs_leaf, items), which is the same thing as btrfs_leaf_data(), so replace all calls of btrfs_item_nr_offset(0) with btrfs_leaf_data(). Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:14 +01:00
Josef Bacik	5dc3964aaa	btrfs-progs: remove the _nr from the item helpers Now that all callers are using the _nr variations we can simply rename these helpers to btrfs_item_##member/btrfs_set_item_##member and change the actual item SETGET funcs to raw_item_##member/set_raw_item_##member and then change all callers to drop the _nr part. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:13 +01:00
Josef Bacik	f3be0ff01a	btrfs-progs: rename btrfs_item_end_nr to btrfs_item_end All callers use the btrfs_item_end_nr() variation, simply drop btrfs_item_end() and make btrfs_item_end_nr() use the _nr() variations of the item get helpers. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:13 +01:00
Josef Bacik	49539423fa	btrfs-progs: change btrfs_file_extent_inline_item_len to take a slot This matches how the kernel does it, simply pass in the slot and fix up btrfs_file_extent_inline_item_len to use the btrfs_item_nr() helper and the correct define. Fixup all the callers to use the slot now instead of passing in the btrfs_item. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:13 +01:00
Josef Bacik	04ffea07e4	btrfs-progs: add btrfs_set_item__nr() helpers We have a lot of the following patterns item = btrfs_item_nr(nr); btrfs_set_item_(eb, item, val); btrfs_set_item_*(eb, btrfs_item_nr(nr), val); in a lot of places in our code. Instead add _nr variations of these helpers and convert all of the users to this new helper. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:13 +01:00
Josef Bacik	80ff3f2c8e	btrfs-progs: btrfs_item_size_nr/btrfs_item_offset_nr everywhere We have this pattern in a lot of places item = btrfs_item_nr(slot); btrfs_item_size(leaf, item); btrfs_item_offset(leaf, item); when we could simply use btrfs_item_size_nr(leaf, slot); btrfs_item_offset_nr(leaf, slot); Fix all callers of btrfs_item_size() and btrfs_item_offset() to use the _nr variation of the helpers. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:13 +01:00
Josef Bacik	ed098523dc	btrfs-progs: reduce usage of __BTRFS_LEAF_DATA_SIZE This helper only takes the nodesize, but in the future it'll take a bool to indicate if we're extent tree v2. The remaining users are all where we only have extent_buffer, but we should always have a valid eb->fs_info in these cases, so add BUG_ON()'s for the !eb->fs_info case and then convert these callers to use BTRFS_LEAF_DATA_SIZE which takes the fs_info. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:13 +01:00
Josef Bacik	6cf2da1f6f	btrfs-progs: store BTRFS_LEAF_DATA_SIZE in the fs_info This is going to be a different value based on the incompat settings of the file system, just store this in the fs_info instead of calculating it every time. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:12 +01:00
Josef Bacik	ee21714e21	btrfs-progs: don't check skip_csum_check if there's no fs_info The chunk recover code passes in a buffer it allocates with metadata but no fs_info, causing fuzz-test 008 to segfault. Fix this test to only check the flag if we have buf->fs_info set. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-08 18:18:01 +01:00
Josef Bacik	401690ff68	btrfs-progs: properly populate missing trees With my global roots prep patches I regressed us on handling the case where we didn't find a root at all. In this case we need to return an error (prior we returned -ENOENT) or we need to populate a dummy tree if we have OPEN_CTREE_PARTIAL set. This fixes a segfault of fuzz test 006. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-08 18:18:01 +01:00
Josef Bacik	29beeb1a30	btrfs-progs: do not try to load the free space tree if it's not enabled We were previously getting away with this because the load_global_roots() treated ENOENT like everything was a-ok. However that was a bug and fixing that bug uncovered a problem where we were unconditionally trying to load the free space tree. Fix that by skipping the load if we do not have the compat bit set. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-08 18:18:01 +01:00
David Sterba	1bb6fb896d	btrfs-progs: btrfstune: experimental, new option to switch csums This is still work in progress but can survive some stress testing. There are still some sanity checks missing, do not user this on valuable data. To enables this, configure must be run with the experimental features enabled. $ mkfs.btrfs --csum crc32c /dev/sdx $ <mount, fill with data, unmount> $ btrfstune --csum sha256 Will change the checksum to sha256. Implementation: - set bit on superblock when the checksums are being changed (similar to the uuid rewrite) - metadata checksums are overwritten in place - data checksums: - the checksum tree is completely deleted and no checksums are verified - data blocks are enumerated and all checksums generated (same as check --init-csum-tree) To make it usable, it should be restartable and track the current progress somehow. Also the previous data checksums should be verified any time they're available. Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-08 18:10:03 +01:00
David Sterba	471ca4a580	btrfs-progs: build: add stub definition for non-zoned build In commit `88895a920f` ("btrfs-progs: use profile_supported in mkfs as well") there's a wrapper but not available on non-zoned builds. Add it. Issue: #445 Signed-off-by: David Sterba <dsterba@suse.com>	2022-02-16 22:48:01 +01:00
Johannes Thumshirn	89191f8c12	btrfs-progs: pass in block-group type to zoned_profile_supported Pass BTRFS_BLOCK_GROUP_DATA and BTRFS_BLOCK_GROUP_METADATA to zoned_profile_supported(), so we can actually distinguish if it is a data or a meta-data block group. Fixes: 8f914d518a46 ("btrfs-progs: zoned support DUP on metadata block groups") Signed-off-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-02-16 22:48:01 +01:00
Josef Bacik	532bf58b5b	btrfs-progs: sanity check global roots key.offset For !extent tree v2 we should validate the key.offset == 0, and for extent tree v2 we should validate that key.offset < nr_global_roots. If this fails we need to fail to load the global root so that the appropriate action is taken. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-02-16 22:48:01 +01:00
Johannes Thumshirn	f77da2b173	btrfs-progs: zoned support DUP on metadata block groups Support using BTRFS_BLOCK_GROUP_DUP on metadata (and system) block groups. Signed-off-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-02-01 18:41:53 +01:00
Johannes Thumshirn	88895a920f	btrfs-progs: use profile_supported in mkfs as well Currently we have two places checking if a block-group profile is supported on a zoned device, one in mkfs/main.c and one in kernel-shared/zoned.c. Use the one from kernel-shared/zoned.c in mkfs as well, unifying all checks. Signed-off-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-02-01 18:41:51 +01:00
Su Yue	e40f476bbb	btrfs-progs: make generic_err print physical address of extent buffer Unlike kernel, we have cached physical address of extent_buffer in dev_bytenr. Print it for better debug experience. Signed-off-by: Su Yue <l@damenly.su> Signed-off-by: David Sterba <dsterba@suse.com>	2022-02-01 18:41:49 +01:00
Qu Wenruo	08823f7512	btrfs-progs: backref: properly queue indirect refs [BUG] When calling iterate_extent_inodes() on data extents with indirect ref (with inline or keyed EXTENT_DATA_REF_KEY), it will fail to execute the call back function at all. [CAUSE] In function find_parent_nodes(), we only add the target tree block if a backref has @parent populated. For indirect backref like EXTENT_DATA_REF_KEY, we rely on __resolve_indirect_ref() to get the parent leaves. However __resolve_indirect_ref() only grabs backrefs from &prefstate->pending_indirect_refs. Meaning callers should queue any indirect backref to pending_indirect_refs. But unfortunately in __add_prelim_ref() and __add_missing_keys(), none of them properly queue the indirect backrefs to pending_indirect_refs, but directly to pending. Making all indirect backrefs never got resolved, thus no callback function executed [FIX] Fix __add_prelim_ref() and __add_missing_keys() to properly queue indirect backrefs to the correct list. Currently there is no such direct user in btrfs-progs, but later csum tree re-initialization code will rely this to do proper csum re-calculate (to avoid preallocated/nodatasum extents). Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-02-01 18:41:09 +01:00
Josef Bacik	5e8a779f5c	btrfs-progs: add on disk pointers to global tree ids We are going to start creating multiple sets of global trees, which at the moment are the free space tree, csum tree, and extent tree. Generally we will assign these at block group creation time, but Dave would like to be able to have them per-subvolume at some point, so reserve a slot for that as well. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 19:09:47 +01:00
Josef Bacik	ec0eaae673	btrfs-progs: add definitions for the block group tree Add the on disk definitions for the block group tree. This will be part of the super block so we need to add the appropriate helpers to the super block, as well as adding it to the backup roots. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 19:08:39 +01:00
Josef Bacik	3337b7993b	btrfs-progs: common: allow users to select extent-tree-v2 option We want to enable developers to test the extent tree v2 features as they are added, add the ability to mkfs an extent tree v2 fs if we have experimental enabled. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 19:07:34 +01:00
Josef Bacik	b057607325	btrfs-progs: track csum, extent, and free space trees in a rb tree We are going to have multiples of these trees with extent tree v2, so add a rb tree to track them based on their root key value. This works for both v1 and v2, so we can remove the direct pointers to these roots in our fs_info. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 18:57:25 +01:00
Josef Bacik	0b23744de5	btrfs-progs: stop accessing ->free_space_root directly We're going to have multiple free space roots in the future, so access it via a helper in most cases. We will address the remaining direct accesses in future patches. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 18:57:19 +01:00
Josef Bacik	db2ab47823	btrfs-progs: stop accessing ->extent_root directly When we switch to multiple global trees we'll need to access the appropriate extent root depending on the block group or possibly root. To handle this, use a helper in most places and then the actual root in places where it is required. We will whittle down the direct accessors with future patches, but this does the bulk of the preparatory work. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 18:56:54 +01:00
Josef Bacik	550fd48136	btrfs-progs: move btrfs_fix_block_accounting to repair.c We have this helper sitting in extent-tree.c, but it's a repair function. I'm going to need to make changes to this for extent-tree-v2 and would rather this live outside of the code we need to share with the kernel. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Josef Bacik	639b1fc2e7	btrfs-progs: stop accessing ->csum_root directly With extent tree v2 we will have per-block group checksums, so add a helper to access the csum root and rename the fs_info csum_root to _csum_root to catch all the places that are accessing it directly. Convert everybody to use the helper except for internal things. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Josef Bacik	08b63c0fc5	btrfs-progs: stop passing root to csum related functions We are going to need to start looking up the csum root based on the bytenr with extent tree v2. To that end stop passing the root to the csum related functions so that can be done in the helper functions themselves. There's an unrelated deletion of a function prototype that no longer exists. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Josef Bacik	e968675f0a	btrfs-progs: add a helper for setting up a root node We use this pattern in a few places, and will use it more with different roots in the future. Extract out this helper to read the root nodes. There is a behavior change here in that we're now checking the root levels, whereas before we were not. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Josef Bacik	7119dc3d79	btrfs-progs: simplify btrfs_make_block_group This is doing the same work as insert_block_group_item, rework it to call the helper instead. Reviewed-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Anand Jain <anand.jain@oracle.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Qu Wenruo	c4ff87c3d1	btrfs-progs: cache csum_size and csum_type in btrfs_fs_info Just like kernel commit 22b6331d9617 ("btrfs: store precalculated csum_size in fs_info"), we can cache csum_size and csum_type in btrfs_fs_info. Furthermore, there is already a 32 bits hole in btrfs_fs_info, and we can fit csum_type and csum_size into the hole without increase the size of btrfs_fs_info. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-05 12:50:03 +01:00
Qu Wenruo	636b2e6027	btrfs-progs: remove temporary buffer for super block There are a lot of call sites where we use the following code snippet: u8 super_block_data[BTRFS_SUPER_INFO_SIZE]; struct btrfs_super_block sb; u64 ret; sb = (struct btrfs_super_block )super_block_data; The reason for this is, structure btrfs_super_block was smaller than BTRFS_SUPER_INFO_SIZE. Thus for anything with csum involved, we have to use a proper 4K buffer. Since the recent unification of sizeof(struct btrfs_super_block), we no longer need such workaround, and can use struct btrfs_super_block directly to do any operation. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-05 12:50:03 +01:00
Qu Wenruo	76f1a2ed57	btrfs-progs: unify size of btrfs_super_block and BTRFS_SUPER_INFO_SIZE Just like kernel change, pad struct btrfs_super_block to 4096 bytes. As ctree.h is part of public headers, use raw number for the superblock offset. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-05 12:50:03 +01:00
Wang Yugui	e9a7efb1b6	btrfs-progs: fix XX_flags_to_str() to always end with '\0' [BUG] We noticed 'btrfs check' outputs something like leaf 30408704 flags 0x0(P1逅?) backref revision 1 but we expected: leaf 30408704 flags 0x0() backref revision 1 [CAUSE] Some XX_flags_to_str() failed to make sure the result string always ends with '\0' in some case. [FIX] Reset the buffer at the beginnig. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Wang Yugui (wangyugui@e16-tech.com) Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-05 12:50:03 +01:00
David Sterba	4882a4f5fa	btrfs-porgs: add exception for upper case single profile name For consistency with older versions switch the case of 'single' to be lower case again even if it's inconsistent. This could be revisited in the future. Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-05 12:50:03 +01:00
Qu Wenruo	b1c944657c	btrfs-progs: make "btrfs filesystem df" command show upper case profile [BUG] Since commit `dad03fac3b` ("btrfs-progs: switch btrfs_group_profile_str to use raid table"), fstests/btrfs/023 and btrfs/151 will always fail. The failure of btrfs/151 explains the reason pretty well: btrfs/151 1s ... - output mismatch --- tests/btrfs/151.out 2019-10-22 15:18:14.068965341 +0800 +++ ~/xfstests-dev/results//btrfs/151.out.bad 2021-11-02 17:13:43.879999994 +0800 @@ -1,2 +1,2 @@ QA output created by 151 -Data, RAID1 +Data, raid1 ... (Run 'diff -u ~/xfstests-dev/tests/btrfs/151.out ~/xfstests-dev/results//btrfs/151.out.bad' to see the entire diff) [CAUSE] Commit `dad03fac3b` ("btrfs-progs: switch btrfs_group_profile_str to use raid table") will use btrfs_raid_array[index].raid_name, which is all lower case. [FIX] There is no need to bring such output format change. So here we split the btrfs_raid_attr::raid_name[] into upper_name[] and lower_name[], and make upper and lower case helpers for callers to use. Now there are several types of callers referring to lower_name and upper_name: - parse_bg_profile() It uses strcasecmp(), either case would be fine. - btrfs_group_profile_str() Originally it uses upper case for all profiles except "single". Now unified to upper case. - sprint_profiles() It uses lower case. - bg_flags_to_str() It uses upper case. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-05 12:50:03 +01:00
Qu Wenruo	5bee5c99bf	btrfs-progs: fix printf formats on 32bit x86 When compiling btrfs-progs on 32bit x86 using GCC 11.1.0, there are several warnings: In file included from ./common/utils.h:30, from check/main.c:36: check/main.c: In function 'run_next_block': ./common/messages.h:42:31: warning: format '%lu' expects argument of type 'long unsigned int', but argument 4 has type 'u32' {aka 'unsigned int'} [-Wformat=] 42 \| __btrfs_error((fmt), ##__VA_ARGS__); \ \| ^~~~~ check/main.c:6496:33: note: in expansion of macro 'error' 6496 \| error( \| ^~~~~ In file included from ./common/utils.h:30, from kernel-shared/volumes.c:32: kernel-shared/volumes.c: In function 'btrfs_check_chunk_valid': ./common/messages.h:42:31: warning: format '%lu' expects argument of type 'long unsigned int', but argument 4 has type 'u32' {aka 'unsigned int'} [-Wformat=] 42 \| __btrfs_error((fmt), ##__VA_ARGS__); \ \| ^~~~~ kernel-shared/volumes.c:2052:17: note: in expansion of macro 'error' 2052 \| error("invalid chunk item size, have %u expect [%zu, %lu)", \| ^~~~~ image/main.c: In function 'search_for_chunk_blocks': ./common/messages.h:42:31: warning: format '%lu' expects argument of type 'long unsigned int', but argument 3 has type 'size_t' {aka 'unsigned int'} [-Wformat=] 42 \| __btrfs_error((fmt), ##__VA_ARGS__); \ \| ^~~~~ image/main.c:2122:33: note: in expansion of macro 'error' 2122 \| error( \| ^~~~~ There are two types of problems: - __BTRFS_LEAF_DATA_SIZE() This macro has no type definition, making it behaves differently on different arches. Fix this by following kernel to use inline function to make its return value fixed to u32. - size_t related output For x86_64 %lu is OK but not for x86. Fix this by using %zu. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-05 12:50:03 +01:00
Wang Yugui	b1d8f945c9	btrfs-progs: mask out all unwanted profiles in btrfs_group_profile_str Commit ("btrfs-progs: switch btrfs_group_profile_str to use raid table") introduced a regression that raid profile of GlobalReserve will be printed as 'unknown'. $ btrfs filesystem df /mnt/test Data, single: total=5.02TiB, used=4.98TiB System, single: total=4.00MiB, used=624.00KiB Metadata, single: total=11.01GiB, used=6.94GiB GlobalReserve, unknown: total=512.00MiB, used=0.00B Fix it by: - take BTRFS_BLOCK_GROUP_RESERVED into account when masking the block group flags - update the define of BTRFS_BLOCK_GROUP_RESERVED too so it's same as in kernel Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Wang Yugui <wangyugui@e16-tech.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-05 12:50:03 +01:00
Qu Wenruo	8f81113021	btrfs-progs: check: fix a lowmem mode crash where fatal error is not properly handled [BUG] When a special image (diverted from fsck/012) has its unused slots (slot number >= nritems) with garbage, lowmem mode btrfs check can crash: (gdb) run check --mode=lowmem ~/downloads/good.img.restored Starting program: /home/adam/btrfs/btrfs-progs/btrfs check --mode=lowmem ~/downloads/good.img.restored ... ERROR: root 5 INODE[5044031582654955520] nlink(257228800) not equal to inode_refs(0) ERROR: root 5 INODE[5044031582654955520] nbytes 474624 not equal to extent_size 0 Program received signal SIGSEGV, Segmentation fault. 0x0000555555639b11 in btrfs_inode_size (eb=0x5555558a7540, s=0x642e6cd1) at ./kernel-shared/ctree.h:1703 1703 BTRFS_SETGET_FUNCS(inode_size, struct btrfs_inode_item, size, 64); (gdb) bt #0 0x0000555555639b11 in btrfs_inode_size (eb=0x5555558a7540, s=0x642e6cd1) at ./kernel-shared/ctree.h:1703 #1 0x0000555555641544 in check_inode_item (root=0x5555556c2290, path=0x7fffffffd960) at check/mode-lowmem.c:2628 [CAUSE] At check_inode_item() we have path->slot[0] at 29, while the tree block only has 26 items. This happens because two reasons: - btrfs_next_item() never reverts its slots Even if we failed to read next leaf. - check_inode_item() doesn't inform the caller that a fatal error happened In check_inode_item(), if btrfs_next_item() failed, it goes to out label, which doesn't really set @err properly. This means, when check_inode_item() fails at btrfs_next_item(), it will increase path->slots[0], while it's already beyond current tree block nritems. When the slot increases furthermore, and if the unused item slots have some garbage, we will get invalid btrfs_item_ptr() result, and causing above segfault. [FIX] Fix the problems by two ways: - Make btrfs_next_item() to revert its path->slots[0] on failure - Properly detect fatal error from check_inode_item() By this, we will no longer crash on the crafted image. Reported-by: Wang Yugui <wangyugui@e16-tech.com> Issue: #412 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-04 20:56:42 +01:00
Qu Wenruo	eacdd1606c	btrfs-progs: print-tree: fix chunk/block group flags output [BUG] Commit ("btrfs-progs: use raid table for profile names in print-tree.c") introduced one bug in block group and chunk flags output and changed the behavior: item 1 key (FIRST_CHUNK_TREE CHUNK_ITEM 13631488) itemoff 16105 itemsize 80 length 8388608 owner 2 stripe_len 65536 type SINGLE ... item 2 key (FIRST_CHUNK_TREE CHUNK_ITEM 22020096) itemoff 15993 itemsize 112 length 8388608 owner 2 stripe_len 65536 type DUP ... item 3 key (FIRST_CHUNK_TREE CHUNK_ITEM 30408704) itemoff 15881 itemsize 112 length 268435456 owner 2 stripe_len 65536 type DUP ... Note that, the flag string only contains the profile (SINGLE/DUP/etc...) no type (DATA/METADATA/SYSTEM). And we have new "SINGLE" string, even that profile has no extra bit to indicate that. [CAUSE] The "SINGLE" part is caused by the raid array which has a name for SINGLE profile, even it doesn't have the corresponding bit. The missing type string is caused by a code bug: strcpy(buf, name); while (tmp) { tmp = toupper(tmp); tmp++; } strcpy(ret, buf); The last strcpy() call overrides the existing string in @ret. [FIX] - Enhance string handling using strn()/snprintf() - Add extra "UKNOWN.0x%llx" output for unknown profiles - Call proper strncat() to merge type and profile - Add extra handling for "SINGLE" to keep the old output Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:24 +02:00
Qu Wenruo	f4c712e024	btrfs-progs: rename data parameter to profile in extent allocation path In function btrfs_reserve_extent(), we call find_free_extent() passing "u64 profile" into "int data". This is definitely a width reduction, but when looking further into the code, it's more serious than that, in fact the "int data" parameter is not really to indicate whether it's data extent, but really a block group profile (with block group type). This is not only width reduction, but also confusing. Thankfully so for we don't have any BLOCK_GROUP bits beyond 32 bits, so the width reduction is not causing a big problem. This patch will rename the "int data" parameter to a more proper one, "u64 profile" in all involved call paths. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:24 +02:00
David Sterba	bc28dc6bea	btrfs-progs: introduce helper for striped profiles There are several profiles like raid0, raid10, raid5 and raid6 that can span as many devices as possible and need special handling for the stripe calculations. Provide a helper to identify the profiles in a simple way. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:24 +02:00
David Sterba	a25a5cc2c0	btrfs-progs: use btrfs_bg_type_to_nparity in btrfs_stripe_length Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:24 +02:00
David Sterba	11fcdbc35e	btrfs-progs: introduce helper to get allowed profiles for a given device number Use the raid table helper to avoid hard coding profiles for the given number of devices in test_num_disk_vs_raid. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:24 +02:00
David Sterba	cb0b63cd90	btrfs-progs: use raid table value for sub_stripes in btrfs_check_chunk_valid Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:24 +02:00
David Sterba	4f662d74fd	btrfs-progs: export raid table helper for sub_stripes Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:24 +02:00
David Sterba	833ce53872	btrfs-progs: use btrfs_bg_type_to_nparity in calc_stripe_length Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:24 +02:00
David Sterba	e3355b43b4	btrfs-progs: use raid table for min devs in btrfs_check_chunk_valid Replace the hard coded values with the raid table reference. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:24 +02:00
David Sterba	3d05b20435	btrfs-progs: use btrfs_bg_type_to_nparity in chunk_bytes_by_type Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:23 +02:00
David Sterba	359710d4dd	btrfs-progs: use btrfs_bg_type_to_nparity in get_dev_extent_len Stripe calculation with hard coded parity, use the helper. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:23 +02:00
David Sterba	f759e9bbad	btrfs-progs: introduce a public helper for raid parity There's a private helper for parity and there are many open coded calculations of parity for the RAID56 profiles. The helper will be used to remove that and use the raid table values. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:23 +02:00
David Sterba	447bf2fb37	btrfs-progs: zoned: factor out supported profiles to a helper The enumeration could get out of date, like fixed in previous commit. Create a helper that will hide the implementation details. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:23 +02:00
David Sterba	15eb03ca1d	btrfs-progs: use raid table for profile names in print-tree.c Pick the names from the raid table and do the uppercase conversion. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:23 +02:00
David Sterba	80714610f3	btrfs-progs: use raid table for ncopies There's opencoded value of raid table ncopies in print_filesystem_usage_overall, add a helper and use it. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:23 +02:00
David Sterba	b29f1603b0	btrfs-progs: use raid table for devs_min and replace local helper Another duplication of the raid table, in this case missing the changes to raid10 and raid0 minimum devices changed in `a177ef7dd4` ("btrfs-progs: mkfs: allow degenerate raid0/raid10"). Define and use a helper using the table value. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:23 +02:00
Naohiro Aota	e9696b06f0	btrfs-progs: use direct-io for zoned device We need to use direct-IO for zoned devices to preserve the write ordering. Instead of detecting if the device is zoned or not, we simply use direct-IO for any kind of device (even if emulated zoned mode on a regular device). Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:23 +02:00
Naohiro Aota	4a8d85f730	btrfs-progs: temporarily set zoned flag for initial tree reading Functions to read data/metadata e.g. read_extent_from_disk() now depend on the fs_info->zoned flag to determine if they do direct-IO or not. The flag (and zone_size) is not known before reading the chunk tree and it set to 0 while in the initial chunk tree setup process. That will cause btrfs_pread() to fail because it does not align the buffer. Use fcntl() to find out the file descriptor is opened with O_DIRECT or not, and if it is, set the zoned flag to 1 temporally for this initial process. Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:23 +02:00
Naohiro Aota	ae0dfb246d	btrfs-progs: introduce btrfs_pread wrapper for pread Wrap pread with btrfs_pread as well. Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:23 +02:00
Naohiro Aota	c821e5545f	btrfs-progs: introduce btrfs_pwrite wrapper for pwrite Wrap pwrite with btrfs_pwrite(). It simply calls pwrite() on non-zoned btrfs (opened without O_DIRECT). On zoned mode (opened with O_DIRECT), it allocates an aligned bounce buffer, copies the contents and uses it for direct-IO writing. Writes in device_zero_blocks() and btrfs_wipe_existing_sb() are a little tricky. We don't have fs_info on our hands, so use zinfo to determine it is a zoned device or not. Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-20 18:59:23 +02:00
Naohiro Aota	40ab7530df	btrfs-progs: set eb::fs_info properly everywhere Several extent_buffer initializations miss fs_info initialization. This is OK before the following patch ("btrfs-progs: use direct-io for zoned device") as eb->fs_info is not always necessary. But, after that patch, we will use fs_info to determine it is zoned or not and that causes segfault in such cases. Properly set fs_info when initializing extent_buffers to fix the issue. Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:47:04 +02:00
David Sterba	8bb13015bd	btrfs-progs: don't include btrfs-list.h unless necessary We don't need to include this besides btrfs-list.c itself and subvolume.c that does use the btrfs_list_* API. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:47:03 +02:00
Naohiro Aota	585ac14d1a	btrfs-progs: use btrfs_device_size() instead of device_get_partition_size_fd() device_get_partition_size_fd() fails if we pass a regular file. This can happen when trying to create an emulated zoned filesystem on a regular file. Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:46:35 +02:00
David Sterba	785218efb1	btrfs-progs: remove direct calls to crc32c from ctree.h Make the helpers using crc32c not inline so the crc32c.h can be removed from the public headers exported by libbtrfs. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:46:35 +02:00
David Sterba	732d73dc1f	btrfs-progs: remove btrfs_crc32c alias There's an ancient macro btrfs_crc32c which is just wrapping crc32c and not doing anything else, so we can use the crc helper directly. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:46:35 +02:00
David Sterba	979bda6fb5	btrfs-progs: libbtrfs: replace SZ_ constants and drop sizes.h To drop sizes.h from exported headers, replace the few SZ_ constants from the existing exported headers (ctree.h, send.h). It would be nice to use them in the long run but right now it would prevent unexporting the sizes.h file. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:46:35 +02:00
David Sterba	38356d456b	btrfs-progs: libbtrfs: drop radix-tree.h from exported headers The header is only included from ctree.h but not actually used, we can drop it from the exported files. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:46:35 +02:00
Nikolay Borisov	39c6e0b79c	btrfs-progs: add btrfs_uuid_tree_remove It will be used to clear received data on RW snapshots that were received. The function is copied from kernel sources. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:46:34 +02:00
Nikolay Borisov	97640a5b81	btrfs-progs: remove root argument from btrfs_truncate_item This function lies in the kernel-shared directory and is supposed to be close to 1:1 copy with its kernel counterpart, yet it takes one extra argument - root. But this is now unused to simply remove it. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:46:34 +02:00
Nikolay Borisov	c3584b4fc0	btrfs-progs: remove fs_info argument from leaf_data_end The function already takes an extent_buffer which has a reference to the owning filesystem's fs_info. This also brings the function in line with the kernel's signature. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:46:34 +02:00
Nikolay Borisov	7c58b09548	btrfs-progs: remove root argument from btrfs_fixup_low_keys It's not used, so just remove it. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:46:34 +02:00
David Sterba	d0ea2b2af4	btrfs-progs: zoned: also exclude raid1c3 and raid1c4 from supported profiles The enumeration of profiles not available for zoned mode in btrfs_load_block_group_zone_info was lacking the 3 and 4 copy raid1, add them. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-07 18:39:58 +02:00
David Sterba	27207d651a	btrfs-progs: dump-tree: print complete root_item The output of root_item in the 'inspect dump-tree' command lacks some items and some of them are printed conditionally. As the dump utility is for debugging, it's better to print all the items, with names matching the structure members and order. Some values will inevitably be all zeros like uuids or various timestamps, but that's a minor issue and affecting only a few trees. Example: item 0 key (EXTENT_TREE ROOT_ITEM 0) itemoff 15844 itemsize 439 generation 5 root_dirid 0 bytenr 30523392 byte_limit 0 bytes_used 16384 last_snapshot 0 flags 0x0(none) refs 1 drop_progress key (0 UNKNOWN.0 0) drop_level 0 level 0 generation_v2 5 uuid 00000000-0000-0000-0000-000000000000 parent_uuid 00000000-0000-0000-0000-000000000000 received_uuid 00000000-0000-0000-0000-000000000000 ctransid 0 otransid 0 stransid 0 rtransid 0 ctime 0.0 (1970-01-01 01:00:00) otime 0.0 (1970-01-01 01:00:00) stime 0.0 (1970-01-01 01:00:00) rtime 0.0 (1970-01-01 01:00:00) item 3 key (FS_TREE ROOT_ITEM 0) itemoff 14949 itemsize 439 generation 4 root_dirid 256 bytenr 30408704 byte_limit 0 bytes_used 16384 last_snapshot 0 flags 0x0(none) refs 1 drop_progress key (0 UNKNOWN.0 0) drop_level 0 level 0 generation_v2 4 uuid ec4669b6-6d21-46ab-857e-d60cafde45b3 parent_uuid 00000000-0000-0000-0000-000000000000 received_uuid 00000000-0000-0000-0000-000000000000 ctransid 0 otransid 0 stransid 0 rtransid 0 ctime 1633021823.0 (2021-09-30 19:10:23) otime 1633021823.0 (2021-09-30 19:10:23) stime 0.0 (1970-01-01 01:00:00) rtime 0.0 (1970-01-01 01:00:00) Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-07 18:39:44 +02:00
Josef Bacik	dd8e7477f7	btrfs-progs: remove data extents from the free space tree Dave reported a failure of mkfs-test 009 with the free space tree enabled by default. This is because 009 pre-populates the file system with a given directory, and for some reason our data allocation path isn't the same as in the kernel. Fix this by making sure when we allocate a data extent we remove the space from the free space tree, and with this our mkfs tests now pass. Issue: #410 Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-06 16:49:52 +02:00
Naohiro Aota	85e102f212	btrfs-progs: properly format btrfs_header in btrfs_create_root() Enabling quota in zoned mored hits the following assertion: $ mkfs.btrfs -f -d single -m single -R quota /dev/nullb0 btrfs-progs v5.11 See http://btrfs.wiki.kernel.org for more information. Zoned: /dev/nullb0: host-managed device detected, setting zoned feature Resetting device zones /dev/nullb0 (1600 zones) ... bad tree block 25395200, bytenr mismatch, want=25395200, have=0 kernel-shared/disk-io.c:549: write_tree_block: BUG_ON `1` triggered, value 1 ./mkfs.btrfs(+0x26aaa)[0x564d1a7ccaaa] ./mkfs.btrfs(write_tree_block+0xb8)[0x564d1a7cee29] ./mkfs.btrfs(__commit_transaction+0x91)[0x564d1a7e3740] ./mkfs.btrfs(btrfs_commit_transaction+0x135)[0x564d1a7e39aa] ./mkfs.btrfs(main+0x1fe9)[0x564d1a7b442a] /lib64/libc.so.6(__libc_start_main+0xcd)[0x7f36377d37fd] ./mkfs.btrfs(_start+0x2a)[0x564d1a7b1fda] zsh: IOT instruction sudo ./mkfs.btrfs -f -d single -m single -R quota /dev/nullb0 The issue occurs because btrfs_create_root() is not formatting the root node properly. This is fine in regular mode, because it's fortunately reusing an once freed buffer. As the previous tree node allocation kindly formatted the header, it will see the proper bytenr and pass the checks. However, we never reuse a once freed buffer on zoned filesystem. As a result, we have zero-filled bytenr, FSID, and chunk-tree UUID, hitting the asserts in check_tree_block(). Reported-by: Johannes Thumshirn <Johannes.Thumshirn@wdc.com> Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-06 16:49:11 +02:00
Johannes Thumshirn	c22e9487a7	btrfs-progs: remove max_zone_append_size logic max_zone_append_size is unused and can as well be removed just like we did on the kernel side. Keep one sanity check though, so we're not adding devices to a zoned FS that aren't supporting zone append. Signed-off-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-06 16:49:07 +02:00
Naohiro Aota	53ec59ead0	btrfs-progs: do not zone reset on emulated zoned mode We cannot zone reset a regular file with emulated zones. So, mkfs.btrfs on such a file causes the following error. ERROR: zoned: failed to reset device '/home/naota/tmp/btrfs.img' zones: Inappropriate ioctl for device Introduce btrfs_zoned_device_info->emulated to distinguish the zones are emulated or not. And, use it to decide it needs zone reset or not. Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-06 16:48:56 +02:00
Qu Wenruo	60651ad9da	btrfs-progs: introduce OPEN_CTREE_ALLOW_TRANSID_MISMATCH flag [BUG] There is a report that, btrfstune can even work while the fs has transid mismatch problems. $ btrfstune -f -u /dev/sdb1 Current fsid: b2b5ae8d-4c49-45f0-b42e-46fe7dcfcb07 New fsid: b2b5ae8d-4c49-45f0-b42e-46fe7dcfcb07 Set superblock flag CHANGING_FSID Change fsid in extents parent transid verify failed on 792854528 wanted 20103 found 20091 parent transid verify failed on 792854528 wanted 20103 found 20091 parent transid verify failed on 792854528 wanted 20103 found 20091 Ignoring transid failure parent transid verify failed on 792870912 wanted 20103 found 20091 parent transid verify failed on 792870912 wanted 20103 found 20091 parent transid verify failed on 792870912 wanted 20103 found 20091 Ignoring transid failure parent transid verify failed on 792887296 wanted 20103 found 20091 parent transid verify failed on 792887296 wanted 20103 found 20091 parent transid verify failed on 792887296 wanted 20103 found 20091 Ignoring transid failure ERROR: child eb corrupted: parent bytenr=38010880 item=69 parent level=1 child level=1 ERROR: failed to change UUID of metadata: -5 ERROR: btrfstune failed This leaves a corrupted fs even more corrupted, and due to the extra CHANGING_FSID flag, btrfs check will not even try to run on it: Opening filesystem to check... ERROR: Filesystem UUID change in progress ERROR: cannot open file system [CAUSE] Unlike kernel, btrfs-progs has a less strict check on transid mismatch. In read_tree_block() we will fall back to use the tree block even its transid mismatch if we can't find any better copy. However not all commands in btrfs-progs needs this feature, only btrfs-check (which may fix the problem) and btrfs-restore (it just tries to ignore any problems) really utilize this feature. [FIX] Introduce a new open ctree flag, OPEN_CTREE_ALLOW_TRANSID_MISMATCH, to be explicit about whether we really want to ignore transid error. Currently only btrfs-check and btrfs-restore will utilize this new flag. Also add btrfs-image to allow opening such fs with transid error. Link: https://www.reddit.com/r/btrfs/comments/pivpqk/failure_during_btrfstune_u/ Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-20 12:17:29 +02:00
David Sterba	96a5cf0719	btrfs-progs: handle EINVAL when reading zone size on older kernels A combination of new progs and old kernel may lead to problems with detecting zone size by ioctl. Fixed by #376 but still incomplete because old kernels may return EINVAL for unsupported ioctl. This should be ENOTTY but hasn't been like that until kernel 5.11. As we always pass valid arguments to the ioctl we can't conflate the two and can EINVAL the same way as ENOTTY. Issue: #399 Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-20 11:31:09 +02:00
David Sterba	ee17bcec33	btrfs-progs: remove stale declaration from send.h We don't use this header for kernel compilation so the guarded declaration is pointless. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 19:27:59 +02:00
David Sterba	e86425242f	btrfs-progs: move send.h to kernel-shared/ The header contains the protocol definitions and is almost exactly the same as the kernel version, move it to the proper directory. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 19:26:46 +02:00
David Sterba	76ab1fa364	btrfs-progs: rename and move group_profile_max_safe_loss The helper belongs to the others that translate bg flags to the raid attr table member. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 16:38:56 +02:00
Qu Wenruo	9a11b1b792	btrfs-progs: backport btrfs_check_node() from kernel The btrfs_check_node() has far less meaningful error message compared to kernel counterpart, and it even lacks certain checks like level check. Backport btrfs_check_node() to btrfs-progs to not only unify the code but greatly improve the readability of the error messages. Extra modification includes: - No fs_info needed As we don't need to output fsid. - Remove unlikely() macro - Extra BTRFS_TREE_BLOCK_* error type - Btrfs-progs specific error handling To record the corrupted tree blocks. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 14:20:41 +02:00
Qu Wenruo	8f8cafa2ce	btrfs-progs: backport btrfs_check_leaf() from kernel Currently btrfs_check_leaf() provides almost meaningless messages for things like invalid item offset: incorrect offsets 8492 3707786077 While kernel tree-checker is doing a way better job, so it's wise to backport btrfs_check_leaf() from kernel. There are some modification needed: - New generic_err() helper - Remove unlikely() macro - Remove empty essential tree check Mkfs still needs to create empty essential trees. - Using BTRFS_TREE_BLOCK_* return value Original mode check still relies on them to do certain repair. - No need for btrfs_fs_info We no longer need fsid output, thus no need for btrfs_fs_info. - No item contents check - Still using the fail: label for btrfs-progs specific error handling The new output looks like: corrupt leaf: root=2 block=72164753408 slot=109, unexpected item end, have 3707786077 expect 8492 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 14:19:54 +02:00
Qu Wenruo	1f8dfe681f	btrfs-progs: use btrfs_key for btrfs_check_node() and btrfs_check_leaf() In kernel space we hardly use btrfs_disk_key, unless for very lowlevel code. There is no need to intentionally use btrfs_disk_key in btrfs-progs either. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 13:58:44 +02:00
David Sterba	c3ee6a8a09	btrfs-progs: unify GPL header comments Add the GPL v2 header to files where it was missing and is not from an external source, update to the most recent version with the address. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 13:58:44 +02:00
David Sterba	7572839a74	btrfs-progs: add and use bit masks for RAID1 and RAID56 profiles Many test conditions can be simplified in case they check all the related profiles. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-06 16:36:18 +02:00
David Sterba	7fe4396467	btrfs-progs: copy some raid_attr helpers from kernel There are convenience helpers for the raid attr table, copy them from kernel for further cleanups. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-06 16:36:17 +02:00
Josef Bacik	79e534def9	btrfs-progs: add the incompat flag for extent tree v2 I will have a lot of preparatory patches to reduce the review pain of this large feature. In order to enable that work define the incompat flag. Once all of the work lands to support the feature there will be a patch to actually enable us to select it and manipulate file systems with that incompat flag set. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-06 16:36:17 +02:00
Josef Bacik	826e466028	btrfs-progs: add add_block_group_free_space helper This exists in the kernel free-space-tree.c but not in progs. We need it to generate the free space items for new block groups, which is needed when we start creating the free space tree in make_btrfs(). Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-06 16:36:17 +02:00
Josef Bacik	3d870a491f	btrfs-progs: make sure track_dirty and ref_cows is set properly Adding support for the per-block group roots means we will be reading the roots directly in different places. Make sure we set ->track_dirty and ->ref_cows properly in the helper so we don't have to do this everywhere. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-03 15:33:53 +02:00
David Sterba	a177ef7dd4	btrfs-progs: mkfs: allow degenerate raid0/raid10 Kernel patch b2f78e88052bc0bee ("btrfs: allow degenerate raid0/raid10") in 5.15 will allow mounting and converting to single device raid0 or two device raid10. Let mkfs create such filesystem. "The motivation is to allow to preserve the profile type as long as it possible for some intermediate state (device removal, conversion), or when there are disks of different size, with raid0 the otherwise unusable space of the last device will be used too. Similarly for raid10, though the two largest devices would need to be the same." Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-27 15:40:53 +02:00

... 2 3 4 5 6 ...

434 Commits