btrfs-progs

mirror of https://github.com/kdave/btrfs-progs synced 2024-12-28 00:52:17 +00:00

Author	SHA1	Message	Date
Josef Bacik	2b9fba7974	btrfs-progs: rename btrfs_item_end to btrfs_item_data_end This matches what we did in the kernel, btrfs_item_data_end is more inline with what the helper does, which is give us the offset of the end of the data portion of the item, not the offset of the end of the item itself. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-28 18:57:43 +01:00
Josef Bacik	53f8b8fd01	btrfs-progs: make btrfs_qgroup_level helper match the kernel We return __u16 in the kernel, as this is actually the size of btrfs_qgroup_level. Adjust the existing helpers and update all the callers to deal with the new size appropriately. This will make syncing the kernel code cleaner. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-28 18:57:43 +01:00
Josef Bacik	03451430de	btrfs-progs: rename qgroup items to match the kernel naming scheme We're going to sync the kernel source into btrfs-progs, and in the kernel we have all these qgroup fields named with short names instead of the full name, so rename referenced -> rfer compressed -> cmpr exclusive -> excl to match the kernel and update all the users, this will make the sync cleaner. ioctl.h is a public header but there are no users of the btrfs_qgroup_limit structure. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-11-28 18:57:43 +01:00
David Sterba	44efde7fa0	btrfs-progs: unify naming of qgroup subvolid helpers Kernel function name is btrfs_qgroup_subvolid so rename it in progs. The libbtrfs can't API be changed without versioning so at least add the new helper. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-26 09:36:44 +02:00
David Sterba	ccb2d4aa45	btrfs-progs: device-utils: rename btrfs_device_size There's a group of helpers to read device size, the btrfs_device_size should be one of them. Rename it and so minor cleanup. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:10 +02:00
David Sterba	a827bb2db8	btrfs-progs: use template for transaction commit error messages Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:10 +02:00
David Sterba	8fcafae04a	btrfs-progs: use template for transaction start error messages Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:10 +02:00
David Sterba	c2be0e2ce0	btrfs-progs: use template for out of memory error messages Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:09 +02:00
David Sterba	b5cf79f107	btrfs-progs: check: rename global repair option Make the repair mode more visible with prefix opt_ for option and also add 'check' for context. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:09 +02:00
David Sterba	2267708bfe	btrfs-progs: move repair.c from common/ to check/ Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:09 +02:00
David Sterba	e012d1fedb	btrfs-progs: check: factor out code for clearing caches Move helpers related to clearing space and inode cache from main.co to a separate file. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:09 +02:00
David Sterba	f7224bdd12	btrfs-progs: check: move global variables to common headers Add declarations for global fs_info and task context so they can be accessed from any .c file once the main.c will be split. Add prefix "g_" for the task. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:09 +02:00
David Sterba	feef6aaaf6	btrfs-progs: kernel-lib: remove radix-tree The radix-tree is not used in userspace code. In kernel it's for tracking unpersisted and in-memory structures and has been replaced by the xarray. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:07 +02:00
David Sterba	0e38e1c4f2	btrfs-progs: use error helper for messages in non-kernel code Lots of code still uses fprintf(stderr, "...") that should be the error() helper. The kernel-shared code is left out of the conversion for now. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:07 +02:00
David Sterba	b5aac254c7	btrfs-progs: common: update include lists, part 1 The tool IWYU (include what you use) suggests to remove and add some includes. This is only partial to avoid accidental build breakage, the includes are entangled and will have to be cleaned in the future again. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:07 +02:00
David Sterba	6edd4b2121	btrfs-progs: factor string helpers out of utils.c Utils is the catch-all file, we can now separate some string utility functions. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:06:13 +02:00
David Sterba	bd94a00de0	btrfs-progs: check: update include lists The tool IWYU (include what you use) suggests to remove and add some includes. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:06:12 +02:00
David Sterba	8802a81eb8	btrfs-progs: check: reorder includes The preferred order: - system headers - standard headers - libraries - kernel library - kernel shared - common headers - other tools - own headers Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:06:12 +02:00
Qu Wenruo	1430b41427	btrfs-progs: separate block group tree from extent tree v2 Block group tree feature is completely a standalone feature, and it has been over 5 years before the initial introduction to solve the long mount time. I don't really want to waste another 5 years waiting for a feature which may or may not work, but definitely not properly reviewed for its preparation patches. So this patch will separate the block group tree feature into a standalone compat RO feature. There is a catch, in mkfs create_block_group_tree(), current tree-checker only accepts block group item with valid chunk_objectid, but the existing code from extent-tree-v2 didn't properly initialize it. This patch will also fix above mentioned problem so kernel can mount it correctly. Now mkfs/fsck should be able to handle the fs with block group tree. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-09-12 18:25:32 +02:00
Su Yue	c7d5638824	btrfs-progs: free extent buffer after repairing wrong transid eb In read_tree_block, extent buffer EXTENT_BAD_TRANSID flagged will be added into fs_info->recow_ebs with an increment of its refs. The corresponding free_extent_buffer should be called after we fix transid error by cowing extent buffer then remove them from fs_info->recow_ebs. Otherwise, extent buffers will be leaked as fsck-tests/002 reports: =================================================================== ====== RUN CHECK /root/btrfs-progs/btrfs check --repair --force ./default_case.img.restored parent transid verify failed on 29360128 wanted 9 found 755944791 parent transid verify failed on 29360128 wanted 9 found 755944791 parent transid verify failed on 29360128 wanted 9 found 755944791 Ignoring transid failure [1/7] checking root items Fixed 0 roots. [2/7] checking extents [3/7] checking free space cache [4/7] checking fs roots [5/7] checking only csums items (without verifying data) [6/7] checking root refs [7/7] checking quota groups skipped (not enabled on this FS) extent buffer leak: start 29360128 len 4096 enabling repair mode =================================================================== Fixes: `c64485544b` ("Btrfs-progs: keep track of transid failures and fix them if possible") Signed-off-by: Su Yue <glass@fydeos.io> Signed-off-by: David Sterba <dsterba@suse.com>	2022-09-12 15:31:23 +02:00
Qu Wenruo	0a348d3c81	btrfs-progs: check: verify the underlying block device size is valid [BUG] There is a bug report that, one btrfs got its underlying device shrunk accidentally. Fortunately the user has no data at the truncated range. However kernel will reject such filesystem, while btrfs-check reports nothing wrong with it. This can be easily reproduced by: # truncate -s 1G test.img # mkfs.btrfs test.img # truncate -s 996M test.img # btrfs check test.img Opening filesystem to check... Checking filesystem on test.img UUID: dbf0a16d-f158-4383-9025-29d7f4c43f17 [1/7] checking root items [2/7] checking extents [3/7] checking free space tree [4/7] checking fs roots [5/7] checking only csums items (without verifying data) [6/7] checking root refs [7/7] checking quota groups skipped (not enabled on this FS) found 16527360 bytes used, no error found ^^^^^^^^^^^^^^ total csum bytes: 13836 total tree bytes: 2359296 total fs tree bytes: 2162688 total extent tree bytes: 65536 btree space waste bytes: 503569 file data blocks allocated: 14168064 referenced 14168064 [CAUSE] Btrfs check really only checks the metadata cross references, not really bothering if the underlying device has correct size. Thus we completely ignored such size mismatch. [FIX] For both regular and lowmem mode, add extra check against the underlying block device size. If the block device size is smaller than its total_bytes, gives a error message and error out. Now the check looks like this for both modes: ... [2/7] checking extents ERROR: block device size is smaller than total_bytes in device item, has 1046478848 expect >= 1073741824 ERROR: errors found in extent allocation tree or chunk allocation [3/7] checking free space tree ... found 16527360 bytes used, error(s) found Issue: #504 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-09-12 15:31:15 +02:00
Qu Wenruo	650282a04c	btrfs-progs: fix eb leak caused by missing btrfs_release_path() call. [BUG] Commit `06b6ad5e01` ("btrfs-progs: check: check for invalid free space tree entries") makes btrfs check to report eb leakage even on newly created btrfs: # mkfs.btrfs -f test.img # btrfs check test.img Opening filesystem to check... Checking filesystem on test.img UUID: 13c26b6a-3b2c-49b3-94c7-80bcfa4e494b [1/7] checking root items [2/7] checking extents [3/7] checking free space tree [4/7] checking fs roots [5/7] checking only csums items (without verifying data) [6/7] checking root refs [7/7] checking quota groups skipped (not enabled on this FS) found 147456 bytes used, no error found total csum bytes: 0 total tree bytes: 147456 total fs tree bytes: 32768 total extent tree bytes: 16384 btree space waste bytes: 140595 file data blocks allocated: 0 referenced 0 extent buffer leak: start 30572544 len 16384 <<< Extent buffer leakage [CAUSE] The patch in mailinglist uses a dynamically allocated path while the committed one has been converted to on-stack path, which is preferred. However, the cleanup was not done properly. We only release the path inside the while loop, no at out label. This means, if we hit error or even just exhausted free space tree as expected, we will leak the path to free space tree root. Thus leading to the above leak report. [FIX] Fix the bug by calling btrfs_release_path() at out: label too. This should make the code behave the same as the patch submitted to the mailing list. Fixes: `06b6ad5e01` ("btrfs-progs: check: check for invalid free space tree entries") Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-09-10 22:50:47 +02:00
Josef Bacik	06b6ad5e01	btrfs-progs: check: check for invalid free space tree entries While testing some changes to how we reclaim block groups I started hitting failures with my TEST_DEV. This occurred because I had a bug and failed to properly remove a block groups free space tree entries. However this wasn't caught in testing when it happened because btrfs check only checks that the free space cache for the existing block groups is valid, it doesn't check for free space entries that don't have a corresponding block group. Fix this by checking for free space entries that don't have a corresponding block group. Additionally add a test image to validate this fix. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-08-16 15:18:12 +02:00
David Sterba	a33af50c52	btrfs-progs: add constant for initial getopt values Add constant for initial value to avoid unexpected clashes with user defined getopt values and shift the common size getopt values. Signed-off-by: David Sterba <dsterba@suse.com>	2022-08-16 15:18:11 +02:00
Qu Wenruo	9bded24a46	btrfs-progs: do not use btrfs_commit_transaction() just to update super blocks There are several call sites utilizing btrfs_commit_transaction() just to update members in super blocks, without any metadata update. This can be problematic for some simple call sites, like zero_log_tree() or check_and_repair_super_num_devs(). If we have big problems preventing the fs to be mounted in the first place, and need to clear the log or super block size, but by some other problems in extent tree, we're unable to allocate new blocks. Then we fall into a deadlock that, we need to mount (even ro,rescue=all) to collect extra info, but btrfs-progs can not do any super block updates. Fix the problem by allowing the following super blocks only operations to be done without using btrfs_commit_transaction(): - btrfs_fix_super_size() - check_and_repair_super_num_devs() - zero_log_tree(). There are some exceptions in btrfstune.c, related to the csum type conversion and seed flags. In those btrfstune cases, we in fact wants to proper error report in btrfs_commit_transaction(), as those operations are not mount critical, and any early error can be helpful to expose any problems in the fs. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-05-20 15:54:16 +02:00
Qu Wenruo	90850c26fa	btrfs-progs: remove unused header check/btrfsck.h It should have been deleted, as CHANGES mentioned this in v5.14, but obvious it's not. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-05-17 21:12:19 +02:00
Qu Wenruo	23c6d24668	btrfs-progs: check: lowmem, fix path leak when dev extents are invalid [BUG] When testing my new RAID56J code, there is a bug causing dev extents overlapping. Although both modes can detect the problem, lowmem has leaked some extent buffers: $ btrfs check --mode=lowmem /dev/test/scratch1 Opening filesystem to check... Checking filesystem on /dev/test/scratch1 UUID: 65775ce9-bb9d-4f61-a210-beea52eef090 [1/7] checking root items [2/7] checking extents ERROR: dev extent devid 1 offset 1095761920 len 1073741824 overlap with previous dev extent end 1096810496 ERROR: dev extent devid 2 offset 1351614464 len 1073741824 overlap with previous dev extent end 1352663040 ERROR: dev extent devid 3 offset 1351614464 len 1073741824 overlap with previous dev extent end 1352663040 ERROR: errors found in extent allocation tree or chunk allocation [3/7] checking free space tree [4/7] checking fs roots [5/7] checking only csums items (without verifying data) [6/7] checking root refs done with fs roots in lowmem mode, skipping [7/7] checking quota groups skipped (not enabled on this FS) found 3221372928 bytes used, error(s) found total csum bytes: 0 total tree bytes: 147456 total fs tree bytes: 32768 total extent tree bytes: 16384 btree space waste bytes: 136231 file data blocks allocated: 3221225472 referenced 3221225472 extent buffer leak: start 30752768 len 16384 extent buffer leak: start 30752768 len 16384 extent buffer leak: start 30752768 len 16384 [CAUSE] In the function check_dev_item(), we iterate through all the dev extents, but when we found overlapping extents, we exit without releasing the path, causing extent buffer leakage. [FIX] Just release the path before we exit the function. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-05-17 21:12:19 +02:00
Qu Wenruo	3ff9d35257	btrfs-progs: use read_data_from_disk() to replace read_extent_from_disk() and replace read_extent_data() The function read_extent_from_disk() is only a wrapper to read tree block. And read_extent_data() is just a while loop to eliminate short read caused by stripe boundary. In fact, a lot of call sites of read_extent_data() are either reading metadata (thus no possible short read) or doing extra loop by themselves. This patch will replace those two functions with read_data_from_disk(), making it the only entrance for data/metadata read. And update read_data_from_disk() to return the read bytes, so caller can do a simple while loop. For the few callers of read_extent_data(), open-code a small while loop for them. This will allow later RAID56 read repair using P/Q much easier. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-25 19:08:30 +02:00
Qu Wenruo	0dc8b8b6a4	btrfs-progs: check: fix wrong total bytes check for seed device [BUG] The following script can lead to false positive from btrfs check: mkfs.btrfs -f $dev1 mount $dev1 $mnt btrfstune -S1 $dev1 mount $dev1 $mnt btrfs dev add -f $dev2 $mnt umount $mnt # Now dev1 is seed, and dev2 is the rw fs. btrfs check $dev2 ... [2/7] checking extents WARNING: minor unaligned/mismatch device size detected WARNING: recommended to use 'btrfs rescue fix-device-size' to fix it ... This false positive only happens on $dev2, $dev1 is completely fine. [CAUSE] The warning is from is_super_size_valid(), in that function we verify the super block total bytes (@super_bytes) is correct against the total device bytes (@total_bytes). However the when calculating @total_bytes, we only use devices in current fs_devices, which only contains RW devices. Thus all bytes from seed device are not taken into consideration, and trigger the false positive. [FIX] Fix it by also iterating seed devices. Since we're here, also output @total_bytes and @super_bytes when outputting the warning message, to allow end users have a better idea on what's going wrong. Reviewed-by: Su Yue <l@damenly.su> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-25 18:59:44 +02:00
Mark Harmstone	ef194732d5	btrfs-progs: check: add check for overlong xattr names While working on my Windows driver, I found that it was inadvertently allowing users to create xattrs with names longer than 255 bytes, which wasn't being picked up by btrfs-check. If the Linux driver encounters a file with an invalid xattr like this, it makes the whole directory it's in inaccessible. If it's the root directory, it'll refuse to mount the filesystem entirely. Pull-request: #456 Signed-off-by: Mark Harmstone <mark@harmstone.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-04-25 18:52:49 +02:00
Qu Wenruo	b18ceb8585	btrfs-progs: check: add check and repair ability for super num devs mismatch [BUG] There is a bug report of kernel rejecting fs which has a mismatch in super num devices and num devices found in chunk tree. But btrfs-check reports no problem about the fs. [CAUSE] We just didn't verify super num devices against the result found in chunk tree. [FIX] Add such check and repair ability for btrfs-check. The ability is mode independent. Reported-by: Luca Béla Palkovics <luca.bela.palkovics@gmail.com> Link: https://lore.kernel.org/linux-btrfs/CA+8xDSpvdm_U0QLBAnrH=zqDq_cWCOH5TiV46CKmp3igr44okQ@mail.gmail.com/ Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-24 00:36:13 +01:00
Qu Wenruo	fd500029eb	btrfs-progs: check: fix two error messages used in qgroup verification [BUG] There is a weird error message when running btrfs check on a specific image: [7/7] checking quota groups ERROR: out of memory ERROR: Loading qgroups from disk: -2 ERROR: failed to check quota groups [CAUSE] The "Out of memory" one is in load_quota_info(), which is output in two cases: - No memory can be allocated for btrfs_qgroup_list AKA, real -ENOMEM. - No qgroup can be found for either the child or the parent qgroup This returnes -ENOENT. Obvious the image has hit -ENOENT case, but the error message is fixed to ENOMEM case. [FIX] Fix it by using %m to output the real reason of failure. Reported-by: Andrei Borzenkov <arvidjaar@gmail.com> Link: https://forums.opensuse.org/showthread.php/567851-btrfs-fails-to-load-qgroups-from-disk-with-error-2-(out-of-memory) Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-23 18:18:31 +01:00
Josef Bacik	e8f7f034af	btrfs-progs: check: don't do the root item check for extent tree v2 With the current set of changes we could probably do this check, but it would involve changing the code quite a bit, and in the future we're not going to track the metadata in the extent tree at all. Since this check was for a very old kernel just skip it for extent tree v2. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:07:34 +01:00
Josef Bacik	c36d8ea235	btrfs-progs: check: handle the block group tree properly We need to make sure we process the block group root, and mark its blocks as used for the free space tree checking. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:07:12 +01:00
Josef Bacik	37d05e9b1e	btrfs-progs: check: add helper to reinit the root based on a key In the case of per-bg roots we may be missing the root items. To re-initialize them we want to add the root item as well as allocate the empty block. To achieve this extract out the reinit root logic to a helper that just takes the root key and then does the appropriate work to allocate an empty root and update the root item. Fix the normal reinit root helper to use this new helper. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:07:10 +01:00
Josef Bacik	f9b65a54c0	btrfs-progs: check: make free space tree validation extent tree v2 aware The free space tree needs to be validated against all referenced blocks in the file system, so use the btrfs_mark_used_blocks() helper to check the free space tree and free space cache against. This will do the right thing for both extent tree v1 and extent tree v2. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:07:08 +01:00
Josef Bacik	b4d714a47a	btrfs-progs: qgroup-verify: scan extents based on block groups When we switch to per-block group extent roots we'll need to scan each individual extent root. To make this easier in the future go ahead and use the range of the block groups to scan the extents. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:07:06 +01:00
Josef Bacik	1bbfc9c9a6	btrfs-progs: check: add block group tree support This makes the appropriate changes to enable the block group tree checking for both lowmem and normal check modes. This is relatively straightforward, simply need to use the helper to get the right root for dealing with block groups. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:07:04 +01:00
Josef Bacik	9f1f88c642	btrfs-progs: check-lowmem: use the btrfs_block_group_root helper When we're messing with block group items use the btrfs_block_group_root() helper to get the correct root to search, and this will do the right thing based on the file system flags. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:06:57 +01:00
Josef Bacik	360103e610	btrfs-progs: mkfs: use the btrfs_block_group_root helper Instead of accessing the extent root directory for modifying block groups, use the helper which will do the correct thing based on the flags of the file system. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 18:06:56 +01:00
Josef Bacik	4f184cc911	btrfs-progs: make all of the item/key_ptr offset helpers take an eb When we change the size of the btrfs_header we're going to need to change how these helpers calculate where to find the start of items or block ptrs. To prepare for that make these helpers take the extent_buffer as an argument so we can do the appropriate math based on the version type. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:14 +01:00
Josef Bacik	5dc3964aaa	btrfs-progs: remove the _nr from the item helpers Now that all callers are using the _nr variations we can simply rename these helpers to btrfs_item_##member/btrfs_set_item_##member and change the actual item SETGET funcs to raw_item_##member/set_raw_item_##member and then change all callers to drop the _nr part. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:13 +01:00
Josef Bacik	f3be0ff01a	btrfs-progs: rename btrfs_item_end_nr to btrfs_item_end All callers use the btrfs_item_end_nr() variation, simply drop btrfs_item_end() and make btrfs_item_end_nr() use the _nr() variations of the item get helpers. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:13 +01:00
Josef Bacik	49539423fa	btrfs-progs: change btrfs_file_extent_inline_item_len to take a slot This matches how the kernel does it, simply pass in the slot and fix up btrfs_file_extent_inline_item_len to use the btrfs_item_nr() helper and the correct define. Fixup all the callers to use the slot now instead of passing in the btrfs_item. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:13 +01:00
Josef Bacik	04ffea07e4	btrfs-progs: add btrfs_set_item__nr() helpers We have a lot of the following patterns item = btrfs_item_nr(nr); btrfs_set_item_(eb, item, val); btrfs_set_item_*(eb, btrfs_item_nr(nr), val); in a lot of places in our code. Instead add _nr variations of these helpers and convert all of the users to this new helper. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:13 +01:00
Josef Bacik	80ff3f2c8e	btrfs-progs: btrfs_item_size_nr/btrfs_item_offset_nr everywhere We have this pattern in a lot of places item = btrfs_item_nr(slot); btrfs_item_size(leaf, item); btrfs_item_offset(leaf, item); when we could simply use btrfs_item_size_nr(leaf, slot); btrfs_item_offset_nr(leaf, slot); Fix all callers of btrfs_item_size() and btrfs_item_offset() to use the _nr variation of the helpers. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-09 15:13:13 +01:00
Josef Bacik	c9703d8726	btrfs-progs: handle orphan directories properly When implementing the GC tree I started getting btrfsck errors when a test rm -rf <directory> with files inside of it and immediately unmount, leaving behind orphaned directory items that have GC items for them. This made me realize that we don't actually handle this case currently for our normal orphan path. If we fail to clean everything up and leave behind the orphan items we'll fail fsck. Fix this by not processing any backrefs we find if we found an inode item and its nlink is 0. This allows us to pass the test case I've provided to validate this patch. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-08 18:40:45 +01:00
Josef Bacik	65dde617b1	btrfs-progs: check: fix check_global_roots_uptodate While running make test on other patches I noticed we are now segfaulting on the fuzz tests. This is because when I converted us to a rb tree for the global roots we lost the ability to catch that there's no extent root at all. Before we'd populate a dummy fs_info->extent_root with a not uptodate node, but now you simply don't get an extent root in the rb_tree. Fix the check_global_roots_uptodate helper to count how many roots we find and make sure it matches the number we expect. If it doesn't then we can return -EIO. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-03-08 18:18:00 +01:00
Qu Wenruo	1a8f283c97	btrfs-progs: check: orig: reject bad metadata backref with invalid level [BUG] There is a bug report that kernel tree-checker rejected an invalid metadata item: corrupt leaf: block=934474399744 slot=68 extent bytenr=425173254144 len=16384 invalid tree level, have 33554432 expect [0, 7] But original mode btrfs-check reports nothing wrong. (lowmem mode will just crash, and fixed in previous patch). [CAUSE] For original mode it doesn't really check tree level, thus didn't find the problem. [FIX] I don't have a good idea to completely make original mode to verify the level in backref and in the tree block (while lowmem does that). But at least we can detect obviously corrupted level just like kernel. Now original mode will detect such problem: ... [2/7] checking extents ERROR: tree block 30457856 has bad backref level, has 256 expect [0, 7] ref mismatch on [30457856 16384] extent item 0, found 1 tree backref 30457856 root 5 not found in extent tree backpointer mismatch on [30457856 16384] ERROR: errors found in extent allocation tree or chunk allocation [3/7] checking free space tree ... Reported-by: Stickstoff <stickstoff@posteo.de> Link: https://lore.kernel.org/linux-btrfs/6ed4cd5a-7430-f326-4056-25ae7eb44416@posteo.de/ Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-02-01 18:42:01 +01:00
Qu Wenruo	bf21c82675	btrfs-progs: check: lowmem: fix crash when METADATA_ITEM has invalid level [BUG] When running lowmem mode with METADATA_ITEM which has invalid level, it will crash with the following backtrace: (gdb) bt #0 0x0000555555616b0b in btrfs_header_bytenr (eb=0x4) at ./kernel-shared/ctree.h:2134 #1 0x0000555555620c78 in check_tree_block_backref (root_id=5, bytenr=30457856, level=256) at check/mode-lowmem.c:3818 #2 0x0000555555621f6c in check_extent_item (path=0x7fffffffd9c0) at check/mode-lowmem.c:4334 #3 0x00005555556235a5 in check_leaf_items (root=0x555555688e10, path=0x7fffffffd9c0, nrefs=0x7fffffffda30, account_bytes=1) at check/mode-lowmem.c:4835 #4 0x0000555555623c6d in walk_down_tree (root=0x555555688e10, path=0x7fffffffd9c0, level=0x7fffffffd984, nrefs=0x7fffffffda30, check_all=1) at check/mode-lowmem.c:4967 #5 0x000055555562494f in check_btrfs_root (root=0x555555688e10, check_all=1) at check/mode-lowmem.c:5266 #6 0x00005555556254ee in check_chunks_and_extents_lowmem () at check/mode-lowmem.c:5556 #7 0x00005555555f0b82 in do_check_chunks_and_extents () at check/main.c:9114 #8 0x00005555555f50ea in cmd_check (cmd=0x55555567c640 <cmd_struct_check>, argc=3, argv=0x7fffffffdec0) at check/main.c:10892 #9 0x000055555556b2b1 in cmd_execute (argv=0x7fffffffdec0, argc=3, cmd=0x55555567c640 <cmd_struct_check>) at cmds/commands.h:125 [CAUSE] For function check_extent_item() it will go through inline extent items and then check their backrefs. But for METADATA_ITEM, it doesn't really validate key.offset, which is u64 and can contain value way larger than BTRFS_MAX_LEVEL (mostly caused by bit flip). In that case, if we have a larger value like 256 in key.offset, then later check_tree_block_backref() will use 256 as level, and overflow path->nodes[level] and crash. [FIX] Just verify the level, no matter if it's from btrfs_tree_block_level() (which is just u8), or it's from key.offset (which is u64). To do the check properly and detect higher bits corruption, also change the type of @level from u8 to u64. Now lowmem mode can detect the problem properly: ... [2/7] checking extents ERROR: tree block 30457856 has bad backref level, has 256 expect [0, 7] ERROR: extent[30457856 16384] level mismatch, wanted: 0, have: 256 ERROR: errors found in extent allocation tree or chunk allocation [3/7] checking free space tree ... Reviewed-by: Su Yue <l@damenly.su> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-02-01 18:41:58 +01:00
Qu Wenruo	ec36c043ba	btrfs-progs: check: handle csum generation properly for --init-csum-tree --init-extent-tree When using `--init-csum-tree` with `--init-extent-tree`, csum tree population will be done by iterating all file extent items. This allow us to skip preallocated extents, but it still has the following problems: - Inodes with NODATASUM - Hole file extents - Written preallocated extents We will generate csum for the whole extent, while other part may still be unwritten. Make it to follow the same behavior of recently introduced fill_csum_for_file_extent(), so we can generate correct csum. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-02-01 18:41:16 +01:00
Qu Wenruo	9e349ff18c	btrfs-progs: check: don't calculate csum for preallocated file extents [BUG] If a btrfs filesystem has preallocated file extents, 'btrfs check --init-csum-tree' will create csum item for preallocated extents, and cause error: # mkfs.btrfs -f test.img # mount test.img /mnt/btrfs # fallocate -l 32K /mnt/btrfs/file # umount /mnt/btrfs # btrfs check --init-csum-tree --force test.img ... [4/7] checking fs roots root 5 inode 257 errors 800, odd csum item ERROR: errors found in fs roots found 376832 bytes used, error(s) found And the csum tree is not empty, containing csum for the preallocated extent: $ btrfs ins dump-tree -t csum test.img btrfs-progs v5.15.1 checksum tree key (CSUM_TREE ROOT_ITEM 0) leaf 30408704 items 1 free space 16226 generation 9 owner CSUM_TREE leaf 30408704 flags 0x1(WRITTEN) backref revision 1 fs uuid ecc79835-5611-4609-b985-e4ccd6f15b54 chunk uuid b1c75553-5b82-4aa6-bbbe-e7f50643b1a8 item 0 key (EXTENT_CSUM EXTENT_CSUM 13631488) itemoff 16251 itemsize 32 range start 13631488 end 13664256 length 32768 [CAUSE] For `--init-csum-tree` alone, we will use extent tree to iterate each data extent, and calculate csum for them. But extent items alone can not tell us if the file extent belongs to a NODATASUM inode, nor if it's preallocated. Thus we create csums for those data extents, and cause the problem. [FIX] However the fix is not that simple, we can not just generate csum for non-preallocated range. As the following case we still need csum for the un-referred part: xfs_io -f -c "pwrite 0 8K" -c "sync" -c "punch 0 4K" So here we have to go another direction by: - Always generate csum for the whole data extent This is the same as the old code - Iterate the file extents, and delete csum for preallocated range or NODATASUM inodes Issue: #430 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-02-01 18:41:15 +01:00
Qu Wenruo	f8bc08cb75	btrfs-progs: check: move csum tree population into mode-common.[ch] This part has no mode specific operations, just move them into mode-common.[ch]. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-02-01 18:41:13 +01:00
Sidong Yang	7f94ccb20a	btrfs-progs: check: change commit condition in fixup_extent_refs() This patch fixes potential bugs in fixup_extent_refs(). If btrfs_start_transaction() fails in some way and returns error ptr, It goes to out logic. But old code checkes whether it is null and it calls commit. This patch solves the problem with make that it calls only if ret is no error. Issue: #409 Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Sidong Yang <realwakka@gmail.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 19:16:22 +01:00
Josef Bacik	e68e73318f	btrfs-progs: check: fill csum root from all extent roots We may have multiple extent roots, so cycle through all of the extent roots and populate the csum tree based on the content of every extent root we have in the file system. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 18:59:34 +01:00
Josef Bacik	bb575e3818	btrfs-progs: check: check all of the csum roots Use the global roots tree to find all of the csum roots in the system and check all of them as appropriate. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 18:59:20 +01:00
Josef Bacik	d26c0b4414	btrfs-progs: check: check the global roots for uptodate root nodes Instead of checking the csum and extent tree individually, walk through the global roots and validate them all. This will work properly if we have extent tree v1 or extent tree v2. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 18:57:45 +01:00
Josef Bacik	bf0ce5cb86	btrfs-progs: check: make reinit work per found root item Instead of looking for the first extent root or csum root in memory, scan through the tree root and re-init any root items that match the given objectid. This will allow reinit to work with both extent tree v1 and extent tree v2 global roots when using the --reinit option. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 18:57:43 +01:00
Josef Bacik	db2ab47823	btrfs-progs: stop accessing ->extent_root directly When we switch to multiple global trees we'll need to access the appropriate extent root depending on the block group or possibly root. To handle this, use a helper in most places and then the actual root in places where it is required. We will whittle down the direct accessors with future patches, but this does the bulk of the preparatory work. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 18:56:54 +01:00
Josef Bacik	d67bf1b365	btrfs-progs: check: move btrfs_mark_used_tree_blocks to common This is going to be used for the extent tree v2 stuff more commonly, so move it out so that it is accessible from everywhere that we need it. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Josef Bacik	17f6374cfa	btrfs-progs: check: abstract out the used marking helpers We will walk all referenced tree blocks during check in order to avoid writing over any referenced blocks during fsck. However in the future we're going to need to do this for things like fixing block group accounting with extent tree v2. This is because extent tree v2 will not refer to all of the allocated blocks in the extent tree. Refactor the code some to allow us to send down an arbitrary extent_io_tree so we can use this helper for any case where we need to figure out where all the used space is on an extent tree v2 file system. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Josef Bacik	639b1fc2e7	btrfs-progs: stop accessing ->csum_root directly With extent tree v2 we will have per-block group checksums, so add a helper to access the csum root and rename the fs_info csum_root to _csum_root to catch all the places that are accessing it directly. Convert everybody to use the helper except for internal things. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Josef Bacik	8742a38d5f	btrfs-progs: check: stop passing csum root around We pass the csum root from way high in the call chain in check down to where we actually need it. However we can just get it from the fs_info in these places, so clean up the functions to skip passing around the csum root needlessly. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Josef Bacik	08b63c0fc5	btrfs-progs: stop passing root to csum related functions We are going to need to start looking up the csum root based on the bytenr with extent tree v2. To that end stop passing the root to the csum related functions so that can be done in the helper functions themselves. There's an unrelated deletion of a function prototype that no longer exists. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Josef Bacik	94729e9c5a	btrfs-progs: check: don't walk down non fs-trees for qgroup check We will lookup implied refs for every root id for every block that we find when verifying qgroups, but we don't need to worry about non fstrees, so skip them here. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Josef Bacik	9fdb291b8e	btrfs-progs: check: fix set_extent_dirty range I screwed up a fix where we're setting the bytenr range as dirty when marking all tree blocks used, I was looking at btrfs_pin_extent and put ->nodesize for end instead of the actual end, which is bytenr + ->nodesize - 1. Fix this up so it's correct. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Mark Harmstone	945cfa1ae3	btrfs-progs: check: add check for too many csum entries My tool ntfs2btrfs has been creating btrfs volumes in a way that the kernel doesn't like, but which isn't picked up by btrfs check - see maharmstone/ntfs2btrfs#23 for the details, including a backtrace. This patch adds a check for when a csum item contains too many entries - effectively it ensures that there's always at least sizeof(struct btrfs_item) bytes free in the tree, otherwise btrfs_del_csums can throw an error. max_entries is the value of the __MAX_CSUM_ITEMS macro in fs/btrfs/file-item.c. Pull-request: #401 Signed-off-by: Mark Harmstone <mark@harmstone.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-18 10:17:21 +01:00
Qu Wenruo	c4ff87c3d1	btrfs-progs: cache csum_size and csum_type in btrfs_fs_info Just like kernel commit 22b6331d9617 ("btrfs: store precalculated csum_size in fs_info"), we can cache csum_size and csum_type in btrfs_fs_info. Furthermore, there is already a 32 bits hole in btrfs_fs_info, and we can fit csum_type and csum_size into the hole without increase the size of btrfs_fs_info. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-05 12:50:03 +01:00
Qu Wenruo	5bee5c99bf	btrfs-progs: fix printf formats on 32bit x86 When compiling btrfs-progs on 32bit x86 using GCC 11.1.0, there are several warnings: In file included from ./common/utils.h:30, from check/main.c:36: check/main.c: In function 'run_next_block': ./common/messages.h:42:31: warning: format '%lu' expects argument of type 'long unsigned int', but argument 4 has type 'u32' {aka 'unsigned int'} [-Wformat=] 42 \| __btrfs_error((fmt), ##__VA_ARGS__); \ \| ^~~~~ check/main.c:6496:33: note: in expansion of macro 'error' 6496 \| error( \| ^~~~~ In file included from ./common/utils.h:30, from kernel-shared/volumes.c:32: kernel-shared/volumes.c: In function 'btrfs_check_chunk_valid': ./common/messages.h:42:31: warning: format '%lu' expects argument of type 'long unsigned int', but argument 4 has type 'u32' {aka 'unsigned int'} [-Wformat=] 42 \| __btrfs_error((fmt), ##__VA_ARGS__); \ \| ^~~~~ kernel-shared/volumes.c:2052:17: note: in expansion of macro 'error' 2052 \| error("invalid chunk item size, have %u expect [%zu, %lu)", \| ^~~~~ image/main.c: In function 'search_for_chunk_blocks': ./common/messages.h:42:31: warning: format '%lu' expects argument of type 'long unsigned int', but argument 3 has type 'size_t' {aka 'unsigned int'} [-Wformat=] 42 \| __btrfs_error((fmt), ##__VA_ARGS__); \ \| ^~~~~ image/main.c:2122:33: note: in expansion of macro 'error' 2122 \| error( \| ^~~~~ There are two types of problems: - __BTRFS_LEAF_DATA_SIZE() This macro has no type definition, making it behaves differently on different arches. Fix this by following kernel to use inline function to make its return value fixed to u32. - size_t related output For x86_64 %lu is OK but not for x86. Fix this by using %zu. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-05 12:50:03 +01:00
Qu Wenruo	8f81113021	btrfs-progs: check: fix a lowmem mode crash where fatal error is not properly handled [BUG] When a special image (diverted from fsck/012) has its unused slots (slot number >= nritems) with garbage, lowmem mode btrfs check can crash: (gdb) run check --mode=lowmem ~/downloads/good.img.restored Starting program: /home/adam/btrfs/btrfs-progs/btrfs check --mode=lowmem ~/downloads/good.img.restored ... ERROR: root 5 INODE[5044031582654955520] nlink(257228800) not equal to inode_refs(0) ERROR: root 5 INODE[5044031582654955520] nbytes 474624 not equal to extent_size 0 Program received signal SIGSEGV, Segmentation fault. 0x0000555555639b11 in btrfs_inode_size (eb=0x5555558a7540, s=0x642e6cd1) at ./kernel-shared/ctree.h:1703 1703 BTRFS_SETGET_FUNCS(inode_size, struct btrfs_inode_item, size, 64); (gdb) bt #0 0x0000555555639b11 in btrfs_inode_size (eb=0x5555558a7540, s=0x642e6cd1) at ./kernel-shared/ctree.h:1703 #1 0x0000555555641544 in check_inode_item (root=0x5555556c2290, path=0x7fffffffd960) at check/mode-lowmem.c:2628 [CAUSE] At check_inode_item() we have path->slot[0] at 29, while the tree block only has 26 items. This happens because two reasons: - btrfs_next_item() never reverts its slots Even if we failed to read next leaf. - check_inode_item() doesn't inform the caller that a fatal error happened In check_inode_item(), if btrfs_next_item() failed, it goes to out label, which doesn't really set @err properly. This means, when check_inode_item() fails at btrfs_next_item(), it will increase path->slots[0], while it's already beyond current tree block nritems. When the slot increases furthermore, and if the unused item slots have some garbage, we will get invalid btrfs_item_ptr() result, and causing above segfault. [FIX] Fix the problems by two ways: - Make btrfs_next_item() to revert its path->slots[0] on failure - Properly detect fatal error from check_inode_item() By this, we will no longer crash on the crafted image. Reported-by: Wang Yugui <wangyugui@e16-tech.com> Issue: #412 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-04 20:56:42 +01:00
David Sterba	00610f5853	btrfs-progs: libbtrfs: remove unneeded BTRFS_FLAT_INCLUDES protections Remove the switch for local and system-wide headers in headers that are not part of libbtrfs anymore. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:47:03 +02:00
David Sterba	38356d456b	btrfs-progs: libbtrfs: drop radix-tree.h from exported headers The header is only included from ctree.h but not actually used, we can drop it from the exported files. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:46:35 +02:00
Nikolay Borisov	7c58b09548	btrfs-progs: remove root argument from btrfs_fixup_low_keys It's not used, so just remove it. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:46:34 +02:00
Qu Wenruo	9d2460fb3f	btrfs-progs: check: fix indentation of --clear-ino-cache option Help string for "--clear-ino-cache" option is not following the indentation of other help strings: repair options: --init-csum-tree create a new CRC tree --init-extent-tree create a new extent tree --clear-space-cache v1\|v2 clear space cache for v1 or v2 --clear-ino-cache clear ino cache leftover items The problem is caused by the usage of tab instead of space. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-06 16:49:00 +02:00
Qu Wenruo	60651ad9da	btrfs-progs: introduce OPEN_CTREE_ALLOW_TRANSID_MISMATCH flag [BUG] There is a report that, btrfstune can even work while the fs has transid mismatch problems. $ btrfstune -f -u /dev/sdb1 Current fsid: b2b5ae8d-4c49-45f0-b42e-46fe7dcfcb07 New fsid: b2b5ae8d-4c49-45f0-b42e-46fe7dcfcb07 Set superblock flag CHANGING_FSID Change fsid in extents parent transid verify failed on 792854528 wanted 20103 found 20091 parent transid verify failed on 792854528 wanted 20103 found 20091 parent transid verify failed on 792854528 wanted 20103 found 20091 Ignoring transid failure parent transid verify failed on 792870912 wanted 20103 found 20091 parent transid verify failed on 792870912 wanted 20103 found 20091 parent transid verify failed on 792870912 wanted 20103 found 20091 Ignoring transid failure parent transid verify failed on 792887296 wanted 20103 found 20091 parent transid verify failed on 792887296 wanted 20103 found 20091 parent transid verify failed on 792887296 wanted 20103 found 20091 Ignoring transid failure ERROR: child eb corrupted: parent bytenr=38010880 item=69 parent level=1 child level=1 ERROR: failed to change UUID of metadata: -5 ERROR: btrfstune failed This leaves a corrupted fs even more corrupted, and due to the extra CHANGING_FSID flag, btrfs check will not even try to run on it: Opening filesystem to check... ERROR: Filesystem UUID change in progress ERROR: cannot open file system [CAUSE] Unlike kernel, btrfs-progs has a less strict check on transid mismatch. In read_tree_block() we will fall back to use the tree block even its transid mismatch if we can't find any better copy. However not all commands in btrfs-progs needs this feature, only btrfs-check (which may fix the problem) and btrfs-restore (it just tries to ignore any problems) really utilize this feature. [FIX] Introduce a new open ctree flag, OPEN_CTREE_ALLOW_TRANSID_MISMATCH, to be explicit about whether we really want to ignore transid error. Currently only btrfs-check and btrfs-restore will utilize this new flag. Also add btrfs-image to allow opening such fs with transid error. Link: https://www.reddit.com/r/btrfs/comments/pivpqk/failure_during_btrfstune_u/ Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-20 12:17:29 +02:00
David Sterba	400b66689e	btrfs-progs: move btrfsck.h to check/ This is part of the checker and unfortunately also the public header, so we can only copy it to the right directory. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 19:49:33 +02:00
Qu Wenruo	1f8dfe681f	btrfs-progs: use btrfs_key for btrfs_check_node() and btrfs_check_leaf() In kernel space we hardly use btrfs_disk_key, unless for very lowlevel code. There is no need to intentionally use btrfs_disk_key in btrfs-progs either. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 13:58:44 +02:00
Qu Wenruo	325dba6432	btrfs-progs: check: output proper csum values for --check-data-csum [BUG] When running "btrfs check --check-data-csum" on fs with corrupted data, the error message almost makes no sense: $ btrfs check --check-data-csum /dev/test/test Opening filesystem to check... Checking filesystem on /dev/test/test UUID: c31afe0a-55bc-4e7d-aba0-9dfa9ddf8090 [1/7] checking root items [2/7] checking extents [3/7] checking free space cache [4/7] checking fs roots [5/7] checking csums against data mirror 1 bytenr 13631488 csum 19 expected csum 152 <<< ERROR: errors found in csum tree [6/7] checking root refs [7/7] checking quota groups skipped (not enabled on this FS) found 147456 bytes used, error(s) found total csum bytes: 16 total tree bytes: 131072 total fs tree bytes: 32768 total extent tree bytes: 16384 btree space waste bytes: 124799 file data blocks allocated: 16384 referenced 16384 [CAUSE] We're just outputting the first byte and in decimal, which is completely different from what we did in kernel space, nor what we did for metadata csum mismatch. [FIX] Use btrfs_format_csum() for btrfs-check to output csum. Now the result looks much better: [5/7] checking csums against data mirror 1 bytenr 13631488 csum 0x13fec125 expected csum 0x98757625 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-26 14:27:40 +02:00
Qu Wenruo	773afad3e6	btrfs-progs: slightly enhance btrfs_format_csum() - Change it void The old one always return csum_size. - Use snprintf() Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-26 14:27:01 +02:00
Josef Bacik	e64af00bd1	btrfs-progs: check: detect and fix problems with super_bytes_used We do not detect problems with our bytes_used counter in the super block. Thankfully the same method to fix block groups is used to re-set the value in the super block, so simply add some extra code to validate the bytes_used field and then piggy back on the repair code for block groups. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-25 15:39:48 +02:00
Josef Bacik	04520da77b	btrfs-progs: check: do not infinite loop on corrupt keys with lowmem mode By enabling the lowmem checks properly I uncovered the case where test fsck/007 will infinite loop at the detection stage. This is because when checking the inode item we will just btrfs_next_item(), and because we ignore check tree block failures at read time we don't get an -EIO from btrfs_next_leaf. Generally what check usually does is validate the leaves/nodes as we hit them, but in this case we're not doing that. Fix this by checking the leaf if we move to the next one and if it fails bail. This allows us to pass the fsck/007 test with lowmem. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-25 15:39:48 +02:00
Josef Bacik	4a1863d638	btrfs-progs: check: do not double add unaligned extent records The repair cycle in the main check will drop all of our cache and loop through again to make sure everything is still good to go. Unfortunately we record our unaligned extent records on a per-root list so they can be retrieved when we're checking the fs roots. This isn't straightforward to clean up, so instead simply check our current list of unaligned extent records when we are adding a new one to make sure we're not duplicating our efforts. This makes us able to pass fsck/001 with my super bytes_used fix applied. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-25 15:38:54 +02:00
Josef Bacik	a4190da45e	btrfs-progs: check btrfs_super_used in lowmem check We can already fix this problem with the block accounting code, we just need to keep track of how much we should have used on the file system, and then check it against the bytes_super. The repair just piggy backs on the block group used repair. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-25 15:38:54 +02:00
Josef Bacik	c3521f8a57	btrfs-progs: propagate extent item errors in lowmem mode Test 044 was failing with lowmem because it was not bubbling up the error to the user. This is because we try to allow repair the opportunity to clear the error, however if repair isn't set we simply do not add the temporary error to the main error return variable. Fix this by adding the tmp_err to err before moving on to the next item. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-25 15:38:54 +02:00
Josef Bacik	477775946d	btrfs-progs: propagate fs root errors in lowmem mode We have a check that will return an error only if ret < 0, but we return the lowmem specific errors which are all > 0. Fix this by simply checking if (ret). This allows test 010 to pass with lowmem properly. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-25 15:38:54 +02:00
Josef Bacik	a832d36b59	btrfs-progs: check: detect and fix invalid used for block groups The lowmem mode validates the used field of the block group item, but the normal mode does not. Fix this by keeping a running tally of what we think the used value for the block group should be, and then if it mismatches report an error and fix the problem if we have repair set. We have to keep track of pending extents because we process leaves as we see them, so it could be much later in the process that we find the block group item to associate the extents with. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-25 15:38:54 +02:00
Qu Wenruo	9df3619a9d	btrfs-progs: require full nodesize alignement for subpage support For the incoming extra page size support for subpage (sectorsize < PAGE_SIZE) cases, the support for metadata will be a critical point. Currently for subpage support, we require 64K page size, so that no matter whatever the nodesize is, it will be contained inside one page. And we will reject any tree block which crosses page boundary. But for other page size, especially 16K page size, we must support nodesize differently. For nodesize < PAGE_SIZE, we will have the same requirement (tree blocks can't cross page boundary). While for nodesize >= PAGE_SIZE, we will require the tree blocks to be page aligned. To support such feature, we will make btrfs-check to reports more subpage related warnings for metadata. This patch will report any tree block which is not nodesize aligned as a warning. Existing mkfs/convert has already make sure all new tree blocks are nodesize aligned, this is just for older converted filesystems. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-20 14:45:58 +02:00
Qu Wenruo	478e98cd01	btrfs-progs: check/original: detect directory inode with nlinks >= 2 Linux VFS doesn't allow directory to have hard links, thus for btrfs on-disk directory inode items, their nlinks should never go beyond 1. Lowmem mode already has the check and will report it without problem. Only original mode needs this update. Reported-by: Pepperpoint <pepperpoint@mb.ardentcoding.com> Link: https://lore.kernel.org/linux-btrfs/162648632340.7.1932907459648384384.10178178@mb.ardentcoding.com/ Reviewed-by: Su Yue <l@damenly.su> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-07-30 15:50:48 +02:00
Qu Wenruo	07ecf878c1	btrfs-progs: check: batch v1 space cache inodes when clearing Currently v1 space cache clearing will delete one cache inode just in one transaction, and then start a new transaction to delete the next inode. This is far from efficient and can make the already slow v1 space cache deleting even slower, as large fs has tons of cache inodes to delete. This patch will speed up the process by batching up to 16 inode deletion into one transaction. A quick benchmark of deleting 702 v1 space cache inodes would look like this: Unpatched: 4.898s Patched: 0.087s Which is obviously a big win. Reported-by: Joshua <joshua@mailmag.net> Link: https://lore.kernel.org/linux-btrfs/0b4cf70fc883e28c97d893a3b2f81b11@mailmag.net/ Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-07-22 16:26:05 +02:00
Qu Wenruo	0ebd1f47c5	btrfs-progs: check: add the ability to reset btrfs_dev_item::bytes_used There is a report from the mailing list that one user got its filesystem with device item bytes_used mismatch. This problem leaves the device item with some ghost bytes_used, meaning even if we delete all device extents of that device, the bytes_used still won't be 0. This itself is not a big deal, but when the user used up all its unallocated space, write time tree-checker can be triggered and make the fs RO, as the new device::bytes_used can be larger than device::total_bytes. Thus we need to fix the problem in btrfs-check to avoid above write-time tree check warning. This patch will add the ability to reset a device's bytes_used to both original mode and lowmem mode. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-06-19 22:07:49 +02:00
Su Yue	4c282411eb	btrfs-progs: check: continue to check space cache if sb cache_generation is 0 User reported that test fsck-tests/037-freespacetree-repair fails: # TEST=037\* ./fsck-tests.sh [TEST/fsck] 037-freespacetree-repair btrfs check should have detected corruption test failed for case 037-freespacetree-repair The test tries to corrupt FST, call btrfs check readonly then repair FST using btrfs check. Above case failed at the second readonly check step. Test log said "cache and super generation don't match, space cache will be invalidated" which is printed by validate_free_space_cache(). If cache_generation of the superblock is not -1ULL, validate_free_space_cache() requires that cache_generation must equal to the superblock's generation. Otherwise, it skips the check of space cache(v1, v2) like the above case where the sb cache_generation is 0. Since kernel commit 948462294577 ("btrfs: keep sb cache_generation consistent with space_cache"), sb cache_generation will be set to be 0 once space cache v1 is disabled (nospace_cache/space_cache=v2). But progs check was forgotten to be added the 0 case support. Fix it by adding the condition if sb cache_generation is 0 in validate_free_space_cache() as the 0 case is valid now since the kernel commit mentioned above. Issue: #338 Signed-off-by: Su Yue <l@damenly.su> Reviewed-by: Boris Burkov <boris@bur.io> Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-12 20:44:12 +02:00
David Sterba	c19ac510a7	btrfs-progs: move repair.[ch] to common/ Move the file to common as it's used by several parts, while still keeping the name 'repair' although the only thing it does is adding a corrupted extent. Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:47 +02:00
David Sterba	cfbcfaa4e4	btrfs-progs: mkfs: move btrfs_make_root_dir from utils.c The helper is used in several tools but logically belongs to mkfs, so put it to the common section. Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:47 +02:00
David Sterba	7fa07e2abb	btrfs-progs: split open/close helpers from utils.c There's a group of functions that are related to opening filesystem in various modes, this can be moved to a separate file. Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:47 +02:00
Johannes Thumshirn	94b60b67a9	btrfs-progs: pass in fs_info to btrfs_csum_data For passing authentication keys to the checksumming functions we need a container for the key. Pass in a btrfs_fs_info to btrfs_csum_data() so we can use the fs_info as a container for the authentication key. Note this is not always possible for all callers of btrfs_csum_data() so we're just passing in NULL for now Functions calling btrfs_csum_data() with a NULL fs_info argument are currently not supported in the context of an authenticated file system. Signed-off-by: Johannes Thumshirn <jthumshirn@suse.de> Signed-off-by: David Sterba <dsterba@suse.com>	2021-03-24 22:20:19 +01:00
David Sterba	6bb6d1215d	btrfs-progs: factor open_ctree parameters to a structure Extending open_ctree with more parameters would be difficult, we'll need to add more so factor out the parameters to a structure for easier extension. Signed-off-by: David Sterba <dsterba@suse.com>	2021-03-24 22:20:19 +01:00
Jaak Ristioja	2f9736d051	btrfs-progs: check: fix typos in code comment and 1 typo in warning Pull-request: #356 Signed-off-by: Jaak Ristioja <jaak@ristioja.ee> Signed-off-by: David Sterba <dsterba@suse.com>	2021-03-23 17:04:22 +01:00
Qu Wenruo	33d84be030	btrfs-progs: fix false alert on tree block crossing 64K page boundary [BUG] When btrfs-check is executed on even newly created fs, it can report tree blocks crossing 64K page boundary like this: Opening filesystem to check... Checking filesystem on /dev/test/test UUID: 80d734c8-dcbc-411b-9623-a10bd9e7767f [1/7] checking root items [2/7] checking extents WARNING: tree block [30523392, 30539776) crosses 64K page boudnary, may cause problem for 64K page system [3/7] checking free space cache [4/7] checking fs roots [5/7] checking only csums items (without verifying data) [6/7] checking root refs [7/7] checking quota groups skipped (not enabled on this FS) found 131072 bytes used, no error found total csum bytes: 0 total tree bytes: 131072 total fs tree bytes: 32768 total extent tree bytes: 16384 btree space waste bytes: 125199 file data blocks allocated: 0 referenced 0 [CAUSE] Tree block [30523392, 30539776) is at the last 16K slot of page. As 30523392 % 65536 = 49152, and 30539776 % 65536 = 0. The cross boundary check is using exclusive end, which causes false alerts. [FIX] Use inclusive end to do the cross 64K boundary check. Reported-by: Wang Yugui <wangyugui@e16-tech.com> Issue: #352 Issue: #354 Fixes: `fc38ae7f48` ("btrfs-progs: check: detect and warn about tree blocks crossing 64K page boundary") Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-03-15 15:40:48 +01:00
Qu Wenruo	fc38ae7f48	btrfs-progs: check: detect and warn about tree blocks crossing 64K page boundary For the incoming subpage support, there is a new requirement for tree blocks. Tree blocks should not cross 64K page boundary. For current btrfs-progs and kernel, there shouldn't be any causes to create such tree blocks. But still, we want to detect such tree blocks in the wild before subpage support fully lands in upstream. This patch will add such check for both lowmem and original mode. Currently it's just a warning, since there aren't many users using 64K page size yet. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-02-19 15:24:42 +01:00
Qu Wenruo	501dc44c7f	btrfs-progs: check: only warn if clearing v1 cache and v2 found There are cases where v1 free space cache is still left while user has already enabled v2 cache. In that case, we still want to force v1 space cache cleanup in btrfs-check. This patch will only warn and not exit if v2 is detected while the user asked to clear v1. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-01-18 17:49:23 +01:00

1 2 3 4 5 ...

377 Commits