btrfs-progs

mirror of https://github.com/kdave/btrfs-progs synced 2025-05-05 01:17:56 +00:00

Author	SHA1	Message	Date
Qu Wenruo	f8bc08cb75	btrfs-progs: check: move csum tree population into mode-common.[ch] This part has no mode specific operations, just move them into mode-common.[ch]. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-02-01 18:41:13 +01:00
Sidong Yang	7f94ccb20a	btrfs-progs: check: change commit condition in fixup_extent_refs() This patch fixes potential bugs in fixup_extent_refs(). If btrfs_start_transaction() fails in some way and returns error ptr, It goes to out logic. But old code checkes whether it is null and it calls commit. This patch solves the problem with make that it calls only if ret is no error. Issue: #409 Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Sidong Yang <realwakka@gmail.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 19:16:22 +01:00
Josef Bacik	e68e73318f	btrfs-progs: check: fill csum root from all extent roots We may have multiple extent roots, so cycle through all of the extent roots and populate the csum tree based on the content of every extent root we have in the file system. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 18:59:34 +01:00
Josef Bacik	bb575e3818	btrfs-progs: check: check all of the csum roots Use the global roots tree to find all of the csum roots in the system and check all of them as appropriate. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 18:59:20 +01:00
Josef Bacik	d26c0b4414	btrfs-progs: check: check the global roots for uptodate root nodes Instead of checking the csum and extent tree individually, walk through the global roots and validate them all. This will work properly if we have extent tree v1 or extent tree v2. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 18:57:45 +01:00
Josef Bacik	bf0ce5cb86	btrfs-progs: check: make reinit work per found root item Instead of looking for the first extent root or csum root in memory, scan through the tree root and re-init any root items that match the given objectid. This will allow reinit to work with both extent tree v1 and extent tree v2 global roots when using the --reinit option. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 18:57:43 +01:00
Josef Bacik	db2ab47823	btrfs-progs: stop accessing ->extent_root directly When we switch to multiple global trees we'll need to access the appropriate extent root depending on the block group or possibly root. To handle this, use a helper in most places and then the actual root in places where it is required. We will whittle down the direct accessors with future patches, but this does the bulk of the preparatory work. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-30 18:56:54 +01:00
Josef Bacik	639b1fc2e7	btrfs-progs: stop accessing ->csum_root directly With extent tree v2 we will have per-block group checksums, so add a helper to access the csum root and rename the fs_info csum_root to _csum_root to catch all the places that are accessing it directly. Convert everybody to use the helper except for internal things. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Josef Bacik	8742a38d5f	btrfs-progs: check: stop passing csum root around We pass the csum root from way high in the call chain in check down to where we actually need it. However we can just get it from the fs_info in these places, so clean up the functions to skip passing around the csum root needlessly. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Josef Bacik	08b63c0fc5	btrfs-progs: stop passing root to csum related functions We are going to need to start looking up the csum root based on the bytenr with extent tree v2. To that end stop passing the root to the csum related functions so that can be done in the helper functions themselves. There's an unrelated deletion of a function prototype that no longer exists. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-22 21:45:37 +01:00
Mark Harmstone	945cfa1ae3	btrfs-progs: check: add check for too many csum entries My tool ntfs2btrfs has been creating btrfs volumes in a way that the kernel doesn't like, but which isn't picked up by btrfs check - see maharmstone/ntfs2btrfs#23 for the details, including a backtrace. This patch adds a check for when a csum item contains too many entries - effectively it ensures that there's always at least sizeof(struct btrfs_item) bytes free in the tree, otherwise btrfs_del_csums can throw an error. max_entries is the value of the __MAX_CSUM_ITEMS macro in fs/btrfs/file-item.c. Pull-request: #401 Signed-off-by: Mark Harmstone <mark@harmstone.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-18 10:17:21 +01:00
Qu Wenruo	c4ff87c3d1	btrfs-progs: cache csum_size and csum_type in btrfs_fs_info Just like kernel commit 22b6331d9617 ("btrfs: store precalculated csum_size in fs_info"), we can cache csum_size and csum_type in btrfs_fs_info. Furthermore, there is already a 32 bits hole in btrfs_fs_info, and we can fit csum_type and csum_size into the hole without increase the size of btrfs_fs_info. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-05 12:50:03 +01:00
Qu Wenruo	5bee5c99bf	btrfs-progs: fix printf formats on 32bit x86 When compiling btrfs-progs on 32bit x86 using GCC 11.1.0, there are several warnings: In file included from ./common/utils.h:30, from check/main.c:36: check/main.c: In function 'run_next_block': ./common/messages.h:42:31: warning: format '%lu' expects argument of type 'long unsigned int', but argument 4 has type 'u32' {aka 'unsigned int'} [-Wformat=] 42 \| __btrfs_error((fmt), ##__VA_ARGS__); \ \| ^~~~~ check/main.c:6496:33: note: in expansion of macro 'error' 6496 \| error( \| ^~~~~ In file included from ./common/utils.h:30, from kernel-shared/volumes.c:32: kernel-shared/volumes.c: In function 'btrfs_check_chunk_valid': ./common/messages.h:42:31: warning: format '%lu' expects argument of type 'long unsigned int', but argument 4 has type 'u32' {aka 'unsigned int'} [-Wformat=] 42 \| __btrfs_error((fmt), ##__VA_ARGS__); \ \| ^~~~~ kernel-shared/volumes.c:2052:17: note: in expansion of macro 'error' 2052 \| error("invalid chunk item size, have %u expect [%zu, %lu)", \| ^~~~~ image/main.c: In function 'search_for_chunk_blocks': ./common/messages.h:42:31: warning: format '%lu' expects argument of type 'long unsigned int', but argument 3 has type 'size_t' {aka 'unsigned int'} [-Wformat=] 42 \| __btrfs_error((fmt), ##__VA_ARGS__); \ \| ^~~~~ image/main.c:2122:33: note: in expansion of macro 'error' 2122 \| error( \| ^~~~~ There are two types of problems: - __BTRFS_LEAF_DATA_SIZE() This macro has no type definition, making it behaves differently on different arches. Fix this by following kernel to use inline function to make its return value fixed to u32. - size_t related output For x86_64 %lu is OK but not for x86. Fix this by using %zu. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-11-05 12:50:03 +01:00
David Sterba	38356d456b	btrfs-progs: libbtrfs: drop radix-tree.h from exported headers The header is only included from ctree.h but not actually used, we can drop it from the exported files. Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:46:35 +02:00
Nikolay Borisov	7c58b09548	btrfs-progs: remove root argument from btrfs_fixup_low_keys It's not used, so just remove it. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-08 20:46:34 +02:00
Qu Wenruo	9d2460fb3f	btrfs-progs: check: fix indentation of --clear-ino-cache option Help string for "--clear-ino-cache" option is not following the indentation of other help strings: repair options: --init-csum-tree create a new CRC tree --init-extent-tree create a new extent tree --clear-space-cache v1\|v2 clear space cache for v1 or v2 --clear-ino-cache clear ino cache leftover items The problem is caused by the usage of tab instead of space. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-06 16:49:00 +02:00
Qu Wenruo	60651ad9da	btrfs-progs: introduce OPEN_CTREE_ALLOW_TRANSID_MISMATCH flag [BUG] There is a report that, btrfstune can even work while the fs has transid mismatch problems. $ btrfstune -f -u /dev/sdb1 Current fsid: b2b5ae8d-4c49-45f0-b42e-46fe7dcfcb07 New fsid: b2b5ae8d-4c49-45f0-b42e-46fe7dcfcb07 Set superblock flag CHANGING_FSID Change fsid in extents parent transid verify failed on 792854528 wanted 20103 found 20091 parent transid verify failed on 792854528 wanted 20103 found 20091 parent transid verify failed on 792854528 wanted 20103 found 20091 Ignoring transid failure parent transid verify failed on 792870912 wanted 20103 found 20091 parent transid verify failed on 792870912 wanted 20103 found 20091 parent transid verify failed on 792870912 wanted 20103 found 20091 Ignoring transid failure parent transid verify failed on 792887296 wanted 20103 found 20091 parent transid verify failed on 792887296 wanted 20103 found 20091 parent transid verify failed on 792887296 wanted 20103 found 20091 Ignoring transid failure ERROR: child eb corrupted: parent bytenr=38010880 item=69 parent level=1 child level=1 ERROR: failed to change UUID of metadata: -5 ERROR: btrfstune failed This leaves a corrupted fs even more corrupted, and due to the extra CHANGING_FSID flag, btrfs check will not even try to run on it: Opening filesystem to check... ERROR: Filesystem UUID change in progress ERROR: cannot open file system [CAUSE] Unlike kernel, btrfs-progs has a less strict check on transid mismatch. In read_tree_block() we will fall back to use the tree block even its transid mismatch if we can't find any better copy. However not all commands in btrfs-progs needs this feature, only btrfs-check (which may fix the problem) and btrfs-restore (it just tries to ignore any problems) really utilize this feature. [FIX] Introduce a new open ctree flag, OPEN_CTREE_ALLOW_TRANSID_MISMATCH, to be explicit about whether we really want to ignore transid error. Currently only btrfs-check and btrfs-restore will utilize this new flag. Also add btrfs-image to allow opening such fs with transid error. Link: https://www.reddit.com/r/btrfs/comments/pivpqk/failure_during_btrfstune_u/ Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-20 12:17:29 +02:00
Qu Wenruo	1f8dfe681f	btrfs-progs: use btrfs_key for btrfs_check_node() and btrfs_check_leaf() In kernel space we hardly use btrfs_disk_key, unless for very lowlevel code. There is no need to intentionally use btrfs_disk_key in btrfs-progs either. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 13:58:44 +02:00
Qu Wenruo	325dba6432	btrfs-progs: check: output proper csum values for --check-data-csum [BUG] When running "btrfs check --check-data-csum" on fs with corrupted data, the error message almost makes no sense: $ btrfs check --check-data-csum /dev/test/test Opening filesystem to check... Checking filesystem on /dev/test/test UUID: c31afe0a-55bc-4e7d-aba0-9dfa9ddf8090 [1/7] checking root items [2/7] checking extents [3/7] checking free space cache [4/7] checking fs roots [5/7] checking csums against data mirror 1 bytenr 13631488 csum 19 expected csum 152 <<< ERROR: errors found in csum tree [6/7] checking root refs [7/7] checking quota groups skipped (not enabled on this FS) found 147456 bytes used, error(s) found total csum bytes: 16 total tree bytes: 131072 total fs tree bytes: 32768 total extent tree bytes: 16384 btree space waste bytes: 124799 file data blocks allocated: 16384 referenced 16384 [CAUSE] We're just outputting the first byte and in decimal, which is completely different from what we did in kernel space, nor what we did for metadata csum mismatch. [FIX] Use btrfs_format_csum() for btrfs-check to output csum. Now the result looks much better: [5/7] checking csums against data mirror 1 bytenr 13631488 csum 0x13fec125 expected csum 0x98757625 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-26 14:27:40 +02:00
Qu Wenruo	773afad3e6	btrfs-progs: slightly enhance btrfs_format_csum() - Change it void The old one always return csum_size. - Use snprintf() Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-26 14:27:01 +02:00
Josef Bacik	e64af00bd1	btrfs-progs: check: detect and fix problems with super_bytes_used We do not detect problems with our bytes_used counter in the super block. Thankfully the same method to fix block groups is used to re-set the value in the super block, so simply add some extra code to validate the bytes_used field and then piggy back on the repair code for block groups. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-25 15:39:48 +02:00
Josef Bacik	4a1863d638	btrfs-progs: check: do not double add unaligned extent records The repair cycle in the main check will drop all of our cache and loop through again to make sure everything is still good to go. Unfortunately we record our unaligned extent records on a per-root list so they can be retrieved when we're checking the fs roots. This isn't straightforward to clean up, so instead simply check our current list of unaligned extent records when we are adding a new one to make sure we're not duplicating our efforts. This makes us able to pass fsck/001 with my super bytes_used fix applied. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-25 15:38:54 +02:00
Josef Bacik	a832d36b59	btrfs-progs: check: detect and fix invalid used for block groups The lowmem mode validates the used field of the block group item, but the normal mode does not. Fix this by keeping a running tally of what we think the used value for the block group should be, and then if it mismatches report an error and fix the problem if we have repair set. We have to keep track of pending extents because we process leaves as we see them, so it could be much later in the process that we find the block group item to associate the extents with. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-25 15:38:54 +02:00
Qu Wenruo	9df3619a9d	btrfs-progs: require full nodesize alignement for subpage support For the incoming extra page size support for subpage (sectorsize < PAGE_SIZE) cases, the support for metadata will be a critical point. Currently for subpage support, we require 64K page size, so that no matter whatever the nodesize is, it will be contained inside one page. And we will reject any tree block which crosses page boundary. But for other page size, especially 16K page size, we must support nodesize differently. For nodesize < PAGE_SIZE, we will have the same requirement (tree blocks can't cross page boundary). While for nodesize >= PAGE_SIZE, we will require the tree blocks to be page aligned. To support such feature, we will make btrfs-check to reports more subpage related warnings for metadata. This patch will report any tree block which is not nodesize aligned as a warning. Existing mkfs/convert has already make sure all new tree blocks are nodesize aligned, this is just for older converted filesystems. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-20 14:45:58 +02:00
Qu Wenruo	478e98cd01	btrfs-progs: check/original: detect directory inode with nlinks >= 2 Linux VFS doesn't allow directory to have hard links, thus for btrfs on-disk directory inode items, their nlinks should never go beyond 1. Lowmem mode already has the check and will report it without problem. Only original mode needs this update. Reported-by: Pepperpoint <pepperpoint@mb.ardentcoding.com> Link: https://lore.kernel.org/linux-btrfs/162648632340.7.1932907459648384384.10178178@mb.ardentcoding.com/ Reviewed-by: Su Yue <l@damenly.su> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-07-30 15:50:48 +02:00
Qu Wenruo	07ecf878c1	btrfs-progs: check: batch v1 space cache inodes when clearing Currently v1 space cache clearing will delete one cache inode just in one transaction, and then start a new transaction to delete the next inode. This is far from efficient and can make the already slow v1 space cache deleting even slower, as large fs has tons of cache inodes to delete. This patch will speed up the process by batching up to 16 inode deletion into one transaction. A quick benchmark of deleting 702 v1 space cache inodes would look like this: Unpatched: 4.898s Patched: 0.087s Which is obviously a big win. Reported-by: Joshua <joshua@mailmag.net> Link: https://lore.kernel.org/linux-btrfs/0b4cf70fc883e28c97d893a3b2f81b11@mailmag.net/ Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-07-22 16:26:05 +02:00
Qu Wenruo	0ebd1f47c5	btrfs-progs: check: add the ability to reset btrfs_dev_item::bytes_used There is a report from the mailing list that one user got its filesystem with device item bytes_used mismatch. This problem leaves the device item with some ghost bytes_used, meaning even if we delete all device extents of that device, the bytes_used still won't be 0. This itself is not a big deal, but when the user used up all its unallocated space, write time tree-checker can be triggered and make the fs RO, as the new device::bytes_used can be larger than device::total_bytes. Thus we need to fix the problem in btrfs-check to avoid above write-time tree check warning. This patch will add the ability to reset a device's bytes_used to both original mode and lowmem mode. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-06-19 22:07:49 +02:00
Su Yue	4c282411eb	btrfs-progs: check: continue to check space cache if sb cache_generation is 0 User reported that test fsck-tests/037-freespacetree-repair fails: # TEST=037\* ./fsck-tests.sh [TEST/fsck] 037-freespacetree-repair btrfs check should have detected corruption test failed for case 037-freespacetree-repair The test tries to corrupt FST, call btrfs check readonly then repair FST using btrfs check. Above case failed at the second readonly check step. Test log said "cache and super generation don't match, space cache will be invalidated" which is printed by validate_free_space_cache(). If cache_generation of the superblock is not -1ULL, validate_free_space_cache() requires that cache_generation must equal to the superblock's generation. Otherwise, it skips the check of space cache(v1, v2) like the above case where the sb cache_generation is 0. Since kernel commit 948462294577 ("btrfs: keep sb cache_generation consistent with space_cache"), sb cache_generation will be set to be 0 once space cache v1 is disabled (nospace_cache/space_cache=v2). But progs check was forgotten to be added the 0 case support. Fix it by adding the condition if sb cache_generation is 0 in validate_free_space_cache() as the 0 case is valid now since the kernel commit mentioned above. Issue: #338 Signed-off-by: Su Yue <l@damenly.su> Reviewed-by: Boris Burkov <boris@bur.io> Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-12 20:44:12 +02:00
David Sterba	c19ac510a7	btrfs-progs: move repair.[ch] to common/ Move the file to common as it's used by several parts, while still keeping the name 'repair' although the only thing it does is adding a corrupted extent. Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:47 +02:00
David Sterba	cfbcfaa4e4	btrfs-progs: mkfs: move btrfs_make_root_dir from utils.c The helper is used in several tools but logically belongs to mkfs, so put it to the common section. Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:47 +02:00
David Sterba	7fa07e2abb	btrfs-progs: split open/close helpers from utils.c There's a group of functions that are related to opening filesystem in various modes, this can be moved to a separate file. Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:47 +02:00
Johannes Thumshirn	94b60b67a9	btrfs-progs: pass in fs_info to btrfs_csum_data For passing authentication keys to the checksumming functions we need a container for the key. Pass in a btrfs_fs_info to btrfs_csum_data() so we can use the fs_info as a container for the authentication key. Note this is not always possible for all callers of btrfs_csum_data() so we're just passing in NULL for now Functions calling btrfs_csum_data() with a NULL fs_info argument are currently not supported in the context of an authenticated file system. Signed-off-by: Johannes Thumshirn <jthumshirn@suse.de> Signed-off-by: David Sterba <dsterba@suse.com>	2021-03-24 22:20:19 +01:00
David Sterba	6bb6d1215d	btrfs-progs: factor open_ctree parameters to a structure Extending open_ctree with more parameters would be difficult, we'll need to add more so factor out the parameters to a structure for easier extension. Signed-off-by: David Sterba <dsterba@suse.com>	2021-03-24 22:20:19 +01:00
Qu Wenruo	fc38ae7f48	btrfs-progs: check: detect and warn about tree blocks crossing 64K page boundary For the incoming subpage support, there is a new requirement for tree blocks. Tree blocks should not cross 64K page boundary. For current btrfs-progs and kernel, there shouldn't be any causes to create such tree blocks. But still, we want to detect such tree blocks in the wild before subpage support fully lands in upstream. This patch will add such check for both lowmem and original mode. Currently it's just a warning, since there aren't many users using 64K page size yet. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-02-19 15:24:42 +01:00
Qu Wenruo	501dc44c7f	btrfs-progs: check: only warn if clearing v1 cache and v2 found There are cases where v1 free space cache is still left while user has already enabled v2 cache. In that case, we still want to force v1 space cache cleanup in btrfs-check. This patch will only warn and not exit if v2 is detected while the user asked to clear v1. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-01-18 17:49:23 +01:00
Nikolay Borisov	04f1977832	btrfs-progs: check: add option to remove ino cache Inode cache feature is going to be removed in kernel 5.11. After this kernel version items left on disk by this feature will take some extra space. Testing showed that the size is actually negligible but for completeness' sake give ability to users to remove such left-overs. This is achieved by iterating every fs root and removing respective items as well as relevant csum extents since the ino cache used the csum tree for csums. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-12-16 17:08:53 +01:00
Josef Bacik	bc06a75a04	btrfs-progs: only print the parent or ref root for ref mismatches While debugging some corruption, I got confused because it appeared as if we had an invalid parent set on a extent reference, because of this message: tree backref 67014213632 parent 5 root 5 not found in extent tree But it turns out that parent and the root are a union, and we were just printing it out regardless of the type of backref it was. Fix the error message to be consistent with the other mismatch messages, simply print parent or root, depending on the ref type. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-12-16 17:08:53 +01:00
Qu Wenruo	05f25a1460	btrfs-progs: check/original: add ability to repair extent item generation This is pretty much the same as for lowmem mode, it will try to reset the extent item generation using either the tree block generation or current transid. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-12-10 17:10:30 +01:00
Qu Wenruo	b858e3b5c2	btrfs-progs: check/original: don't reset extent generation for check_block In check_block(), we unconditionally reset extent_record::generation. This is in fact correct, but this makes original mode fail to detect bad extent item generation. So change to behavior to set the generation if and only if the tree block generation is higher. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-12-10 17:10:30 +01:00
Nikolay Borisov	c08983291e	btrfs-progs: check: fix error reporting on root inode If btrfs check detects an error on the root inode of a subvolume it prints: Opening filesystem to check... Checking filesystem on /dev/vdc UUID: 4ac7a216-bf97-4c5f-9899-0f203c20d8af [1/7] checking root items [2/7] checking extents [3/7] checking free space cache [4/7] checking fs roots root 5 root dir 256 error ERROR: errors found in fs roots found 196608 bytes used, error(s) found total csum bytes: 0 total tree bytes: 131072 total fs tree bytes: 32768 total extent tree bytes: 16384 btree space waste bytes: 124376 file data blocks allocated: 65536 referenced 65536 This is not very helpful since there is no specific information about the exact error. This is due to the fact that check_root_dir doesn't set inode_record::errors accordingly. This patch rectifies this and now the output would look like: [1/7] checking root items [2/7] checking extents [3/7] checking free space cache [4/7] checking fs roots root 5 inode 256 errors 2000, link count wrong ERROR: errors found in fs roots found 196608 bytes used, error(s) found total csum bytes: 0 total tree bytes: 131072 total fs tree bytes: 32768 total extent tree bytes: 16384 btree space waste bytes: 124376 file data blocks allocated: 65536 referenced 65536 Reviewed-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-08-31 17:16:36 +02:00
Qu Wenruo	0eafacf668	btrfs-progs: check/original: add inode transid detect and repair support The inode transid detect and repair is reusing the existing inode geneartion code. Reviewed-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-08-31 17:09:55 +02:00
Marcos Paulo de Souza	c655b5e4b1	btrfs-progs: make btrfs_lookup_dir_index in parity with kernel code This function exists in kernel side but using the _item suffix, and objectid argument is placed before the name argument. Change the function to reflect the kernel version. Reviewed-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: Marcos Paulo de Souza <mpdesouza@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-08-31 17:09:49 +02:00
David Sterba	0144bcb713	btrfs-progs: move volumes.c to kernel-shared/ Signed-off-by: David Sterba <dsterba@suse.com>	2020-08-31 17:01:06 +02:00
David Sterba	6069bc52a9	btrfs-progs: move transaction.c to kernel-shared/ Signed-off-by: David Sterba <dsterba@suse.com>	2020-08-31 17:01:06 +02:00
David Sterba	abb670f883	btrfs-progs: move ctree.c to kernel-shared/ Signed-off-by: David Sterba <dsterba@suse.com>	2020-08-31 17:01:05 +02:00
David Sterba	772f0da6df	btrfs-progs: move disk-io.c to kernel-shared/ Signed-off-by: David Sterba <dsterba@suse.com>	2020-08-31 17:01:05 +02:00
David Sterba	cf529f36ad	btrfs-progs: move print-tree.c to kernel-shared/ Signed-off-by: David Sterba <dsterba@suse.com>	2020-08-31 17:01:05 +02:00
David Sterba	da90f38ad9	btrfs-progs: move free-space-tree.c to kernel-shared/ Signed-off-by: David Sterba <dsterba@suse.com>	2020-08-31 17:01:04 +02:00
David Sterba	2335cd02b3	btrfs-progs: check: drop unused fs_info Signed-off-by: David Sterba <dsterba@suse.com>	2020-08-31 17:01:03 +02:00
David Sterba	3b86c9fd82	btrfs-progs: check: replace local fs_info by global fs_info in main.c Signed-off-by: David Sterba <dsterba@suse.com>	2020-08-31 17:01:03 +02:00
David Sterba	cfe94be618	btrfs-progs: check: rename global_info Signed-off-by: David Sterba <dsterba@suse.com>	2020-08-31 17:01:03 +02:00
Qu Wenruo	ca4cbfaba5	btrfs-progs: qgroup-verify: allow repair_qgroups to do silent repair Allow repair_qgroups() to do silent repair, so it can acts as offline qgroup rescan. This provides the basis for later mkfs quota support. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-05-28 19:59:56 +02:00
Qu Wenruo	f2292a9f17	btrfs-progs: qgroup-verify: move qgroup classification out of report_qgroups The original qgroup-verify integrates qgroup classification into report_qgroups(). This behavior makes silent qgroup repair (or offline rescan) impossible. To repair qgroup, we must call report_qgroups() to trigger bad qgroup classification, which will output error message. This patch moves bad qgroup classification from report_qgroups() to qgroup_verify_all(). Now report_qgroups() is pretty lightweight, only doing basic qgroup difference report thus change it type to void. And since the functionality of qgroup_verify_all() changes, change callers to handle the new return value correctly. Signed-off-by: Qu Wenruo <wqu@suse.com> [ removed some comments ] Signed-off-by: Marcos Paulo de Souza <mpdesouza@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-05-28 19:59:56 +02:00
Qu Wenruo	f0a8324d50	btrfs-progs: check: detect checksum item overlap There is a report about checksum item overlap, which makes newer btrfs kernel to reject it due to tree-checker. Now let btrfs-progs have the same ability to detect such problem. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-05-25 15:46:11 +02:00
Qu Wenruo	ccad599701	btrfs-progs: rename btrfs_block_group_cache to btrfs_block_group To keep the same naming across kernel and btrfs-progs. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-05-11 20:50:00 +02:00
Qu Wenruo	5bc44891c9	btrfs-progs: kill block_group_cache::key This would sync the code between kernel and btrfs-progs, and save at least 1 byte for each btrfs_block_group_cache. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-05-11 20:49:50 +02:00
Qu Wenruo	877f512c55	btrfs-progs: sync block group item accessors from kernel Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-05-11 20:49:46 +02:00
Qu Wenruo	2b6af38f72	btrfs-progs: check: don't exit if maybe_repair_root_item() can't find needed root extent The whole maybe_repair_root_item() and repair_root_items() functions are introduced to handle an ancient bug in v3.17. However in certain extent tree corruption case, such early exit would only exit the whole check process early on, preventing user to know what's really wrong about the fs. So this patch will allow the check to continue, since the ancient bug is no long that common. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-05-11 20:44:51 +02:00
Qu Wenruo	2b1be545db	btrfs-progs: remove the duplicated @level parameter for btrfs_bin_search() We can easily get the level from @eb parameter, thus the level is not needed. This is inspired by the work of Marek in U-boot. Cc: Marek Behun <marek.behun@nic.cz> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-05-04 20:49:01 +02:00
David Sterba	8c85b34420	btrfs-progs: move backref.[ch] to kernel-shared/ The files are very close to kernel versions, for that keep them in the shared directory. Signed-off-by: David Sterba <dsterba@suse.com>	2020-05-04 20:48:35 +02:00
Qu Wenruo	4aa0dcba8c	btrfs-progs: do proper error handling in add_cache_extent() If we have memory allocation failure in add_cache_extent(), it will simply exit with one error message. That's definitely not proper, especially when all but one call sites have handled the error. This patch will return -ENOMEM for add_cache_extent(), and fix the only call site which doesn't handle error from it. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-05-04 20:48:35 +02:00
Qu Wenruo	6384f9df92	btrfs-progs: check: sanitize the return value for qgroup error [BUG] btrfs check can return strange return value for shell: [Inferior 1 (process 48641) exited with code 0213] ^^^^ [CAUSE] It's caused by the incorrect handling of qgroup error. qgroup_report_ret can be -117 (-EUCLEAN), using that value with exit() can cause overflow, causing return value not properly recognized. [FIX] Fix it by sanitize the return value to 0 or 1. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-04-01 19:56:42 +02:00
Adam Borowski	ac405aa597	btrfs-progs: check: typo in an error message: "boudnary" Reviewed-by: Marcos Paulo de Souza <mpdesouza@suse.com> Signed-off-by: Adam Borowski <kilobyte@angband.pl> Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-31 18:37:37 +02:00
Qu Wenruo	3c2dbee299	btrfs-progs: check/original: Fix uninitialized memory for newly allocated data_backref [BUG] Valgrind reports the following error for fsck/002 (which only supports original mode): ==97088== Conditional jump or move depends on uninitialised value(s) ==97088== at 0x15BFF6: add_data_backref (main.c:4884) ==97088== by 0x16025C: run_next_block (main.c:6452) ==97088== by 0x165539: deal_root_from_list (main.c:8471) ==97088== by 0x166040: check_chunks_and_extents (main.c:8753) ==97088== by 0x166441: do_check_chunks_and_extents (main.c:8842) ==97088== by 0x169D13: cmd_check (main.c:10324) ==97088== by 0x11CDC6: cmd_execute (commands.h:125) ==97088== by 0x11D712: main (btrfs.c:386) [CAUSE] In alloc_data_backref(), only ref->node is set to 0. While ref->disk_bytenr is not initialized at all. And then in add_data_backref(), if @back is a newly allocated data backref, we use the garbage from back->disk_bytenr to determine if we should reset them. [FIX] Fix it by initialize the whole data_backref structure in alloc_data_backref(). Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-31 18:37:37 +02:00
Qu Wenruo	0a472ef887	btrfs-progs: check/original: Fix uninitialized stack memory access for deal_root_from_list() [BUG] With valgrind, fsck/002 test with original mode would report the following valgrind error: ==90600== Conditional jump or move depends on uninitialised value(s) ==90600== at 0x15C280: pick_next_pending (main.c:4949) ==90600== by 0x15F3CF: run_next_block (main.c:6175) ==90600== by 0x1655CC: deal_root_from_list (main.c:8486) ==90600== by 0x1660C7: check_chunks_and_extents (main.c:8762) ==90600== by 0x166439: do_check_chunks_and_extents (main.c:8842) ==90600== by 0x169D0B: cmd_check (main.c:10324) ==90600== by 0x11CDC6: cmd_execute (commands.h:125) ==90600== by 0x11D712: main (btrfs.c:386) [CAUSE] The problem happens like this: deal_root_from_list(@list is empty) \|- stack @last is not initialized \|- while(!list_empty(list)) {} is skipped \|- run_next_block(&last); \|- pick_next_pending(*last); \|- node_start = last; Since the stack @last is not initialized in deal_root_from_list(), the final node_start = last assignment would just fetch the garbage from stack. [FIX] Fix the problem by initializing @last to 0, as that's exactly what the first while loop did. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-31 18:37:37 +02:00
Josef Bacik	1173a4ba26	btrfs-progs: fix hole error output in fsck If we don't find holes in our hole rb tree we'll just assume there's a gap from 0 to the length of the file and print that out. But this simply isn't correct, we could have a gap between the last extent and the isize, or 0 and the start of the first extent. Fix the error message to tell us exactly where the hole is. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-31 18:37:35 +02:00
Josef Bacik	9123fc6142	btrfs-progs: fix check to catch gaps at the start of the file When writing my test for the i_size patches, I noticed that I was not actually failing without my patches as I should have been. This is because we only check if the inode record extent end is < isize, we don't check if the inode record extent start is > 0. Add this check to make sure we're catching holes that start at the beginning of the file. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-31 18:37:35 +02:00
David Sterba	2efe160bc7	btrfs-progs: move name hashing functions to ctree.h and delete hash.h The two functions are trivial and kernel has them in ctree.h, so do the same and remove hash.h. Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-31 18:37:35 +02:00
David Sterba	58bcd4260f	btrfs-progs: move free-space-tree.[ch] to kernel-shared/ The files are very close to kernel versions, for that keep them in the shared directory. Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-31 18:37:34 +02:00
Su Yue	162d891e4a	btrfs-progs: pass @trans to functions working with dirty block groups We are going to touch dirty_bgs in transaction directly, so every call chain should pass @trans to the leaf functions. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Su Yue <Damenly_Su@gmx.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-03 19:58:54 +01:00
Qu Wenruo	b3621da146	btrfs-progs: check/original: Detect invalid extent generation Much like what we have done in lowmem mode, also detect and report invalid extent generation in original mode. Unlike lowmem mode, we have extent_record::generation, which is the higher number of generations in EXTENT_ITEM, EXTENT_DATA or tree block header, so there is no need to check generations in different locations. For repair, we still need to depend on --init-extent-tree, as directly modifying extent items can easily cause conflicts with delayed refs, thus it should be avoided. Reviewed-by: Su Yue <Damenly_Su@gmx.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-01-09 14:27:10 +01:00
Qu Wenruo	8d45dc270a	btrfs-progs: check: Populate extent generation correctly for data extents [BUG] When using `btrfs check --init-extent-tree`, we will create incorrect generation number for data extents in extent tree: item 10 key (13631488 EXTENT_ITEM 1048576) itemoff 15828 itemsize 53 refs 1 gen 0 flags DATA extent data backref root FS_TREE objectid 257 offset 0 count 1 [CAUSE] Since data extent generation is not as obvious as tree blocks, which has header containing its generations, so for data extents, its extent_record::generation is not really updated, resulting such 0 generation. [FIX] To get generation of a data extent, there are two sources we can rely: - EXTENT_ITEM There is always a btrfs_extent_item::generation can be utilized. Although this is not possible for --init-extent-tree use case. - EXTENT_DATA We have btrfs_file_extent_item::generation for regular and preallocated data extents. Since --init-extent-tree will go through subvolume trees, this would be the main source for extent data generation. Then we only need to make add_data_backref() to accept @gen parameter, and pass it down to extent_record structure. And for the final extent item generation update, here we add extra fallback values, if we can't find FILE_EXTENT items. In that case, we just fall back to current transid. With this modification, recreated data EXTENT_ITEM now has correct generation number: item 10 key (13631488 EXTENT_ITEM 1048576) itemoff 15828 itemsize 53 refs 1 gen 6 flags DATA extent data backref root FS_TREE objectid 257 offset 0 count 1 Reviewed-by: Su Yue <Damenly_Su@gmx.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-01-09 14:27:10 +01:00
Qu Wenruo	9a85b7db02	btrfs-progs: check: Initialize extent_record::generation member [BUG] When using `btrfs check --init-extent-tree`, there is a pretty high chance that the result fs can't pass tree-checker: BTRFS critical (device dm-3): corrupt leaf: block=5390336 slot=149 extent bytenr=20115456 len=4096 invalid generation, have 16384 expect (0, 360] BTRFS error (device dm-3): block=5390336 read time tree block corruption detected BTRFS error (device dm-3): failed to read block groups: -5 BTRFS error (device dm-3): open_ctree failed [CAUSE] The result fs has a pretty screwed up EXTENT_ITEMs for data extents: item 148 key (20111360 EXTENT_ITEM 4096) itemoff 8777 itemsize 53 refs 1 gen 0 flags DATA extent data backref root FS_TREE objectid 841 offset 0 count 1 item 149 key (20115456 EXTENT_ITEM 4096) itemoff 8724 itemsize 53 refs 1 gen 16384 flags DATA extent data backref root FS_TREE objectid 906 offset 0 count 1 Kernel tree-checker will accept 0 generation, but that 16384 generation is definitely going to trigger the alarm. Looking into the code, it's add_extent_rec_nolookup() allocating a new extent_rec, but not copying all members from parameter @tmpl, resulting generation not properly initialized. [FIX] Just copy tmpl->generation in add_extent_rec_nolookup(). And since all call sites have set all members of @tmpl to 0 before add_extent_rec_nolookup(), we shouldn't get garbage values. For the 0 generation problem, it will be solved in another patch. Issue: #225 (Not the initial report, but extent tree rebuild result) Reviewed-by: Su Yue <Damenly_Su@gmx.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-01-09 14:27:10 +01:00
Qu Wenruo	3dcce48fd7	btrfs-progs: check/original: Do extra verification on file extent item [BUG] For certain fuzzed image, `btrfs check` will fail with the following call trace: Checking filesystem on issue_213.raw UUID: 99e50868-0bda-4d89-b0e4-7e8560312ef9 [1/7] checking root items [2/7] checking extents Program received signal SIGABRT, Aborted. 0x00007ffff7c88f25 in raise () from /usr/lib/libc.so.6 (gdb) bt #0 0x00007ffff7c88f25 in raise () from /usr/lib/libc.so.6 #1 0x00007ffff7c72897 in abort () from /usr/lib/libc.so.6 #2 0x00005555555abc3e in run_next_block (...) at check/main.c:6398 #3 0x00005555555b0f36 in deal_root_from_list (...) at check/main.c:8408 #4 0x00005555555b1a3d in check_chunks_and_extents (fs_info=0x5555556a1e30) at check/main.c:8690 #5 0x00005555555b1e3e in do_check_chunks_and_extents (fs_info=0x5555556a1e30) a #6 0x00005555555b5710 in cmd_check (cmd=0x555555696920 <cmd_struct_check>, argc #7 0x0000555555568dc7 in cmd_execute (cmd=0x555555696920 <cmd_struct_check>, ar #8 0x0000555555569713 in main (argc=2, argv=0x7fffffffde70) at btrfs.c:386 [CAUSE] This fuzzed images has a corrupted EXTENT_DATA item in data reloc tree: item 1 key (256 EXTENT_DATA 256) itemoff 16111 itemsize 12 generation 0 type 2 (prealloc) prealloc data disk byte 16777216 nr 0 prealloc data offset 0 nr 0 There are several problems with the item: - Bad item size 12 is too small. - Bad key offset offset of EXTENT_DATA type key represents file offset, which should always be aligned to sector size (4K in this particular case). [FIX] Do extra item size and key offset check for original mode, and remove the abort() call in run_next_block(). And to show off how robust lowmem mode is, lowmem can handle it without any hiccup. With this fix, original mode can detect the problem properly: Checking filesystem on issue_213.raw UUID: 99e50868-0bda-4d89-b0e4-7e8560312ef9 [1/7] checking root items [2/7] checking extents ERROR: invalid file extent item size, have 12 expect (21, 16283] ERROR: errors found in extent allocation tree or chunk allocation [3/7] checking free space cache [4/7] checking fs roots root 18446744073709551607 root dir 256 error root 18446744073709551607 inode 256 errors 62, no orphan item, odd file extent, bad file extent ERROR: errors found in fs roots found 131072 bytes used, error(s) found total csum bytes: 0 total tree bytes: 131072 total fs tree bytes: 32768 total extent tree bytes: 16384 btree space waste bytes: 124774 file data blocks allocated: 0 referenced 0 Issue: #213 Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Su Yue <Damenly_Su@gmx.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-01-09 14:27:09 +01:00
Johannes Thumshirn	e388bf386b	btrfs-progs: check: warn users about the possible dangers of --repair The manual page of btrfsck clearly states 'btrfs check --repair' is a dangerous operation. Although this warning is in place users do not read the manual page and/or are used to the behaviour of fsck utilities which repair the filesystem, and thus potentially cause harm. Similar to 'btrfs balance' without any filters, add a warning and a countdown, so users can bail out before eventual corrupting the filesystem more than it already is. To override the timeout, let --force skip it and continue. Signed-off-by: Johannes Thumshirn <jthumshirn@suse.de> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-22 19:07:20 +01:00
Qu Wenruo	989a99b5f8	btrfs-progs: Replace btrfs_block_group_cache::item with dedicated members We access btrfs_block_group_cache::item mostly for @used and @flags. @flags is already a dedicated member in btrfs_block_group_cache, only @used doesn't have a dedicated member. This patch will remove btrfs_block_group_cache::item and add btrfs_block_group_cache::used. It's the btrfs-progs equivalent of the following kernel patches: btrfs: move block_group_item::used to block group btrfs: move block_group_item::flags to block group btrfs: remove embedded block_group_cache::item Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-18 19:21:09 +01:00
Qu Wenruo	e46281d6fb	btrfs-progs: Refactor excluded extent functions to use fs_info The following functions are just using @root to reach fs_info: - exclude_super_stripes - free_excluded_extents - add_excluded_extent Refactor them to use fs_info directly. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Johannes Thumshirn <jthumshirn@suse.de> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-18 19:21:08 +01:00
Qu Wenruo	b9ea7c1b23	btrfs-progs: check/original: Add check and repair for invalid inode generation There are at least two bug reports of kernel tree-checker complaining about invalid inode generation. All offending inodes seem to be caused by old kernel around 2014, with inode generation overflow. So add such check and repair ability to lowmem mode check first. This involves: - Calculate the inode generation upper limit Unlike the lowmem mode context, we don't have anyway to determine if this inode belongs to log tree. So we use super_generation + 1 as upper limit, just like what we did in kernel tree checker. - Check if the inode generation is larger than the upper limit - Repair by resetting inode generation to current transaction generation The difference is, in original mode, we have a common trans handle for all repair and reset path for each repair. Reported-by: Charles Wright <charles.v.wright@gmail.com> Reviewed-by: Nikolay Borisov <nborisov@suse.com> Tested-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-18 19:21:07 +01:00
Qu Wenruo	eae0c8e32f	btrfs-progs: check/original: Fix inode mode in subvolume trees To make original mode to repair imode error in subvolume trees, this patch will do: - Remove the show-stopper checks for root->objectid. Now repair_imode_original() will accept inodes in subvolume trees. - Export detect_imode() for original mode Due to the call requirement, original mode must use an existing trans handler to do the repair, thus we need to re-implement most of the work done in repair_imode_common(). - Make repair_imode_original() to use detect_imode(). - Free the path after reset_imode() reset_imode() keeps the path, as lowmem mode uses path to locate its current check position. But for original mode, the unreleased path can cause later repair to report warning, so we need to manually release the path. - Update rec->imode after imode reset So later repair depending on rec->imode can get correct value. - Move the repair before repair_inode_nlinks() repair_inode_nlinks() needs correct imode to add DIR_INDEX/DIR_ITEM. So moving the repair before repair_inode_nlinks() makes the latter repair happier. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-18 19:21:07 +01:00
Qu Wenruo	87207654f1	btrfs-progs: check: Export btrfs_type_to_imode This function will be later used by common mode code, so export it. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-18 19:21:07 +01:00
Rosen Penev	5d72055066	btrfs-progs: Fix printf formats Discovered with cppcheck. Fix signed/unsigned int mismatches, sizeof and long formats. Pull-request: #197 Signed-off-by: Rosen Penev <rosenp@gmail.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:31:05 +02:00
Johannes Thumshirn	ed33908b44	btrfs-progs: update checksumming api Update the checksumming API to be able to cope with more checksum types than just CRC32C. The finalization call is merged into btrfs_csum_data. There are some fixme's and asserts added that need to be resolved. Co-developed-by: David Sterba <dsterba@suse.com> Signed-off-by: Johannes Thumshirn <jthumshirn@suse.de> Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:28:34 +02:00
Johannes Thumshirn	7b4f1035a6	btrfs-progs: pass checksum type to btrfs_csum_data()/btrfs_csum_final() In preparation to supporting new checksum algorithm pass the checksum type to btrfs_csum_data/btrfs_csum_final, this allows us to encapsulate any differences in processing into the respective functions Signed-off-by: Johannes Thumshirn <jthumshirn@suse.de> Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:28:28 +02:00
Johannes Thumshirn	de68086e35	btrfs-progs: don't assume checksums are always 4 bytes Pass pointer to a generic buffer instead of fixed size that crc32c currently uses. Signed-off-by: Johannes Thumshirn <jthumshirn@suse.de> Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:28:25 +02:00
David Sterba	0889161127	btrfs-progs: check: move device_record to main.c The structure is used only inside main.c, no need to export it. Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:27:41 +02:00
David Sterba	1b42e71053	btrfs-progs: copy btrfsck.h to check/common.h We want just one header for the check API (similar to what mkfs does) but as btrfsck.h is exported header (libbtrfs), it needs some deprecation beriod before it's moved through there are probably no users of that header file in particular. Copy the header to check, all modifications and cleanups won't affect the public header. Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:27:37 +02:00
David Sterba	8e68287d88	btrfs-progs: move qgroup-verify.[ch] to check/ The only user of the functionality is in check. Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:27:35 +02:00
Josef Bacik	73ff25c65d	btrfs-progs: check: don't check nbytes on unlinked files We don't update the inode when evicting it, so the nbytes will be wrong in between transaction commits. This isn't a problem, stop complaining about it to make generic/269 stop randomly failing. The orphan outdated inodes can be still present but check will not skip them. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-09-04 16:08:11 +02:00
Qu Wenruo	1baf729743	btrfs-progs: check/original: Check and repair root item geneartion Add such ability to original mode to fix root generation mismatch, which can be rejected by kernel. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-09-04 16:06:52 +02:00
Qu Wenruo	ebf29e39c3	btrfs-progs: check/lowmem: Check and repair root generation Since kernel is going to reject any root item which is newer than super block generation, we need to provide a way to fix such problem in btrfs-check. This patch addes the ability to report and repair root generation in lowmem mode. This is done by cowing the root node, so we will update the root generation along with the root node generation at commit transaction time. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-09-04 16:06:50 +02:00
Adam Borowski	90a87e817d	btrfs-progs: check: fix option parsing for -E The option -E has a mandatory argument that was missing from the short option spec, thus it always crashed. Signed-off-by: Adam Borowski <kilobyte@angband.pl> Signed-off-by: David Sterba <dsterba@suse.com>	2019-09-04 16:06:24 +02:00
Naohiro Aota	8ca6c0c3c7	btrfs-progs: check: initialize qgroup_item_count in earlier stage "btrfsck -Q" segfaults because it does not call qgroup_set_item_count_ptr() properly: # btrfsck -Q /dev/sdk Opening filesystem to check... Checking filesystem on /dev/sdk UUID: 34a35bbc-43f8-40f0-8043-65ed33f2e6c3 Print quota groups for /dev/sdk UUID: 34a35bbc-43f8-40f0-8043-65ed33f2e6c3 Segmentation fault (core dumped) Since "struct task_ctx ctx" is global, we can just move qgroup_set_item_count_ptr() much earlier stage in the check process to avoid to forget initializing it. Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-26 17:46:42 +02:00
Josef Bacik	42a1aaeec4	btrfs-progs: deal with drop_progress properly in fsck While testing snapshot deletion with dm-log-writes I saw that I was failing the fsck sometimes when the fs was actually in the correct state. This is because we only skip blocks on the same level of root_item->drop_level. If the drop_level < the root level then we could very well walk into nodes that we wouldn't actually walk into on fs mount, because the drop_progress is further ahead in the slot of the root. Instead only process the slots of the nodes that are above the drop_progress key. With this patch in place we no longer improperly fail to check fs'es that have a drop_progress set with a drop_level < root level. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-26 17:46:42 +02:00
David Sterba	c8bea2b84b	btrfs-progs: move rbtree-utils.[ch] to common/ Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 20:49:04 +02:00
David Sterba	c07960c8be	btrfs-progs: move utils.[ch] to common/ Update include paths and remove some duplicates. Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 20:49:04 +02:00
David Sterba	aac564aca6	btrfs-progs: move commonh to common/ Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 20:49:03 +02:00
David Sterba	f93b471143	btrfs-progs: move help.[ch] to common/ Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 20:49:03 +02:00
David Sterba	d0970a05cd	btrfs-progs: move task-utils.[ch] to common/ Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 20:49:03 +02:00
Nikolay Borisov	32f75c87a0	btrfs-progs: check: Remove duplicated and commented functions Commit `756105181e` ("btrfs-progs: check: supplement extent backref list with rbtree") changed the backref implementation to use rb tree and also commented the old implementations. It's been almost 2 years since that change and it's unlikely the old version will ever be used, so just remove it. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 13:31:16 +02:00
Jeff Mahoney	91a1476d53	btrfs-progs: pass cmd_struct to usage() Now that every call site has a cmd_struct, we can just pass the cmd_struct to usage to print the usager information. This allows us to interpret the format flags we'll add later in this series to inform the user of which output formats any given command supports. Signed-off-by: Jeff Mahoney <jeffm@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 13:31:15 +02:00
Jeff Mahoney	b44445131f	btrfs-progs: pass cmd_struct to command callback function This patch passes the cmd_struct to the command callback function. This has several purposes: It allows the command callback to identify which command was used to call it. It also gives us direct access to the usage associated with that command. Signed-off-by: Jeff Mahoney <jeffm@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 13:31:15 +02:00
Jeff Mahoney	82d48463ea	btrfs-progs: use cmd_struct as command entry point Rather than having global command usage and callbacks used to create cmd_structs in the command array, establish the cmd_struct structures separately and use those. The next commit in the series passes the cmd_struct to the command callbacks such that we can access flags and determine which of several potential command we were called as. This establishes several macros to more easily define the commands within each command's source. Signed-off-by: Jeff Mahoney <jeffm@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 13:31:15 +02:00
Qu Wenruo	085445e793	btrfs-progs: Cleanup BTRFS_COMPAT_EXTENT_TREE_V0 BTRFS_COMPAT_EXTENT_TREE_V0 is introduced for a short time in kernel, and it's over 10 years ago. Nowadays there should be no user for that feature, and kernel has remove this support in Jun, 2018. There is no need for btrfs-progs to support it. This patch will remove EXTENT_TREE_V0 related code and replace those BUG_ON() to a more graceful error message. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-06-05 18:00:07 +02:00
Qu Wenruo	d73427b76d	btrfs-progs: check/original: Add checks for compressed extent without csum There is one report of compressed extent happens in btrfs, but has no csum and then leads to possible decompress error screwing up kernel memory. Although it's a kernel bug, and won't cause problem until compressed data get corrupted, let's catch such problem in advance. This patch will catch any unexpected compressed extent with: 1) 0 or less than expected csum 2) nodatasum flag set in the inode item This is for original mode. Reported-by: James Harvey <jamespharvey20@gmail.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-05-17 12:32:38 +02:00
Jeff Mahoney	50faf73f22	btrfs-progs: check: fixup_extent_flags needs to deal with non-skinny metadata When repairing a file system created by a very old kernel, I ran into issues fixing up the extent flags since fixup_extent_flags assumed that a METADATA_ITEM would be present if the record was for metadata. Since METADATA_ITEMs don't exist without skinny metadata, we need to fall back to EXTENT_ITEMs. This also falls back to EXTENT_ITEMs even with skinny metadata enabled as other parts of the tools do. Reviewed-by: Filipe Manana <fdmanana@suse.com> Signed-off-by: Jeff Mahoney <jeffm@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-05-17 12:32:38 +02:00
Qu Wenruo	427990ad74	btrfs-progs: check/original: Check and repair free space cache inode item Just like lowmem mode, also check and repair free space cache inode item. And since we don't really have a good timing/function to check free space chace inodes, we use the same common mode check_repair_free_space_inode() when iterating root tree. Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Qu Wenruo	11fd6cff82	btrfs-progs: check/original: Repair invalid inode mode in root tree This patch will reuse the mode independent repair_imode() function, to repair invalid inode mode. Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Qu Wenruo	23f1e9a13f	btrfs-progs: check/original: Add inode mode check Just like lowmem mode, check inode mode, specially for S_IFMT bits and beyond. Please note that, this check only applies to inodes in fs/subvol trees. It doesn't apply to free space cache inodes. Reported-by: Thorsten Hirsch <t.hirsch@web.de> Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Qu Wenruo	db51d8d8f6	btrfs-progs: Use @fs_info to replace @root for btrfs_check_leaf/node() Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Su Yanjun	cedbfc2561	btrfs-progs: check: Delete file extent item with unaligned disk bytenr For test case fsck-tests/001-bad-file-extent-bytenr, we have an obviously hand crafted image with unaligned file extent: item 7 key (257 EXTENT_DATA 0) itemoff 3453 itemsize 53 generation 6 type 1 (regular) extent data disk byte 755944791 nr 1048576 extent data offset 0 nr 1048576 ram 1048576 extent compression 0 (none) disk bytenr 755944791 is obviously unaligned (not even). For such obviously corrupted file extent, we should just delete the file extent. Signed-off-by: Su Yanjun <suyj.fnst@cn.fujitsu.com> [Update commit message and comment] Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Su Yanjun	3b35deeadd	btrfs-progs: check: fix wrong @offset used in find_possible_backrefs() Function find_possible_backrefs() is used to locate the file extents referring to an data extent. For data extent backref, its btrfs_extent_data_ref structure has the following members: - root Which root refers to this data extent - objectid Which inode refers to this data extent - offset Search hint. Its value is @file_offset - @extent_offset. While for @file_offset, it's directly recorded in (INO EXTENT_DATA FILE_OFFSET) key. So when searching the file extents refers to this data extent, we can't use btrfs_extent_data_ref::offset as search key::offset. We must search from file offset 0, and iterate all file extents until we hit a file extent matches the data backref. Thankfully such time consuming behavior is not triggered frequently, it only gets called for repair, so it shouldn't affect normal check routine. Signed-off-by: Su Yanjun <suyj.fnst@cn.fujitsu.com> [Update commit message] Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Su Yanjun	b6a0d97cba	Revert "btrfs-progs: Record orphan data extent ref to corresponding root." Commit `0ddf63c09f` ("btrfs-progs: Record orphan data extent ref to corresponding root.") introduces the ability to record a file extent even all other related info is lost (data backref, inode item). However this patch only records such info without doing any proper repair, further more, it could even record invalid file extents, and the report part only happens after all check is done. Since we will later introduce proper file extent repair functionality, we could revert that patch. Signed-off-by: Su Yanjun <suyj.fnst@cn.fujitsu.com> [Update commit message, solve merge conflicts] Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:03:51 +08:00
Su Yanjun	872c116c75	Revert "btrfs-progs: Add repair and report function for orphan file extent." Commit `ad03f840f0` ("btrfs-progs: Add repair and report function for orphan file extent.") will record and try to repair orphan file extents by: - Removing the orphan file extent item if no extent backref can be found Or - Re-insert a file extent using data backref Especially the later case is far from ideal, as normally extent tree is more fragile and corruption prone. Use any data from extent tree to try to repair could easily lead to further corruption. So here we revert commit `ad03f840f0` ("btrfs-progs: Add repair and report function for orphan file extent.") to cleanup the space for later proper repair in original mode. Signed-off-by: Su Yanjun <suyj.fnst@cn.fujitsu.com> [Update commit message, solve conflicts with DIR_ITEM hash mismatch patchset] Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:03:02 +08:00
Qu Wenruo	e76fb663a7	btrfs-progs: check: Fix false alert about uninitialized variable GCC 8.2.1 will report the following error: check/main.c: In function 'try_repair_inode': check/main.c:2606:5: warning: 'ret' may be used uninitialized in this function [-Wmaybe-uninitialized] if (!ret) { ^ check/main.c:2584:6: note: 'ret' was declared here int ret; ^~~ The offending code is in repair_mismatch_dir_hash(): int ret; printf( "Deleting bad dir items with invalid hash for root %llu ino %llu\n", root->root_key.objectid, rec->ino); while (!list_empty(&rec->mismatch_dir_hash)) { /* do some repair / } if (!ret) { <<< Here / do some fix */ } The truth is, to enter try_repair_inode(), we must have I_ERR_MISMATCH_DIR_HASH bit set for rec->errors. And just after we set I_ERR_MISMATCH_DIR_HASH, we call add_mismatch_dir_hash() and handled its error correctly. So it's impossible to to skip the while loop. Fix it by initializing @ret to -EUCLEAN, so even we hit some impossible case, repair_mismatch_dir_hash() won't falsely consider the mismatch hash fixed. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-03-05 12:57:41 +01:00
Jeff Mahoney	8bda5132d8	btrfs-progs: fix stray error message in check Commit `e578b59bf6` ("btrfs-progs: convert strerror to implicit %m") missed adding braces after a conditional so we will see the following message whenever a tree block needs repair, regardless of whether repair was successful: "Failed to repair btree: Success" Signed-off-by: Jeff Mahoney <jeffm@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 18:42:14 +01:00
Qu Wenruo	af816ca930	btrfs-progs: check: orig: Add ability to repair dir item with invalid hash The repair function is reusing delete_corrupted_dir_item(). Since the error can happen for root dir inode, also call try_repair_inode() on root dir inode. This is especially important for old filesystems, since later kernel introduces stricter tree-checker, which could detect such hash mismatch and refuse to read the corrupted leaf. With this repair ability, user could repair with btrfs check --repair. Link: https://bugzilla.opensuse.org/show_bug.cgi?id=1111991 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 18:42:13 +01:00
Qu Wenruo	846f838797	btrfs-progs: check: orig: Use mismatch_dir_hash_record to record bad dir items This changes reporting from current in-place, like: ERROR: DIR_ITEM[256 751495445] name foor.WvG1c1TdU namelen 13 filetype 1 mismatch with its hash, wanted 751495445 have 2870353892 root 5 root dir 256 error To new summary report at the end of the pass: root 5 root dir 256 error root 5 inode 256 errors 40000 Dir items with mismatch hash: name: foor.WvG1c1Td namelen: 13 wanted 0xab161fe4 has 0x2ccae915 Also, with mismatch_dir_hash_record structure, it provides the base for later original mode repair. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 18:42:13 +01:00
Qu Wenruo	39ea41f00b	btrfs-progs: check: orig: Add dev_item check for used bytes and total bytes Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 18:42:13 +01:00
Qu Wenruo	6b4ee5612e	btrfs-progs: check: orig: Add ability to detect bad dev extents Unlike lowmem mode check, we don't have good place for original mode to check overlapping device extents. So this patch introduces a new function, btrfs_check_dev_extents(), to handle such extents. Reported-by: Hans van Kranenburg <hans.van.kranenburg@mendix.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 18:42:13 +01:00
Nikolay Borisov	f7717d8cdb	btrfs-progs: Remove fsid/metdata_uuid fields from fs_info Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 16:20:09 +01:00
Nikolay Borisov	c4aadd9af2	btrfs-progs: Add support for metadata_uuid field Add support for a new metadata_uuid field. This is just a preparatory commit which switches all users of the fsid field for metdata comparison purposes to utilize the new field. This more or less mirrors the kernel patch, additionally: * Update 'btrfs inspect-internal dump-super' to account for the new field. This involes introducing the 'metadata_uuid' line to the output and updating the logic for comparing the fs uuid to the dev_item uuid. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-12-06 12:51:36 +01:00
Josh Soref	33d375bc20	btrfs-progs: check: fix typo in device_extent_record::chunk_objectid This header is exported to /usr/include/btrfs but there are no known users, so the change should be safe. Generated by https://github.com/jsoref/spelling Issue: #154 Author: Josh Soref <jsoref@users.noreply.github.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-11-26 18:24:49 +01:00
Josh Soref	2cd4a76ea9	btrfs-progs: fix typos in user-visible strings * error messages * help strings Generated by https://github.com/jsoref/spelling Issue: #154 Author: Josh Soref <jsoref@users.noreply.github.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-11-26 18:24:49 +01:00
Josh Soref	b1d39a42a4	btrfs-progs: fix typos in comments Generated by https://github.com/jsoref/spelling Issue: #154 Author: Josh Soref <jsoref@users.noreply.github.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-11-26 18:24:48 +01:00
David Sterba	e578b59bf6	btrfs-progs: convert strerror to implicit %m Similar to the changes where strerror(errno) was converted, continue with the remaining cases where the argument was stored in another variable. The savings in object size are about 4500 bytes: $ size btrfs.old btrfs.new text data bss dec hex filename 805055 24248 19748 849051 cf49b btrfs.old 804527 24248 19748 848523 cf28b btrfs.new Signed-off-by: David Sterba <dsterba@suse.com>	2018-10-31 18:24:14 +01:00
Nikolay Borisov	be32eb0557	btrfs-progs: check: Add support for freespace tree fixing Now that all the prerequisite code for proper support of free space tree repair is in, it's time to wire it in. This is achieved by first hooking the freespace tree to the __free_extent/alloc_reserved_tree_block functions. And then introducing a wrapper function to contains the existing check_space_cache and the newly introduced repair code. Finally, it's important to note that FST repair code first clears the existing FST in case of any problem found and rebuilds it from scratch. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-10-25 16:11:40 +02:00
Nikolay Borisov	909357e867	btrfs-progs: Wire up delayed refs This commit enables the delayed refs infrastructures. This entails doing the following: 1. Replacing existing calls of btrfs_extent_post_op (which is the equivalent of delayed refs) with the proper btrfs_run_delayed_refs. As well as eliminating open-coded calls to finish_current_insert and del_pending_extents which execute the delayed ops. 2. Wiring up the addition of delayed refs when freeing extents (btrfs_free_extent) and when adding new extents (alloc_tree_block). 3. Adding calls to btrfs_run_delayed refs in the transaction commit path alongside comments why every call is needed, since it's not always obvious (those call sites were derived empirically by running and debugging existing tests) 4. Correctly flagging the transaction in which we are reinitialising the extent tree. 5. Moving btrfs_write_dirty_block_groups to btrfs_write_dirty_block_groups since blockgroups should be written to disk after the last delayed refs have been run. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-10-23 14:48:41 +02:00
Qu Wenruo	b02b426a78	btrfs-progs: fix infinite loop when bad key order repair fails An infinite loop can be triggered during fuzz/003: ====== RUN MAYFAIL btrfs check --repair tests/fuzz-tests/images/bko-199833-reloc-recovery-crash.raw.restored [1/7] checking root items Fixed 0 roots. [2/7] checking extents ctree.c:1650: leaf_space_used: Warning: assertion `data_len < 0` failed, value 1 bad key ordering 18 19 ctree.c:1650: leaf_space_used: Warning: assertion `data_len < 0` failed, value 1 bad key ordering 18 19 ctree.c:1650: leaf_space_used: Warning: assertion `data_len < 0` failed, value 1 bad key ordering 18 19 [CAUSE] In try_to_fix_bad_block() it's possible that btrfs_find_all_roots() finds no root referring to that tree block, thus we can't do any repair. However in that case, we still return 0 since the last caller assigning @ret is btrfs_find_all_roots(), and the ulist while loop doesn't get run at all. And since try_to_fix_bad_block() returns 0, check_block() in check/main.c will return -EAGAIN to re-check the tree block. This leads to the infinite loop. [FIX] Change the default return value from 0 to -EIO in try_to_fix_bad_block(), so if there is no tree referring to the bad tree block, it won't cause infinite loop anymore. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-10-23 14:48:40 +02:00
Qu Wenruo	0adf626374	btrfs-progs: exit gracefully when root dir item repair fails Another BUG_ON() during fuzz/003: ====== RUN MAYFAIL btrfs check --init-csum-tree tests/fuzz-tests/images/bko-161821.raw.restored [1/7] checking root items Fixed 0 roots. [2/7] checking extents parent transid verify failed on 4198400 wanted 14 found 1114126 parent transid verify failed on 4198400 wanted 14 found 1114126 Ignoring transid failure owner ref check failed [4198400 4096] repair deleting extent record: key [4198400,169,0] adding new tree backref on start 4198400 len 4096 parent 0 root 5 Repaired extent references for 4198400 ref mismatch on [4222976 4096] extent item 1, found 0 backref 4222976 root 7 not referenced back 0x55e9cc694780 incorrect global backref count on 4222976 found 1 wanted 0 backpointer mismatch on [4222976 4096] owner ref check failed [4222976 4096] repair deleting extent record: key [4222976,169,0] Repaired extent references for 4222976 [3/7] checking free space cache [4/7] checking fs roots parent transid verify failed on 4198400 wanted 14 found 1114126 Ignoring transid failure Wrong generation of child node/leaf, wanted: 1114126, have: 14 root 5 missing its root dir, recreating parent transid verify failed on 4198400 wanted 14 found 1114126 Ignoring transid failure ERROR: child eb corrupted: parent bytenr=4222976 item=0 parent level=1 child level=2 check/main.c:2738: check_inode_recs: BUG_ON `ret` triggered, value -5 failed (ignored, ret=134): btrfs check --init-csum-tree tests/fuzz-tests/images/bko-161821.raw.restored mayfail: returned code 134 (SIGABRT), not ignored test failed for case 003-multi-check-unmounted Just abort current transaction and exit gracefully in this case, the caller handles errors. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-10-23 14:48:40 +02:00
David Sterba	4dca0ccf3d	btrfs-progs: check: reorder options to sections Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:06:24 +02:00
Qu Wenruo	21d1109168	btrfs-progs: check/original: Don't overwrite return value when we failed to repair In check_inode_recs(), for repair mode we always reset @ret to 0. It makes no sense and later we check @ret to determine if the repair is successful. Fix it by removing the offending overwrite. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Gu Jinxiang <gujx@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:06:22 +02:00
Qu Wenruo	159f177cf9	btrfs-progs: check/original: Avoid infinite loop when failed to repair inode Exposed by fuzz-tests/003-multi-check-unmounted/ on fuzzed image bko-161811.raw.xz. It's caused by the fact when check_fs_roots() finds tree root is modified, it re-search tree root by goto again: label. However again: label. will also reset root objectid to 0. If we failed to repair one fs root but still modified tree root, we will go into such infinite loop. Fix it by recording which root we should skip for repair mode. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Gu Jinxiang <gujx@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:06:20 +02:00
Qu Wenruo	5f594c3ce3	btrfs-progs: check: Remove the ability to rebuild root overwriting existing tree blocks We have function btrfs_fsck_reinit_root() to reinit csum or extent tree. However this function allows us to let it overwrite existing tree blocks using @overwrite parameter. Such behavior is pretty dangerous while no caller is using this feature explicitly. So just remove @overwrite parameter and allow btrfs_fsck_reinit_root() to error out when it fails to allocate tree block. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Gu Jinxiang <gujx@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:04:58 +02:00
Stéphane Lesimple	078e9a1cc9	btrfs-progs: check: enhanced progress indicator We reuse the task_position enum and task_ctx struct of the original progress indicator, adding more values and fields for our needs. Then add hooks in all steps of the check to properly record progress. Here's how the output looks like on a 22 Tb 5-disk RAID1 FS: Opening filesystem to check... Checking filesystem on /dev/mapper/luks-ST10000VN0004-XXXXXXXX UUID: xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx [1/7] checking extents (0:20:21 elapsed, 950958 items checked) [2/7] checking root items (0:01:29 elapsed, 15121 items checked) [3/7] checking free space cache (0:00:11 elapsed, 4928 items checked) [4/7] checking fs roots (0:51:31 elapsed, 600892 items checked) [5/7] checking csums (0:14:35 elapsed, 754522 items checked) [6/7] checking root refs (0:00:00 elapsed, 232 items checked) [7/7] checking quota groups skipped (not enabled on this FS) found 5286458060800 bytes used, no error found Signed-off-by: Stéphane Lesimple <stephane_btrfs@lesimple.fr> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:03:23 +02:00
Qu Wenruo	c010437455	btrfs-progs: check: orig: Don't panic when unexpected root item is referring to one extent With crafted image, expected root item can refer to certain extent, and original mode uses BUG_ON() to handle such case. Fix it by gracefully return error. Link: https://bugzilla.kernel.org/show_bug.cgi?id=200403 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:03:07 +02:00
Nikolay Borisov	f3c7101116	btrfs-progs: check: Drop trans/root arguments from free_extent_hook They are not really needed, what free_extent_hook wants is really a pointer to fs_info so give it to it directly. This is in preparation of delayed refs code. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:01:57 +02:00
David Disseldorp	7490f94dd7	btrfs-progs: check: add experimental flag for lowmem mode The experimental flag is already carried in the manpage, but was removed from the btrfs check usage message as part of refactoring via `87c1bd13c1`. Add it back. Signed-off-by: David Disseldorp <ddiss@suse.de> Reviewed-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:01:19 +02:00
Qu Wenruo	e5f50be06f	btrfs-progs: check/original: Detect and repair wrong inline ram_bytes It looks like that around 2014, btrfs kernel has a regression that would cause offset-by-one ram_bytes for inline extent. Add the ability to repair it in original mode. Reported-by: Steve Leung <sjleung@shaw.ca> Tested-by: Steve Leung <sjleung@shaw.ca> Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:00:44 +02:00
Qu Wenruo	3c042605db	btrfs-progs: Get rid of the confusing btrfs_file_extent_inline_len() [BUG] If one uncompressed inline extent has incorrect ram_bytes, neither btrfs check nor dump-tree could detect such corruption. [CAUSE] Every caller tries to read inline extent ram_bytes is using btrfs_file_extent_inline_len(), other than directly calling btrfs_file_extent_ram_bytes(). For compressed extent, it's just calling btrfs_file_extent_ram_bytes(). However for uncompressed extent, it falls back to btrfs_file_extent_inline_item_len(), makes us unable to detect anything wrong in ram_bytes. [FIX] Just get rid of such confusing btrfs_file_extent_inline_len() function. Reported-by: Steve Leung <sjleung@shaw.ca> Tested-by: Steve Leung <sjleung@shaw.ca> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:00:34 +02:00
Nikolay Borisov	d4f19ad084	btrfs-progs: check: factor out root parsing from check_chunks_and_extents check_chunks_and_extents does quite a number of distinct things. The first of those is going through all root items in the root tree and classify every root depending on whether they have a dropping operation in progress or not. Lets factor out this code and move the variables specific to this in a separate function so clean up check_chunks_and_extents a bit. Accidentally, this patch fixes some reference leaks since in error conditions in the loop the code does "goto out" but at that label we don't really release the path. Having this code extracted in a separate function which always releases the path avoids this problem entirely. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:00:08 +02:00
Nikolay Borisov	a7773d7f06	btrfs-progs: check: Fix wrong root parameter of btrfs_next_leaf call The first thing that check_chunks_and_extents does is to iterate all the root items in the root tree and link them to either the "normal_list" or "dropping_trees" list. If a leaf has to be crossed during this operation btrfs_next_leaf is called to do that. However, currently it's called with a wrong argument for its 'root' parameter. Since we are iterating the root tree the passed root should be fs_info->tree_rot, whereas right now we are passing the local variable 'root' which is assigned to the fs_tree. As it stands, this bug is actually benign since the passed root is only passed to reada_for_search, where it's used to reference the fs_info. Nevertheless the code is wrong and at the very least misleading, so fix it by passing the correct root. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 14:59:56 +02:00
Su Yue	eef9f3eed6	btrfs-progs: check: check symlinks with append/immutable flags Define new macro I_ERR_ODD_INODE_FLAGS to represents odd inode flags. Symlinks should never have append/immutable flags. While processing inodes, if found a symlink with append/immutable flags, mark the inode record with I_ERR_ODD_INODE_FLAGS. This is for original mode. Issue: #133 Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:37 +02:00
Nikolay Borisov	a22b593587	btrfs-progs: Change btrfs_root to btrfs_fs_info argument in btrfs_lookup_extent_info That function really wants an fs_info and not a root. Accidentally, this also makes the kernel/user space signatures to be coherent. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:37 +02:00
Nikolay Borisov	12e2280d7b	btrfs-progs: check: Remove unused root argument from btrfs_extent_post_op This is no longer used by the callees of that function so remove it. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:37 +02:00
Nikolay Borisov	b4c4ff9c4a	btrfs-progs: check: Remove root parameter from btrfs_fix_block_accounting It's always set to extent_root and the function already takes a transaction handle where fs_info could be referenced and in turn the extent_tree. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:36 +02:00
Nikolay Borisov	48663d6e64	btrfs-progs: check: Remove root argument from delete_extent_records This function is always passed the extent_root as "root" parameter. In turn it uses the root parameter to mostly access fs_info and performs only a single call to btrfs_update_block_group where it passses the passed root. This is all redundant since fs_info can be referenced from the transaction handle and in turn extent_root can be referenced from the fs_info. So do that to simplify the function's signature. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:36 +02:00
Qu Wenruo	41750fe660	btrfs-progs: Remove fs_info parameter from btrfs_leaf_free_space() For btrfs_leaf_free_space(), to get leaf data size, we have two way to get it: 1) leaf->fs_info->nodesize 2) leaf->len Anyway, we could get rid of @fs_info parameter for btrfs_leaf_free_space(). And here we choose method 2), as it provides extra benefit to get leaf free space without initializing a real fs_info. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:35 +02:00
Qu Wenruo	4bd7bbb6f6	btrfs-progs: check: Make btrfs check return error for qgroup mismatch Current btrfs-check will check qgroup consistency, but even when it finds something wrong, the return value is still 0. Fix it by allowing report_qgroups() to return int to indicate qgroup mismatch, and also add extra logic to return no error if qgroup repair is successful. Without this patch, fstests can't detect qgroup corruption by its fsck alone. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:33 +02:00
Nikolay Borisov	873fba7101	btrfs-progs: Remove devid parameter from btrfs_rmap_block This parameter was introduced with the original implementation of the function but has never really been used, so just remove it. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:32 +02:00
Su Yue	0ee012318f	btrfs-progs: check: lowmem: exclude extents if init-extent-tree in lowmem If options '--init-extent-tree' and '--mode=lowmem' are both input, all metadata blocks will be traversed twice. First one is done by pin_metadata_blocks() in reinit_extent_tree(). Second one is in check_chunks_and_extents_v2(). Excluding instead of pinning metadata blocks before reinitializing th extent tree in lowmem can save some time. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:30 +02:00

1 2 3 4 5 ...

281 Commits