btrfs-progs

mirror of https://github.com/kdave/btrfs-progs synced 2025-02-19 11:16:51 +00:00

Author	SHA1	Message	Date
Marcos Paulo de Souza	59af046fd2	btrfs-progs: qgroup-verify: Remove duplicated message in report_qgroups Since `1d5b2ad9` ("btrfs-progs: qgroup-verify: Don't treat qgroup difference as error if the fs hasn't initialized a rescan") a new message is being printed when the qgroups is incosistent and the rescan hasn't being executed, so remove the later message send to stderr. While in this function, simplify the check for a not executed rescan since !counts.rescan_running and counts.rescan_running == 0 means the same thing. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Marcos Paulo de Souza <mpdesouza@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-31 18:37:36 +02:00
Josef Bacik	1173a4ba26	btrfs-progs: fix hole error output in fsck If we don't find holes in our hole rb tree we'll just assume there's a gap from 0 to the length of the file and print that out. But this simply isn't correct, we could have a gap between the last extent and the isize, or 0 and the start of the first extent. Fix the error message to tell us exactly where the hole is. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-31 18:37:35 +02:00
Josef Bacik	f2832d534b	btrfs-progs: fix lowmem check's handling of holes Lowmem check had the opposite problem of normal check, it caught gaps that started at 0, but would still fail with my fixes in place. This is because lowmem check doesn't take into account the isize of the inode. Address this by making sure we do not complain about gaps that are after isize. This makes lowmem pass with my fixes applied, and still fail without my fixes. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-31 18:37:35 +02:00
Josef Bacik	9123fc6142	btrfs-progs: fix check to catch gaps at the start of the file When writing my test for the i_size patches, I noticed that I was not actually failing without my patches as I should have been. This is because we only check if the inode record extent end is < isize, we don't check if the inode record extent start is > 0. Add this check to make sure we're catching holes that start at the beginning of the file. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-31 18:37:35 +02:00
David Sterba	2efe160bc7	btrfs-progs: move name hashing functions to ctree.h and delete hash.h The two functions are trivial and kernel has them in ctree.h, so do the same and remove hash.h. Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-31 18:37:35 +02:00
David Sterba	58bcd4260f	btrfs-progs: move free-space-tree.[ch] to kernel-shared/ The files are very close to kernel versions, for that keep them in the shared directory. Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-31 18:37:34 +02:00
Su Yue	162d891e4a	btrfs-progs: pass @trans to functions working with dirty block groups We are going to touch dirty_bgs in transaction directly, so every call chain should pass @trans to the leaf functions. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Su Yue <Damenly_Su@gmx.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-03-03 19:58:54 +01:00
Qu Wenruo	b3621da146	btrfs-progs: check/original: Detect invalid extent generation Much like what we have done in lowmem mode, also detect and report invalid extent generation in original mode. Unlike lowmem mode, we have extent_record::generation, which is the higher number of generations in EXTENT_ITEM, EXTENT_DATA or tree block header, so there is no need to check generations in different locations. For repair, we still need to depend on --init-extent-tree, as directly modifying extent items can easily cause conflicts with delayed refs, thus it should be avoided. Reviewed-by: Su Yue <Damenly_Su@gmx.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-01-09 14:27:10 +01:00
Qu Wenruo	918ba847bc	btrfs-progs: check/lowmem: Detect invalid EXTENT_ITEM and EXTENT_DATA generation Since older `btrfs check --init-extent-tree` could cause invalid EXTENT_ITEM generation for data extents, add such check to lowmem mode check. Also add such generation check to file extents too. For the repair part, I don't have any good idea yet. So affected user may depend on --init-extent-tree again. Reviewed-by: Su Yue <Damenly_Su@gmx.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-01-09 14:27:10 +01:00
Qu Wenruo	8d45dc270a	btrfs-progs: check: Populate extent generation correctly for data extents [BUG] When using `btrfs check --init-extent-tree`, we will create incorrect generation number for data extents in extent tree: item 10 key (13631488 EXTENT_ITEM 1048576) itemoff 15828 itemsize 53 refs 1 gen 0 flags DATA extent data backref root FS_TREE objectid 257 offset 0 count 1 [CAUSE] Since data extent generation is not as obvious as tree blocks, which has header containing its generations, so for data extents, its extent_record::generation is not really updated, resulting such 0 generation. [FIX] To get generation of a data extent, there are two sources we can rely: - EXTENT_ITEM There is always a btrfs_extent_item::generation can be utilized. Although this is not possible for --init-extent-tree use case. - EXTENT_DATA We have btrfs_file_extent_item::generation for regular and preallocated data extents. Since --init-extent-tree will go through subvolume trees, this would be the main source for extent data generation. Then we only need to make add_data_backref() to accept @gen parameter, and pass it down to extent_record structure. And for the final extent item generation update, here we add extra fallback values, if we can't find FILE_EXTENT items. In that case, we just fall back to current transid. With this modification, recreated data EXTENT_ITEM now has correct generation number: item 10 key (13631488 EXTENT_ITEM 1048576) itemoff 15828 itemsize 53 refs 1 gen 6 flags DATA extent data backref root FS_TREE objectid 257 offset 0 count 1 Reviewed-by: Su Yue <Damenly_Su@gmx.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-01-09 14:27:10 +01:00
Qu Wenruo	9a85b7db02	btrfs-progs: check: Initialize extent_record::generation member [BUG] When using `btrfs check --init-extent-tree`, there is a pretty high chance that the result fs can't pass tree-checker: BTRFS critical (device dm-3): corrupt leaf: block=5390336 slot=149 extent bytenr=20115456 len=4096 invalid generation, have 16384 expect (0, 360] BTRFS error (device dm-3): block=5390336 read time tree block corruption detected BTRFS error (device dm-3): failed to read block groups: -5 BTRFS error (device dm-3): open_ctree failed [CAUSE] The result fs has a pretty screwed up EXTENT_ITEMs for data extents: item 148 key (20111360 EXTENT_ITEM 4096) itemoff 8777 itemsize 53 refs 1 gen 0 flags DATA extent data backref root FS_TREE objectid 841 offset 0 count 1 item 149 key (20115456 EXTENT_ITEM 4096) itemoff 8724 itemsize 53 refs 1 gen 16384 flags DATA extent data backref root FS_TREE objectid 906 offset 0 count 1 Kernel tree-checker will accept 0 generation, but that 16384 generation is definitely going to trigger the alarm. Looking into the code, it's add_extent_rec_nolookup() allocating a new extent_rec, but not copying all members from parameter @tmpl, resulting generation not properly initialized. [FIX] Just copy tmpl->generation in add_extent_rec_nolookup(). And since all call sites have set all members of @tmpl to 0 before add_extent_rec_nolookup(), we shouldn't get garbage values. For the 0 generation problem, it will be solved in another patch. Issue: #225 (Not the initial report, but extent tree rebuild result) Reviewed-by: Su Yue <Damenly_Su@gmx.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-01-09 14:27:10 +01:00
Qu Wenruo	3dcce48fd7	btrfs-progs: check/original: Do extra verification on file extent item [BUG] For certain fuzzed image, `btrfs check` will fail with the following call trace: Checking filesystem on issue_213.raw UUID: 99e50868-0bda-4d89-b0e4-7e8560312ef9 [1/7] checking root items [2/7] checking extents Program received signal SIGABRT, Aborted. 0x00007ffff7c88f25 in raise () from /usr/lib/libc.so.6 (gdb) bt #0 0x00007ffff7c88f25 in raise () from /usr/lib/libc.so.6 #1 0x00007ffff7c72897 in abort () from /usr/lib/libc.so.6 #2 0x00005555555abc3e in run_next_block (...) at check/main.c:6398 #3 0x00005555555b0f36 in deal_root_from_list (...) at check/main.c:8408 #4 0x00005555555b1a3d in check_chunks_and_extents (fs_info=0x5555556a1e30) at check/main.c:8690 #5 0x00005555555b1e3e in do_check_chunks_and_extents (fs_info=0x5555556a1e30) a #6 0x00005555555b5710 in cmd_check (cmd=0x555555696920 <cmd_struct_check>, argc #7 0x0000555555568dc7 in cmd_execute (cmd=0x555555696920 <cmd_struct_check>, ar #8 0x0000555555569713 in main (argc=2, argv=0x7fffffffde70) at btrfs.c:386 [CAUSE] This fuzzed images has a corrupted EXTENT_DATA item in data reloc tree: item 1 key (256 EXTENT_DATA 256) itemoff 16111 itemsize 12 generation 0 type 2 (prealloc) prealloc data disk byte 16777216 nr 0 prealloc data offset 0 nr 0 There are several problems with the item: - Bad item size 12 is too small. - Bad key offset offset of EXTENT_DATA type key represents file offset, which should always be aligned to sector size (4K in this particular case). [FIX] Do extra item size and key offset check for original mode, and remove the abort() call in run_next_block(). And to show off how robust lowmem mode is, lowmem can handle it without any hiccup. With this fix, original mode can detect the problem properly: Checking filesystem on issue_213.raw UUID: 99e50868-0bda-4d89-b0e4-7e8560312ef9 [1/7] checking root items [2/7] checking extents ERROR: invalid file extent item size, have 12 expect (21, 16283] ERROR: errors found in extent allocation tree or chunk allocation [3/7] checking free space cache [4/7] checking fs roots root 18446744073709551607 root dir 256 error root 18446744073709551607 inode 256 errors 62, no orphan item, odd file extent, bad file extent ERROR: errors found in fs roots found 131072 bytes used, error(s) found total csum bytes: 0 total tree bytes: 131072 total fs tree bytes: 32768 total extent tree bytes: 16384 btree space waste bytes: 124774 file data blocks allocated: 0 referenced 0 Issue: #213 Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Su Yue <Damenly_Su@gmx.com> Signed-off-by: David Sterba <dsterba@suse.com>	2020-01-09 14:27:09 +01:00
Qu Wenruo	36128dff44	btrfs-progs: check/lowmem: Fix a false alert on uninitialized value [BUG] When compiling the devel branch with commit fb8f05e40b458 ("btrfs-progs: check: Make repair_imode_common() handle inodes in subvolume trees"), the following warning will be reported: check/mode-common.c: In function ‘detect_imode’: check/mode-common.c\|1071 col 23\| warning: ‘imode’ may be used uninitialized in this function [-Wmaybe-uninitialized] 1071 \| *imode_ret = (imode \| 0700); \| ~~~~~~~^~~~~~~ This only occurs for regular build. If compiled with D=1, the warning just disappears. [CAUSE] Looks like a bug in gcc optimization. The code will only set @imode_ret when @found is true. And for every "found = true" assignment we have assigned @imode. So this is just a false alert. [FIX] I hope I can fix the problem of GCC, but obviously I can't (at least for now). So let's assign an initial value 0 to @imode to suppress the false alert. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-22 19:09:51 +01:00
Johannes Thumshirn	e388bf386b	btrfs-progs: check: warn users about the possible dangers of --repair The manual page of btrfsck clearly states 'btrfs check --repair' is a dangerous operation. Although this warning is in place users do not read the manual page and/or are used to the behaviour of fsck utilities which repair the filesystem, and thus potentially cause harm. Similar to 'btrfs balance' without any filters, add a warning and a countdown, so users can bail out before eventual corrupting the filesystem more than it already is. To override the timeout, let --force skip it and continue. Signed-off-by: Johannes Thumshirn <jthumshirn@suse.de> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-22 19:07:20 +01:00
Qu Wenruo	989a99b5f8	btrfs-progs: Replace btrfs_block_group_cache::item with dedicated members We access btrfs_block_group_cache::item mostly for @used and @flags. @flags is already a dedicated member in btrfs_block_group_cache, only @used doesn't have a dedicated member. This patch will remove btrfs_block_group_cache::item and add btrfs_block_group_cache::used. It's the btrfs-progs equivalent of the following kernel patches: btrfs: move block_group_item::used to block group btrfs: move block_group_item::flags to block group btrfs: remove embedded block_group_cache::item Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-18 19:21:09 +01:00
Qu Wenruo	e46281d6fb	btrfs-progs: Refactor excluded extent functions to use fs_info The following functions are just using @root to reach fs_info: - exclude_super_stripes - free_excluded_extents - add_excluded_extent Refactor them to use fs_info directly. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Johannes Thumshirn <jthumshirn@suse.de> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-18 19:21:08 +01:00
Qu Wenruo	b9ea7c1b23	btrfs-progs: check/original: Add check and repair for invalid inode generation There are at least two bug reports of kernel tree-checker complaining about invalid inode generation. All offending inodes seem to be caused by old kernel around 2014, with inode generation overflow. So add such check and repair ability to lowmem mode check first. This involves: - Calculate the inode generation upper limit Unlike the lowmem mode context, we don't have anyway to determine if this inode belongs to log tree. So we use super_generation + 1 as upper limit, just like what we did in kernel tree checker. - Check if the inode generation is larger than the upper limit - Repair by resetting inode generation to current transaction generation The difference is, in original mode, we have a common trans handle for all repair and reset path for each repair. Reported-by: Charles Wright <charles.v.wright@gmail.com> Reviewed-by: Nikolay Borisov <nborisov@suse.com> Tested-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-18 19:21:07 +01:00
Qu Wenruo	0a6e40bdc1	btrfs-progs: check/lowmem: Add check and repair for invalid inode generation There are at least two bug reports of kernel tree-checker complaining about invalid inode generation. All offending inodes seem to be caused by old kernel around 2014, with inode generation overflow. So add such check and repair ability to lowmem mode check first. This involves: - Calculate the inode generation upper limit If it's an inode from log tree, then the upper limit is super_generation + 1, otherwise it's super_generation. - Check if the inode generation is larger than the upper limit - Repair by resetting inode generation to current transaction generation Reported-by: Charles Wright <charles.v.wright@gmail.com> Tested-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-18 19:21:07 +01:00
Qu Wenruo	eae0c8e32f	btrfs-progs: check/original: Fix inode mode in subvolume trees To make original mode to repair imode error in subvolume trees, this patch will do: - Remove the show-stopper checks for root->objectid. Now repair_imode_original() will accept inodes in subvolume trees. - Export detect_imode() for original mode Due to the call requirement, original mode must use an existing trans handler to do the repair, thus we need to re-implement most of the work done in repair_imode_common(). - Make repair_imode_original() to use detect_imode(). - Free the path after reset_imode() reset_imode() keeps the path, as lowmem mode uses path to locate its current check position. But for original mode, the unreleased path can cause later repair to report warning, so we need to manually release the path. - Update rec->imode after imode reset So later repair depending on rec->imode can get correct value. - Move the repair before repair_inode_nlinks() repair_inode_nlinks() needs correct imode to add DIR_INDEX/DIR_ITEM. So moving the repair before repair_inode_nlinks() makes the latter repair happier. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-18 19:21:07 +01:00
Qu Wenruo	69eac9de0c	btrfs-progs: check/lowmem: Repair bad imode early For lowmem mode, if we hit a bad inode mode, normally it is reported when we checking the DIR_INDEX/DIR_ITEM of the parent inode. If we didn't repair at that time, the error will be recorded even if we fixed it later. So this patch will check for INODE_ITEM_MISMATCH error type, and if it's really caused by invalid imode, repair it and clear the error. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-18 19:21:07 +01:00
Qu Wenruo	7f8383b7a6	btrfs-progs: check: Make repair_imode_common() handle inodes in subvolume trees [[PROBLEM]] Before this patch, repair_imode_common() can only handle two types of inodes: - Free space cache inodes - ROOT DIR inodes For inodes in subvolume trees, the core complexity is how to determine the correct imode, thus it was not implemented. However there are more reports of incorrect imode in subvolume trees, we need to support such fix. [[ENHANCEMENT]] So this patch adds a new function, detect_imode(), to detect imode for inodes in subvolume trees. The policy here is, try our best to find a valid imode to recovery. If no convicing info can be found, fail out. That function will determine imode by: 1) Search for INODE_REF of the inode If we have INODE_REF, we will then try to find DIR_ITEM/DIR_INDEX. As long as one valid DIR_ITEM or DIR_INDEX can be found, we convert the BTRFS_FT_* to imode, then call it a day. This should be the most accurate way. 2) Search for DIR_INDEX/DIR_ITEM belongs to this inode If above search fails, we falls back to locate the DIR_INDEX/DIR_ITEM just after the INODE_ITEM. Thus this only works for non-empty directory. If any can be found, it's definitely a directory. 3) Search for EXTENT_DATA belongs to this inode If EXTENT_DATA can be found, it's either REG or LNK. Thus this only works for non-empty file or soft link. For this case, we default to REG, as user can inspect the file to determine if it's a file or just a path. 4) Use rdev to detect BLK/CHR If all above fails, but INODE_ITEM has non-zero rdev, then it's either a BLK or CHR file. Then we default to BLK. 5) Fail out if none of above methods succeeded No educated guess to make things worse. [[SHORTCOMING]] The above search is not perfect, there are cases where we can't repair: E.g. orphan empty regular inode. Since it's already orphan, it has no INODE_REF. And it's regular empty file, it has no DIR_INDEX nor EXTENT_DATA nor rdev. Thus we can't recover. Although for this case, it really doesn't matter as it's already orphan and will be deleted anyway. Furthermore, due to the DIR_ITEM/DIR_INDEX/INODE_REF repair code which can happen before imode repair, it's possible that DIR_ITEM search code may not be executed. If there is only DIR_ITEM remaining, repair code will remove the DIR_ITEM completely and move the inode to lost+found, leaving us no info to rebuild imode. If there is DIR_INDEX missing, repair code will re-insert the DIR_INDEX, then imode repair code will go DIR_INDEX directly. But overall, the repair code should handle the invalid imode caused by older kernels without problem. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-18 19:21:07 +01:00
Qu Wenruo	ac9e07a780	btrfs-progs: check: find imode using info from INODE_REF item Introduce a function, find_file_type(), to find filetype using info from INODE_REF, including dir_id from key index/name from inode_ref_item. This function will: - Search DIR_INDEX first DIR_INDEX is easier since there is only one item in it. - Validate the DIR_INDEX item If the DIR_INDEX is valid, use the filetype and call it a day. - Search DIR_ITEM then It needs extra iteration since it's possible to have hash collision. - Validate the DIR_ITEM If valid, call it a day. Or return -ENOENT; This would be used as the primary method to determine the imode in later imode repair code. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-18 19:21:07 +01:00
Qu Wenruo	87207654f1	btrfs-progs: check: Export btrfs_type_to_imode This function will be later used by common mode code, so export it. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-11-18 19:21:07 +01:00
Rosen Penev	5d72055066	btrfs-progs: Fix printf formats Discovered with cppcheck. Fix signed/unsigned int mismatches, sizeof and long formats. Pull-request: #197 Signed-off-by: Rosen Penev <rosenp@gmail.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:31:05 +02:00
Johannes Thumshirn	ed33908b44	btrfs-progs: update checksumming api Update the checksumming API to be able to cope with more checksum types than just CRC32C. The finalization call is merged into btrfs_csum_data. There are some fixme's and asserts added that need to be resolved. Co-developed-by: David Sterba <dsterba@suse.com> Signed-off-by: Johannes Thumshirn <jthumshirn@suse.de> Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:28:34 +02:00
Johannes Thumshirn	7b4f1035a6	btrfs-progs: pass checksum type to btrfs_csum_data()/btrfs_csum_final() In preparation to supporting new checksum algorithm pass the checksum type to btrfs_csum_data/btrfs_csum_final, this allows us to encapsulate any differences in processing into the respective functions Signed-off-by: Johannes Thumshirn <jthumshirn@suse.de> Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:28:28 +02:00
Johannes Thumshirn	de68086e35	btrfs-progs: don't assume checksums are always 4 bytes Pass pointer to a generic buffer instead of fixed size that crc32c currently uses. Signed-off-by: Johannes Thumshirn <jthumshirn@suse.de> Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:28:25 +02:00
David Sterba	0889161127	btrfs-progs: check: move device_record to main.c The structure is used only inside main.c, no need to export it. Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:27:41 +02:00
David Sterba	705f4ecf02	btrfs-progs: check: remove flat include switch from common.h The header will be used only internally, so the FLAT_INCLUDES is not needed. Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:27:39 +02:00
David Sterba	1b42e71053	btrfs-progs: copy btrfsck.h to check/common.h We want just one header for the check API (similar to what mkfs does) but as btrfsck.h is exported header (libbtrfs), it needs some deprecation beriod before it's moved through there are probably no users of that header file in particular. Copy the header to check, all modifications and cleanups won't affect the public header. Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:27:37 +02:00
David Sterba	8e68287d88	btrfs-progs: move qgroup-verify.[ch] to check/ The only user of the functionality is in check. Signed-off-by: David Sterba <dsterba@suse.com>	2019-10-14 17:27:35 +02:00
Qu Wenruo	c31bd8db54	btrfs-progs: check/lowmem: Skip nbytes check for orphan inodes For orphan inodes, kernel won't update its nbytes and size since it's a waste of time. So lowmem check can report false alert on some orphan inodes. Fix it by checking if the inode is an orphan before complaining/repairing its nbytes. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-09-04 16:08:56 +02:00
Josef Bacik	73ff25c65d	btrfs-progs: check: don't check nbytes on unlinked files We don't update the inode when evicting it, so the nbytes will be wrong in between transaction commits. This isn't a problem, stop complaining about it to make generic/269 stop randomly failing. The orphan outdated inodes can be still present but check will not skip them. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-09-04 16:08:11 +02:00
Qu Wenruo	1baf729743	btrfs-progs: check/original: Check and repair root item geneartion Add such ability to original mode to fix root generation mismatch, which can be rejected by kernel. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-09-04 16:06:52 +02:00
Qu Wenruo	ebf29e39c3	btrfs-progs: check/lowmem: Check and repair root generation Since kernel is going to reject any root item which is newer than super block generation, we need to provide a way to fix such problem in btrfs-check. This patch addes the ability to report and repair root generation in lowmem mode. This is done by cowing the root node, so we will update the root generation along with the root node generation at commit transaction time. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-09-04 16:06:50 +02:00
Adam Borowski	90a87e817d	btrfs-progs: check: fix option parsing for -E The option -E has a mandatory argument that was missing from the short option spec, thus it always crashed. Signed-off-by: Adam Borowski <kilobyte@angband.pl> Signed-off-by: David Sterba <dsterba@suse.com>	2019-09-04 16:06:24 +02:00
Naohiro Aota	8ca6c0c3c7	btrfs-progs: check: initialize qgroup_item_count in earlier stage "btrfsck -Q" segfaults because it does not call qgroup_set_item_count_ptr() properly: # btrfsck -Q /dev/sdk Opening filesystem to check... Checking filesystem on /dev/sdk UUID: 34a35bbc-43f8-40f0-8043-65ed33f2e6c3 Print quota groups for /dev/sdk UUID: 34a35bbc-43f8-40f0-8043-65ed33f2e6c3 Segmentation fault (core dumped) Since "struct task_ctx ctx" is global, we can just move qgroup_set_item_count_ptr() much earlier stage in the check process to avoid to forget initializing it. Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-26 17:46:42 +02:00
Josef Bacik	42a1aaeec4	btrfs-progs: deal with drop_progress properly in fsck While testing snapshot deletion with dm-log-writes I saw that I was failing the fsck sometimes when the fs was actually in the correct state. This is because we only skip blocks on the same level of root_item->drop_level. If the drop_level < the root level then we could very well walk into nodes that we wouldn't actually walk into on fs mount, because the drop_progress is further ahead in the slot of the root. Instead only process the slots of the nodes that are above the drop_progress key. With this patch in place we no longer improperly fail to check fs'es that have a drop_progress set with a drop_level < root level. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-26 17:46:42 +02:00
David Sterba	c8bea2b84b	btrfs-progs: move rbtree-utils.[ch] to common/ Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 20:49:04 +02:00
David Sterba	c07960c8be	btrfs-progs: move utils.[ch] to common/ Update include paths and remove some duplicates. Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 20:49:04 +02:00
David Sterba	aac564aca6	btrfs-progs: move commonh to common/ Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 20:49:03 +02:00
David Sterba	f93b471143	btrfs-progs: move help.[ch] to common/ Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 20:49:03 +02:00
David Sterba	d0970a05cd	btrfs-progs: move task-utils.[ch] to common/ Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 20:49:03 +02:00
David Sterba	d1efe50d0a	btrfs-progs: move messages.[ch] to common/ Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 20:49:03 +02:00
David Sterba	f63f29e9e9	btrfs-progs: move internal.h to common/ Create directory for all sources that can be used by anything that's not rellated to a relevant kernel part, all common functions, helpers, utilities that do not fit any other specific category. The traditional location would be probably lib/ with all things that are statically linked to the main binaries, but we have libbtrfs and libbtrfsutil so this would be confusing. Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 20:49:03 +02:00
Nikolay Borisov	32f75c87a0	btrfs-progs: check: Remove duplicated and commented functions Commit `756105181e` ("btrfs-progs: check: supplement extent backref list with rbtree") changed the backref implementation to use rb tree and also commented the old implementations. It's been almost 2 years since that change and it's unlikely the old version will ever be used, so just remove it. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 13:31:16 +02:00
Jeff Mahoney	91a1476d53	btrfs-progs: pass cmd_struct to usage() Now that every call site has a cmd_struct, we can just pass the cmd_struct to usage to print the usager information. This allows us to interpret the format flags we'll add later in this series to inform the user of which output formats any given command supports. Signed-off-by: Jeff Mahoney <jeffm@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 13:31:15 +02:00
Jeff Mahoney	b44445131f	btrfs-progs: pass cmd_struct to command callback function This patch passes the cmd_struct to the command callback function. This has several purposes: It allows the command callback to identify which command was used to call it. It also gives us direct access to the usage associated with that command. Signed-off-by: Jeff Mahoney <jeffm@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 13:31:15 +02:00
Jeff Mahoney	82d48463ea	btrfs-progs: use cmd_struct as command entry point Rather than having global command usage and callbacks used to create cmd_structs in the command array, establish the cmd_struct structures separately and use those. The next commit in the series passes the cmd_struct to the command callbacks such that we can access flags and determine which of several potential command we were called as. This establishes several macros to more easily define the commands within each command's source. Signed-off-by: Jeff Mahoney <jeffm@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-07-03 13:31:15 +02:00
Qu Wenruo	48248693cd	btrfs-progs: check/lowmem: Reset path in repair mode to avoid incorrect item from being passed to lowmem check. In lowmem mode, we check fs roots and free space cache by iterating each root item and inode item, using btrfs_next_item() and a path pointing to the root tree. However in repair mode, check_fs_root() can modify the fs root, thus CoWs the tree root, and the old path in check_fs It could lead to strange behavior, e.g. after repairing a fs tree, the path can point to a fs tree. Since no ROOT_ITEM exists in fs tree, all remaining trees are skipped in repair mode. This bug exists from the early time of lowmem mode repair, and is only exposed by recent free space inode check code. (Fs tree inodes are passed to free space inode check, causing false alerts and repair failure). Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-06-05 18:02:05 +02:00
Qu Wenruo	085445e793	btrfs-progs: Cleanup BTRFS_COMPAT_EXTENT_TREE_V0 BTRFS_COMPAT_EXTENT_TREE_V0 is introduced for a short time in kernel, and it's over 10 years ago. Nowadays there should be no user for that feature, and kernel has remove this support in Jun, 2018. There is no need for btrfs-progs to support it. This patch will remove EXTENT_TREE_V0 related code and replace those BUG_ON() to a more graceful error message. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-06-05 18:00:07 +02:00
Qu Wenruo	d73427b76d	btrfs-progs: check/original: Add checks for compressed extent without csum There is one report of compressed extent happens in btrfs, but has no csum and then leads to possible decompress error screwing up kernel memory. Although it's a kernel bug, and won't cause problem until compressed data get corrupted, let's catch such problem in advance. This patch will catch any unexpected compressed extent with: 1) 0 or less than expected csum 2) nodatasum flag set in the inode item This is for original mode. Reported-by: James Harvey <jamespharvey20@gmail.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-05-17 12:32:38 +02:00
Qu Wenruo	cc7a35f642	btrfs-progs: check/lowmem: Add checks for compressed extent without csum There is one report of compressed extent happens in btrfs, but has no csum and then leads to possible decompress error screwing up kernel memory. Although it's a kernel bug, and won't cause problem until compressed data get corrupted, let's catch such problem in advance. This patch will catch any unexpected compressed extent with: 1) missing csum 2) nodatasum flag set in the inode item This is for lowmem mode. Reported-by: James Harvey <jamespharvey20@gmail.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-05-17 12:32:38 +02:00
Jeff Mahoney	50faf73f22	btrfs-progs: check: fixup_extent_flags needs to deal with non-skinny metadata When repairing a file system created by a very old kernel, I ran into issues fixing up the extent flags since fixup_extent_flags assumed that a METADATA_ITEM would be present if the record was for metadata. Since METADATA_ITEMs don't exist without skinny metadata, we need to fall back to EXTENT_ITEMs. This also falls back to EXTENT_ITEMs even with skinny metadata enabled as other parts of the tools do. Reviewed-by: Filipe Manana <fdmanana@suse.com> Signed-off-by: Jeff Mahoney <jeffm@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-05-17 12:32:38 +02:00
Qu Wenruo	427990ad74	btrfs-progs: check/original: Check and repair free space cache inode item Just like lowmem mode, also check and repair free space cache inode item. And since we don't really have a good timing/function to check free space chace inodes, we use the same common mode check_repair_free_space_inode() when iterating root tree. Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Qu Wenruo	77fe19ba16	btrfs-progs: check/lowmem: Check and repair free space cache inode mode Unlike inodes in fs roots, we don't really check the inode items in root tree, in fact we just skip everything other than ROOT_ITEM and ROOT_REF. This makes invalid inode items sneak into root tree. For example: item 9 key (256 INODE_ITEM 0) itemoff 13702 itemsize 160 generation 30 transid 30 size 65536 nbytes 1507328 block group 0 mode 0 links 1 uid 0 gid 0 rdev 0 ^ Should be 100600 sequence 23 flags 0x1b(NODATASUM\|NODATACOW\|NOCOMPRESS\|PREALLOC) atime 0.0 (1970-01-01 08:00:00) ctime 1553491158.189771625 (2019-03-25 13:19:18) mtime 0.0 (1970-01-01 08:00:00) otime 0.0 (1970-01-01 08:00:00) There is a report of such problem in the mail list. This patch will check and repair inode items of free space cache inodes in lowmem mode. Since free space cache inodes doesn't have INODE_REF but still has 1 link, we can't use check_inode_item() directly. Instead we only check the inode mode, as that's the important part. The check and repair function: check_repair_free_space_inode() is also exported for original mode. Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Qu Wenruo	11fd6cff82	btrfs-progs: check/original: Repair invalid inode mode in root tree This patch will reuse the mode independent repair_imode() function, to repair invalid inode mode. Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Qu Wenruo	64ca43a8ad	btrfs-progs: check/lowmem: Repair invalid inode mode in root tree In root tree, we only have 2 types of inodes: - ROOT_TREE_DIR inode Its mode is fixed to 40755 - free space cache inodes Its mode is fixed to 100600 This patch will add the ability to repair such inodes to lowmem mode. For fs/subvolume tree error, at least we haven't see such corruption yet, so we don't need to rush to fix corruption in fs trees yet. The repair function, reset_imode() and repair_imode_common() can be reused by later original mode patch, so it's placed in check/mode-common.c. Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Qu Wenruo	23f1e9a13f	btrfs-progs: check/original: Add inode mode check Just like lowmem mode, check inode mode, specially for S_IFMT bits and beyond. Please note that, this check only applies to inodes in fs/subvol trees. It doesn't apply to free space cache inodes. Reported-by: Thorsten Hirsch <t.hirsch@web.de> Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Qu Wenruo	c06c5eef88	btrfs-progs: check/lowmem: Add inode mode check There is one report about invalid free space cache inode mode. Normally free space cache inode should have mode 100600 (regular file, no uid/gid/sticky bit, rw------ bit). But in that report, we have free space cache inode mode as 0. So at least btrfs check should report invalid inode mode. This patch will at least make btrfs check lowmem mode to detect this problem. Please note that, this check only applies to inodes in fs/subvol trees. It doesn't apply to free space cache inodes. Reported-by: Thorsten Hirsch <t.hirsch@web.de> Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Qu Wenruo	db51d8d8f6	btrfs-progs: Use @fs_info to replace @root for btrfs_check_leaf/node() Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Su Yanjun	cedbfc2561	btrfs-progs: check: Delete file extent item with unaligned disk bytenr For test case fsck-tests/001-bad-file-extent-bytenr, we have an obviously hand crafted image with unaligned file extent: item 7 key (257 EXTENT_DATA 0) itemoff 3453 itemsize 53 generation 6 type 1 (regular) extent data disk byte 755944791 nr 1048576 extent data offset 0 nr 1048576 ram 1048576 extent compression 0 (none) disk bytenr 755944791 is obviously unaligned (not even). For such obviously corrupted file extent, we should just delete the file extent. Signed-off-by: Su Yanjun <suyj.fnst@cn.fujitsu.com> [Update commit message and comment] Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Su Yanjun	3b35deeadd	btrfs-progs: check: fix wrong @offset used in find_possible_backrefs() Function find_possible_backrefs() is used to locate the file extents referring to an data extent. For data extent backref, its btrfs_extent_data_ref structure has the following members: - root Which root refers to this data extent - objectid Which inode refers to this data extent - offset Search hint. Its value is @file_offset - @extent_offset. While for @file_offset, it's directly recorded in (INO EXTENT_DATA FILE_OFFSET) key. So when searching the file extents refers to this data extent, we can't use btrfs_extent_data_ref::offset as search key::offset. We must search from file offset 0, and iterate all file extents until we hit a file extent matches the data backref. Thankfully such time consuming behavior is not triggered frequently, it only gets called for repair, so it shouldn't affect normal check routine. Signed-off-by: Su Yanjun <suyj.fnst@cn.fujitsu.com> [Update commit message] Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:04:25 +08:00
Su Yanjun	b6a0d97cba	Revert "btrfs-progs: Record orphan data extent ref to corresponding root." Commit `0ddf63c09f` ("btrfs-progs: Record orphan data extent ref to corresponding root.") introduces the ability to record a file extent even all other related info is lost (data backref, inode item). However this patch only records such info without doing any proper repair, further more, it could even record invalid file extents, and the report part only happens after all check is done. Since we will later introduce proper file extent repair functionality, we could revert that patch. Signed-off-by: Su Yanjun <suyj.fnst@cn.fujitsu.com> [Update commit message, solve merge conflicts] Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:03:51 +08:00
Su Yanjun	872c116c75	Revert "btrfs-progs: Add repair and report function for orphan file extent." Commit `ad03f840f0` ("btrfs-progs: Add repair and report function for orphan file extent.") will record and try to repair orphan file extents by: - Removing the orphan file extent item if no extent backref can be found Or - Re-insert a file extent using data backref Especially the later case is far from ideal, as normally extent tree is more fragile and corruption prone. Use any data from extent tree to try to repair could easily lead to further corruption. So here we revert commit `ad03f840f0` ("btrfs-progs: Add repair and report function for orphan file extent.") to cleanup the space for later proper repair in original mode. Signed-off-by: Su Yanjun <suyj.fnst@cn.fujitsu.com> [Update commit message, solve conflicts with DIR_ITEM hash mismatch patchset] Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:03:02 +08:00
Su Yue	0617bde3bc	btrfs-progs: lowmem: delete unaligned bytes extent data under repair If found a extent data item has unaligned part, lowmem repair just deletes it. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:02:13 +08:00
Su Yue	a170ef86ff	btrfs-progs: lowmem: rename delete_extent_tree_item() to delete_item() The function can delete items in trees besides extent tree. Rename and move it for further use. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> [Update comment, solve merge conflicts] Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:02:09 +08:00
Su Yue	2a058a2204	btrfs-progs: lowmem: check unaligned disk_bytenr for extent_data Add support to check unaligned disk_bytenr for extent_data. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:02:08 +08:00
Su Yue	cbe66f5804	btrfs-progs: lowmem: fix false alert if extent item has been repaired Previously, @err are assigned immediately after check but before repair. repair_extent_item()'s return value also confuses the caller. If error has been repaired and returns 0, check_extent_item() will try to continue check the nonexistent and cause flase alerts. Here make repair_extent_item()'s return codes only represents status of the extent item, error bits are handled in caller of the repair function. Change of @err after repair. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> [Solve conflicts with DIR_ITEM hash mismatch patchset] Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:02:06 +08:00
Su Yue	0ea38ae08d	btrfs-progs: lowmem: move nbytes check before isize check For files, lowmem repair will try to check nbytes and isize, but isize check depends nbytes. Once bytes has been repaired, then isize should be checked and repaired. So move nbytes check before isize check. Also set nbytes to extent_size once repaired successfully. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:02:04 +08:00
Su Yue	ee455db07e	btrfs-progs: lowmem: add argument path to punch_extent_hole() Since repair will do CoW, the outer path may be invalid. This patch will add an argument, @path, to punch_extent_hole(). When punch_extent_hole() returns, path will still point to the same key. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> [Update comment and commit message] Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:02:01 +08:00
Lu Fengqi	fa13420cc7	btrfs-progs: lowmem: fix false alert about the existence of gaps in the check_file_extent The 'end' parameter of check_file_extent tracks the ending offset of the last checked extent. This is used to detect gaps between adjacent extents. Currently such gaps are wrongly detected since for regular extents only the size of the extent is added to the 'end' parameter. This results in wrongly considering all extents of a file as having gaps between them when only 2 of them really have a gap as seen in the example below. Solution: The extent_end variable should set to the sum of the offset and the extent_num_bytes of the file extent. Example: Suppose that lowmem check the following file extent of inode 257. item 6 key (257 EXTENT_DATA 0) itemoff 15813 itemsize 53 generation 6 type 1 (regular) extent data disk byte 13631488 nr 4096 extent data offset 0 nr 4096 ram 4096 extent compression 0 (none) item 7 key (257 EXTENT_DATA 8192) itemoff 15760 itemsize 53 generation 6 type 1 (regular) extent data disk byte 13631488 nr 4096 extent data offset 0 nr 4096 ram 4096 extent compression 0 (none) item 8 key (257 EXTENT_DATA 12288) itemoff 15707 itemsize 53 generation 6 type 1 (regular) extent data disk byte 13631488 nr 4096 extent data offset 0 nr 4096 ram 4096 extent compression 0 (none) For inode 257, check_inode_item set extent_end to 0, then call check_file_extent to check item {6,7,8}. item 6) offset(0) == extent_end(0) extent_end = extent_end(0) + extent_num_bytes(4096) item 7) offset(8192) != extent_end(4096) extent_end = extent_end(4096) + extent_num_bytes(4096) ^^^ The old extent_end should replace by offset(8192). item 8) offset(12288) != extent_end(8192) ^^^ But there is no gap between item {7,8}. Fixes: `d88da10ddd` ("btrfs-progs: check: introduce function to check file extent") Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Lu Fengqi <lufq.fnst@cn.fujitsu.com> [Move this patch as the 1st patch, since it's an independent fix] Signed-off-by: Qu Wenruo <wqu@suse.com>	2019-04-16 09:01:53 +08:00
Qu Wenruo	e76fb663a7	btrfs-progs: check: Fix false alert about uninitialized variable GCC 8.2.1 will report the following error: check/main.c: In function 'try_repair_inode': check/main.c:2606:5: warning: 'ret' may be used uninitialized in this function [-Wmaybe-uninitialized] if (!ret) { ^ check/main.c:2584:6: note: 'ret' was declared here int ret; ^~~ The offending code is in repair_mismatch_dir_hash(): int ret; printf( "Deleting bad dir items with invalid hash for root %llu ino %llu\n", root->root_key.objectid, rec->ino); while (!list_empty(&rec->mismatch_dir_hash)) { /* do some repair / } if (!ret) { <<< Here / do some fix */ } The truth is, to enter try_repair_inode(), we must have I_ERR_MISMATCH_DIR_HASH bit set for rec->errors. And just after we set I_ERR_MISMATCH_DIR_HASH, we call add_mismatch_dir_hash() and handled its error correctly. So it's impossible to to skip the while loop. Fix it by initializing @ret to -EUCLEAN, so even we hit some impossible case, repair_mismatch_dir_hash() won't falsely consider the mismatch hash fixed. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-03-05 12:57:41 +01:00
Jeff Mahoney	8bda5132d8	btrfs-progs: fix stray error message in check Commit `e578b59bf6` ("btrfs-progs: convert strerror to implicit %m") missed adding braces after a conditional so we will see the following message whenever a tree block needs repair, regardless of whether repair was successful: "Failed to repair btree: Success" Signed-off-by: Jeff Mahoney <jeffm@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 18:42:14 +01:00
Qu Wenruo	af816ca930	btrfs-progs: check: orig: Add ability to repair dir item with invalid hash The repair function is reusing delete_corrupted_dir_item(). Since the error can happen for root dir inode, also call try_repair_inode() on root dir inode. This is especially important for old filesystems, since later kernel introduces stricter tree-checker, which could detect such hash mismatch and refuse to read the corrupted leaf. With this repair ability, user could repair with btrfs check --repair. Link: https://bugzilla.opensuse.org/show_bug.cgi?id=1111991 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 18:42:13 +01:00
Qu Wenruo	846f838797	btrfs-progs: check: orig: Use mismatch_dir_hash_record to record bad dir items This changes reporting from current in-place, like: ERROR: DIR_ITEM[256 751495445] name foor.WvG1c1TdU namelen 13 filetype 1 mismatch with its hash, wanted 751495445 have 2870353892 root 5 root dir 256 error To new summary report at the end of the pass: root 5 root dir 256 error root 5 inode 256 errors 40000 Dir items with mismatch hash: name: foor.WvG1c1Td namelen: 13 wanted 0xab161fe4 has 0x2ccae915 Also, with mismatch_dir_hash_record structure, it provides the base for later original mode repair. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 18:42:13 +01:00
Qu Wenruo	38aff3d1ed	btrfs-progs: check: lowmem: Add ability to repair dir item with mismatched hash For DIR_ITEM with mismatch hash, we could just remove the offending dir item from the tree. Lowmem mode will handle the rest, either re-create the correct dir_item or move the orphan inode to lost+found. This is especially important for old filesystems, since later kernel introduces stricter tree-checker, which could detect such hash mismatch and refuse to read the corrupted leaf. With this repair ability, user could repair with 'btrfs check --mode=lowmem --repair'. Link: https://bugzilla.opensuse.org/show_bug.cgi?id=1111991 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 18:42:13 +01:00
Qu Wenruo	39ea41f00b	btrfs-progs: check: orig: Add dev_item check for used bytes and total bytes Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 18:42:13 +01:00
Qu Wenruo	607a10d71c	btrfs-progs: check: lowmem: Add dev_item check for used bytes and total bytes Obviously, used bytes can't be larger than total bytes. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 18:42:13 +01:00
Qu Wenruo	6b4ee5612e	btrfs-progs: check: orig: Add ability to detect bad dev extents Unlike lowmem mode check, we don't have good place for original mode to check overlapping device extents. So this patch introduces a new function, btrfs_check_dev_extents(), to handle such extents. Reported-by: Hans van Kranenburg <hans.van.kranenburg@mendix.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 18:42:13 +01:00
Qu Wenruo	920d475a98	btrfs-progs: check: lowmem: Add check for overlapping dev extents Add such check to check_dev_item(), since at that time we're also iterating dev extents for dev item accounting. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 18:42:13 +01:00
Nikolay Borisov	f7717d8cdb	btrfs-progs: Remove fsid/metdata_uuid fields from fs_info Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2019-01-15 16:20:09 +01:00
Nikolay Borisov	c4aadd9af2	btrfs-progs: Add support for metadata_uuid field Add support for a new metadata_uuid field. This is just a preparatory commit which switches all users of the fsid field for metdata comparison purposes to utilize the new field. This more or less mirrors the kernel patch, additionally: * Update 'btrfs inspect-internal dump-super' to account for the new field. This involes introducing the 'metadata_uuid' line to the output and updating the logic for comparing the fs uuid to the dev_item uuid. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-12-06 12:51:36 +01:00
Su Yue	dc347d0ec6	btrfs-progs: trivial fix of line break in repair_inode_nbytes_lowmem() Move "\n" at end of the sentence to print. Fixes: `281eec7a9d` ("btrfs-progs: check: repair inode nbytes in lowmem mode") Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-12-05 15:47:06 +01:00
Josh Soref	33d375bc20	btrfs-progs: check: fix typo in device_extent_record::chunk_objectid This header is exported to /usr/include/btrfs but there are no known users, so the change should be safe. Generated by https://github.com/jsoref/spelling Issue: #154 Author: Josh Soref <jsoref@users.noreply.github.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-11-26 18:24:49 +01:00
Josh Soref	2cd4a76ea9	btrfs-progs: fix typos in user-visible strings * error messages * help strings Generated by https://github.com/jsoref/spelling Issue: #154 Author: Josh Soref <jsoref@users.noreply.github.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-11-26 18:24:49 +01:00
Josh Soref	b1d39a42a4	btrfs-progs: fix typos in comments Generated by https://github.com/jsoref/spelling Issue: #154 Author: Josh Soref <jsoref@users.noreply.github.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-11-26 18:24:48 +01:00
Rosen Penev	01e35d9f53	btrfs-progs: treewide: Fix missing declarations Found using -Wmissing-prototypes in GCC. This should improve LTO behavior. Note that set_free_space_tree_thresholds is an unused function. Adding inline seems to remove the unused function warning. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: Rosen Penev <rosenp@gmail.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-11-13 13:32:41 +01:00
David Sterba	e578b59bf6	btrfs-progs: convert strerror to implicit %m Similar to the changes where strerror(errno) was converted, continue with the remaining cases where the argument was stored in another variable. The savings in object size are about 4500 bytes: $ size btrfs.old btrfs.new text data bss dec hex filename 805055 24248 19748 849051 cf49b btrfs.old 804527 24248 19748 848523 cf28b btrfs.new Signed-off-by: David Sterba <dsterba@suse.com>	2018-10-31 18:24:14 +01:00
Nikolay Borisov	bf60039cbe	btrfs-progs: check: lowmem: Refactor extent type checks in check_file_extent Make the checks in check_file_extent a bit more explicit. First we check for unknown type and fail accordingly. Then we check for inline extent and handle it in the newly introduced check_file_extent_inline. Finally if none of the above checks triggered then we must have a regular or prealloc extents. Reviewed-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Lu Fengqi <lufq.fnst@cn.fujitsu.com> Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-10-31 18:24:13 +01:00
Nikolay Borisov	a9cddf632f	btrfs-progs: check: lowmem: Refactor extent len test in check_file_extent_inline Instead of having another top-level if which checks for 'extent_num_bytes != item_inline_len' only if we are !compressed, just move the 'if' inside the 'else' branch of the first top-level if, since it has already checked for !compressed or not. No functional changes. Reviewed-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Lu Fengqi <lufq.fnst@cn.fujitsu.com> Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-10-31 18:24:13 +01:00
Nikolay Borisov	2daecf07f3	btrfs-progs: check: lowmem: Factor out inline extent checking code in its own function Since the inline extent code can be largely self-sufficient, factor it out from check_file_extent. No functional changes. Reviewed-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Lu Fengqi <lufq.fnst@cn.fujitsu.com> Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-10-31 18:24:13 +01:00
Nikolay Borisov	be32eb0557	btrfs-progs: check: Add support for freespace tree fixing Now that all the prerequisite code for proper support of free space tree repair is in, it's time to wire it in. This is achieved by first hooking the freespace tree to the __free_extent/alloc_reserved_tree_block functions. And then introducing a wrapper function to contains the existing check_space_cache and the newly introduced repair code. Finally, it's important to note that FST repair code first clears the existing FST in case of any problem found and rebuilds it from scratch. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-10-25 16:11:40 +02:00
Nikolay Borisov	909357e867	btrfs-progs: Wire up delayed refs This commit enables the delayed refs infrastructures. This entails doing the following: 1. Replacing existing calls of btrfs_extent_post_op (which is the equivalent of delayed refs) with the proper btrfs_run_delayed_refs. As well as eliminating open-coded calls to finish_current_insert and del_pending_extents which execute the delayed ops. 2. Wiring up the addition of delayed refs when freeing extents (btrfs_free_extent) and when adding new extents (alloc_tree_block). 3. Adding calls to btrfs_run_delayed refs in the transaction commit path alongside comments why every call is needed, since it's not always obvious (those call sites were derived empirically by running and debugging existing tests) 4. Correctly flagging the transaction in which we are reinitialising the extent tree. 5. Moving btrfs_write_dirty_block_groups to btrfs_write_dirty_block_groups since blockgroups should be written to disk after the last delayed refs have been run. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-10-23 14:48:41 +02:00
Qu Wenruo	b02b426a78	btrfs-progs: fix infinite loop when bad key order repair fails An infinite loop can be triggered during fuzz/003: ====== RUN MAYFAIL btrfs check --repair tests/fuzz-tests/images/bko-199833-reloc-recovery-crash.raw.restored [1/7] checking root items Fixed 0 roots. [2/7] checking extents ctree.c:1650: leaf_space_used: Warning: assertion `data_len < 0` failed, value 1 bad key ordering 18 19 ctree.c:1650: leaf_space_used: Warning: assertion `data_len < 0` failed, value 1 bad key ordering 18 19 ctree.c:1650: leaf_space_used: Warning: assertion `data_len < 0` failed, value 1 bad key ordering 18 19 [CAUSE] In try_to_fix_bad_block() it's possible that btrfs_find_all_roots() finds no root referring to that tree block, thus we can't do any repair. However in that case, we still return 0 since the last caller assigning @ret is btrfs_find_all_roots(), and the ulist while loop doesn't get run at all. And since try_to_fix_bad_block() returns 0, check_block() in check/main.c will return -EAGAIN to re-check the tree block. This leads to the infinite loop. [FIX] Change the default return value from 0 to -EIO in try_to_fix_bad_block(), so if there is no tree referring to the bad tree block, it won't cause infinite loop anymore. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-10-23 14:48:40 +02:00
Qu Wenruo	0adf626374	btrfs-progs: exit gracefully when root dir item repair fails Another BUG_ON() during fuzz/003: ====== RUN MAYFAIL btrfs check --init-csum-tree tests/fuzz-tests/images/bko-161821.raw.restored [1/7] checking root items Fixed 0 roots. [2/7] checking extents parent transid verify failed on 4198400 wanted 14 found 1114126 parent transid verify failed on 4198400 wanted 14 found 1114126 Ignoring transid failure owner ref check failed [4198400 4096] repair deleting extent record: key [4198400,169,0] adding new tree backref on start 4198400 len 4096 parent 0 root 5 Repaired extent references for 4198400 ref mismatch on [4222976 4096] extent item 1, found 0 backref 4222976 root 7 not referenced back 0x55e9cc694780 incorrect global backref count on 4222976 found 1 wanted 0 backpointer mismatch on [4222976 4096] owner ref check failed [4222976 4096] repair deleting extent record: key [4222976,169,0] Repaired extent references for 4222976 [3/7] checking free space cache [4/7] checking fs roots parent transid verify failed on 4198400 wanted 14 found 1114126 Ignoring transid failure Wrong generation of child node/leaf, wanted: 1114126, have: 14 root 5 missing its root dir, recreating parent transid verify failed on 4198400 wanted 14 found 1114126 Ignoring transid failure ERROR: child eb corrupted: parent bytenr=4222976 item=0 parent level=1 child level=2 check/main.c:2738: check_inode_recs: BUG_ON `ret` triggered, value -5 failed (ignored, ret=134): btrfs check --init-csum-tree tests/fuzz-tests/images/bko-161821.raw.restored mayfail: returned code 134 (SIGABRT), not ignored test failed for case 003-multi-check-unmounted Just abort current transaction and exit gracefully in this case, the caller handles errors. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-10-23 14:48:40 +02:00
David Sterba	4dca0ccf3d	btrfs-progs: check: reorder options to sections Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:06:24 +02:00
Qu Wenruo	21d1109168	btrfs-progs: check/original: Don't overwrite return value when we failed to repair In check_inode_recs(), for repair mode we always reset @ret to 0. It makes no sense and later we check @ret to determine if the repair is successful. Fix it by removing the offending overwrite. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Gu Jinxiang <gujx@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:06:22 +02:00
Qu Wenruo	159f177cf9	btrfs-progs: check/original: Avoid infinite loop when failed to repair inode Exposed by fuzz-tests/003-multi-check-unmounted/ on fuzzed image bko-161811.raw.xz. It's caused by the fact when check_fs_roots() finds tree root is modified, it re-search tree root by goto again: label. However again: label. will also reset root objectid to 0. If we failed to repair one fs root but still modified tree root, we will go into such infinite loop. Fix it by recording which root we should skip for repair mode. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Gu Jinxiang <gujx@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:06:20 +02:00
Qu Wenruo	5f594c3ce3	btrfs-progs: check: Remove the ability to rebuild root overwriting existing tree blocks We have function btrfs_fsck_reinit_root() to reinit csum or extent tree. However this function allows us to let it overwrite existing tree blocks using @overwrite parameter. Such behavior is pretty dangerous while no caller is using this feature explicitly. So just remove @overwrite parameter and allow btrfs_fsck_reinit_root() to error out when it fails to allocate tree block. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Gu Jinxiang <gujx@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:04:58 +02:00
Stéphane Lesimple	078e9a1cc9	btrfs-progs: check: enhanced progress indicator We reuse the task_position enum and task_ctx struct of the original progress indicator, adding more values and fields for our needs. Then add hooks in all steps of the check to properly record progress. Here's how the output looks like on a 22 Tb 5-disk RAID1 FS: Opening filesystem to check... Checking filesystem on /dev/mapper/luks-ST10000VN0004-XXXXXXXX UUID: xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx [1/7] checking extents (0:20:21 elapsed, 950958 items checked) [2/7] checking root items (0:01:29 elapsed, 15121 items checked) [3/7] checking free space cache (0:00:11 elapsed, 4928 items checked) [4/7] checking fs roots (0:51:31 elapsed, 600892 items checked) [5/7] checking csums (0:14:35 elapsed, 754522 items checked) [6/7] checking root refs (0:00:00 elapsed, 232 items checked) [7/7] checking quota groups skipped (not enabled on this FS) found 5286458060800 bytes used, no error found Signed-off-by: Stéphane Lesimple <stephane_btrfs@lesimple.fr> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:03:23 +02:00
Su Yue	d53d42fa21	btrfs-progs: lowmem: fix false alerts of referencer count mismatch for blocks relocated Btrfs lowmem check reports such false alerts: ========= ERROR: extent[1419709677568, 1703936] referencer count mismatch (root: 2192, owner: 327635, offset: 0) wanted: 9, have: 13 ========= While in extent tree, the extent has: ========= item 98 key (1419709677568 EXTENT_ITEM 1703936) itemoff 10745 itemsize 92 refs 35 gen 16921 flags DATA extent data backref root 2192 objectid 327635 offset 0 count 9 shared data backref parent 1232784752640 count 4 shared data backref parent 1232784736256 count 13 shared data backref parent 1232784719872 count 9 ========= The extent data 327635 starts in this leaf without BTRFS_HEADER_FLAG_RELOC: ========= leaf 1471521390592 items 146 free space 3671 generation 18581 owner 2192 leaf 1471521390592 flags 0x1(WRITTEN) backref revision 1 ... item 137 key (327635 EXTENT_DATA 0) itemoff 7745 itemsize 53 generation 15770 type 1 (regular) extent data disk byte 1419709677568 nr 1703936 extent data offset 0 nr 131072 ram 1703936 extent compression 0 (none) item 138 key (327635 EXTENT_DATA 131072) itemoff 7692 itemsize 53 generation 15770 type 1 (regular) extent data disk byte 1419709677568 nr 1703936 extent data offset 131072 nr 131072 ram 1703936 extent compression 0 (none) ...(Exactly 9 items here) ========= The next leaf is flaged as relocated, so extent data backrefs are shared not keyed: ========= leaf 1232784752640 items 159 free space 2503 generation 16924 owner 2192 leaf 1232784752640 flags 0x3(WRITTEN\|RELOC) backref revision 1 fs uuid 0f43f49d-6e63-4b1b-bc8c-c54da409872d chunk uuid e558e6f7-4f08-4292-91fb-9a775fdd530b item 0 key (327635 EXTENT_DATA 1179648) itemoff 16230 itemsize 53 generation 15770 type 1 (regular) extent data disk byte 1419709677568 nr 1703936 extent data offset 1179648 nr 131072 ram 1703936 extent compression 0 (none) item 1 key (327635 EXTENT_DATA 1310720) itemoff 16177 itemsize 53 generation 15770 type 1 (regular) extent data disk byte 1419709677568 nr 1703936 extent data offset 1310720 nr 131072 ram 1703936 extent compression 0 (none) item 2 key (327635 EXTENT_DATA 1441792) itemoff 16124 itemsize 53 generation 15770 type 1 (regular) extent data disk byte 1419709677568 nr 1703936 extent data offset 1441792 nr 131072 ram 1703936 extent compression 0 (none) item 3 key (327635 EXTENT_DATA 1572864) itemoff 16071 itemsize 53 ======= Lowmem mode starts to count extent data backrefs in first leaf, those backrefs are keyed as wanted. It calls btrfs_next_item to skip to next leaf. However, the next leaf was relocated, extent data items in this leaf should have shared backrefs shouldn't be counted. So let lowmem mode do not count data backrefs if leaf is flaged with BTRFS_HEADER_FLAG_RELOC. Reported-by: Chris Murphy <chris@colorremedies.com> Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:03:14 +02:00
Qu Wenruo	c010437455	btrfs-progs: check: orig: Don't panic when unexpected root item is referring to one extent With crafted image, expected root item can refer to certain extent, and original mode uses BUG_ON() to handle such case. Fix it by gracefully return error. Link: https://bugzilla.kernel.org/show_bug.cgi?id=200403 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:03:07 +02:00
Nikolay Borisov	f3c7101116	btrfs-progs: check: Drop trans/root arguments from free_extent_hook They are not really needed, what free_extent_hook wants is really a pointer to fs_info so give it to it directly. This is in preparation of delayed refs code. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:01:57 +02:00
David Disseldorp	7490f94dd7	btrfs-progs: check: add experimental flag for lowmem mode The experimental flag is already carried in the manpage, but was removed from the btrfs check usage message as part of refactoring via `87c1bd13c1`. Add it back. Signed-off-by: David Disseldorp <ddiss@suse.de> Reviewed-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:01:19 +02:00
Qu Wenruo	eca8dc0757	btrfs-progs: check/lowmem: Repair wrong inline ram_bytes for uncompressed extent Similar to the original mode repair. Reported-by: Steve Leung <sjleung@shaw.ca> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:01:03 +02:00
Qu Wenruo	cd28b15496	btrfs-progs: check/lowmem: Prepare check_file_extent() to handle repair Current check_file_extent() doesn't support later repair work, since it doesn't accept btrfs_path structure as parameter, thus it can't modify btrfs trees, or later check will still use the old and wrong path. Use btrfs_path to replace btrfs_key, extent_buffer and slot parameters, so we can modify @path directly for repair, and reduce the number of parameters for check_file_extent(). Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:00:58 +02:00
Qu Wenruo	e5f50be06f	btrfs-progs: check/original: Detect and repair wrong inline ram_bytes It looks like that around 2014, btrfs kernel has a regression that would cause offset-by-one ram_bytes for inline extent. Add the ability to repair it in original mode. Reported-by: Steve Leung <sjleung@shaw.ca> Tested-by: Steve Leung <sjleung@shaw.ca> Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:00:44 +02:00
Qu Wenruo	3c042605db	btrfs-progs: Get rid of the confusing btrfs_file_extent_inline_len() [BUG] If one uncompressed inline extent has incorrect ram_bytes, neither btrfs check nor dump-tree could detect such corruption. [CAUSE] Every caller tries to read inline extent ram_bytes is using btrfs_file_extent_inline_len(), other than directly calling btrfs_file_extent_ram_bytes(). For compressed extent, it's just calling btrfs_file_extent_ram_bytes(). However for uncompressed extent, it falls back to btrfs_file_extent_inline_item_len(), makes us unable to detect anything wrong in ram_bytes. [FIX] Just get rid of such confusing btrfs_file_extent_inline_len() function. Reported-by: Steve Leung <sjleung@shaw.ca> Tested-by: Steve Leung <sjleung@shaw.ca> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:00:34 +02:00
Nikolay Borisov	d4f19ad084	btrfs-progs: check: factor out root parsing from check_chunks_and_extents check_chunks_and_extents does quite a number of distinct things. The first of those is going through all root items in the root tree and classify every root depending on whether they have a dropping operation in progress or not. Lets factor out this code and move the variables specific to this in a separate function so clean up check_chunks_and_extents a bit. Accidentally, this patch fixes some reference leaks since in error conditions in the loop the code does "goto out" but at that label we don't really release the path. Having this code extracted in a separate function which always releases the path avoids this problem entirely. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 15:00:08 +02:00
Nikolay Borisov	a7773d7f06	btrfs-progs: check: Fix wrong root parameter of btrfs_next_leaf call The first thing that check_chunks_and_extents does is to iterate all the root items in the root tree and link them to either the "normal_list" or "dropping_trees" list. If a leaf has to be crossed during this operation btrfs_next_leaf is called to do that. However, currently it's called with a wrong argument for its 'root' parameter. Since we are iterating the root tree the passed root should be fs_info->tree_rot, whereas right now we are passing the local variable 'root' which is assigned to the fs_tree. As it stands, this bug is actually benign since the passed root is only passed to reada_for_search, where it's used to reference the fs_info. Nevertheless the code is wrong and at the very least misleading, so fix it by passing the correct root. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-08-06 14:59:56 +02:00
Misono Tomohiro	26892ad5f1	btrfs-progs: check: Initialize all filed of btrfs_inode_item in insert_inode_item() Initialize all filed of btrfs_inode_item to zero in order to prevent having some garbage, especially for flags field. Signed-off-by: Misono Tomohiro <misono.tomohiro@jp.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:40 +02:00
Su Yue	42f7017ee6	btrfs-progs: check: lowmem: check symlinks with append/immutable flags Define new error bit INODE_FLAGS_ERROR to represents invalid inode flags error. Symlinks should never have append/immutable flags set. While checking inodes, if found a symlink with append/immutable flags, report and record the inode flags error. This is for lowmem mode. Issue: #133 Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:38 +02:00
Su Yue	eef9f3eed6	btrfs-progs: check: check symlinks with append/immutable flags Define new macro I_ERR_ODD_INODE_FLAGS to represents odd inode flags. Symlinks should never have append/immutable flags. While processing inodes, if found a symlink with append/immutable flags, mark the inode record with I_ERR_ODD_INODE_FLAGS. This is for original mode. Issue: #133 Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:37 +02:00
Nikolay Borisov	a22b593587	btrfs-progs: Change btrfs_root to btrfs_fs_info argument in btrfs_lookup_extent_info That function really wants an fs_info and not a root. Accidentally, this also makes the kernel/user space signatures to be coherent. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:37 +02:00
Nikolay Borisov	12e2280d7b	btrfs-progs: check: Remove unused root argument from btrfs_extent_post_op This is no longer used by the callees of that function so remove it. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:37 +02:00
Nikolay Borisov	b4c4ff9c4a	btrfs-progs: check: Remove root parameter from btrfs_fix_block_accounting It's always set to extent_root and the function already takes a transaction handle where fs_info could be referenced and in turn the extent_tree. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:36 +02:00
Nikolay Borisov	48663d6e64	btrfs-progs: check: Remove root argument from delete_extent_records This function is always passed the extent_root as "root" parameter. In turn it uses the root parameter to mostly access fs_info and performs only a single call to btrfs_update_block_group where it passses the passed root. This is all redundant since fs_info can be referenced from the transaction handle and in turn extent_root can be referenced from the fs_info. So do that to simplify the function's signature. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:36 +02:00
Qu Wenruo	41750fe660	btrfs-progs: Remove fs_info parameter from btrfs_leaf_free_space() For btrfs_leaf_free_space(), to get leaf data size, we have two way to get it: 1) leaf->fs_info->nodesize 2) leaf->len Anyway, we could get rid of @fs_info parameter for btrfs_leaf_free_space(). And here we choose method 2), as it provides extra benefit to get leaf free space without initializing a real fs_info. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:35 +02:00
Nikolay Borisov	5a621e039e	btrfs-progs: check: Remove ext_ref local variable from check_fs_roots_lowmem All real consumers of that variable have inlined the checks since they are simple enough. So just remove it. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:34 +02:00
Nikolay Borisov	03b6929079	btrfs-progs: check: Drop ext_ref arument from check_fs_root It's no longer used so just remove it. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:34 +02:00
Nikolay Borisov	511cd09d27	btrfs-progs: check: Drop ext_ref param from check_fs_first_inode It's no longer used in that function so can be dropped altogether. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:34 +02:00
Nikolay Borisov	d01839e95a	btrfs-progs: check: Remove ext_ref param from walk_down_tree It's no longer used so just remove it. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:34 +02:00
Nikolay Borisov	cd62501086	btrfs-progs: check: Remove ext_ref param from check_fs_first_inode It's no longer use and can be dropped. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:34 +02:00
Nikolay Borisov	1e154d9039	btrfs-progs: check: Drop unused ext_ref parameter from process_one_leaf It's no longer used in the function so just remove it Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:34 +02:00
Nikolay Borisov	72cdfeaf62	btrfs-progs: check: Drop ext_ref argument from check_inode_item We can derive this argument from root->fs_info and also make it local to the sole switch case it's used. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:34 +02:00
Nikolay Borisov	990c79bff5	btrfs-progs: check: Drop ext_ref param from check_dir_item This parameter is no longer used in this function, so drop it Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:34 +02:00
Nikolay Borisov	30c72595ea	btrfs-progs: check: Drop ext_ref parameter from find_inode_ref This is a boolean parameter which signals whether the fs has the EXTENDED_IREF feature flag toggled or not. Since a reference to fs_info can be obtained there is no need to pollute the interface. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:33 +02:00
Nikolay Borisov	62b7a389c6	btrfs-progs: check: fix DIR_ITEM checking in lowmem When checking the validity of a DIR_ITEM item the index variable is explicitly set to -1 so that the index check in find_inode_ref() is ignored. This is necessary due to possible name collisions in DIR_ITEMS (i.e. multiple files with the same crc32c for their names, resulting in the identical key->offset). Currently the code is broken due to index being set to -1 at the beginning of check_dir_item and subsequently reset to the index found in find_inode_ref. On subsequent iterations of the while loop in check_dir_items we are going to erroneously perform index checking, since index is not -1 but whatever the index value of the last handled INODE_REF ITEM. Without this patch check in lowmem mode produces the following false warnings positivess: ERROR: root 5 INODE REF[1991456, 256] name 5ab4e28b~~~~~~~~QBXUT2GBJDRT5CB6ZWAJVDHK filetype 1 missing But the items for this name are in fact correct: ------------------SNIP------------------------ item 13 key (256 DIR_ITEM 4227063046) itemoff 2945 itemsize 140 location key (220890 INODE_ITEM 0) type FILE transid 2278 data_len 0 name_len 40 name: 5ab4e1bb~~~~~~~~65KNTAWVJ5VD4SV2R3BKFCXL location key (1991456 INODE_ITEM 0) type FILE transid 2285 data_len 0 name_len 40 name: 5ab4e28b~~~~~~~~QBXUT2GBJDRT5CB6ZWAJVDHK item 21 key (1991456 INODE_REF 256) itemoff 2104 itemsize 50 index 1934146 namelen 40 name: 5ab4e28b~~~~~~~~QBXUT2GBJDRT5CB6ZWAJVDHK -------------------SNIP------------------------- Fix this by moving the code setting index at the beginning of the while loop. This ensure that for each item in DIR_ITEM we have index set correctly. Fixes: `564901eac7` ("btrfs-progs: check: introduce print_dir_item_err()") Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:33 +02:00
Qu Wenruo	4bd7bbb6f6	btrfs-progs: check: Make btrfs check return error for qgroup mismatch Current btrfs-check will check qgroup consistency, but even when it finds something wrong, the return value is still 0. Fix it by allowing report_qgroups() to return int to indicate qgroup mismatch, and also add extra logic to return no error if qgroup repair is successful. Without this patch, fstests can't detect qgroup corruption by its fsck alone. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:33 +02:00
Nikolay Borisov	873fba7101	btrfs-progs: Remove devid parameter from btrfs_rmap_block This parameter was introduced with the original implementation of the function but has never really been used, so just remove it. Signed-off-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:32 +02:00
Su Yue	04855a16d2	btrfs-progs: check: lowmem: end of removing parameters @trans in lowmem Remove @trans in check_chunks_and_extents_lowmem(). After this patch, lowmem repair works again. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:32 +02:00
Su Yue	ea8763bd05	btrfs-progs: check: lowmem: introduce repair_block_accounting Introduce repair_block_accounting() which calls btrfs_fix_block_accounting() to repair block group accouting. Replace btrfs_fix_block_accounting() with the new function. Note: This patch and next patches cause error in lowmem repair like: "Error: Commit_root already set when starting transaction". Such error will disappear after removing @trans finished. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:31 +02:00
Su Yue	b9fb29a9ba	btrfs-progs: check: lowmem: remove parameter @trans of check_btrfs_root Remove parameters @trans of delete_extent_item() and walk_down_tree_v2(). Note: This patch and next patches cause error in lowmem repair like: "Error: Commit_root already set when starting transaction". Such error will disappear after removing @trans finished. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:31 +02:00
Su Yue	32705f2307	btrfs-progs: check: lowmem: remove parameter @trans of repair_tree_back_ref This patch removes parameter @trans of repair_tree_back_ref(). It calls try_avoid_extents_overwrite() and starts a transaction by itself. Note: This patch and next patches cause error in lowmem repair like: "Error: Commit_root already set when starting transaction". Such error will disappear after removing @trans finished. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:31 +02:00
Su Yue	4c2b21acaa	btrfs-progs: check: lowmem: remove parameter @trans of check_leaf_items This patch removes parameter @trans of check_leaf_items(). Note: This patch and next patches cause error in lowmem repair like: "Error: Commit_root already set when starting transaction". Such error will disappear after removing @trans finished. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:31 +02:00
Su Yue	adb198aa6d	btrfs-progs: check: lowmem: remove parameter @trans of repair_extent_item This patch removes parameter @trans of repair_extent_item(). It calls try_avoid_extents_overwrite() and starts a transaction by itself. Note: This patch and next patches cause error in lowmem repair like: "Error: Commit_root already set when starting transaction". Such error will disappear after removing @trans finished. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:31 +02:00
Su Yue	b4069a192d	btrfs-progs: check: lowmem: remove parameter @trans of repair_chunk_item This patch removes parameter @trans of repair_chunk_item(). It calls try_avoid_extents_overwrite() and starts a transaction by itself. Note: This patch and next patches cause error in lowmem repair like: "Error: Commit_root already set when starting transaction". Such error will disappear after removing @trans finished. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:31 +02:00
Su Yue	e9f0396a42	btrfs-progs: check: lowmem: remove parameter @trans of delete_extent_item This patch removes the parameter @trans of delete_extent_item(). It calls try_avoid_extents_overwrite() and starts a transaction by itself. Note: This patch and next patches cause error in lowmem repair like: "Error: Commit_root already set when starting transaction". Such error will disappear after removing @trans finished. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:31 +02:00
Su Yue	50782a34d7	btrfs-progs: check: lowmem: start to remove parameters @trans in lowmem Since extents can be avoided to be overwrittten by excluding or chunk allocation. It's not necessary to do all repairs in one transaction. This patch removes parameter @trans of repair_extent_data_item(). repair_extent_data_item() calls try_avoid_extents_overwrite() and starts a transaction by itself. Note: This patch and next patches cause error in lowmem repair like: "Error: Commit_root already set when starting transaction". Such error will disappear after removing @trans finished. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:31 +02:00
Su Yue	0ee012318f	btrfs-progs: check: lowmem: exclude extents if init-extent-tree in lowmem If options '--init-extent-tree' and '--mode=lowmem' are both input, all metadata blocks will be traversed twice. First one is done by pin_metadata_blocks() in reinit_extent_tree(). Second one is in check_chunks_and_extents_v2(). Excluding instead of pinning metadata blocks before reinitializing th extent tree in lowmem can save some time. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:30 +02:00
Su Yue	4e320db9dd	btrfs-progs: check: lowmem: introduce avoid_extents_overwrite Declare a global u64 variable last_allocated_chunk records start of the last chunk allocated by lowmem repair. Although global variable is not nice, it simplifies the code a lot. avoid_extents_overwrite() prefers to allocate a new chunk first. If it failed because of no space or wrong used bytes (like in fsck-tests/004), then it tries to exclude metadata blocks which costs a lot of time in large filesystems. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:30 +02:00
Su Yue	e99f377541	btrfs-progs: check: lowmem: introduce try_force_cow_in_new_chunk Introduce create_chunk_and_block_block_group() to allocate a new chunk and corresponding block group. The new function force_cow_in_new_chunk() firstly allocates new chunk and records its start. Then it modifies all metadata block groups cached and full. Finally it marks the new block group uncached and unfree. In the next COW, extents states will be updated automatically by cache_block_group(). New function try_to_force_cow_in_new_chunk() will try to mark block groups full, allocate a new chunk and records the start. If the last allocated chunk is almost full, a new chunk will be allocated. Suggested-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:30 +02:00
Su Yue	8586c84e8e	btrfs-progs: check: lowmem: introduce mark/clear_block_groups_full Excluding or pining all metadata blocks is time-inefficient for large filesystems. Here is another way to mark all metadata block groups full and allocate new chunks for COW. Then new reserved extents never overwrite extents. Introduce modify_block_groups_cache() to modify all blocks groups cache state and set all extents in block groups unfree in free space cache. mark/clear_block_groups_full() are wrappers of above function. Suggested-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:37:07 +02:00
Su Yue	d8020d5f69	btrfs-progs: check: lowmem: exclude extents of metadata blocks Commit d17d6663c99c ("btrfs-progs: lowmem check: Fix regression which screws up extent allocator") removes pin_metadata_blocks() from lowmem repair. So we have to find another way to exclude extents which should be occupied by existing tree blocks. Modify pin_down_tree_blocks() and rename it to traverse_tree_blocks for sharing code with new function exclude_metadata_blocks(). * exclude_metadata_blocks() traverses and marks extents of all tree blocks dirty in fs_info->excluded_extents. * cleanup_excluded_extents() is responsible for cleanup. Export them to mode-common.h since they will be used both in original and lowmem modes. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:35:55 +02:00
Su Yue	4fb23ea06f	btrfs-progs: check: move pin_down_tree_blocks to mode-common.c Move pin_down_tree_blocks from main.c to mode-common.c for further patches. And export pin_metadata_blocks to mode-common.h. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:34:46 +02:00
Qu Wenruo	b3a99bc663	btrfs-progs: check: Distinguish csum checking output for --check-data-csum No matter --check-data-csum is passed or not, btrfs check will always output message like "checking csum". This message could be a little confusing, change it according to --check-data-csum option. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:34:19 +02:00
Qu Wenruo	ccb3ddb695	btrfs-progs: check: Continue check even if csum error is found Since data csum mismatch is not a fatal error compared to fs/extent trees, continue check. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:32:03 +02:00
Qu Wenruo	40b6bc1cb6	btrfs-progs: check: Fix data csum check return value When --check-data-csum option found csum mismatch, btrfs check still return 0. Fix it so log-replay could automatically pause when it finds csum error. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:32:03 +02:00
Qu Wenruo	9be7988677	btrfs-progs: check: Check data csum for all copies Original --check-data-csum option will skip the extra copy if the first copy matches csum. Since offline scrub (with recoverability report) is still out-of-tree, at least enhance --check-data-csum option to handle multiple copies. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:32:03 +02:00
Su Yue	cb1fbe71ae	btrfs-progs: check: lowmem: assign parent early in repair_extent_data_item The variable @eb is assigned to leaf in fs_tree before insertion of backref. It will cause wrong parent of new inserted backref. Set @parent at beginning to fix the problem. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:32:03 +02:00
Su Yue	bbf1e8a400	btrfs-progs: check: lowmem: record returned errors after walk_down_tree_v2 In lowmem mode with '--repair', check_chunks_and_extents_v2 will fix accounting in block groups and clear the error bit BG_ACCOUNTING_ERROR. However, return value of check_btrfs_root() doesn't contain error bits. If extent tree is on error, lowmem repair always prints error and returns nonzero value even the filesystem is fine after repair. Introduce FATAL_ERROR for lowmem mode to represent negative return values since negative and positive can't be mixed in the bit operations. Then let check_btrfs_root() return error bits. Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:32:03 +02:00
Su Yue	3f00e00912	btrfs-progs: check: lowmem: release path in repair_extent_data_item() In repair_extent_data_item(), path is not released if some errors occurs which causes extent buffer leak. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-06-07 16:32:03 +02:00
Qu Wenruo	a5f0e46ac3	btrfs-progs: check: lowmem: Remove corupted link before readding correct link For repair_ternary_lowmem() used in lowmem mode, if it found 1 of DIR_INDEX/DIR_ITEM/INODE_REF missing, it will try to insert correct link. However for case like invalid type in DIR_INDEX, we should delete the corrupted DIR_INDEX first before inserting the correct link. This patch will remove the corrupted link before re-inserting. This should solve the duplicated DIR_INDEX problem in old lowmem mode repair. Reported-by: Sebastian Andrzej Siewior <sebastian@breakpoint.cc> Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-05-07 20:14:11 +02:00
Qu Wenruo	47c694b0f4	btrfs-progs: Unify btrfs_leaf_free_space() parameter with kernel Instead of struct btrfs_root, use struct btrfs_fs_info, since nodesize is now a per-fs setting, and with the need to pass a @root, caller don't need to wonder which root should be passed. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Lu Fengqi <lufq.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-04-24 13:00:11 +02:00
Qu Wenruo	2777e53911	btrfs-progs: check: Skip data csum verification for metadata dump For metadata dump (fs restored by btrfs-image), no data is restored and check sum verification will definitely report error. Add such check in check_csums() and prompt for user input. Issue: #103 Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-04-24 13:00:11 +02:00
Qu Wenruo	35a629514e	btrfs-progs: check/original: Remove unused variable first_key This @first_key variable is introduced in `f5c4c4f3b7` ("btrfsck: add code to rebuild extent records"), however it's not only unused, but also used incorrectly. It's calling btrfs_item_key_to_cpu() on an node extent buffer. Anyway, just remove it. Fixes: `f5c4c4f3b7` ("btrfsck: add code to rebuild extent records") Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-03-30 22:15:55 +02:00
Qu Wenruo	046be13c55	btrfs-progs: check/lowmem mode: Check inline extent size Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-03-30 22:15:54 +02:00
Qu Wenruo	9708f0d54e	btrfs-progs: check: original mode: Check inline extent size For inline compressed file extent, kernel doesn't allow inline extent ram size larger than sector size and on-disk inline extent size should not exceed BTRFS_MAX_INLINE_DATA_SIZE(). For inline uncompressed file extent, kernel doesn't allow inline extent ram and on-disk size larger than either BTRFS_MAX_INLINE_DATA_SIZE() or sector size. Check it in original mode. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-03-30 22:15:54 +02:00
Filipe Manana	457a288cb5	Btrfs-progs: check, fix false error reports for shared prealloc extents Under some cases the filesystem checker reports an error when it finds checksum items for an extent that is referenced by an inode as a prealloc extent. Such cases are not an error when the extent is actually shared (was cloned/reflinked) with other inodes and was written through one of those other inodes. Example: $ mkfs.btrfs -f /dev/sdb $ mount /dev/sdb /mnt $ touch /mnt/foo $ xfs_io -c "falloc 0 256K" /mnt/foo $ sync $ xfs_io -c "pwrite -S 0xab 0 256K" /mnt/foo $ touch /mnt/bar $ xfs_io -c "reflink /mnt/foo 0 0 256K" /mnt/bar $ xfs_io -c "fsync" /mnt/bar <power fail> $ mount /dev/sdb /mnt $ umount /mnt $ btrfs check /dev/sdc Checking filesystem on /dev/sdb UUID: 52d3006e-ee3b-40eb-aa21-e56253a03d39 checking extents checking free space cache checking fs roots root 5 inode 257 errors 800, odd csum item ERROR: errors found in fs roots found 688128 bytes used, error(s) found total csum bytes: 256 total tree bytes: 163840 total fs tree bytes: 65536 total extent tree bytes: 16384 btree space waste bytes: 138819 file data blocks allocated: 10747904 referenced 10747904 $ echo $? 1 So teach check to not report such cases as errors by checking if the extent is shared with other inodes and if so, consider it an error the existence of checksum items only if all those other inodes are referencing the extent as a prealloc extent. This case can be hit often when running the generic/475 testcase from fstests. A test case will follow in a separate patch. Signed-off-by: Filipe Manana <fdmanana@suse.com> Reviewed-by: Nikolay Borisov <nborisov@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-03-30 22:15:54 +02:00
Qu Wenruo	8d6c444782	btrfs-progs: check: lowmem: Fix false alert about orphan inode Btrfs can delay inode deletion and in that case btrfs will unlink the victim inode from its parent dir, and insert a marker to info btrfs to delete it later. In that case, such victim inode will have nlinks == 0, but is still completely valid. Original mode won't report such problem, but lowmem mode doesn't check the ORPHAN_ITEM key for such inode, and can report false alert like: ------ ERROR: root 257 INODE[28891726] is orphan item ------ Fix such false alert by checking orphan item for inode whose nlink is 0. Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-03-30 22:15:54 +02:00
Lu Fengqi	905b3d18ae	btrfs-progs: check/lowmem: Fix false alert of data extent backref lost for snapshot Btrfs lowmem check reports the following false alert: ------ ERROR: file extent[267 2162688] root 256 owner 5 backref lost ------ The file extent is in the leaf which is shared by file tree 256 and fs tree. ------ leaf 30605312 items 46 free space 4353 generation 7 owner 5 ...... item 45 key (267 EXTENT_DATA 2162688) itemoff 5503 itemsize 53 generation 7 type 2 (prealloc) prealloc data disk byte 13631488 nr 65536 prealloc data offset 32768 nr 32768 ------ And there is the corresponding extent_data_ref item in the extent tree. ------ item 1 key (13631488 EXTENT_DATA_REF 1007496934287921081) itemoff 15274 itemsize 28 extent data backref root 5 objectid 267 offset 2129920 count 1 ------ The offset of EXTENT_DATA_REF which is the hash of the owner root objectid, the inode number and the calculated offset (file offset - extent offset). What caused the false alert is the code mix up the owner root objectid and the file tree objectid. Fixes: `b0d360b541` ("btrfs-progs: check: introduce function to check data backref in extent tree") Signed-off-by: Lu Fengqi <lufq.fnst@cn.fujitsu.com> Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-03-30 22:15:53 +02:00
Lu Fengqi	634e73e9d3	btrfs-progs: check/lowmem: Fix the incorrect error message of check_extent_data_item Instead of the disk_bytenr and disk_num_bytes of the extent_item which the file extent references, we should output the objectid and offset of the file extent. And the leaf may be shared by the file trees, we should print the objectid of the root and the owner of the leaf. Fixes: `b0d360b541` ("btrfs-progs: check: introduce function to check data backref in extent tree") Signed-off-by: Lu Fengqi <lufq.fnst@cn.fujitsu.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-03-30 22:15:53 +02:00
Qu Wenruo	4c7094e046	btrfs-progs: free-space-cache: Enhance free space cache free space check When we found free space difference between free space cache and block group item, we just discard this free space cache. Normally such difference is caused by btrfs_reserve_extent() called by delalloc which is out of a transaction. And since all btrfs_release_extent() is called with a transaction, under heavy race free space cache can have less free space than block group item. Normally kernel will detect such difference and just discard that cache. However we must be more careful if free space cache has more free space cache, and if that happens, paried with above race one invalid free space cache can be loaded into kernel. So if we find any free space cache who has more free space then block group item, we report it as an error other than ignoring it. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-03-30 22:15:53 +02:00
Qu Wenruo	2ac42077c2	btrfs-progs: Move chunk stripe size calculation function to volumes.h Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-03-06 11:28:26 +01:00
Qu Wenruo	54246115ba	btrfs-progs: btrfs-progs: Fix read beyond boundary bug in build_roots_info_cache() This bug is exposed by fsck-test with D=asan, hit by test case 020, with the following error report: ================================================================= ==10740==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x621000061580 at pc 0x56051f0db6cd bp 0x7ffe170f3e20 sp 0x7ffe170f3e10 READ of size 1 at 0x621000061580 thread T0 #0 0x56051f0db6cc in btrfs_extent_inline_ref_type /home/adam/btrfs/btrfs-progs/ctree.h:1727 #1 0x56051f13b669 in build_roots_info_cache /home/adam/btrfs/btrfs-progs/cmds-check.c:14306 #2 0x56051f13c86a in repair_root_items /home/adam/btrfs/btrfs-progs/cmds-check.c:14450 #3 0x56051f13ea89 in cmd_check /home/adam/btrfs/btrfs-progs/cmds-check.c:14965 #4 0x56051efe75bb in main /home/adam/btrfs/btrfs-progs/btrfs.c:302 #5 0x7f04ddbb0f49 in __libc_start_main (/usr/lib/libc.so.6+0x20f49) #6 0x56051efe68c9 in _start (/home/adam/btrfs/btrfs-progs/btrfs+0x5b8c9) 0x621000061580 is located 0 bytes to the right of 4224-byte region [0x621000060500,0x621000061580) allocated by thread T0 here: #0 0x7f04ded50ce1 in __interceptor_calloc /build/gcc/src/gcc/libsanitizer/asan/asan_malloc_linux.cc:70 #1 0x56051f04685e in __alloc_extent_buffer /home/adam/btrfs/btrfs-progs/extent_io.c:553 #2 0x56051f047563 in alloc_extent_buffer /home/adam/btrfs/btrfs-progs/extent_io.c:687 #3 0x56051efff1d1 in btrfs_find_create_tree_block /home/adam/btrfs/btrfs-progs/disk-io.c:187 #4 0x56051f000133 in read_tree_block /home/adam/btrfs/btrfs-progs/disk-io.c:327 #5 0x56051efeddb8 in read_node_slot /home/adam/btrfs/btrfs-progs/ctree.c:652 #6 0x56051effb0d9 in btrfs_next_leaf /home/adam/btrfs/btrfs-progs/ctree.c:2853 #7 0x56051f13b343 in build_roots_info_cache /home/adam/btrfs/btrfs-progs/cmds-check.c:14267 #8 0x56051f13c86a in repair_root_items /home/adam/btrfs/btrfs-progs/cmds-check.c:14450 #9 0x56051f13ea89 in cmd_check /home/adam/btrfs/btrfs-progs/cmds-check.c:14965 #10 0x56051efe75bb in main /home/adam/btrfs/btrfs-progs/btrfs.c:302 #11 0x7f04ddbb0f49 in __libc_start_main (/usr/lib/libc.so.6+0x20f49) It's completely possible that one extent/metadata item has no inline reference, while build_roots_info_cache() doesn't have such check. Fix it by checking @iref against item end to avoid such problem. Issue: #92 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-14 16:20:30 +01:00
David Sterba	6d3fea8f46	btrfs-progs: check: rename files after moving code Add prefixes to the files so it's clear they belong to the mode group. Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:59 +01:00
Qu Wenruo	3876f9f1d5	btrfs-progs: check: Cleanup all checkpatch error and warning Since we're moving tons of code, it's a good idea to fix all errors and warnings from checkpatch. Note: we don't rely on checkpatch, take its warnings and errors as a hint and do not blindly fix anything that gets reported. The code follows some style that should be kept and is not in 100% with what checkpatch finds. Always apply your common sense and decide if the change makes sense or if it is really making the code better. Additional fixes from dsterba: - %Lu -> %llu - lowercase first word or messages Signed-off-by: Qu Wenruo <wqu@suse.com> [ added note ] Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:59 +01:00
Qu Wenruo	3a939bc7f3	btrfs-progs: check/lowmem: Cleanup unnecessary _v2 suffixes There used to be some functions with _v2 suffix to distinguish them from original mode that had similar functions. However now when the lowmem mode is moved to own check/lowmem.[ch], cleanup the _v2 suffixes, and for functions that really need to be distinguished from original mode (exported functions), change the _v2 suffix to _lowmem. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:59 +01:00
Qu Wenruo	b8d706e58e	btrfs-progs: check: Move lowmem check code to its own check/lowmem.[ch] Since lowmem mode code is highly internally connected, it's pretty hard to move them piece by piece. In theory it's possible to move part of the functions and temporarily export them, but it will just cause extra temporarily modifications. So this patch moves the whole lowmem check part into its own check/lowmem.[ch]. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:59 +01:00
Qu Wenruo	5dade4b951	btrfs-progs: check: Move reset_cached_block_groups to check/common.c Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:59 +01:00
Qu Wenruo	3d3ce73d90	btrfs-progs: check: Move check_child_node to check/common.c Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:58 +01:00
Qu Wenruo	fa37a13f73	btrfs-progs: check: move reada_walk_down to check/common.c Both original and lowmem modes share this function to do readahead. Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:58 +01:00
Qu Wenruo	d0b191b880	btrfs-progs: check: Move check_dev_size_alignment to check/common.c Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:58 +01:00
Qu Wenruo	4e84c8251a	btrfs-progs: check: Move link_inode_to_lostfound function to common.c Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:58 +01:00
Qu Wenruo	a20557599e	btrfs-progs: check: Move __create_inode_item function to check/common.c Move __create_inode_item() function to check/common.c and rename it to insert_inode_item(), with comment added. Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:58 +01:00
Qu Wenruo	b695ef1ddc	btrfs-progs: check: Move count_csum_range function to check/common.c Despite of moving it to check/common.c, also: 1) Add extra comment of the function 2) Change @root parameter to @fs_info Since @root is never used, csum_root is picked from fs_info anyway. Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:58 +01:00
Qu Wenruo	d1b55330c2	btrfs-progs: check: Move fs_root_objectid function to check/common.h Just another small wrapper shared between original and lowmem mode. Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:58 +01:00
Qu Wenruo	58a19f1ea7	btrfs-progs: check: Move imode_to_type function to check/common.h This function is shared between original and lowmem mode, and it's small enough, so move it to check/common.h. Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:58 +01:00
Qu Wenruo	96c6e1f0aa	btrfs-progs: check: Export check global variables to check/common.h There are a dozen of variables which are used as "check global" variables, like @total_csum_bytes or @no_holes. These variables are used freely across the check code, however since we're splitting check code, they need to be exported so they can be used in other files. This patch just export them and add declarations for them in check/common.h. Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:58 +01:00
Qu Wenruo	5c9ac8dc48	btrfs-progs: check: Move node_refs structure to check/common.h Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:58 +01:00
Qu Wenruo	16e58a4003	btrfs-progs: check: Move definitions of lowmem mode to check/lowmem.h Unlike original mode, lowmem mode mostly uses normal tree operations, so no structure definitions, only a lot of random error bits. Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:57 +01:00
Qu Wenruo	c10efe7c1c	btrfs-progs: check: Move original mode definitions to check/original.h Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:57 +01:00
Qu Wenruo	c772db30e9	btrfs-progs: Move cmds-check.c to check/main.c Reviewed-by: Su Yue <suy.fnst@cn.fujitsu.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2018-02-02 16:01:57 +01:00

... 2 3 4 5 6 ...

334 Commits