btrfs-progs

Commit Graph

Author	SHA1	Message	Date
Neal Gompa	e3232c2abb	btrfs-progs: mkfs: make 4k sectorsize default We have had working subpage support in Btrfs for many cycles now. Generally, we do not want people creating filesystems by default with non-4k sectorsizes since it creates portability problems. As the subpage has stabilized it seems to be safe to do the switch. This may still affect users that relying on the previous behaviour. Issue: #604 Reviewed-by: Anand Jain <anand.jain@oracle.com> Reviewed-by: Qu Wenruo <wqu@suse.com> Reviewed-by: Josef Bacik <josef@toxicpanda.com> Reviewed-by: Eric Curtin <ecurtin@redhat.com> Signed-off-by: Neal Gompa <neal@gompa.dev> Signed-off-by: David Sterba <dsterba@suse.com>	2024-01-18 02:37:45 +01:00
Qu Wenruo	389c959d6d	btrfs-progs: implement arg_strtou64_with_suffix() with a new helper This patch introduces a new parser helper, parse_u64_with_suffix(), which has a better error handling, following all the parse_*() helpers to return non-zero value for errors. This new helper is going to replace parse_size_from_string(), which would directly call exit(1) to stop the whole program. Furthermore most callers of parse_size_from_string() are expecting exit(1) for error, so that they can skip the error handling. For those call sites, introduce a wrapper, arg_strtou64_with_suffix(), to do that. The only disadvantage is a little less detailed error report for why the parse failed, but for most cases the generic error string should be enough. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2024-01-18 02:14:23 +01:00
David Sterba	b1e2de452a	btrfs-progs: mkfs: print zone count for each device In zoned mode print zone count for each device, the zone size must be the same so it's sufficient to print it in the summary. $ mkfs.btrfs -O zoned /dev/nullb[0-3] ... Zoned device: yes Zone size: 16.00MiB ... Devices: ID SIZE ZONES PATH 1 512.00MiB 32 /dev/nullb0 2 256.00MiB 16 /dev/nullb1 3 1.00GiB 64 /dev/nullb2 4 2.00GiB 128 /dev/nullb3 Issue: #693 Signed-off-by: David Sterba <dsterba@suse.com>	2023-11-03 18:04:37 +01:00
David Sterba	2069bfe016	btrfs-progs: mkfs: drop unsigned long long casts for printf The %llu specifier does not need the typecast for ULL for a long time, remove it. Signed-off-by: David Sterba <dsterba@suse.com>	2023-11-03 18:04:37 +01:00
David Sterba	b4f43d72ff	btrfs-progs: mkfs: support parametric zone size In experimental build, read global '--param zone-size=SIZE' and use it as emulated zone size. This is for testing only, will be promoted to a proper option in the future. Signed-off-by: David Sterba <dsterba@suse.com>	2023-11-03 18:04:37 +01:00
David Sterba	9908894102	btrfs-progs: mkfs: validate device uuid set on command line We need to validate the device uuid the same way as the fsid: $ ./mkfs.btrfs --device-uuid 18eabcf0-6766-4fbf-b366-71b4ae725b2- img btrfs-progs v6.5.2 See https://btrfs.readthedocs.io for more information. ERROR: could not parse device UUID: 18eabcf0-6766-4fbf-b366-71b4ae725b2- Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-21 15:51:07 +02:00
David Sterba	d8032c3b8b	btrfs-progs: mkfs: print device uuid if set from command line Print the device uuid in the summary in case it's specified on the command line, not always as it would be confusing and is not usually needed. Can be found in 'btrfs inspect-internal dump-super' as device_item.uuid . Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-21 15:51:07 +02:00
Anand Jain	3f27e60866	btrfs-progs: mkfs: add option to specify device uuid Add option --device-uuid that will set the device uuid item in super block. This is useful for creating a filesystem with a specific device uuid, namely for testing. Signed-off-by: Anand Jain <anand.jain@oracle.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-21 15:51:07 +02:00
Anand Jain	b0c4dfaaac	btrfs-progs: document allowing duplicate fsid The commit ("btrfs-progs: allow duplicate fsid for single device filesystems") lets the duplicate fsid used for a new mkfs document this. Signed-off-by: Anand Jain <anand.jain@oracle.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-21 15:51:07 +02:00
Qu Wenruo	0af60b1faf	btrfs-progs: mkfs: add the missing xattr for the rootdir inode [BUG] When using "mkfs.btrfs" with "--rootdir" option, the top level inode (rootdir) will not get the same xattr from the source dir: mkdir -p source_dir/ touch source_dir/file setfattr -n user.rootdir_xattr source_dir/ setfattr -n user.regular_xattr source_dir/file mkfs.btrfs -f --rootdir source_dir $dev mount $dev $mnt getfattr $mnt # Nothing <<< getfattr $mnt/file # file: $mnt/file user.regular_xattr <<< [CAUSE] In function traverse_directory(), we only call add_xattr_item() for all the child inodes, not really for the rootdir inode itself, leading to the missing xattr items. Not only xattr, in fact we also miss the uid/gid/timestamps/mode for the rootdir inode. [FIX] Extract a dedicated function, copy_rootdir_inode(), to handle every needed attributes for the rootdir inode, including: - xattr - uid - gid - mode - timestamps Issue: #688 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-21 15:51:07 +02:00
Qu Wenruo	4f5a455d1b	btrfs-progs: mkfs: do not enlarge the target block device [BUG] When running mkfs.btrfs with --rootdir on a block device, and the source directory contains a sparse file, whose size is larger than the block size, then mkfs.btrfs would fail: # lsblk /dev/test/test NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINTS test-test 253:0 0 10G 0 lvm # mkdir -p /tmp/output # truncate -s 20G /tmp/output/file # mkfs.btrfs -f --rootdir /tmp/output /dev/test/test # sudo mkfs.btrfs -f /dev/test/scratch1 --rootdir /tmp/output/ btrfs-progs v6.3.3 See https://btrfs.readthedocs.io for more information. ERROR: unable to zero the output file [CAUSE] Mkfs.btrfs would try to zero out the target file according to the total size of the directory. However the directory size is calculated using the file size, not the real bytes taken by the file, thus for such sparse file with holes only, it would still take 20G. Then we would use that 20G size to zero out the target file, but if the target file is a block device, we would fail as we can not enlarge a block device. [FIX] When zeroing the file, we only enlarge it if the target is a regular file. Otherwise we warn about the size and continue. Please note that, since "mkfs.btrfs --rootdir" doesn't handle sparse file any differently from regular file, above case would still fail due to ENOSPC, as will write zeros into the target file inside the fs. Proper handling for sparse files would need a new series of patch to address. Issue: #653 Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-21 15:51:07 +02:00
David Sterba	b421fdff95	btrfs-progs: move raid-stripe-tree and squota build out of experimental The kernel patches for RST and squota are queued for 6.7, we need to be able to test the features so it's not necessary to hide the mkfs support under experimental build. The kernel may still need debug build to enable mount. Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-17 19:33:59 +02:00
David Sterba	21aa6777b2	btrfs-progs: clean up includes, using include-what-you-use Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-03 01:11:57 +02:00
Josef Bacik	0fa89a9da7	btrfs-progs: move btrfs_uuid_tree_add into mkfs/main.c This function is only used in mkfs, and doesn't exist in the kernel in ctree.c. Additionally we have a uuid lookup function to see if the uuid exists in the tree, which for mkfs it won't because we just created the tree. Move btrfs_uuid_tree_add into mkfs, and remove the lookup function as it's not needed. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-03 01:11:56 +02:00
Josef Bacik	3808db2b3e	btrfs-progs: move btrfs_record_file_extent and code into a new file This function and it's related functions only exist for the utilities that populate existing file systems, and do not exist in the upstream kernel. Move this function and the related function into it's own common source file and out of the kernel-shared sources, and then update all of the users to include the new location of this code. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-03 01:11:56 +02:00
Josef Bacik	8069b8b8cd	btrfs-progs: drop btrfs_init_path This simply zero's out the path, and this is used everywhere we use a stack path. Drop this usage and simply init the path's to empty instead of using a function to do the memset. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-03 01:11:56 +02:00
Boris Burkov	14ac1a6051	btrfs-progs: mkfs: add support for squota Add the ability to enable simple quotas from mkfs with '-O squota' There is some complication around handling enable_gen while still counting the root node of an fs. To handle this, employ a hack of doing a no-op write on the root node to bump its generation up above that of the qgroup enable generation, which results in counting it properly. Reviewed-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: Boris Burkov <boris@bur.io> Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-03 01:11:55 +02:00
Anand Jain	ff4c4a3a00	btrfs-progs: allow duplicate fsid for single device filesystems For single device btrfs filesystem, allow duplicate fsid to be created. This should be used with caution as more devices with the same uuid could be confused with each other. Signed-off-by: Anand Jain <anand.jain@oracle.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-02 18:41:08 +02:00
Johannes Thumshirn	fff57d3774	btrfs-progs: load zone info for all zoned devices Signed-off-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-02 18:41:08 +02:00
Johannes Thumshirn	b4ab282686	btrfs-progs: allow zoned RAID Allow for RAID levels 0, 1 and 10 on zoned devices if the RAID stripe tree is used. Signed-off-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-10-02 18:41:08 +02:00
David Sterba	76c0446bec	btrfs-progs: mkfs: convert int to bool in a few helpers Signed-off-by: David Sterba <dsterba@suse.com>	2023-07-27 14:45:29 +02:00
Anand Jain	d46a0ef6a0	btrfs-progs: rename struct open_ctree_flags to open_ctree_args The struct open_ctree_flags currently holds arguments for open_ctree_fs_info(), it can be confusing when mixed with a local variable named open_ctree_flags as below in the function cmd_inspect_dump_tree(). cmd_inspect_dump_tree() :: struct open_ctree_flags ocf = { 0 }; :: unsigned open_ctree_flags; So rename struct open_ctree_flags to struct open_ctree_args. Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Anand Jain <anand.jain@oracle.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-07-26 15:00:47 +02:00
Dominique Martinet	9362803539	btrfs-progs: mkfs: make --quiet silence the 5.15 default change NOTE mkfs.btrfs help message for --quiet is 'no message except errors' so we probably ought to silence this as well in the quiet case. Author: Dominique Martinet <dominique.martinet@atmark-techno.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-06-09 11:57:39 +02:00
David Sterba	ae73e89f28	btrfs-progs: mkfs: more verbose output for --rootdir Print the source directory for --rootdir and if --shrink is used. With -vv then print the individual files as added: $ mkfs.btrfs --rootdir dir --shrink -vv img ... Rootdir from: Documentation ADD: /btrfs-progs/Documentation/btrfs-check.rst ... ADD: /btrfs-progs/Documentation/btrfs-send.rst Shrink: yes Label: (null) UUID: 40d3a16f-02d8-40d7-824b-239cee528093 ... The 'Rootdir from' is printed before the files are added so there's now message before the files are added which could take some time. Issue: #627 Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 22:17:33 +02:00
David Sterba	95c1fa1871	btrfs-progs: mkfs: remove redundant variable for source dir Validity of source dir can be determined by the variable itself, no need to track it separately. Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 22:17:33 +02:00
Qu Wenruo	08a3bd7694	btrfs-progs: tune: add the ability to generate new data checksums This patch would modify btrfs_csum_file_block() to handle csum type other than the one used in the current fs. The new data checksum would use a different objectid (-13) to distinguish with the existing one (-10). This needs to change tree-checker to skip the item size checks, since new csum can be larger than the original csum. After this stage, the resulted csum tree would look like this: item 0 key (CSUM_CHANGE EXTENT_CSUM 13631488) itemoff 8091 itemsize 8192 range start 13631488 end 22020096 length 8388608 item 1 key (EXTENT_CSUM EXTENT_CSUM 13631488) itemoff 7067 itemsize 1024 range start 13631488 end 14680064 length 1048576 Note the itemsize is 8 times the original one, as the original csum is CRC32, while target csum is SHA256, which is 8 times the size. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:32 +02:00
Qu Wenruo	46364d3766	btrfs-progs: replace write_and_map_eb() by write_data_to_disk() The function write_and_map_eb() is quite abused as a way to write any generic buffer back to disk. But we have a more suitable function already, write_data_to_disk(). This patch would remove the abused write_data_to_disk() calls, and convert the only three valid call sites to write_data_to_disk() instead. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:31 +02:00
Josef Bacik	f8efe9f724	btrfs-progs: sync file-item.h into progs This patch syncs file-item.h into btrfs-progs. This carries with it an API change for btrfs_del_csums, which takes a root argument in the kernel, so all callsites have been updated accordingly. I didn't sync file-item.c because it carries with it a bunch of bio related helpers which are difficult to adapt to the kernel. Additionally there's a few helpers in the local copy of file-item.c that aren't in the kernel that are required for different tools. This requires more cleanups in both the kernel and progs in order to sync file-item.c, so for now just do file-item.h in order to pull things out of ctree.h. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:29 +02:00
Josef Bacik	c979ffd787	btrfs-progs: sync accessors.[ch] from the kernel This syncs accessors.[ch] from the kernel. For the most part accessors.h will remain the same, there's just some helpers that need to be adjusted for eb->data instead of eb->pages. Additionally accessors.c needed to be completely updated to deal with this as well. This is a set of files where we will likely only sync the header going forward, and leave the C file in place as it needs to be specific to btrfs-progs. This forced a few "unrelated" changes - Using btrfs_dir_item_ftype() instead of btrfs_dir_item_type(). This is due to the encryption changes, and was simpler to just do in this patch. - Adjusting some of the print tree code to use the actual helpers and not the btrfs-progs ones. A local definition of static_assert is used to avoid compilation failures on older gcc (< 9) where the 2nd parameter is mandatory. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:28 +02:00
Josef Bacik	a754fe29d9	btrfs-progs: sync uapi/btrfs.h into btrfs-progs We want to keep this file locally as we want to be uptodate with upstream, so we can build btrfs-progs regardless of which kernel is currently installed. Sync this with the upstream version and put it in kernel-shared/uapi to maintain some semblance of where this file comes from. There are some changes that need to be synced back to kernel. A local definition of static_assert is used to avoid compilation problems on gcc (< 9) due to mandatory 2nd parameter. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:28 +02:00
Josef Bacik	bf0f3db765	btrfs-progs: introduce UASSERT() for purely userspace code While syncing messages.[ch] I had to back out the ASSERT() code in kerncompat.h, which means we now rely on the kernel code for ASSERT(). In order to maintain some semblance of separation introduce UASSERT() and use that in all the purely userspace code. Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-05-26 18:02:28 +02:00
psykose	c9abbf6264	btrfs-progs: stop using legacy 64 interfaces The 64 interfaces, such as fstat64, off64_t, etc, are legacy interfaces created at a time when 64-bit file support was still new. They are generally exposed when defining a macro named _LARGEFILE64_SOURCE, as e.g. the glibc docs[0] say. The modern way to utilise largefile support, is to continue to use the regular interfaces (off_t, fstat, ..), and define _FILE_OFFSET_BITS=64. We already use the autoconf macro AC_SYS_LARGEFILE[1] which arranges this and sets this macro for us. Therefore, we can utilise the non-64 names without fear of breaking on 32-bit systems. This fixes the build against musl libc, ever since musl dropped the 64 compat from interfaces by default[2] just for _GNU_SOURCE, unless _LARGEFILE64_SOURCE is defined. However, there are plans for a future removal of the whole 64 header API, and that workaround (adding another define) might cease to exist. So, rename all 64 API use to the regular non-suffixed names. For consistency, rename the internal functions that were 64 named (lstat64_path, ..) too. This should have no regressions on any platform. [0]: https://www.gnu.org/software/libc/manual/html_node/Feature-Test-Macros.html#index-_005fLARGEFILE64_005fSOURCE [1]: https://www.gnu.org/software/autoconf/manual/autoconf-2.67/html_node/System-Services.html [2]: `25e6fee27f` Pull-request: #615 Signed-off-by: psykose <alice@ayaya.dev> Signed-off-by: David Sterba <dsterba@suse.com>	2023-04-25 16:59:42 +02:00
Qu Wenruo	b2a1be83b8	btrfs-progs: mkfs: keep file descriptors open during whole time [BUG] There is an internal bug report that, after mkfs.btrfs there is a chance that no /dev/disk/by-uuid/<uuid> symlink is not created at all. [CAUSE] That uuid symlink is created by udev, which listens to inotify IN_CLOSE_WRITE events from all block devices. After such IN_CLOSE_WRITE event is triggered, udev would disable inotify for that block device, and do a blkid scan on it. After the blkid scan is done, re-enables the inotify listening. This means normally mkfs tools should open the fd, do all the writes, and close the fd after everything is done. But unfortunately for mkfs.btrfs, it's not the case, we have a lot of phases separated by different close() calls: open_ctree() would open fds of each involved device and close them at close_ctree() Only after close_ctree() we have a valid superblock -\ \| \|<------- A -------->\|<--------- B --------->\|<------- C ------->\| \| \| \| `- open a new fd for make_btrfs() \| and close it before open_ctree() \| The device contains invalid sb. \| `- open a new fd for each device, then call btrfs_prepare_device(), then close the fd. The device would contain no valid superblock. If at the close() of phase A udev event is triggered, while doing udev scan we go into phase C (but before the new valid super blocks written), udev would only see no superblock or invalid superblock. Then phase C finished, udev resumes its inotify listening, but at this time mkfs is finished, while udev only sees the premature data from phase A, and misses the IN_CLOSE_WRITE events from phase C. [FIX] Instead of opening and closing a new fd for each device, re-use the fd opened during prepare_one_device(), and close all the fds until close_ctree() is called. By this, although we may still have race between close_ctree() and explicit close() calls, at least udev can always see the properly written super blocks. To compensate the change, some extra cleanups are made: - Do not touch @device_count Which makes later prepare_ctx iteration much easier. - Remove top-level @fd variable Instead go with prepare_ctx[i].fd. - Do not open with O_RDWR in test_dev_for_mkfs() as test_dev_for_mkfs() would close the fd, if we go O_RDWR, it can cause the udev race. Reviewed-by: Anand Jain <anand.jain@oracle.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-04-25 16:59:41 +02:00
Qu Wenruo	4dbe66ca2f	btrfs-progs: mkfs: make -R\|--runtime-features option deprecated The option -R\|--runtime-features was introduced to support features that don't result in a full incompat flag change, thus things like free-space-tree and quota features are put here. But to end users, such separation of features is not helpful and can be sometimes confusing. Thus we're already migrating those runtime features into -O\|--features option under experimental builds. I believe this is the proper time to move those runtime features into -O\|--features option, and mark the -R\|--runtime-features option deprecated. For now we still keep the old option as for compatibility purposes. Reviewed-by: Anand Jain <anand.jain@oracle.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-04-17 19:27:53 +02:00
David Sterba	a7fa81f296	btrfs-progs: open code print_usage where applicable After previous change to usage() that now has the return code, there's no purpose of the print_usage() wrapper so it can be removed. Signed-off-by: David Sterba <dsterba@suse.com>	2023-02-28 20:11:23 +01:00
Qu Wenruo	f61b90aff9	btrfs-progs: make usage call properly return an exit value [BUG] Currently cli/009 test case failed with different exit number: ====== RUN CHECK /home/adam/btrfs-progs/btrfstune --help usage: btrfstune [options] device [...] failed: /home/adam/btrfs-progs/btrfstune --help test failed for case 009-btrfstune [CAUSE] In tune/main.c, we have the following call on usage(): static void print_usage(int ret) { usage(&tune_cmd); exit(ret); } However usage() itself would always call exit(1): void usage(const struct cmd_struct *cmd) { usage_command_usagestr(cmd->usagestr, NULL, 0, true, true); exit(1); } This makes prevents any caller of usage() to modify its exit number. [FIX] Add a new argument @error for print_usage(), so we can properly return 0 for -h/--help usage. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-02-28 20:11:23 +01:00
David Sterba	24ec095295	btrfs-progs: crypto: add common function for accelerated initialization Prepare a single location that will detect or set accelerated versions of hash algorithms. Right now it's the crc32c, blake2 and sha256 do an if-else switch while crc32c sets a function pointer. Signed-off-by: David Sterba <dsterba@suse.com>	2023-02-28 19:49:31 +01:00
Qu Wenruo	f914949b1a	btrfs-progs: fix set but not used variables [WARNING] Clang 15.0.7 warns about several unused variables: kernel-shared/zoned.c:829:6: warning: variable 'num_sequential' set but not used [-Wunused-but-set-variable] u32 num_sequential = 0, num_conventional = 0; ^ cmds/scrub.c:1174:6: warning: variable 'n_skip' set but not used [-Wunused-but-set-variable] int n_skip = 0; ^ mkfs/main.c:493:6: warning: variable 'total_block_count' set but not used [-Wunused-but-set-variable] u64 total_block_count = 0; ^ image/main.c:2246:6: warning: variable 'bytenr' set but not used [-Wunused-but-set-variable] u64 bytenr = 0; ^ [CAUSE] Most of them are just straightforward set but not used variables. The only exception is total_block_count, which has commented out code relying on it. [FIX] Just remove those variables, and for @total_block_count, also remove the comments. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-02-18 17:44:03 +01:00
David Sterba	347c8209e8	btrfs-progs: mkfs: convert help text to option formatter Signed-off-by: David Sterba <dsterba@suse.com>	2023-01-25 19:55:47 +01:00
David Sterba	dfd58c294b	btrfs-progs: mkfs: use help and cmd_struct for printing help text Unify the mkfs help text so it uses the help framework. The cmd struct is set up only partially. Signed-off-by: David Sterba <dsterba@suse.com>	2023-01-25 19:55:47 +01:00
Naohiro Aota	d8c6021727	btrfs-progs: mkfs: check blkid version on zoned filesystems Prior to version 2.38, libblkid fails to detect zoned mode's superblock location resulting in blkid failing to detect btrfs on zoned block devices. This patch suggest to the user to upgrade libblkid if it detects a version lower then 2.38. Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2023-01-25 16:19:55 +01:00
David Sterba	f5e07cc60a	btrfs-progs: warn when an experimental functionality is used Print warning when one of the following is requested by some command line option: - btrfstune -b: conversion to block-group-tree - mkfs.btrfs --num-global-roots: extent-tree-v2 - btrfs-image -d: dump image with data Issue: #523 Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-20 16:39:11 +02:00
Qu Wenruo	d8f1bd519f	btrfs-progs: mkfs: fix a stack over-flow when features string are too long [BUG] Even with chunk_objectid bug fixed, mkfs.btrfs can still caused stack overflow when enabling extent-tree-v2 feature (need experimental features enabled): # ./mkfs.btrfs -f -O extent-tree-v2 ~/test.img btrfs-progs v5.19.1 See http://btrfs.wiki.kernel.org for more information. ERROR: superblock magic doesn't match NOTE: several default settings have changed in version 5.15, please make sure this does not affect your deployments: - DUP for metadata (-m dup) - enabled no-holes (-O no-holes) - enabled free-space-tree (-R free-space-tree) Label: (null) UUID: 205c61e7-f58e-4e8f-9dc2-38724f5c554b Node size: 16384 Sector size: 4096 Filesystem size: 512.00MiB Block group profiles: Data: single 8.00MiB Metadata: DUP 32.00MiB System: DUP 8.00MiB SSD detected: no Zoned device: no ================================================================= [... Skip full ASAN output ...] ==65655==ABORTING [CAUSE] For experimental build, we have unified feature output, but the old buffer size is only 64 bytes, which is too small to cover the new full feature string: extref, skinny-metadata, no-holes, free-space-tree, block-group-tree, extent-tree-v2 Above feature string is already 84 bytes, over the 64 on-stack memory size. This can also be proved by the ASAN output: ==65655==ERROR: AddressSanitizer: stack-buffer-overflow on address 0x7ffc4e03b1d0 at pc 0x7ff0fc05fafe bp 0x7ffc4e03ac60 sp 0x7ffc4e03a408 WRITE of size 17 at 0x7ffc4e03b1d0 thread T0 #0 0x7ff0fc05fafd in __interceptor_strcat /usr/src/debug/gcc/libsanitizer/asan/asan_interceptors.cpp:377 #1 0x55cdb7b06ca5 in parse_features_to_string common/fsfeatures.c:316 #2 0x55cdb7b06ce1 in btrfs_parse_fs_features_to_string common/fsfeatures.c:324 #3 0x55cdb7a37226 in main mkfs/main.c:1783 #4 0x7ff0fbe3c28f (/usr/lib/libc.so.6+0x2328f) #5 0x7ff0fbe3c349 in __libc_start_main (/usr/lib/libc.so.6+0x23349) #6 0x55cdb7a2cb34 in _start ../sysdeps/x86_64/start.S:115 [FIX] Introduce a new macro, BTRFS_FEATURE_STRING_BUF_SIZE, along with a new sanity check helper, btrfs_assert_feature_buf_size(). The problem is I can not find a build time method to verify BTRFS_FEATURE_STRING_BUF_SIZE is large enough to contain all feature names, thus have to go the runtime function to do the BUG_ON() to verify the macro size. Now the minimal buffer size for experimental build is 138 bytes, just bump it to 160 for future expansion. And if further features go beyond that number, mkfs.btrfs/btrfs-convert will immediately crash at that BUG_ON(), so we can definitely detect it. Reviewed-by: Anand Jain <anand.jain@oracle.com> Tested-by: Anand Jain <anand.jain@oracle.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:12 +02:00
Qu Wenruo	56e75c9f75	btrfs-progs: mkfs: fix a crash when enabling extent-tree-v2 [BUG] When enabling extent-tree-v2 feature at mkfs time (need to enable experimental features), mkfs.btrfs will crash: # ./mkfs.btrfs -f -O extent-tree-v2 ~/test.img btrfs-progs v5.19.1 See http://btrfs.wiki.kernel.org for more information. ERROR: superblock magic doesn't match NOTE: several default settings have changed in version 5.15, please make sure this does not affect your deployments: - DUP for metadata (-m dup) - enabled no-holes (-O no-holes) - enabled free-space-tree (-R free-space-tree) Segmentation fault (core dumped) [CAUSE] The block group tree looks like this after make_btrfs() call: (gdb) call btrfs_print_tree(root->fs_info->block_group_root->node, 0) leaf 1163264 items 1 free space 16234 generation 1 owner BLOCK_GROUP_TREE leaf 1163264 flags 0x0() backref revision 1 checksum stored f137c1ac checksum calced f137c1ac fs uuid 450d4b15-4954-4574-9801-8c6d248aaec6 chunk uuid 4c4cc54d-f240-4aa4-b88b-bd487db43444 item 0 key (1048576 BLOCK_GROUP_ITEM 4194304) itemoff 16259 itemsize 24 block group used 131072 chunk_objectid 256 flags SYSTEM\|single ^^^ This looks completely sane, but notice that chunk_objectid 256. That 256 value is the expected one for regular non-extent-tree-v2 btrfs, but for extent-tree-v2, chunk_objectid is reused as the global id of extent tree where the block group belongs to. With the old 256 value as chunk_objectid, btrfs will not find an extent tree root for the block group, and return NULL for btrfs_extent_root() call, and trigger segfault. This is a regression caused by commit `1430b41427` ("btrfs-progs: separate block group tree from extent tree v2"), which doesn't take extent-tree-v2 on-disk format into consideration. [FIX] For the initial btrfs created by make_btrfs(), all block group items will be in extent-tree global id 0, thus we can reset chunk_objectid to 0, if and only if extent-tree-v2 is enabled. Reviewed-by: Anand Jain <anand.jain@oracle.com> Tested-by: Anand Jain <anand.jain@oracle.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:12 +02:00
Qu Wenruo	bed70b939f	btrfs-progs: fsfeatures: properly merge -O and -R options [BUG] Commit "btrfs-progs: prepare merging compat feature lists" tries to merged "-O" and "-R" options, as they don't correctly represents btrfs features. But that commit caused the following bug during mkfs for experimental build: $ mkfs.btrfs -f -O block-group-tree /dev/nvme0n1 btrfs-progs v5.19.1 See http://btrfs.wiki.kernel.org for more information. ERROR: superblock magic doesn't match ERROR: illegal nodesize 16384 (not equal to 4096 for mixed block group) [CAUSE] Currently btrfs_parse_fs_features() will return a u64, and reuse the same u64 for both incompat and compat RO flags for experimental branch. This can easily leads to conflicts, as BTRFS_FEATURE_INCOMPAT_MIXED_BLOCK_GROUP and BTRFS_FEATURE_COMPAT_RO_BLOCK_GROUP_TREE both share the same bit (1 << 2). Thus for above case, mkfs.btrfs believe it has set MIXED_BLOCK_GROUP feature, but what we really want is BLOCK_GROUP_TREE. [FIX] Instead of incorrectly re-using the same bits in btrfs_feature, split the old flags into 3 flags: - incompat_flag - compat_ro_flag - runtime_flag The first two flags are easy to understand, the corresponding flag of each feature. The last runtime_flag is to compensate features which doesn't have any on-disk flag set, like QUOTA and LIST_ALL. And since we're no longer using a single u64 as features, we have to introduce a new structure, btrfs_mkfs_features, to contain above 3 flags. This also mean, things like default mkfs features must be converted to use the new structure, thus those old macros are all converted to const static structures: - BTRFS_MKFS_DEFAULT_FEATURES + BTRFS_MKFS_DEFAULT_RUNTIME_FEATURES -> btrfs_mkfs_default_features - BTRFS_CONVERT_ALLOWED_FEATURES -> btrfs_convert_allowed_features And since we're using a structure, it's not longer as easy to implement a disallowed mask. Thus functions with @mask_disallowed are all changed to using an @allowed structure pointer (which can be NULL). Finally if we have experimental features enabled, all features can be specified by -O options, and we can output a unified feature list, instead of the old split ones. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:11 +02:00
Qu Wenruo	2cdc8dddbf	btrfs-progs: mkfs: offset inode numbers of the source filesystem [BUG] When running mkfs tests on a newly rebooted minimal system, it can cause mkfs/009 to fail. The reproduce steps requires /tmp to has minimal files in the first place. # mkdir /tmp/rootdir # xfs_io -f -c "pwrite 0 16k" /tmp/rootdir # mkfs.btrfs --rootdir /tmp/rootdir -f $dev # btrfs check $dev Opening filesystem to check... Checking filesystem on /dev/test/scratch1 UUID: 6821b3db-f056-4c18-b797-32679dcd4272 [1/7] checking root items [2/7] checking extents data backref 13631488 root 5 owner 170 offset 0 num_refs 0 not found in extent tree incorrect local backref count on 13631488 root 5 owner 170 offset 0 found 1 wanted 0 back 0x55ff6cd72260 backref 13631488 root 5 not referenced back 0x55ff6cd4c1f0 incorrect global backref count on 13631488 found 2 wanted 1 backpointer mismatch on [13631488 16384] ERROR: errors found in extent allocation tree or chunk allocation [CAUSE] The extent tree has the following weird item: item 0 key (13631488 EXTENT_ITEM 16384) itemoff 16250 itemsize 33 refs 1 gen 0 flags DATA tree block backref root FS_TREE This is an extent item for data, thus it should not have an inline tree backref. Then checking the fs tree: item 0 key (170 INODE_ITEM 0) itemoff 16123 itemsize 160 generation 7 transid 0 size 16384 nbytes 16384 block group 0 mode 100600 links 1 uid 1000 gid 1000 rdev 0 sequence 0 flags 0x0(none) atime 1664866393.0 (2022-10-04 14:53:13) ctime 1664863510.0 (2022-10-04 14:05:10) mtime 1664863455.0 (2022-10-04 14:04:15) otime 0.0 (1970-01-01 08:00:00) There is an inode item before the root dir inode. And that inode number 170 is causing the problem. In traverse_directory(), we use the inode number reported from stat() directly as btrfs inode number, and pass it to btrfs_record_file_extent(), which finally calls btrfs_inc_extent_ref(), with above 170 passed as @owner parameter. But inside btrfs_inc_extent_ref() we use that @owner value to determine if it's a data backref. Since we got a smaller than BTRFS_FIRST_FREE_OBJECTID, btrfs treats it as tree block, and cause the above problem. [FIX] As a quick fix, always add BTRFS_FIRST_FREE_OBJECTID to all inode number directly grabbed from stat(). And add an ASSERT() in __btrfs_record_file_extent() to catch unexpected objectid. This is not a perfect solution, as the resulted fs will has a huge gap in its inodes: item 0 key (256 INODE_ITEM 0) itemoff 16123 itemsize 160 item 4 key (426 INODE_ITEM 0) itemoff 15883 itemsize 160 For a proper fix, we should allocate new btrfs inode numbers in a sequential order, but that would be another series of patches. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:10 +02:00
David Sterba	ccb2d4aa45	btrfs-progs: device-utils: rename btrfs_device_size There's a group of helpers to read device size, the btrfs_device_size should be one of them. Rename it and so minor cleanup. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:10 +02:00
David Sterba	ea0b894967	btrfs-progs: mkfs: do proper error handling Replace BUG_ON after transaction start failures, all the functions already handle errors and return them to the caller. The other error handling is for impossible conditions. Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:10 +02:00
David Sterba	a827bb2db8	btrfs-progs: use template for transaction commit error messages Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:10 +02:00
David Sterba	8fcafae04a	btrfs-progs: use template for transaction start error messages Signed-off-by: David Sterba <dsterba@suse.com>	2022-10-11 09:08:10 +02:00

1 2 3 4 5 ...

265 Commits