btrfs-progs

mirror of https://github.com/kdave/btrfs-progs synced 2025-01-19 20:20:47 +00:00

Author	SHA1	Message	Date
David Sterba	2d50b98189	btrfs-progs: replace start: add option -K/--nodiscard The commands initializing a new device (mkfs, device add) do discard by default, while this is missing from replace start. For parity add the options with same name and semantics. Issue: #390 Signed-off-by: David Sterba <dsterba@suse.com>	2021-10-06 16:49:35 +02:00
Qu Wenruo	60651ad9da	btrfs-progs: introduce OPEN_CTREE_ALLOW_TRANSID_MISMATCH flag [BUG] There is a report that, btrfstune can even work while the fs has transid mismatch problems. $ btrfstune -f -u /dev/sdb1 Current fsid: b2b5ae8d-4c49-45f0-b42e-46fe7dcfcb07 New fsid: b2b5ae8d-4c49-45f0-b42e-46fe7dcfcb07 Set superblock flag CHANGING_FSID Change fsid in extents parent transid verify failed on 792854528 wanted 20103 found 20091 parent transid verify failed on 792854528 wanted 20103 found 20091 parent transid verify failed on 792854528 wanted 20103 found 20091 Ignoring transid failure parent transid verify failed on 792870912 wanted 20103 found 20091 parent transid verify failed on 792870912 wanted 20103 found 20091 parent transid verify failed on 792870912 wanted 20103 found 20091 Ignoring transid failure parent transid verify failed on 792887296 wanted 20103 found 20091 parent transid verify failed on 792887296 wanted 20103 found 20091 parent transid verify failed on 792887296 wanted 20103 found 20091 Ignoring transid failure ERROR: child eb corrupted: parent bytenr=38010880 item=69 parent level=1 child level=1 ERROR: failed to change UUID of metadata: -5 ERROR: btrfstune failed This leaves a corrupted fs even more corrupted, and due to the extra CHANGING_FSID flag, btrfs check will not even try to run on it: Opening filesystem to check... ERROR: Filesystem UUID change in progress ERROR: cannot open file system [CAUSE] Unlike kernel, btrfs-progs has a less strict check on transid mismatch. In read_tree_block() we will fall back to use the tree block even its transid mismatch if we can't find any better copy. However not all commands in btrfs-progs needs this feature, only btrfs-check (which may fix the problem) and btrfs-restore (it just tries to ignore any problems) really utilize this feature. [FIX] Introduce a new open ctree flag, OPEN_CTREE_ALLOW_TRANSID_MISMATCH, to be explicit about whether we really want to ignore transid error. Currently only btrfs-check and btrfs-restore will utilize this new flag. Also add btrfs-image to allow opening such fs with transid error. Link: https://www.reddit.com/r/btrfs/comments/pivpqk/failure_during_btrfstune_u/ Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-20 12:17:29 +02:00
David Sterba	61e00694e7	btrfs-progs: fix defrag -c option parsing The refactoring f3a132fa1b8c ("btrfs-progs: factor out compression type name parsing to common utils") caused a bug with parsing option -c with defrag: # btrfs fi defrag -v -czstd file ERROR: unknown compression type: zstd # btrfs fi defrag -v -clzo file ERROR: unknown compression type: lzo # btrfs fi defrag -v -czlib file ERROR: unknown compression type: zlib Fix it by properly checking the value representing unknown compression algorithm. Issue: #403 Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-18 17:48:32 +02:00
David Sterba	419cb3011c	btrfs-progs: open code btrfs_list_get_path_rootid The function btrfs_list_get_path_rootid is exported to libbtrfs so it needs to stay, but we can inline the implementation. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-08 16:35:47 +02:00
David Sterba	50292bc009	btrfs-progs: merge props.c to cmds/property.c The property definitions and handlers are for the command line processing, so merge it with the main source file. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 19:41:24 +02:00
David Sterba	022a560033	btrfs-progs: move props.h to cmds/ This is included only by the command line handler, move it there. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 19:33:24 +02:00
David Sterba	e86425242f	btrfs-progs: move send.h to kernel-shared/ The header contains the protocol definitions and is almost exactly the same as the kernel version, move it to the proper directory. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 19:26:46 +02:00
David Sterba	c1953c36dd	btrfs-progs: move all private definitions to cmds/qgroup.c Move everything related to the output formatting and filtering out of qgroup.h and leave only the structures used by the public API. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 19:21:14 +02:00
David Sterba	f67e5da2fa	btrfs-progs: remove prefix from all static qgroup helpers The exported functions provided by qgroups have been changed, now remove the prefix from the local helpers. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 19:15:55 +02:00
David Sterba	49671bf29e	btrfs-progs: add prefixes to exported qgroup helpers Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 19:09:33 +02:00
David Sterba	353b55a5cf	btrfs-progs: unexport local qgroup helpers After merging the files, many functions can be made static, leaving only a few helpers that are used by subvolume. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 19:07:54 +02:00
David Sterba	ae56c6a071	btrfs-progs: merge qgroup.c into cmds/qgroup.c The contents of top level qgroups.c is only for command line output and filtering, we already have cmds/qgroup.c for that so merge the files. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 18:59:43 +02:00
David Sterba	6e0e0d467f	btrfs-progs: move qgroup.h to cmds/ There are declarations that are namely for the command line out put, filters and formatting. Move it to cmds/. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 18:45:59 +02:00
David Sterba	9be1b1c442	btrfs-progs: qgroup create: accept only valid qgroupid Many qgroup commands accept the level/id format and also a path to subvolume, the qgroup id is derived from that. This does not make sense for the create command as we can't create the 0/subvolid qgroup (thus can't be derived from the path), only the higher levels. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 14:20:42 +02:00
David Sterba	ebf500b43d	btrfs-progs: rename parse_qgroupid This helper can parse a qgroupid or a path, so rename it accordingly, so a plain qgroupid parsing can be factored out as a standalone helper. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 14:20:42 +02:00
David Sterba	c3ee6a8a09	btrfs-progs: unify GPL header comments Add the GPL v2 header to files where it was missing and is not from an external source, update to the most recent version with the address. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 13:58:44 +02:00
David Sterba	f3a132fa1b	btrfs-progs: factor out compression type name parsing to common utils Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 13:58:44 +02:00
David Sterba	31df7dc295	btrfs-progs: factor out profile parsing to common utils There are some duplicate parsers of the profile names, factor out the one from balance to the common code. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 13:58:44 +02:00
David Sterba	6b93a7336c	btrfs-progs: move number and range parsing helpers to parse-utils.c Some of the parsers in cmds/balance.c are generic enough for the common part. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-07 13:58:43 +02:00
David Sterba	af56460de8	btrfs-progs: split parsing helpers from utils.c There are various parsing helpers scattered everywhere, unify them to one file and start with helpers already in utils.c. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-06 17:15:51 +02:00
David Sterba	d15b7b0d58	btrfs-progs: parse profile name from the raid table We can use the raid table to match profile names, additionally make the test case insensitive. The single profile is not represented as a bit and must be set manually for now. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-06 17:15:13 +02:00
David Sterba	7572839a74	btrfs-progs: add and use bit masks for RAID1 and RAID56 profiles Many test conditions can be simplified in case they check all the related profiles. Signed-off-by: David Sterba <dsterba@suse.com>	2021-09-06 16:36:18 +02:00
David Sterba	33543eb2b1	btrfs-progs: remove stale command declarations The declarations do not correspond to any command descriptors as they have been moved to other command groups. Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-20 14:24:55 +02:00
Qu Wenruo	9d5d3de01c	btrfs-progs: subvol delete: try to delete subvolume by id when its path can't be resolved There is a recent report of ghost subvolumes where such subvolumes has no ROOT_REF/BACKREF, and 0 root ref. But without an orphan item, thus kernel won't queue them for cleanup. Such ghost subvolumes are just here to take up space, and no way to delete them except by btrfs check, which will try to fix the problem by adding orphan item. There is a kernel patch submitted to allow btrfs to detect such ghost subvolumes and queue them for cleanup. But btrfs-progs will not continue to call the ioctl if it can't find the full subvolume path. Thus this patch will loose the restriction by allowing btrfs-progs to continue to call the ioctl even if it can't grab the subvolume path. Signed-off-by: Qu Wenruo <wqu@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-08-20 14:24:55 +02:00
Sidong Yang	d302bf5b34	btrfs-progs: cmds: fix build on musl when using NAME_MAX There is some code that using NAME_MAX but it doesn't include header that is defined. This patch adds a line that includes linux/limits.h which defines NAME_MAX. Issue: #386 Issue: #385 Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Sidong Yang <realwakka@gmail.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-07-26 13:28:22 +02:00
David Sterba	96aedd1121	btrfs-progs: fi usage: swap order of Used and zoned information The Used and Free should be together, while all the device information is in the first section. Example: Overall: Device size: 128.00GiB Device allocated: 24.00GiB Device unallocated: 104.00GiB Device missing: 0.00B Device zone unusable: 5.13MiB Device zone size: 256.00MiB Used: 213.33MiB Free (estimated): 111.79GiB (min: 111.79GiB) Free (statfs, df): 111.79GiB Data ratio: 1.00 Metadata ratio: 1.00 Global reserve: 25.58MiB (used: 16.00KiB) Multiple profiles: no Signed-off-by: David Sterba <dsterba@suse.com>	2021-07-02 17:27:53 +02:00
David Sterba	a410805acc	btrfs-progs: fi usage: print zone size in the overview Read device size and print it in the overall overview in zoned mode. The total unusable size is there so the zone size is complementing it. It's read from the first device assuming that kernel mandates that all devices have the same zone size. Example: Overall: Device size: 128.00GiB Device allocated: 24.00GiB Device unallocated: 104.00GiB Device missing: 0.00B Used: 213.33MiB Device zone unusable: 5.13MiB Device zone size: 256.00MiB Free (estimated): 111.79GiB (min: 111.79GiB) Free (statfs, df): 111.79GiB Data ratio: 1.00 Metadata ratio: 1.00 Global reserve: 25.58MiB (used: 16.00KiB) Multiple profiles: no Signed-off-by: David Sterba <dsterba@suse.com>	2021-07-02 17:27:53 +02:00
Sidong Yang	4693e82261	btrfs-progs: device usage: print number of stripes in relevant profiles Print number of stripes for striped profiles in device usage commands. It helps to see profiles easily. The output is like below. /dev/vdc, ID: 1 Device size: 1.00GiB Device slack: 0.00B Data,RAID0/2: 912.62MiB Data,RAID0/3: 912.62MiB Metadata,RAID1: 102.38MiB System,RAID1: 8.00MiB Unallocated: 1.00MiB Multiple lines can appear in case a balance conversion process was interrupted or if there's been a new device added and new data written to the full stripe. Issue: #372 Signed-off-by: Sidong Yang <realwakka@gmail.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-06-23 13:47:35 +02:00
David Sterba	d51075f05b	btrfs-progs: restore: group help options Group the options in the help text by the purpose for clarity. Signed-off-by: David Sterba <dsterba@suse.com>	2021-06-19 22:07:49 +02:00
David Sterba	9a83a0b5c0	btrfs-progs: restore: remove loop check during directory scan There's another loop protection during scan of directory items. This can fire under invalid conditions, ie. when there's no real endless loop. The layout of b-tree items could trigger that and has been observed in practice. This prevents automated restoration as it requires user attention. The number of loops is 1024, unjustified and without explanation. Errors during traversing the leaves are checked so most errors would be caught. A real loop in the directory items would require some crafting and would not happen on a normal filesystem. Issue: #59 Issue: #164 Issue: #237 Signed-off-by: David Sterba <dsterba@suse.com>	2021-06-19 22:07:49 +02:00
David Sterba	4258b7161f	btrfs-progs: restore: remove loop check during file copy There's some kind of looping protection during copying file extents, mostly likely to avoid endless loops on severely damaged filesystems. This has been bothering users and makes restoring hard to automate as it requires user attention to press 'y' or 'a'. This has not been well documented either. The number of loops is 1024 which looks arbitrary and hard to justify. This eg. means that a file with many fragments hits the interactive question more than once. There are other checks when iterating the leaves that would catch corruptions or other errors, so the looping would happen in some rare and rather artificial case when some kind of loop exists inside the extent items. This is not easily possible if possible at all as the items do not directly reference other. In case there's some genuine error found that would require a looping protection, we'll add it or extend the checks to identify the loop. Issue: #59 Issue: #164 Issue: #237 Signed-off-by: David Sterba <dsterba@suse.com>	2021-06-19 22:07:49 +02:00
David Sterba	afe055f438	btrfs-progs: restore: convert to error message helpers - convert to error() helpers, remove trailing \n - switch messages printing %d as errno to %m - rewording Signed-off-by: David Sterba <dsterba@suse.com>	2021-06-19 22:07:49 +02:00
David Sterba	b1f374dd1d	btrfs-progs: switch %Lu to %llu format The %Lu format is not standard and we use %llu everywhere else, so switch the remaining cases. Signed-off-by: David Sterba <dsterba@suse.com>	2021-06-19 22:07:49 +02:00
David Sterba	9f6c055e38	btrfs-progs: dump-tree: add options to dump checksums Add new options to dumps checksums in node headers and in the checksum items: $ btrfs inspect dump-tree --csum-headers image root tree leaf 471515136 items 19 free space 12186 generation 15 owner ROOT_TREE leaf 471515136 flags 0x1(WRITTEN) backref revision 1 csum 0x756b2d54 fs uuid df0348df-5773-47dd-81e9-a18221461239 For nodes/leaves it's appended on the 2nd line of the header. Checksum items are stored in leaves as EXTENT_CSUM key type, with offset value as the logical offset starting. As the array would be hard to parse or match, each offset value is printed with the checksum. For crc32c it's 4 values on a line, for xxhash it's 2 and for the long 256bit checksums it's one checksum per line. $ btrfs inspect dump-tree --csum-items image leaf 5423104 items 1 free space 30 generation 6 owner CSUM_TREE leaf 5423104 flags 0x1(WRITTEN) backref revision 1 fs uuid bd7c981e-16ff-4081-a734-3ef5d50cafc1 chunk uuid 13f4c76c-7845-4984-88ed-f01b52e05cf8 item 0 key (EXTENT_CSUM EXTENT_CSUM 22020096) itemoff 55 itemsize 16228 range start 22020096 end 38637568 length 16617472 [22020096] 0x8941f998 [22024192] 0x8941f998 [22028288] 0x8941f998 [22032384] 0x8941f998 [22036480] 0x8941f998 [22040576] 0x8941f998 [22044672] 0x8941f998 [22048768] 0x8941f998 ... $ btrfs inspect dump-tree --csum-items image leaf 5718016 items 1 free space 7746 generation 6 owner CSUM_TREE leaf 5718016 flags 0x1(WRITTEN) backref revision 1 fs uuid f453a5b4-8b4a-4fbf-90a2-2925e4fe2335 chunk uuid eb1da63b-248b-44c2-82da-71b2564bf50e item 0 key (EXTENT_CSUM EXTENT_CSUM 52387840) itemoff 7771 itemsize 8512 range start 52387840 end 53477376 length 1089536 [52387840] 0x686ede9288c391e7e05026e56f2f91bfd879987a040ea98445dabc76f55b8e5f [52391936] 0x686ede9288c391e7e05026e56f2f91bfd879987a040ea98445dabc76f55b8e5f ... The options are not on by default, the header checksum is not important for the structures. Data checksums can be quite big so that would make the dump long and without any actual data to match against. Signed-off-by: David Sterba <dsterba@suse.com>	2021-06-19 22:07:49 +02:00
David Sterba	72d710637c	btrfs-progs: print-tree: convert mode to bitmask Replace follow and traverse by one parameter that takes bits to affect the behaviour. This allows to extend btrfs_print_tree output with more modes from one place. Signed-off-by: David Sterba <dsterba@suse.com>	2021-06-09 20:31:49 +02:00
David Sterba	f123c28412	btrfs-progs: fi resize: add support for cancel Recognize special resize amount 'cancel' for resize operation. This will request kernel to stop running any resize operation (most likely shrinking resize). This needs support in kernel, otherwise this will fail due to another exclusive operation running (though could be the same one). The command returns after kernel finishes any work that got interrupted, but this should not take long in kernels 5.10+ that allow interruptible relocation. The waiting inside kernel is interruptible so this command (and the waiting stage) can be interrupted. The resize operation could relocate block groups but the nominal filesystem size will be restored when resize won't finish. It's recommended to review the filesystem state. Note: in kernels 5.10+ sending a fatal signal (TERM, KILL, Ctrl-C) to the process running the resize will cancel it too. Example: $ btrfs fi resize -10G /mnt ... $ btrfs fi resize cancel /mnt Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-20 22:43:29 +02:00
David Sterba	71d68ab8b2	btrfs-progs: device remove: add support for cancel Recognize special name 'cancel' for device deletion, that will request kernel to stop running device deletion. This needs support in kernel, otherwise this will fail due to another exclusive operation running (though could be the same one). The command returns after kernel finishes any work that got interrupted, but this should not take long in kernels 5.10+ that allow interruptible relocation. The waiting inside kernel is interruptible so this command (and the waiting stage) can be interrupted. The device size is restored when deletion does not finish but it's recommended to review the filesystem state. Note: in kernels 5.10+ sending a fatal signal (TERM, KILL, Ctrl-C) to the process running the device deletion will cancel it too. Example: $ btrfs device delete /dev/sdx /mnt ... $ btrfs device delete cancel /mnt Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-20 22:14:18 +02:00
Anand Jain	efe4844e01	btrfs-progs: fix inspect-internal --help incomplete sentence btrfs inspect-internal --help shows incomplete sentence. As shown below: btrfs inspect-internal --help <snip> btrfs inspect-internal min-dev-size [options] <path> Get the minimum size the device can be shrunk to. The btrfs inspect-internal dump-tree [options] <device> [<device> ..] <snip> The short help string can be multi-line but must be in one string. This patch fixes it. Signed-off-by: Anand Jain <anand.jain@oracle.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-17 14:26:32 +02:00
David Sterba	780355d225	btrfs-progs: fi usage: print zone unusable in the overview Print the total zone_unusable size in the summary for 'fi usage' for a filesystem in zoned mode. It's a sum of all the zone_unusable values from 'fi df'. Per-device stats are not implemented and would need more complicated calculations from raw data, kernel does not export that (but it could). As of 5.12, the zone_unusable is stored only in memory so we'd have to map raw block device zones to the block groups and the live extents in the associated block groups to get the exact numbers. Example: # btrfs fi usage /mnt Overall: Device size: 2.00GiB Device allocated: 768.00MiB Device unallocated: 1.25GiB Device missing: 0.00B Device zone unusable: 320.00KiB Used: 128.00KiB Free (estimated): 1.50GiB (min: 1.50GiB) Free (statfs, df): 1.50GiB Data ratio: 1.00 Metadata ratio: 1.00 Global reserve: 3.25MiB (used: 32.00KiB) Multiple profiles: no Data,single: Size:256.00MiB, Used:0.00B (0.00%) /dev/nullb0 256.00MiB Metadata,single: Size:256.00MiB, Used:112.00KiB (0.04%) /dev/nullb0 256.00MiB System,single: Size:256.00MiB, Used:16.00KiB (0.01%) /dev/nullb0 256.00MiB Unallocated: /dev/nullb0 1.25GiB # btrfs fi df Data, single: total=256.00MiB, used=0.00B, zone_unusable=0.00B System, single: total=256.00MiB, used=16.00KiB, zone_unusable=160.00KiB Metadata, single: total=256.00MiB, used=112.00KiB, zone_unusable=160.00KiB GlobalReserve, single: total=3.25MiB, used=32.00KiB Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-08 00:58:50 +02:00
David Sterba	a2495d1e6e	btrfs-progs: export get_zone_unusable and move to utils.c Getting the per bg type zone unusable space will be used in other size reports like 'fi us', so export it to the device utils. Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-08 00:58:50 +02:00
David Sterba	e295a8ad4c	btrfs-progs: fi df: report zone_unusable on zoned filesystem In the zoned mode there are parts of chunks that become unusable once they get COWed and the zone must be reclaimed and reset to make the space usable again. Provide a way to show the total size per block group type in fi df: $ btrfs fi df . Data, single: total=1.00GiB, used=257.51MiB, zone_unusable=238.43MiB System, single: total=256.00MiB, used=16.00KiB, zone_unusable=224.00KiB Metadata, single: total=256.00MiB, used=816.00KiB, zone_unusable=8.61MiB GlobalReserve, single: total=3.25MiB, used=0.00B This will not be shown on non-zoned filesystems. Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-08 00:58:50 +02:00
David Sterba	d591cd7c08	btrfs-progs: split unit related helpers from utils.c Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:47 +02:00
David Sterba	51f15d393a	btrfs-progs: build: remove incomplete android support There is a support to build on android but it's incomplete and there's little interest to fix it. To reinstate we'll need: * fix remaining issues from lore.kernel.org/linux-btrfs/20170802185111.187922-1-filipbystricky@google.com * find CI host with Android support to verify build, either local eg. in docker or in a hosted environment * switch the make-based build to 'soong' (source.android.com/setup/build) Issue: #357 Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:47 +02:00
David Sterba	7fa07e2abb	btrfs-progs: split open/close helpers from utils.c There's a group of functions that are related to opening filesystem in various modes, this can be moved to a separate file. Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:47 +02:00
David Sterba	b19a603d62	btrfs-progs: remove unnecessary linux/*.h includes Decrease dependency on system headers, remove where they're not needed or became stale after code moved. The path-utils.h encapsulate path operations so include linux/limits.h here, that's where PATH_MAX is defined. Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:47 +02:00
David Sterba	51c0ece9f6	btrfs-progs: add prefix to get_partition_size This is a public helper for devices, add the prefix to make it clear. Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:46 +02:00
Naohiro Aota	86f9eb3906	btrfs-progs: zoned: introduce zoned support for device replace This patch checks if the target file system is flagged as ZONED. If it is, the device to be added is flagged PREP_DEVICE_ZONED. Also add checks to prevent mixing non-zoned devices and zoned devices. Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:46 +02:00
Naohiro Aota	568f9ed26f	btrfs-progs: device add: support adding zoned device Check if the target file system is flagged as ZONED. If it is, the device to be added is flagged PREP_DEVICE_ZONED. Also add checks to prevent mixing non-zoned devices and zoned devices. Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:46 +02:00
Naohiro Aota	8ef9313cf2	btrfs-progs: zoned: implement log-structured superblock Superblock (and its copies) is the only data structure in btrfs which has a fixed location on a device. Since we cannot overwrite in a sequential write required zone, we cannot place superblock in the zone. One easy solution is limiting superblock and copies to be placed only in conventional zones. However, this method has two downsides: one is reduced number of superblock copies. The location of the second copy of superblock is 256GB, which is in a sequential write required zone on typical devices in the market today. So, the number of superblock and copies is limited to be two. Second downside is that we cannot support devices which have no conventional zones at all. To solve these two problems, we employ superblock log writing. It uses two adjacent zones as a circular buffer to write updated superblocks. Once the first zone is filled up, start writing into the second one. Then, when both zones are filled up and before starting to write to the first zone again, reset the first zone. We can determine the position of the latest superblock by reading write pointer information from a device. One corner case is when both zones are full. For this situation, we read out the last superblock of each zone, and compare them to determine which zone is older. The following zones are reserved as the circular buffer on ZONED btrfs. - primary superblock: offset 0B (and the following zone) - first copy: offset 512G (and the following zone) - Second copy: offset 4T (4096G, and the following zone) If these reserved zones are conventional, superblock is written fixed at the start of the zone without logging. Currently, superblock reading/writing is done by pread/pwrite. This commit replace the call sites with sbread/sbwrite to wrap the functions. For zoned btrfs, btrfs_sb_io which is called from sbread/sbwrite reverses the IO position back to a mirror number, maps the mirror number into the superblock logging position, and do the IO. Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:45 +02:00
Naohiro Aota	acdd22ab68	btrfs-progs: provide fs_info from btrfs_device Likewise in the kernel code, provide fs_info access from struct btrfs_device. This will help to unify the code between the kernel and the userland. Since fs_info can be NULL at the time of btrfs_add_to_fsid(), let's use btrfs_open_devices() to set fs_info to the devices. Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com> Signed-off-by: David Sterba <dsterba@suse.com>	2021-05-06 16:41:45 +02:00

1 2 3 4 5

201 Commits