[lvc-project] [PATCH 2/2] ocfs2: validate cl_bpc in allocator inodes to prevent divide-by-zero
Heming Zhao
heming.zhao at suse.com
Thu Oct 30 05:00:22 MSK 2025
On Wed, Oct 29, 2025 at 05:25:03PM +0300, Dmitry Antipov wrote:
> On 10/29/25 12:53 PM, Joseph Qi wrote:
>
> > On 2025/10/29 13:53, Dmitry Antipov wrote:
> > > From: Deepanshu Kartikey <kartikey406 at gmail.com>
> > >
> > > The chain allocator field cl_bpc (blocks per cluster) is read from disk
> > > and used in division operations without validation. A corrupted filesystem
> > > image with cl_bpc=0 causes a divide-by-zero crash in the kernel:
> > >
> > > divide error: 0000 [#1] PREEMPT SMP KASAN
> > > RIP: 0010:ocfs2_bg_discontig_add_extent fs/ocfs2/suballoc.c:335 [inline]
> > > RIP: 0010:ocfs2_block_group_fill+0x5bd/0xa70 fs/ocfs2/suballoc.c:386
> > > Call Trace:
> > > ocfs2_block_group_alloc+0x7e9/0x1330 fs/ocfs2/suballoc.c:703
> > > ocfs2_reserve_suballoc_bits+0x20a6/0x4640 fs/ocfs2/suballoc.c:834
> > > ocfs2_reserve_new_inode+0x4f4/0xcc0 fs/ocfs2/suballoc.c:1074
> > > ocfs2_mknod+0x83c/0x2050 fs/ocfs2/namei.c:306
> > >
> > > This patch adds validation in ocfs2_validate_inode_block() to ensure cl_bpc
> > > matches the expected value calculated from the superblock's cluster size
> > > and block size for chain allocator inodes (identified by OCFS2_CHAIN_FL).
> > >
> > > Moving the validation to inode validation time (rather than allocation time)
> > > has several benefits:
> > > - Validates once when the inode is read, rather than on every allocation
> > > - Protects all code paths that use cl_bpc (allocation, resize, etc.)
> > > - Follows the existing pattern of inode validation in OCFS2
> > > - Centralizes validation logic
> > >
> > > The validation catches both:
> > > - Zero values that cause divide-by-zero crashes
> > > - Non-zero but incorrect values indicating filesystem corruption or
> > > mismatched filesystem geometry
> > >
> > > With this fix, mounting a corrupted filesystem produces:
> > >
> > > OCFS2: ERROR (device loop0): ocfs2_validate_inode_block: Inode 74
> > > has corrupted cl_bpc: ondisk=0 expected=16
> > >
> > > instead of a kernel crash.
> > >
> > > Link: https://lore.kernel.org/ocfs2-devel/20251026132625.12348-1-kartikey406@gmail.com/T/#u [v1]
> > > Link: https://lore.kernel.org/all/20251027124131.10002-1-kartikey406@gmail.com/T/ [v2]
> > > Reported-by: syzbot+fd8af97c7227fe605d95 at syzkaller.appspotmail.com
> > > Closes: https://syzkaller.appspot.com/bug?extid=fd8af97c7227fe605d95
> > > Tested-by: syzbot+fd8af97c7227fe605d95 at syzkaller.appspotmail.com
> > > Suggested-by: Joseph Qi <joseph.qi at linux.alibaba.com>
> > > Signed-off-by: Deepanshu Kartikey <kartikey406 at gmail.com>
> > > [dmantipov: combine into the series and tweak
> > > the message to fit the commonly used style]
> > > Signed-off-by: Dmitry Antipov <dmantipov at yandex.ru>
> >
> > Reviewed-by: Joseph Qi <joseph.qi at linux.alibaba.com>
> > > ---
> > > fs/ocfs2/inode.c | 8 ++++++++
> > > 1 file changed, 8 insertions(+)
> > >
> > > diff --git a/fs/ocfs2/inode.c b/fs/ocfs2/inode.c
> > > index 1b6bdd9d7755..efb930da0920 100644
> > > --- a/fs/ocfs2/inode.c
> > > +++ b/fs/ocfs2/inode.c
The cl->cl_bpc field represents "bits per cluster". This means:
- for a 4k (min size) cluster block, cl_bpc is 12
- for a 8k, cl_bpc is 13
- for a 16k, cl_bpc is 14
The OCFS2_SB(sb)->s_clustersize_bits is the same value as cl_bpc.
Its values are 4k:12, 8k:13 and 16k:14
> > > @@ -1505,6 +1505,8 @@ int ocfs2_validate_inode_block(struct super_block *sb,
> > > if (le32_to_cpu(di->i_flags) & OCFS2_CHAIN_FL) {
> > > struct ocfs2_chain_list *cl = &di->id2.i_chain;
> > > + u16 bpc = 1 << (OCFS2_SB(sb)->s_clustersize_bits -
> > > + sb->s_blocksize_bits);
The meaning of above line is "bits shift from block to cluster".
Not "bits per cluster"
> > > if (le16_to_cpu(cl->cl_count) != ocfs2_chain_recs_per_inode(sb)) {
> > > rc = ocfs2_error(sb, "Invalid dinode %llu: chain list count %u\n",
> > > @@ -1518,6 +1520,12 @@ int ocfs2_validate_inode_block(struct super_block *sb,
> > > le16_to_cpu(cl->cl_next_free_rec));
> > > goto bail;
> > > }
> > > + if (le16_to_cpu(cl->cl_bpc) != bpc) {
> > > + rc = ocfs2_error(sb, "Invalid dinode %llu: bits per cluster %u\n",
> > > + (unsigned long long)bh->b_blocknr,
> > > + le16_to_cpu(cl->cl_bpc));
> > > + goto bail;
> > > + }
> > > }
> > > rc = 0;
>
> Oops. This seems prevents from mounting filesystems with any block size except 4k
> (mkfs.ocfs2 -b 512, mkfs.ocfs2 -b 1024, mkfs.ocfs2 -b 2048) with the following message:
>
> OCFS2: ERROR (device sdb): int ocfs2_validate_inode_block(struct super_block *, struct buffer_head *): Invalid dinode 23: bits per cluster 1
>
> Dmitry
>
The value of cl_bpc is an intentionally crafted value designed by syzbot
under bsize:4k csice:4k, the patch code:
"1 << (OCFS2_SB(sb)->s_clustersize_bits - sb->s_blocksize_bits)"
=> "1 << (14 - 14)" => "1 << 0" => 1 (just same as cl->cl_bpc)
The values are the same, but it's logically meaningless
Based purely on code logic, the original patch code:
if (le16_to_cpu(cl->cl_bpc) != bpc)
should be changed to:
if (le16_to_cpu(cl->cl_bpc) != OCFS2_SB(sb)->s_clustersize_bits)
(and remove the line: u16 bpc = ...)
- Heming
More information about the lvc-project
mailing list