Diff - 539d8264093560b917ee3afe4c7f74e5da09d6a5^! - linux

commit	539d8264093560b917ee3afe4c7f74e5da09d6a5	[log] [tgz]
author	Sunil Mushran <sunil.mushran@oracle.com>	Mon Jul 14 17:31:10 2008 -0700
committer	Mark Fasheh <mfasheh@suse.com>	Thu Jul 31 16:21:14 2008 -0700
tree	1fce83387272c0b2d61bd945769f4984aa5e79ce
parent	c69991aac71a8beb57c11d651c7fd4b24c32aa8b [diff] [blame]

[PATCH 2/2] ocfs2: Fix race between mount and recovery

As the fs recovery is asynchronous, there is a small chance that another
node can mount (and thus recover) the slot before the recovery thread
gets to it.

If this happens, the recovery thread will block indefinitely on the
journal/slot lock as that lock will be held for the duration of the mount
(by design) by the node assigned to that slot.

The solution implemented is to keep track of the journal replays using
a recovery generation in the journal inode, which will be incremented by the
thread replaying that journal. The recovery thread, before attempting the
blocking lock on the journal/slot lock, will compare the generation on disk
with what it has cached and skip recovery if it does not match.

This bug appears to have been inadvertently introduced during the mount/umount
vote removal by mainline commit 34d024f84345807bf44163fac84e921513dde323. In the
mount voting scheme, the messaging would indirectly indicate that the slot
was being recovered.

Signed-off-by: Sunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: Mark Fasheh <mfasheh@suse.com>

diff --git a/fs/ocfs2/ocfs2.h b/fs/ocfs2/ocfs2.h
index 1cb814b..7f625f2 100644
--- a/fs/ocfs2/ocfs2.h
+++ b/fs/ocfs2/ocfs2.h

@@ -204,6 +204,8 @@
 
 	struct ocfs2_slot_info *slot_info;
 
+	u32 *slot_recovery_generations;
+
 	spinlock_t node_map_lock;
 
 	u64 root_blkno;