Re: RAID5: failing an active component during spare rebuild -arrays hangs
am 28.06.2011 04:29:21 von NeilBrownOn Sun, 26 Jun 2011 21:13:17 +0300 Alexander Lyakas
wrote:
> Hello Neil,
> thank you for your response. Meanwhile I have moved to stock ubuntu
> natty 11.04, but it still happens. I have a simple script that
> reproduces the issue for me in less than 1 minute.
> System details:
> Linux ubuntu 2.6.38-8-server #42-Ubuntu SMP Mon Apr 11 03:49:04 UTC
> 2011 x86_64 x86_64 x86_64 GNU/Linux
>
> Here is the script:
> ##################################
> #!/bin/bash
>
> while true
> do
> mdadm --create /dev/md1123 --raid-devices=3 --level=5
> --bitmap=internal --name=1123 --run --auto=md --metadata=1.2
> --homehost=alex --verbose /dev/sda /dev/sdb /dev/sdc
> sleep 6
> mdadm --manage /dev/md1123 --fail /dev/sda
> sleep 1
> if mdadm --stop /dev/md1123
> then
> true
> else
> break
> fi
> done
> #####################################
Thanks for the script. Unfortunately I still cannot reproduce.
I suspect there is some subtle race issue that is heavily dependant on the
particular hardware you have.
It might help if I could get stack traces of the relevant processes. i.e.
md1123_raid5 and md1123_resync.
A previous post contained a trace of _resync, but it wouldn't hurt to get
another one. You can get them by
cat /proc/PROCESS-ID/stack
or possibly
echo w > /proc/sysrq-trigger
then look in the output of 'dmesg'.
Thanks,
NeilBrown
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html