raid1 sector size

am 06.02.2011 17:19:18 von Roberto Spadim

hi guys, there's a way to increase sector size?
instead of 512bytes use 4096bytes?

check:
md0: 512bytes
sda: 4096bytes
sdb: 4096bytes

i want a 512bytes at md0
why? i'm trying to reduce IOPS (more bytes to read = less io / second)
and maybe increase read rate

--
Roberto Spadim
Spadim Technology / SPAEmpresarial
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: raid1 sector size

am 06.02.2011 17:49:45 von Roberto Spadim

the problem (benchmark) i don't know if it's a cpu intensive use
that's blocking some i/o (try to reduce IOPS on md raid1), or a
problem at read logic (check if disk is ok)

[root@myhost block]# dd if=3D/dev/sda of=3D/dev/zero bs=3D8196
^C80106+0 registros de entrada
80105+0 registros de sa=EDda
656540580 bytes (657 MB) copiados, 5,38142 s, 122 MB/s

[root@myhost block]# dd if=3D/dev/md0 of=3D/dev/zero bs=3D8196
^C52851+0 registros de entrada
52850+0 registros de sa=EDda
433158600 bytes (433 MB) copiados, 10,4623 s, 41,4 MB/s

on iostat -d 1 -k
i see that just sda is running (i'm using the normal unpatched kernel
2.6.37), cpu is: using htop (i coudn't select and copy, maybe sum>100%
but it's a mean value)

using md0:
CPU: 1.3% sy 67.3 ni: 0 si 36% wa: 0%

using sda:
CPU: 1.3% sy 33.6 ni: 0 si 8,4% wa: 50%

maybe cpu don't have time to wait i/o (sda showed wa: 50%)
anyone could help me if i'm undestanding it right?
the best block size for my harddisk is 8196

md0 is in sync

2011/2/6 Roberto Spadim :
> hi guys, there's a way to increase sector size?
> instead of 512bytes use 4096bytes?
>
> check:
> md0: 512bytes
> sda: 4096bytes
> sdb: 4096bytes
>
> i want a 512bytes at md0
> why? i'm trying to reduce IOPS (more bytes to read =3D less io / seco=
nd)
> and maybe increase read rate
>
> --
> Roberto Spadim
> Spadim Technology / SPAEmpresarial
>

--=20
Roberto Spadim
Spadim Technology / SPAEmpresarial
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" i=
n
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: raid1 sector size

am 06.02.2011 19:03:59 von jeromepoulin

On Sun, Feb 6, 2011 at 11:49 AM, Roberto Spadim =
wrote:
> the problem (benchmark) i don't know if it's a cpu intensive use
> that's blocking some i/o (try to reduce IOPS on md raid1), or a
> problem at read logic (check if disk is ok)
>
> [root@myhost block]# dd if=3D/dev/sda of=3D/dev/zero bs=3D8196
> ^C80106+0 registros de entrada
> 80105+0 registros de sa=EDda
> 656540580 bytes (657 MB) copiados, 5,38142 s, 122 MB/s
>
> [root@myhost block]# dd if=3D/dev/md0 of=3D/dev/zero bs=3D8196
> ^C52851+0 registros de entrada
> 52850+0 registros de sa=EDda
> 433158600 bytes (433 MB) copiados, 10,4623 s, 41,4 MB/s
>

You could try with some computer friendly bs=3D value, like 8192.
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" i=
n
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: raid1 sector size

am 06.02.2011 23:30:11 von Stan Hoeppner

Roberto Spadim put forth on 2/6/2011 10:49 AM:

> [root@myhost block]# dd if=/dev/sda of=/dev/zero bs=8196

> [root@myhost block]# dd if=/dev/md0 of=/dev/zero bs=8196

/dev/zero is a read source, not a write target. I'm surprised this doesn't bomb
out. Use /dev/null for output.

--
Stan
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: raid1 sector size

am 07.02.2011 02:21:49 von Roberto Spadim

=3D] it works heeh =3D]
but the problem is the same, speed is low on /dev/md0 than /dev/sda

2011/2/6 Stan Hoeppner :
> Roberto Spadim put forth on 2/6/2011 10:49 AM:
>
>> [root@myhost block]# dd if=3D/dev/sda of=3D/dev/zero bs=3D8196
>
>> [root@myhost block]# dd if=3D/dev/md0 of=3D/dev/zero bs=3D8196
>
> /dev/zero is a read source, not a write target. =A0I'm surprised this=
doesn't bomb
> out. =A0Use /dev/null for output.
>
> --
> Stan
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid"=
in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at =A0http://vger.kernel.org/majordomo-info.html
>

--=20
Roberto Spadim
Spadim Technology / SPAEmpresarial
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" i=
n
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Periodic RebuildStarted event

am 08.02.2011 00:44:11 von Martin Cracauer

I just got through the RebuildStarted event which seems to be
monthly. This is being triggered by my Debian config, but before I
nuke it I'd like to know a little more.

If a real disk error happens during this rebuild on a raid5, would the
disk go into regular degraded mode or would it count as a double
fault?

I also noticed that recently all the checks for all the arrays happen
simultaneously. That's bad because most of them share the same
physical disks. Am I imagining this or was the system smart enough to
do them one after another until recently?

Do you do period checks? I get lots of device mismatches reported but
apparently that's normal if there's write activity. The whole thing
sound contra-productive to me and might panic new users.

Martin
--
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% %%%%%%%
Martin Cracauer http://www.cons.org/cracauer/
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: Periodic RebuildStarted event

am 08.02.2011 00:52:34 von Roberto Spadim

i don=B4t know if exists, but maybe a internal mdadm command to check
array (if have mirrors or ecc/checksum) with a limited speed (mb/s)
read operation could help with periodic check (instead resync)

2011/2/7 Martin Cracauer :
> I just got through the RebuildStarted event which seems to be
> monthly. =A0This is being triggered by my Debian config, but before I
> nuke it I'd like to know a little more.
>
> If a real disk error happens during this rebuild on a raid5, would th=
e
> disk go into regular degraded mode or would it count as a double
> fault?
>
> I also noticed that recently all the checks for all the arrays happen
> simultaneously. =A0That's bad because most of them share the same
> physical disks. =A0Am I imagining this or was the system smart enough=
to
> do them one after another until recently?
>
> Do you do period checks? I get lots of device mismatches reported but
> apparently that's normal if there's write activity. =A0The whole thin=
g
> sound contra-productive to me and might panic new users.
>
> Martin
> --
> %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% %%%%%%%
> Martin Cracauer =A0 http://www.cons.org/cracauer/
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid"=
in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at =A0http://vger.kernel.org/majordomo-info.html
>

--=20
Roberto Spadim
Spadim Technology / SPAEmpresarial
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" i=
n
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: Periodic RebuildStarted event

am 08.02.2011 00:59:09 von Martin Cracauer

Roberto Spadim wrote on Mon, Feb 07, 2011 at 09:52:34PM -0200:
> i don?t know if exists, but maybe a internal mdadm command to check
> array (if have mirrors or ecc/checksum) with a limited speed (mb/s)
> read operation could help with periodic check (instead resync)

I don't experience performance problems from this. Linux md seems to
be very good at putting the sync at a backburner when there is real
activity. Just today I benchmarked an array that was in progress of
building a raid5 (initial sync). Left most of the cycles to the
benchmark and took endless. That should be configureable, too (for
those who want the sync done with priority but can't kill activity),
but anyway my concern isn't performance.

Martin

> 2011/2/7 Martin Cracauer :
> > I just got through the RebuildStarted event which seems to be
> > monthly. ?This is being triggered by my Debian config, but before I
> > nuke it I'd like to know a little more.
> >
> > If a real disk error happens during this rebuild on a raid5, would the
> > disk go into regular degraded mode or would it count as a double
> > fault?
> >
> > I also noticed that recently all the checks for all the arrays happen
> > simultaneously. ?That's bad because most of them share the same
> > physical disks. ?Am I imagining this or was the system smart enough to
> > do them one after another until recently?
> >
> > Do you do period checks? I get lots of device mismatches reported but
> > apparently that's normal if there's write activity. ?The whole thing
> > sound contra-productive to me and might panic new users.
> >
> > Martin
> > --
> > %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% %%%%%%%
> > Martin Cracauer ? http://www.cons.org/cracauer/
> > --
> > To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> > the body of a message to majordomo@vger.kernel.org
> > More majordomo info at ?http://vger.kernel.org/majordomo-info.html
> >
>
>
>
> --
> Roberto Spadim
> Spadim Technology / SPAEmpresarial

--
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% %%%%%%%
Martin Cracauer http://www.cons.org/cracauer/
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: Periodic RebuildStarted event

am 08.02.2011 01:25:13 von NeilBrown

On Mon, 7 Feb 2011 18:44:11 -0500 Martin Cracauer wrote:

> I just got through the RebuildStarted event which seems to be
> monthly. This is being triggered by my Debian config, but before I
> nuke it I'd like to know a little more.
>
> If a real disk error happens during this rebuild on a raid5, would the
> disk go into regular degraded mode or would it count as a double
> fault?

The monthly thing is a 'check', not a 'rebuild' (yes, the 'monitor' email is
a little misleading).
So a real disk error will be handled correctly. In fact the main point of a
monthly check is to find and correct these latent read errors.

>
> I also noticed that recently all the checks for all the arrays happen
> simultaneously. That's bad because most of them share the same
> physical disks. Am I imagining this or was the system smart enough to
> do them one after another until recently?

Arrays that share a partition certainly should not be
synced/recovered/checked at the same time (unless you set
sync_force_parallel in sysfs).

If you have evidence that they do I would like to see that evidence.

>
> Do you do period checks? I get lots of device mismatches reported but
> apparently that's normal if there's write activity. The whole thing
> sound contra-productive to me and might panic new users.

Periodic checks are a good thing.
Yes, it can cause confusion. That is not good, but a better approach has not
yet been found. Patches welcome.

What would be really good is to just do an hour of check every night. It is
quite possible to get the kernel to do this, but it requires some non-trivial
scripting that no-one has written yet. You need to record where you are up
to on which array, and when you last did each array. Then start either the
'next' array at the beginning, or the 'current' array at the current point
(write to sync_min).
Then wait for however long you want, abort the check (write 'idle' to
'sync_action') and find out where it got up to (read sync_min) and record
that for next time.

Great project for someone.....

NeilBrown

>
> Martin

--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: Periodic RebuildStarted event

am 08.02.2011 01:28:01 von Roberto Spadim

i don=B4t know how sync is executed, but each write you send to a ssd
make the ssd write period of life smaller
if it=B4s just a READ command no problem, but a WRITE command (sync) is
a problem for ssd
a check (a sync without WRITE) could be nice... it just need tell us
how many mirrors are unsync and how many bytes(sectors) in each mirror
(fail disk if it=B4s not sync, start a resync, these features as a
command line option...)

2011/2/7 Martin Cracauer :
> Roberto Spadim wrote on Mon, Feb 07, 2011 at 09:52:34PM -0200:
>> i don?t know if exists, but maybe a internal mdadm command to check
>> array (if have mirrors or ecc/checksum) with a limited speed (mb/s)
>> read operation could help with periodic check (instead resync)
>
> I don't experience performance problems from this. =A0Linux md seems =
to
> be very good at putting the sync at a backburner when there is real
> activity. =A0Just today I benchmarked an array that was in progress o=
f
> building a raid5 (initial sync). =A0Left most of the cycles to the
> benchmark and took endless. =A0That should be configureable, too (for
> those who want the sync done with priority but can't kill activity),
> but anyway my concern isn't performance.
>
> Martin
>
>> 2011/2/7 Martin Cracauer :
>> > I just got through the RebuildStarted event which seems to be
>> > monthly. ?This is being triggered by my Debian config, but before =
I
>> > nuke it I'd like to know a little more.
>> >
>> > If a real disk error happens during this rebuild on a raid5, would=
the
>> > disk go into regular degraded mode or would it count as a double
>> > fault?
>> >
>> > I also noticed that recently all the checks for all the arrays hap=
pen
>> > simultaneously. ?That's bad because most of them share the same
>> > physical disks. ?Am I imagining this or was the system smart enoug=
h to
>> > do them one after another until recently?
>> >
>> > Do you do period checks? I get lots of device mismatches reported =
but
>> > apparently that's normal if there's write activity. ?The whole thi=
ng
>> > sound contra-productive to me and might panic new users.
>> >
>> > Martin
>> > --
>> > %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% %%%%%%=
%
>> > Martin Cracauer ? http://www.cons.org/cracauer=
/
>> > --
>> > To unsubscribe from this list: send the line "unsubscribe linux-ra=
id" in
>> > the body of a message to majordomo@vger.kernel.org
>> > More majordomo info at ?http://vger.kernel.org/majordomo-info.html
>> >
>>
>>
>>
>> --
>> Roberto Spadim
>> Spadim Technology / SPAEmpresarial
>
> --
> %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% %%%%%%%
> Martin Cracauer =A0 http://www.cons.org/cracauer/
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid"=
in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at =A0http://vger.kernel.org/majordomo-info.html
>

--=20
Roberto Spadim
Spadim Technology / SPAEmpresarial
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" i=
n
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: Periodic RebuildStarted event

am 15.03.2011 04:16:22 von CoolCold

Hello!

On Tue, Feb 8, 2011 at 3:25 AM, NeilBrown wrote:
[snip]
> Then start either the
> 'next' array at the beginning, or the 'current' array at the current point
> (write to sync_min).
I couldn't find documentation for sync_min/sync_max sysfs params at
least for repo cloned from
git://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-2 .6.37.y.git
coolcold@coolcold:~/gits/linux-2.6.37.y$ grep -qi sync_min
Documentation/md.txt || echo failed find docs
failed find docs

As I could understand from sources - resync_min & resync_max are
expressed in sectors (512bytes?) and are set to 0 & total sectors on
device accordingly. resync_max value should be divisible by array
chunk size (in sectors) . After setting this values, one can trugger
"check" / "repair" into sync_action.

My basic idea is to use this method to clear pending sectors from
SMART checks and looks like this gonna work, am i right?

> Then wait for however long you want, abort the check (write 'idle' to
> 'sync_action') and find out where it got up to (read sync_min) and record
> that for next time.
>
> NeilBrown
>

--
Best regards,
[COOLCOLD-RIPN]
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: Periodic RebuildStarted event

am 15.03.2011 04:28:24 von Roberto Spadim

does mdadm have check? could it have check with a startup position?
could the mdadm report last checked position?
this could help partial checks..

2011/3/15 CoolCold :
> Hello!
>
> On Tue, Feb 8, 2011 at 3:25 AM, NeilBrown wrote:
> [snip]
>> Then start either the
>> 'next' array at the beginning, or the 'current' array at the current=
point
>> (write to sync_min).
> I couldn't find documentation for sync_min/sync_max sysfs params at
> least for repo cloned from
> git://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-2 .6.37.y.g=
it
> coolcold@coolcold:~/gits/linux-2.6.37.y$ grep -qi sync_min
> Documentation/md.txt || echo failed find docs
> failed find docs
>
> As I could understand from sources - resync_min & resync_max are
> expressed in sectors (512bytes?) =A0and are set to 0 & total sectors =
on
> device accordingly. resync_max value should be divisible by array
> chunk size (in sectors) . After setting this values, one can trugger
> "check" / "repair" into sync_action.
>
> My basic idea is to use this method to clear pending sectors from
> SMART checks and looks like this gonna work, am i right?
>
>> Then wait for however long you want, abort the check (write 'idle' t=
o
>> 'sync_action') and find out where it got up to (read sync_min) and r=
ecord
>> that for next time.
>>
>> NeilBrown
>>
>
>
> --
> Best regards,
> [COOLCOLD-RIPN]
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid"=
in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at =A0http://vger.kernel.org/majordomo-info.html
>

--=20
Roberto Spadim
Spadim Technology / SPAEmpresarial
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" i=
n
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: Periodic RebuildStarted event

am 15.03.2011 04:41:40 von NeilBrown

On Tue, 15 Mar 2011 06:16:22 +0300 CoolCold wrote:

> Hello!
>
> On Tue, Feb 8, 2011 at 3:25 AM, NeilBrown wrote:
> [snip]
> > Then start either the
> > 'next' array at the beginning, or the 'current' array at the current point
> > (write to sync_min).
> I couldn't find documentation for sync_min/sync_max sysfs params at
> least for repo cloned from
> git://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-2 .6.37.y.git
> coolcold@coolcold:~/gits/linux-2.6.37.y$ grep -qi sync_min
> Documentation/md.txt || echo failed find docs
> failed find docs

Yes, sorry about that.

>
> As I could understand from sources - resync_min & resync_max are
> expressed in sectors (512bytes?) and are set to 0 & total sectors on
> device accordingly. resync_max value should be divisible by array
> chunk size (in sectors) . After setting this values, one can trugger
> "check" / "repair" into sync_action.

Yes - sectors (multiples of 512 bytes)
Yes - 0 and a big number. sync_max is actually set to MAX_LONG rather than
the actual total number of sectors.

Yes - one can trigger 'check' or 'repair' and it will obey these limits.
When it reaches 'sync_max' it will pause rather than complete. You can
use 'select' or 'poll' on "sync_completed" to wait for that number to
reach sync_max. Then you can either increase sync_max, or can write
"idle" to "sync_action".

>
> My basic idea is to use this method to clear pending sectors from
> SMART checks and looks like this gonna work, am i right?
>

I don't know exactly what "pending sectors" are, but if they are sectors
which return an error to READ and can be fixed by writing data to them, then
you are right, this should 'clear' any pending sectors.

Of course you will need to be careful about mapping the sector number
from smart to the second number given to 'sync_min'. Not only must you
adjust for any partition table, but also the 'data offset' of the
md array must be allowed for.

NeilBrown

> > Then wait for however long you want, abort the check (write 'idle' to
> > 'sync_action') and find out where it got up to (read sync_min) and record
> > that for next time.
> >
> > NeilBrown
> >
>
>

--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: Periodic RebuildStarted event

am 15.03.2011 04:51:15 von CoolCold

On Tue, Mar 15, 2011 at 6:28 AM, Roberto Spadim =
wrote:
> does mdadm have check? could it have check with a startup position?
> could the mdadm report last checked position?
> this could help partial checks..
Testing with loop devices shows it works.
limits are set:
root@nekotaz2:/storage/ovzs/keke# cat /sys/block/md5/md/sync_{min,max}
500000
700000

calling check & viewing status:
root@nekotaz2:/storage/ovzs/keke# echo check > /sys/block/md5/md/sync_a=
ction
root@nekotaz2:/storage/ovzs/keke# cat /proc/mdstat
Personalities : [raid0] [raid1] [raid6] [raid5] [raid4] [raid10]
md5 : active raid1 loop1[1] loop0[0]
1048512 blocks [2/2] [UU]
[====>................] check =3D 24.2% (254608/1048512)
finish=3D5.7min speed=3D2304K/sec

Here 254608 is 500000/2 =3D 250000 + some progress for delay in between
of "echo check" & "cat /proc/mdstat"
In result, it stops on 350000 which is exactly 700000/2

root@nekotaz2:/storage/ovzs/keke# cat /proc/mdstat
Personalities : [raid0] [raid1] [raid6] [raid5] [raid4] [raid10]
md5 : active raid1 loop1[1] loop0[0]
1048512 blocks [2/2] [UU]
[======>..............] check =3D 33.3% (350000/1048=
512)
finish=3D76.4min speed=3D151K/sec

Speed & finish time are still being calculated though.

Also, see Nail's message.

>
> 2011/3/15 CoolCold :
>> Hello!
>>
>> On Tue, Feb 8, 2011 at 3:25 AM, NeilBrown wrote:
>> [snip]
>>> Then start either the
>>> 'next' array at the beginning, or the 'current' array at the curren=
t point
>>> (write to sync_min).
>> I couldn't find documentation for sync_min/sync_max sysfs params at
>> least for repo cloned from
>> git://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-2 .6.37.y.=
git
>> coolcold@coolcold:~/gits/linux-2.6.37.y$ grep -qi sync_min
>> Documentation/md.txt || echo failed find docs
>> failed find docs
>>
>> As I could understand from sources - resync_min & resync_max are
>> expressed in sectors (512bytes?) =A0and are set to 0 & total sectors=
on
>> device accordingly. resync_max value should be divisible by array
>> chunk size (in sectors) . After setting this values, one can trugger
>> "check" / "repair" into sync_action.
>>
>> My basic idea is to use this method to clear pending sectors from
>> SMART checks and looks like this gonna work, am i right?
>>
>>> Then wait for however long you want, abort the check (write 'idle' =
to
>>> 'sync_action') and find out where it got up to (read sync_min) and =
record
>>> that for next time.
>>>
>>> NeilBrown
>>>
>>
>>
>> --
>> Best regards,
>> [COOLCOLD-RIPN]
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-raid=
" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at =A0http://vger.kernel.org/majordomo-info.html
>>
>
>
>
> --
> Roberto Spadim
> Spadim Technology / SPAEmpresarial
>

--=20
Best regards,
[COOLCOLD-RIPN]
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" i=
n
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: Periodic RebuildStarted event

am 15.03.2011 05:16:09 von Roberto Spadim

hum...
could we allow start with a initial position, and stop at any time?
maybe a pause too... (set speed to 0bytes/sec)
with pause we could pause get the actual position, stop, and start
from that position later

2011/3/15 CoolCold :
> On Tue, Mar 15, 2011 at 6:28 AM, Roberto Spadim r> wrote:
>> does mdadm have check? could it have check with a startup position?
>> could the mdadm report last checked position?
>> this could help partial checks..
> Testing with loop devices shows it works.
> limits are set:
> root@nekotaz2:/storage/ovzs/keke# cat /sys/block/md5/md/sync_{min,max=
}
> 500000
> 700000
>
> calling check & viewing status:
> root@nekotaz2:/storage/ovzs/keke# echo check > /sys/block/md5/md/sync=
_action
> root@nekotaz2:/storage/ovzs/keke# cat /proc/mdstat
> Personalities : [raid0] [raid1] [raid6] [raid5] [raid4] [raid10]
> md5 : active raid1 loop1[1] loop0[0]
> =A0 =A0 =A01048512 blocks [2/2] [UU]
> =A0 =A0 =A0[====>................] =A0check =3D 24.2% (254608=
/1048512)
> finish=3D5.7min speed=3D2304K/sec
>
> Here 254608 is 500000/2 =3D 250000 + some progress for delay in betwe=
en
> of "echo check" & "cat /proc/mdstat"
> In result, it stops on 350000 which is exactly 700000/2
>
> root@nekotaz2:/storage/ovzs/keke# cat /proc/mdstat
> Personalities : [raid0] [raid1] [raid6] [raid5] [raid4] [raid10]
> md5 : active raid1 loop1[1] loop0[0]
> =A0 =A0 =A01048512 blocks [2/2] [UU]
> =A0 =A0 =A0[======>..............] =A0check =3D 33.3% (35=
0000/1048512)
> finish=3D76.4min speed=3D151K/sec
>
> Speed & finish time are still being calculated though.
>
> Also, see Nail's message.
>
>>
>> 2011/3/15 CoolCold :
>>> Hello!
>>>
>>> On Tue, Feb 8, 2011 at 3:25 AM, NeilBrown wrote:
>>> [snip]
>>>> Then start either the
>>>> 'next' array at the beginning, or the 'current' array at the curre=
nt point
>>>> (write to sync_min).
>>> I couldn't find documentation for sync_min/sync_max sysfs params at
>>> least for repo cloned from
>>> git://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-2 .6.37.y=
git
>>> coolcold@coolcold:~/gits/linux-2.6.37.y$ grep -qi sync_min
>>> Documentation/md.txt || echo failed find docs
>>> failed find docs
>>>
>>> As I could understand from sources - resync_min & resync_max are
>>> expressed in sectors (512bytes?) =A0and are set to 0 & total sector=
s on
>>> device accordingly. resync_max value should be divisible by array
>>> chunk size (in sectors) . After setting this values, one can trugge=
r
>>> "check" / "repair" into sync_action.
>>>
>>> My basic idea is to use this method to clear pending sectors from
>>> SMART checks and looks like this gonna work, am i right?
>>>
>>>> Then wait for however long you want, abort the check (write 'idle'=
to
>>>> 'sync_action') and find out where it got up to (read sync_min) and=
record
>>>> that for next time.
>>>>
>>>> NeilBrown
>>>>
>>>
>>>
>>> --
>>> Best regards,
>>> [COOLCOLD-RIPN]
>>> --
>>> To unsubscribe from this list: send the line "unsubscribe linux-rai=
d" in
>>> the body of a message to majordomo@vger.kernel.org
>>> More majordomo info at =A0http://vger.kernel.org/majordomo-info.htm=
l
>>>
>>
>>
>>
>> --
>> Roberto Spadim
>> Spadim Technology / SPAEmpresarial
>>
>
>
>
> --
> Best regards,
> [COOLCOLD-RIPN]
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid"=
in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at =A0http://vger.kernel.org/majordomo-info.html
>

--=20
Roberto Spadim
Spadim Technology / SPAEmpresarial
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" i=
n
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: Periodic RebuildStarted event

am 15.03.2011 05:21:09 von CoolCold

On Tue, Mar 15, 2011 at 6:41 AM, NeilBrown wrote:
> On Tue, 15 Mar 2011 06:16:22 +0300 CoolCold w=
rote:
>
>> Hello!
>>
>> On Tue, Feb 8, 2011 at 3:25 AM, NeilBrown wrote:
>> [snip]
>> > Then start either the
>> > 'next' array at the beginning, or the 'current' array at the curre=
nt point
>> > (write to sync_min).
>> I couldn't find documentation for sync_min/sync_max sysfs params at
>> least for repo cloned from
>> git://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-2 .6.37.y.=
git
>> coolcold@coolcold:~/gits/linux-2.6.37.y$ grep -qi sync_min
>> Documentation/md.txt || echo failed find docs
>> failed find docs
>
> Yes, sorry about that.
May be I can help and create patch for md.txt after this thread? If
yes, it would be nice to get some link for proper patch providing
instructions, never did patches for kernel ;)

>
>>
>> As I could understand from sources - resync_min & resync_max are
>> expressed in sectors (512bytes?) =A0and are set to 0 & total sectors=
on
>> device accordingly. resync_max value should be divisible by array
>> chunk size (in sectors) . After setting this values, one can trugger
>> "check" / "repair" into sync_action.
>
> Yes - sectors (multiples of 512 bytes)
> Yes - 0 and a big number. =A0sync_max is actually set to MAX_LONG rat=
her than
> =A0 =A0 =A0the actual total number of sectors.
>
> Yes - one can trigger 'check' or 'repair' and it will obey these limi=
ts.
> When it reaches 'sync_max' it will pause rather than complete. =A0You=
can
> use 'select' or 'poll' on "sync_completed" to wait for that number to
> reach sync_max. =A0Then you can either increase sync_max, or can writ=
e
> "idle" to "sync_action".
>
>>
>> My basic idea is to use this method to clear pending sectors from
>> SMART checks and looks like this gonna work, am i right?
>>
>
> I don't know exactly what "pending sectors" are, but if they are sect=
ors
> which return an error to READ and can be fixed by writing data to the=
m, then
> you are right, this should 'clear' any pending sectors.
Yes, i meant that kind.
>
> Of course you will need to be careful about mapping the sector number
> from smart to the second number given to 'sync_min'.
I guess you meant "sector" not "second" here?

> Not only must you
> adjust for any partition table, but also the 'data offset' of the
> md array must be allowed for.
So, for 0.9 metadata format offset is always gonna be 0, right?
And if the bad thing happens - bad block with read error is found on
metadata section, will mdadm with --update will be enought
to do force write?

>
> NeilBrown
>
>
>
>> > Then wait for however long you want, abort the check (write 'idle'=
to
>> > 'sync_action') and find out where it got up to (read sync_min) and=
record
>> > that for next time.
>> >
>> > NeilBrown
>> >
>>
>>
>
>

--=20
Best regards,
[COOLCOLD-RIPN]
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" i=
n
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: Periodic RebuildStarted event

am 15.03.2011 07:17:10 von NeilBrown

On Tue, 15 Mar 2011 07:21:09 +0300 CoolCold wro=
te:

> On Tue, Mar 15, 2011 at 6:41 AM, NeilBrown wrote:
> > On Tue, 15 Mar 2011 06:16:22 +0300 CoolCold =
wrote:
> >
> >> Hello!
> >>
> >> On Tue, Feb 8, 2011 at 3:25 AM, NeilBrown wrote:
> >> [snip]
> >> > Then start either the
> >> > 'next' array at the beginning, or the 'current' array at the cur=
rent point
> >> > (write to sync_min).
> >> I couldn't find documentation for sync_min/sync_max sysfs params a=
t
> >> least for repo cloned from
> >> git://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-2 .6.37.=
y.git
> >> coolcold@coolcold:~/gits/linux-2.6.37.y$ grep -qi sync_min
> >> Documentation/md.txt || echo failed find docs
> >> failed find docs
> >
> > Yes, sorry about that.
> May be I can help and create patch for md.txt after this thread? If
> yes, it would be nice to get some link for proper patch providing
> instructions, never did patches for kernel ;)

Patches always welcome!! So yes please.

Have a read through SubmittingPatches in linux/Documentation
(same directory that 'md.txt' is in).
That should get you close enough.

Thanks,
NeilBrown

>=20
> >
> >>
> >> As I could understand from sources - resync_min & resync_max are
> >> expressed in sectors (512bytes?) =A0and are set to 0 & total secto=
rs on
> >> device accordingly. resync_max value should be divisible by array
> >> chunk size (in sectors) . After setting this values, one can trugg=
er
> >> "check" / "repair" into sync_action.
> >
> > Yes - sectors (multiples of 512 bytes)
> > Yes - 0 and a big number. =A0sync_max is actually set to MAX_LONG r=
ather than
> > =A0 =A0 =A0the actual total number of sectors.
> >
> > Yes - one can trigger 'check' or 'repair' and it will obey these li=
mits.
> > When it reaches 'sync_max' it will pause rather than complete. =A0Y=
ou can
> > use 'select' or 'poll' on "sync_completed" to wait for that number =
to
> > reach sync_max. =A0Then you can either increase sync_max, or can wr=
ite
> > "idle" to "sync_action".
> >
> >>
> >> My basic idea is to use this method to clear pending sectors from
> >> SMART checks and looks like this gonna work, am i right?
> >>
> >
> > I don't know exactly what "pending sectors" are, but if they are se=
ctors
> > which return an error to READ and can be fixed by writing data to t=
hem, then
> > you are right, this should 'clear' any pending sectors.
> Yes, i meant that kind.
> >
> > Of course you will need to be careful about mapping the sector numb=
er
> > from smart to the second number given to 'sync_min'.
> I guess you meant "sector" not "second" here?
>=20
> > Not only must you
> > adjust for any partition table, but also the 'data offset' of the
> > md array must be allowed for.
> So, for 0.9 metadata format offset is always gonna be 0, right?
> And if the bad thing happens - bad block with read error is found on
> metadata section, will mdadm with --update will be enough=
t
> to do force write?
>=20
> >
> > NeilBrown
> >
> >
> >
> >> > Then wait for however long you want, abort the check (write 'idl=
e' to
> >> > 'sync_action') and find out where it got up to (read sync_min) a=
nd record
> >> > that for next time.
> >> >
> >> > NeilBrown
> >> >
> >>
> >>
> >
> >
>=20
>=20
>=20

--
To unsubscribe from this list: send the line "unsubscribe linux-raid" i=
n
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html