Can md/mdadm deal with non-standard size of internal bitmap?

am 08.09.2009 15:44:38 von Mario Holbe

Hello (most likely Hello Neil :)),

are md and mdadm able to deal correctly with internal bitmaps of
non-standard size, i.e. a 132k internal bitmap?

As far as I can see from the code it should be possible. v1 superblocks
store the offset and size of data, bitmap superblocks store the bitmap
chunk size, which, in turn, is used to determine the number of blocks to
read as bitmap data.
Modifying mdadm to set up a non-standard bitmap size would also be easy:
change choose_bm_space(), recompile, create array, done. This would just
increase sb->data_offset or decrease sb->data_size a little.

However, I'm not sure, if really everything in md and mdadm is able to
deal with such a "non-standard" superblock and what would happen on
subsequent operations like mdadm --grow using an unpatched mdadm?

The reason for me to ask is:

When using a chunking RAID level ([0456], 10), there is usually some
space cut at the end of component devices which remains unusable because
it is too small to contain a chunk. For example, by using unpartitioned
1.5T = 1465138584k component devices and v1.1 superblocks there is a
space cut of 20k with 32k-256k chunk size (even 276k with 512k chunks).

When using internal bitmaps, depending on the device size more or less
of the available bitmap space is really used. For example, the internal
bitmap of a 3*1.5T RAID gets 8192KB chunks and uses 536550 bits out of
the 1048576 available (if my calculations are correct :)) - 51%.

If it would be possible to add some of the space cut from RAID chunking
to the internal bitmap, one could optimize the bitmap chunk size. In the
above example, by adding 4k to the internal bitmap the bitmap chunk size
could be decreased to 4096KB and 1073100 of the then 1081344 available
bits would be used - 99%.

Of course, this is not always possible, i.e. a 4*1.5T RAID has a bitmap
usage/available ratio of 68% - the 20k space cut wouldn't be enough here
to optimize it.

I'm not sure if it's worth the effort to automate such optimizations,
but being able to do them manually at least would be great.

regards
Mario
--
Tower: "Say fuelstate." Pilot: "Fuelstate."
Tower: "Say again." Pilot: "Again."
Tower: "Arghl, give me your fuel!" Pilot: "Sorry, need it by myself..."

--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: Can md/mdadm deal with non-standard size of internal bitmap?

am 15.09.2009 19:39:31 von Louis-Michel Gelinas

Mario, Did you get a [private] answer to this?

Seems interesting. Any reason to dismiss without a reply?

Neil, did this pass under your radar?

thanks

On Tue, 2009-09-08 at 15:44 +0200, Mario 'BitKoenig' Holbe wrote:
> Hello (most likely Hello Neil :)),
>
> are md and mdadm able to deal correctly with internal bitmaps of
> non-standard size, i.e. a 132k internal bitmap?
>
> As far as I can see from the code it should be possible. v1 superblocks
> store the offset and size of data, bitmap superblocks store the bitmap
> chunk size, which, in turn, is used to determine the number of blocks to
> read as bitmap data.
> Modifying mdadm to set up a non-standard bitmap size would also be easy:
> change choose_bm_space(), recompile, create array, done. This would just
> increase sb->data_offset or decrease sb->data_size a little.
>
> However, I'm not sure, if really everything in md and mdadm is able to
> deal with such a "non-standard" superblock and what would happen on
> subsequent operations like mdadm --grow using an unpatched mdadm?
>
>
> The reason for me to ask is:
>
> When using a chunking RAID level ([0456], 10), there is usually some
> space cut at the end of component devices which remains unusable because
> it is too small to contain a chunk. For example, by using unpartitioned
> 1.5T = 1465138584k component devices and v1.1 superblocks there is a
> space cut of 20k with 32k-256k chunk size (even 276k with 512k chunks).
>
> When using internal bitmaps, depending on the device size more or less
> of the available bitmap space is really used. For example, the internal
> bitmap of a 3*1.5T RAID gets 8192KB chunks and uses 536550 bits out of
> the 1048576 available (if my calculations are correct :)) - 51%.
>
> If it would be possible to add some of the space cut from RAID chunking
> to the internal bitmap, one could optimize the bitmap chunk size. In the
> above example, by adding 4k to the internal bitmap the bitmap chunk size
> could be decreased to 4096KB and 1073100 of the then 1081344 available
> bits would be used - 99%.
>
> Of course, this is not always possible, i.e. a 4*1.5T RAID has a bitmap
> usage/available ratio of 68% - the 20k space cut wouldn't be enough here
> to optimize it.
>
> I'm not sure if it's worth the effort to automate such optimizations,
> but being able to do them manually at least would be great.
>
>
> regards
> Mario

--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: Can md/mdadm deal with non-standard size of internal bitmap?

am 15.09.2009 23:08:18 von NeilBrown

On Wed, September 16, 2009 3:39 am, Louis-Michel Gelinas wrote:
> Mario, Did you get a [private] answer to this?
>
> Seems interesting. Any reason to dismiss without a reply?
>
> Neil, did this pass under your radar?
>

with 1.x metdata there is no "standard size" for bitmaps.
The metadata records the offset from the superblock of the
bitmap, the offset from the start of the device of the data
and the size of the data area. You can reserve as much as you
like for bitmap.

mdadm chooses an amount of space to reserve based roughly on the
size of the device.
It doesn't take into about the chunksize and alignment and so push the
data region to the very end of the device as suggested, though that
would not be particularly hard to do I suspect.

NeilBrown

> thanks
>
>
> On Tue, 2009-09-08 at 15:44 +0200, Mario 'BitKoenig' Holbe wrote:
>> Hello (most likely Hello Neil :)),
>>
>> are md and mdadm able to deal correctly with internal bitmaps of
>> non-standard size, i.e. a 132k internal bitmap?
>>
>> As far as I can see from the code it should be possible. v1 superblocks
>> store the offset and size of data, bitmap superblocks store the bitmap
>> chunk size, which, in turn, is used to determine the number of blocks to
>> read as bitmap data.
>> Modifying mdadm to set up a non-standard bitmap size would also be easy:
>> change choose_bm_space(), recompile, create array, done. This would just
>> increase sb->data_offset or decrease sb->data_size a little.
>>
>> However, I'm not sure, if really everything in md and mdadm is able to
>> deal with such a "non-standard" superblock and what would happen on
>> subsequent operations like mdadm --grow using an unpatched mdadm?
>>
>>
>> The reason for me to ask is:
>>
>> When using a chunking RAID level ([0456], 10), there is usually some
>> space cut at the end of component devices which remains unusable because
>> it is too small to contain a chunk. For example, by using unpartitioned
>> 1.5T = 1465138584k component devices and v1.1 superblocks there is a
>> space cut of 20k with 32k-256k chunk size (even 276k with 512k chunks).
>>
>> When using internal bitmaps, depending on the device size more or less
>> of the available bitmap space is really used. For example, the internal
>> bitmap of a 3*1.5T RAID gets 8192KB chunks and uses 536550 bits out of
>> the 1048576 available (if my calculations are correct :)) - 51%.
>>
>> If it would be possible to add some of the space cut from RAID chunking
>> to the internal bitmap, one could optimize the bitmap chunk size. In the
>> above example, by adding 4k to the internal bitmap the bitmap chunk size
>> could be decreased to 4096KB and 1073100 of the then 1081344 available
>> bits would be used - 99%.
>>
>> Of course, this is not always possible, i.e. a 4*1.5T RAID has a bitmap
>> usage/available ratio of 68% - the 20k space cut wouldn't be enough here
>> to optimize it.
>>
>> I'm not sure if it's worth the effort to automate such optimizations,
>> but being able to do them manually at least would be great.
>>
>>
>> regards
>> Mario
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>

--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html

Re: Can md/mdadm deal with non-standard size of internal bitmap?

am 18.09.2009 20:53:41 von Mario Holbe

NeilBrown wrote:
> with 1.x metdata there is no "standard size" for bitmaps.

Okay, I was just not sure about superblock-updates and things like that.

> mdadm chooses an amount of space to reserve based roughly on the
> size of the device.
> It doesn't take into about the chunksize and alignment and so push the
> data region to the very end of the device as suggested, though that
> would not be particularly hard to do I suspect.

It would not be hard to implement but I'm not sure about all the
side-effects it has. Some easy ones are:
1. it breaks backwards-compatibility especially in the --assume-clean
RAID-reconstruction case
2. it may harm when the array shall be grown in the future (i.e. to
bigger components) by means of a reduction of the grown device's
netto capacity compared to the unaligned case

So if you (or somebody else) intend to do it, please don't make it
default.

regards
Mario
--
wenn die nsa spam filtern wuerde...
einfach china kappen
*** Signoff: Chinah (ix.irc.at linz.irc.at)

--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html