Wrong sorting order in croat.conf (spouse all version)

Wrong sorting order in croat.conf (spouse all version)

am 25.03.2004 13:36:18 von Dubravko Penezic

--899761410-715206913-1080218178=:3857
Content-Type: TEXT/PLAIN; charset=US-ASCII


Hi,

in attachment is croat.conf file with neccery change that MySQL my sort
correctly Croatian character.

It was mistake on position 8A and 9A wrongly connect to sorting order
number 57 (which is corect to win-1250, not to croat) and position A9 and
B9 wrongly connect to order number A9, B9, insted to be connect to order
nuber 57 .

Dubravko Penezic
ISA, University Computing Center
Zagreb, Croatia
http://www.srce.hr/
--899761410-715206913-1080218178=:3857
Content-Type: TEXT/PLAIN; charset=US-ASCII; name="croat.conf"
Content-Transfer-Encoding: BASE64
Content-ID:
Content-Description: croat.conf
Content-Disposition: attachment; filename="croat.conf"

IyBDb25maWd1cmF0aW9uIGZpbGUgZm9yIHRoZSBjcm9hdCBjaGFyYWN0ZXIg
c2V0DQoNCiMgY3R5cGUgYXJyYXkgKG11c3QgaGF2ZSAyNTcgZWxlbWVudHMp
DQogIDAwDQogIDIwICAyMCAgMjAgIDIwICAyMCAgMjAgIDIwICAyMCAgMjAg
IDI4ICAyOCAgMjggIDI4ICAyOCAgMjAgIDIwDQogIDIwICAyMCAgMjAgIDIw
ICAyMCAgMjAgIDIwICAyMCAgMjAgIDIwICAyMCAgMjAgIDIwICAyMCAgMjAg
IDIwDQogIDQ4ICAxMCAgMTAgIDEwICAxMCAgMTAgIDEwICAxMCAgMTAgIDEw
ICAxMCAgMTAgIDEwICAxMCAgMTAgIDEwDQogIDg0ICA4NCAgODQgIDg0ICA4
NCAgODQgIDg0ICA4NCAgODQgIDg0ICAxMCAgMTAgIDEwICAxMCAgMTAgIDEw
DQogIDEwICA4MSAgODEgIDgxICA4MSAgODEgIDgxICAwMSAgMDEgIDAxICAw
MSAgMDEgIDAxICAwMSAgMDEgIDAxDQogIDAxICAwMSAgMDEgIDAxICAwMSAg
MDEgIDAxICAwMSAgMDEgIDAxICAwMSAgMTAgIDEwICAxMCAgMTAgIDEwDQog
IDEwICA4MiAgODIgIDgyICA4MiAgODIgIDgyICAwMiAgMDIgIDAyICAwMiAg
MDIgIDAyICAwMiAgMDIgIDAyDQogIDAyICAwMiAgMDIgIDAyICAwMiAgMDIg
IDAyICAwMiAgMDIgIDAyICAwMiAgMTAgIDEwICAxMCAgMTAgIDIwDQogIDAw
ICAwMCAgMDAgIDAwICAwMCAgMDAgIDAwICAwMCAgMDAgIDAxICAwMCAgMDAg
IDAwICAwMSAgMDAgIDAwDQogIDAwICAwMCAgMDAgIDAwICAwMCAgMDAgIDAw
ICAwMCAgMDAgIDAyICAwMCAgMDAgIDAwICAwMiAgMDAgIDAwDQogIDQ4ICAx
MCAgMTAgIDEwICAxMCAgMTAgIDEwICAxMCAgMTAgIDEwICAxMCAgMTAgIDEw
ICAxMCAgMTAgIDEwDQogIDEwICAxMCAgMTAgIDEwICAxMCAgMTAgIDEwICAx
MCAgMTAgIDEwICAxMCAgMTAgIDEwICAxMCAgMTAgIDEwDQogIDAxICAwMSAg
MDEgIDAxICAwMSAgMDEgIDAxICAwMSAgMDEgIDAxICAwMSAgMDEgIDAxICAw
MSAgMDEgIDAxDQogIDAxICAwMSAgMDEgIDAxICAwMSAgMDEgIDAxICAxMCAg
MDEgIDAxICAwMSAgMDEgIDAxICAwMSAgMDEgIDAyDQogIDAyICAwMiAgMDIg
IDAyICAwMiAgMDIgIDAyICAwMiAgMDIgIDAyICAwMiAgMDIgIDAyICAwMiAg
MDIgIDAyDQogIDAyICAwMiAgMDIgIDAyICAwMiAgMDIgIDAyICAxMCAgMDIg
IDAyICAwMiAgMDIgIDAyICAwMiAgMDIgIDAyDQoNCiMgdG9fbG93ZXIgYXJy
YXkgKG11c3QgaGF2ZSAyNTYgZWxlbWVudHMpDQogIDAwICAwMSAgMDIgIDAz
ICAwNCAgMDUgIDA2ICAwNyAgMDggIDA5ICAwQSAgMEIgIDBDICAwRCAgMEUg
IDBGDQogIDEwICAxMSAgMTIgIDEzICAxNCAgMTUgIDE2ICAxNyAgMTggIDE5
ICAxQSAgMUIgIDFDICAxRCAgMUUgIDFGDQogIDIwICAyMSAgMjIgIDIzICAy
NCAgMjUgIDI2ICAyNyAgMjggIDI5ICAyQSAgMkIgIDJDICAyRCAgMkUgIDJG
DQogIDMwICAzMSAgMzIgIDMzICAzNCAgMzUgIDM2ICAzNyAgMzggIDM5ICAz
QSAgM0IgIDNDICAzRCAgM0UgIDNGDQogIDQwICA2MSAgNjIgIDYzICA2NCAg
NjUgIDY2ICA2NyAgNjggIDY5ICA2QSAgNkIgIDZDICA2RCAgNkUgIDZGDQog
IDcwICA3MSAgNzIgIDczICA3NCAgNzUgIDc2ICA3NyAgNzggIDc5ICA3QSAg
NUIgIDVDICA1RCAgNUUgIDVGDQogIDYwICA2MSAgNjIgIDYzICA2NCAgNjUg
IDY2ICA2NyAgNjggIDY5ICA2QSAgNkIgIDZDICA2RCAgNkUgIDZGDQogIDcw
ICA3MSAgNzIgIDczICA3NCAgNzUgIDc2ICA3NyAgNzggIDc5ICA3QSAgN0Ig
IDdDICA3RCAgN0UgIDdGDQogIDgwICA4MSAgODIgIDgzICA4NCAgODUgIDg2
ICA4NyAgODggIDg5ICA5QSAgOEIgIDhDICA4RCAgOUUgIDhGDQogIDkwICA5
MSAgOTIgIDkzICA5NCAgOTUgIDk2ICA5NyAgOTggIDk5ICA5QSAgOUIgIDlD
ICA5RCAgOUUgIDlGDQogIEEwICBBMSAgQTIgIEEzICBBNCAgQTUgIEE2ICBB
NyAgQTggIEE5ICBBQSAgQUIgIEFDICBBRCAgQUUgIEFGDQogIEIwICBCMSAg
QjIgIEIzICBCNCAgQjUgIEI2ICBCNyAgQjggIEI5ICBCQSAgQkIgIEJDICBC
RCAgQkUgIEJGDQogIEUwICBFMSAgRTIgIEUzICBFNCAgRTUgIEU2ICBFNyAg
RTggIEU5ICBFQSAgRUIgIEVDICBFRCAgRUUgIEVGDQogIEYwICBGMSAgRjIg
IEYzICBGNCAgRjUgIEY2ICBENyAgRjggIEY5ICBGQSAgRkIgIEZDICBGRCAg
RkUgIERGDQogIEUwICBFMSAgRTIgIEUzICBFNCAgRTUgIEU2ICBFNyAgRTgg
IEU5ICBFQSAgRUIgIEVDICBFRCAgRUUgIEVGDQogIEYwICBGMSAgRjIgIEYz
ICBGNCAgRjUgIEY2ICBGNyAgRjggIEY5ICBGQSAgRkIgIEZDICBGRCAgRkUg
IEZGDQoNCiMgdG9fdXBwZXIgYXJyYXkgKG11c3QgaGF2ZSAyNTYgZWxlbWVu
dHMpDQogIDAwICAwMSAgMDIgIDAzICAwNCAgMDUgIDA2ICAwNyAgMDggIDA5
ICAwQSAgMEIgIDBDICAwRCAgMEUgIDBGDQogIDEwICAxMSAgMTIgIDEzICAx
NCAgMTUgIDE2ICAxNyAgMTggIDE5ICAxQSAgMUIgIDFDICAxRCAgMUUgIDFG
DQogIDIwICAyMSAgMjIgIDIzICAyNCAgMjUgIDI2ICAyNyAgMjggIDI5ICAy
QSAgMkIgIDJDICAyRCAgMkUgIDJGDQogIDMwICAzMSAgMzIgIDMzICAzNCAg
MzUgIDM2ICAzNyAgMzggIDM5ICAzQSAgM0IgIDNDICAzRCAgM0UgIDNGDQog
IDQwICA0MSAgNDIgIDQzICA0NCAgNDUgIDQ2ICA0NyAgNDggIDQ5ICA0QSAg
NEIgIDRDICA0RCAgNEUgIDRGDQogIDUwICA1MSAgNTIgIDUzICA1NCAgNTUg
IDU2ICA1NyAgNTggIDU5ICA1QSAgNUIgIDVDICA1RCAgNUUgIDVGDQogIDYw
ICA0MSAgNDIgIDQzICA0NCAgNDUgIDQ2ICA0NyAgNDggIDQ5ICA0QSAgNEIg
IDRDICA0RCAgNEUgIDRGDQogIDUwICA1MSAgNTIgIDUzICA1NCAgNTUgIDU2
ICA1NyAgNTggIDU5ICA1QSAgN0IgIDdDICA3RCAgN0UgIDdGDQogIDgwICA4
MSAgODIgIDgzICA4NCAgODUgIDg2ICA4NyAgODggIDg5ICA4QSAgOEIgIDhD
ICA4RCAgOEUgIDhGDQogIDkwICA5MSAgOTIgIDkzICA5NCAgOTUgIDk2ICA5
NyAgOTggIDk5ICA4QSAgOUIgIDlDICA5RCAgOEUgIDlGDQogIEEwICBBMSAg
QTIgIEEzICBBNCAgQTUgIEE2ICBBNyAgQTggIEE5ICBBQSAgQUIgIEFDICBB
RCAgQUUgIEFGDQogIEIwICBCMSAgQjIgIEIzICBCNCAgQjUgIEI2ICBCNyAg
QjggIEI5ICBCQSAgQkIgIEJDICBCRCAgQkUgIEJGDQogIEMwICBDMSAgQzIg
IEMzICBDNCAgQzUgIEM2ICBDNyAgQzggIEM5ICBDQSAgQ0IgIENDICBDRCAg
Q0UgIENGDQogIEQwICBEMSAgRDIgIEQzICBENCAgRDUgIEQ2ICBENyAgRDgg
IEQ5ICBEQSAgREIgIERDICBERCAgREUgIERGDQogIEMwICBDMSAgQzIgIEMz
ICBDNCAgQzUgIEM2ICBDNyAgQzggIEM5ICBDQSAgQ0IgIENDICBDRCAgQ0Ug
IENGDQogIEQwICBEMSAgRDIgIEQzICBENCAgRDUgIEQ2ICBGNyAgRDggIEQ5
ICBEQSAgREIgIERDICBERCAgREUgIEZGDQoNCiMgc29ydF9vcmRlciBhcnJh
eSAobXVzdCBoYXZlIDI1NiBlbGVtZW50cykNCiAgMDAgIDAxICAwMiAgMDMg
IDA0ICAwNSAgMDYgIDA3ICAwOCAgMDkgIDBBICAwQiAgMEMgIDBEICAwRSAg
MEYNCiAgMTAgIDExICAxMiAgMTMgIDE0ICAxNSAgMTYgIDE3ICAxOCAgMTkg
IDFBICAxQiAgMUMgIDFEICAxRSAgMUYNCiAgMjAgIDIxICAyMiAgMjMgIDI0
ICAyNSAgMjYgIDI3ICAyOCAgMjkgIDJBICAyQiAgMkMgIDJEICAyRSAgMkYN
CiAgMzAgIDMxICAzMiAgMzMgIDM0ICAzNSAgMzYgIDM3ICAzOCAgMzkgIDNB
ICAzQiAgM0MgIDNEICAzRSAgM0YNCiAgNDAgIDQxICA0MiAgNDMgIDQ2ICA0
OCAgNDkgIDRBICA0QiAgNEMgIDREICA0RSAgNEYgIDUwICA1MSAgNTINCiAg
NTMgIDU0ICA1NSAgNTYgIDU4ICA1OSAgNUEgIDVCICA1QyAgNUQgIDVFICA1
QiAgNUMgIDVEICA1RSAgNUYNCiAgNjAgIDQxICA0MiAgNDMgIDQ2ICA0OCAg
NDkgIDRBICA0QiAgNEMgIDREICA0RSAgNEYgIDUwICA1MSAgNTINCiAgNTMg
IDU0ICA1NSAgNTYgIDU4ICA1OSAgNUEgIDVCICA1QyAgNUQgIDVFICA3QiAg
N0MgIDdEICA3RSAgN0YNCiAgODAgIDgxICA4MiAgODMgIDg0ICA4NSAgODYg
IDg3ICA4OCAgODkgIDhBICA4QiAgOEMgIDhEICA1RiAgOEYNCiAgOTAgIDkx
ICA5MiAgOTMgIDk0ICA5NSAgOTYgIDk3ICA5OCAgOTkgIDlBICA5QiAgOUMg
IDlEICA1RiAgOUYNCiAgQTAgIEExICBBMiAgQTMgIEE0ICBBNSAgQTYgIEE3
ICBBOCAgNTcgIEFBICBBQiAgQUMgIEFEICBBRSAgQUYNCiAgQjAgIEIxICBC
MiAgQjMgIEI0ICBCNSAgQjYgIEI3ICBCOCAgNTcgIEJBICBCQiAgQkMgIEJE
ICBCRSAgQkYNCiAgNDEgIDQxICA0MSAgNDEgIDVDICA1QiAgNDUgIDQzICA0
NCAgNDUgIDQ1ICA0NSAgNDkgIDQ5ICA0OSAgNDkNCiAgNDcgIDRFICA0RiAg
NEYgIDRGICA0RiAgNUQgIEQ3ICBEOCAgNTUgIDU1ICA1NSAgNTkgIDU5ICBE
RSAgREYNCiAgNDEgIDQxICA0MSAgNDEgIDVDICA1QiAgNDUgIDQzICA0NCAg
NDUgIDQ1ICA0NSAgNDkgIDQ5ICA0OSAgNDkNCiAgNDcgIDRFICA0RiAgNEYg
IDRGICA0RiAgNUQgIEY3ICBEOCAgNTUgIDU1ICA1NSAgNTkgIDU5ICBERSAg
RkYNCg==


--899761410-715206913-1080218178=:3857
Content-Type: text/plain; charset=us-ascii

--
MySQL Bugs Mailing List
For list archives: http://lists.mysql.com/bugs
To unsubscribe: http://lists.mysql.com/bugs?unsub=gcdmb-bugs@m.gmane.org
--899761410-715206913-1080218178=:3857--

Re: Wrong sorting order in croat.conf (spouse all version)

am 25.03.2004 14:59:01 von Sinisa Milivojevic

Dubravko Penezic writes:
>
> Hi,
>
> in attachment is croat.conf file with neccery change that MySQL my sort
> correctly Croatian character.
>
> It was mistake on position 8A and 9A wrongly connect to sorting order
> number 57 (which is corect to win-1250, not to croat) and position A9 and
> B9 wrongly connect to order number A9, B9, insted to be connect to order
> nuber 57 .
>
> Dubravko Penezic
> ISA, University Computing Center
> Zagreb, Croatia
> http://www.srce.hr/# Configuration file for the croat character set

Thank you for your contribution.

I have forwarded your corrections to our charset team to implement
them.

--

Sincerely,

--
For technical support contracts, go to https://order.mysql.com/?ref=msmi
__ ___ ___ ____ __
/ |/ /_ __/ __/ __ \/ / Mr. Sinisa Milivojevic
/ /|_/ / // /\ \/ /_/ / /__ MySQL AB
/_/ /_/\_, /___/\___\_\___/ Full time Developer and Support Coordinator
<___/ www.mysql.com Larnaca, Cyprus

Meet the MySQL at User Conference ! (April 14-16, 2004)
http://www.mysql.com/uc2004/


--
MySQL Bugs Mailing List
For list archives: http://lists.mysql.com/bugs
To unsubscribe: http://lists.mysql.com/bugs?unsub=gcdmb-bugs@m.gmane.org

Re: Wrong sorting order in croat.conf (spouse all version)

am 25.03.2004 16:18:39 von Sinisa Milivojevic

Hi!

After our charset experts have analysed your contribution, they
concluded that your charset was created from Windows cp12w50 and you
wanted it reused in latin2, which is not doable.

A source of mistake could be a conversion from 4.0 tables to 4.1,
where croat was renamed as latin2_croatian_ci, instead of
cp1250_croatian_ci.

We shall fix that now.

Thank you for your contribution.

--

Sincerely,

--
For technical support contracts, go to https://order.mysql.com/?ref=msmi
__ ___ ___ ____ __
/ |/ /_ __/ __/ __ \/ / Mr. Sinisa Milivojevic
/ /|_/ / // /\ \/ /_/ / /__ MySQL AB
/_/ /_/\_, /___/\___\_\___/ Full time Developer and Support Coordinator
<___/ www.mysql.com Larnaca, Cyprus

Meet the MySQL at User Conference ! (April 14-16, 2004)
http://www.mysql.com/uc2004/


--
MySQL Bugs Mailing List
For list archives: http://lists.mysql.com/bugs
To unsubscribe: http://lists.mysql.com/bugs?unsub=gcdmb-bugs@m.gmane.org

Re: Wrong sorting order in croat.conf (spouse all version)

am 26.03.2004 09:16:21 von Dubravko Penezic

>
> Hi!
>
> After our charset experts have analysed your contribution, they
> concluded that your charset was created from Windows cp12w50 and you
> wanted it reused in latin2, which is not doable.

Hi !

No that are not case, I use MySQL version 4 and 3, all data are stored
like ISO-8859-2 strictly, for more info please look on next URL :

http://www.laus.hr/sistem/char-sets/grafem.html
http://www.open.hr/hiz/kodsus/primjena.html

and all my database and table use croat definition.

What you sed is complitly wrong becouse Ss in win-1250 is on position 9A
and 8A and in ISO-8859-2 is on position B9 and A9.

Like I implement in croat.conf and send to bug report. You expert need to
check and reread ISO standard onece again.

And finaly, changing that value all data is order corectly.

>
> A source of mistake could be a conversion from 4.0 tables to 4.1,
> where croat was renamed as latin2_croatian_ci, instead of
> cp1250_croatian_ci.

it is not mistake in conversion !!!

it is good renaming becouse more then 85% percent of all MySAL
implementation is on Linux/UNIX os , so name like latin2_croatian_ci is
O.K. in 4.1. version of MySQL.

If still your expert are not sure what I am talking about, I will send
comlete table for testing.

Dubravko Penezic
ISA, Srce
University Computing Center
Zagreb, Croatia

--
MySQL Bugs Mailing List
For list archives: http://lists.mysql.com/bugs
To unsubscribe: http://lists.mysql.com/bugs?unsub=gcdmb-bugs@m.gmane.org

Re: Wrong sorting order in croat.conf (spouse all version)

am 26.03.2004 14:25:55 von Sinisa Milivojevic

Dubravko Penezic writes:
>
> Hi !
>
> No that are not case, I use MySQL version 4 and 3, all data are stored
> like ISO-8859-2 strictly, for more info please look on next URL :
>
> http://www.laus.hr/sistem/char-sets/grafem.html
> http://www.open.hr/hiz/kodsus/primjena.html
>
> and all my database and table use croat definition.
>
> What you sed is complitly wrong becouse Ss in win-1250 is on position 9A
> and 8A and in ISO-8859-2 is on position B9 and A9.
>
> Like I implement in croat.conf and send to bug report. You expert need to
> check and reread ISO standard onece again.
>
> And finaly, changing that value all data is order corectly.
>
> it is not mistake in conversion !!!
>
> it is good renaming becouse more then 85% percent of all MySAL
> implementation is on Linux/UNIX os , so name like latin2_croatian_ci is
> O.K. in 4.1. version of MySQL.
>
> If still your expert are not sure what I am talking about, I will send
> comlete table for testing.
>
> Dubravko Penezic
> ISA, Srce
> University Computing Center
> Zagreb, Croatia
>

Hi!

This goes a way beyond my charset knowledge, so I am involving the
persons in charge.

On an unrelated issue.

If Hrvoje Niksic is still there, please give him my regards.

--

Sincerely,

--
For technical support contracts, go to https://order.mysql.com/?ref=msmi
__ ___ ___ ____ __
/ |/ /_ __/ __/ __ \/ / Mr. Sinisa Milivojevic
/ /|_/ / // /\ \/ /_/ / /__ MySQL AB
/_/ /_/\_, /___/\___\_\___/ Full time Developer and Support Coordinator
<___/ www.mysql.com Larnaca, Cyprus

Meet the MySQL at User Conference ! (April 14-16, 2004)
http://www.mysql.com/uc2004/


--
MySQL Bugs Mailing List
For list archives: http://lists.mysql.com/bugs
To unsubscribe: http://lists.mysql.com/bugs?unsub=gcdmb-bugs@m.gmane.org

Re: Wrong sorting order in croat.conf (spouse all version)

am 30.03.2004 16:06:27 von Alexander Barkov

Hello!

Sinisa Milivojevic wrote:
> Dubravko Penezic writes:
>
>>Hi !
>>
>>No that are not case, I use MySQL version 4 and 3, all data are stored
>>like ISO-8859-2 strictly, for more info please look on next URL :
>>
>>http://www.laus.hr/sistem/char-sets/grafem.html
>>http://www.open.hr/hiz/kodsus/primjena.html
>>
>>and all my database and table use croat definition.
>>
>>What you sed is complitly wrong becouse Ss in win-1250 is on position 9A
>>and 8A and in ISO-8859-2 is on position B9 and A9.

Yes, I agree. The the above pages state this too.
And this is what I wrote in my previous letter
to Sinisa:

> latin2:
>
> 0xA9 0x0160 #LATIN CAPITAL LETTER S WITH CARON
> 0xB9 0x0161 #LATIN SMALL LETTER S WITH CARON
>
> cp1250:
>
> 0x8A 0x0160 #LATIN CAPITAL LETTER S WITH CARON
> 0x9A 0x0161 #LATIN SMALL LETTER S WITH CARON

I'm 100% sure abote these three facts:

1. "croat" was created for cp1250 character set,
and it provides correct sort order for Croatian
language for cp1250.

2. "croat" was renamed into "latin2_croatian_ci" in mistake
in 4.1, it should have been "cp1250_croatian_ci" instead.

3. We didn't have Croatian sort order for latin2 character
set in MySQL so far, and it should be just added now.


Your version could be used as a template for latin2 Croatian,
but for my opinion, the patch fixes only a half of the problem,
and an additional fix is required: to put CAPITAL LETTER Z WITH CARON
on the same position with SMALL LETTER Z WITH CARON. Now they
are sorted differently.

Please confirm this. Or am I mistaken?
Please send the complete table for testing in this case.

Thank you.

>>Like I implement in croat.conf and send to bug report. You expert need to
>>check and reread ISO standard onece again.
>>
>>And finaly, changing that value all data is order corectly.
>>
>>it is not mistake in conversion !!!
>>
>>it is good renaming becouse more then 85% percent of all MySAL
>>implementation is on Linux/UNIX os , so name like latin2_croatian_ci is
>>O.K. in 4.1. version of MySQL.
>>
>>If still your expert are not sure what I am talking about, I will send
>>comlete table for testing.
>>
>>Dubravko Penezic
>>ISA, Srce
>>University Computing Center
>>Zagreb, Croatia
>>
>
>
> Hi!
>
> This goes a way beyond my charset knowledge, so I am involving the
> persons in charge.
>
> On an unrelated issue.
>
> If Hrvoje Niksic is still there, please give him my regards.
>


--
For technical support contracts, visit https://order.mysql.com/
__ ___ ___ ____ __
/ |/ /_ __/ __/ __ \/ / Mr. Alexander Barkov
/ /|_/ / // /\ \/ /_/ / /__ MySQL AB, Full-Time Developer
/_/ /_/\_, /___/\___\_\___/ Izhevsk, Russia
<___/ www.mysql.com +7-912-856-80-21


--
MySQL Bugs Mailing List
For list archives: http://lists.mysql.com/bugs
To unsubscribe: http://lists.mysql.com/bugs?unsub=gcdmb-bugs@m.gmane.org

Re: Wrong sorting order in croat.conf (spouse all version)

am 30.03.2004 21:43:24 von Dubravko Penezic

Hi !

> >>What you sed is complitly wrong becouse Ss in win-1250 is on position 9A
> >>and 8A and in ISO-8859-2 is on position B9 and A9.
>
> Yes, I agree. The the above pages state this too.
> And this is what I wrote in my previous letter
> to Sinisa:
>
> > latin2:
> >
> > 0xA9 0x0160 #LATIN CAPITAL LETTER S WITH CARON
> > 0xB9 0x0161 #LATIN SMALL LETTER S WITH CARON
> >
> > cp1250:
> >
> > 0x8A 0x0160 #LATIN CAPITAL LETTER S WITH CARON
> > 0x9A 0x0161 #LATIN SMALL LETTER S WITH CARON
>
> I'm 100% sure abote these three facts:
>
> 1. "croat" was created for cp1250 character set,
> and it provides correct sort order for Croatian
> language for cp1250.

Wrong ... check in repository of codepage you have in source dir
../sql/share/charsets/

croat.conf
win1250.conf

what you talking about is win1250 in some case called cp1250

Croatian languge have only one code page standard which is recommandate by
law, and that is ISO Latin 2 under ISO code 8859-2.

CP1250 or win1250 or what ever Micro$oft would like ta call thay
"standard", is only forced standard, but situation is radical changed in
last 5 years.

Also freely check linux/unix implementation of croatian character, you
will finde only latin2 under ISO-8859-2 code shema.

Also very simply test of ordering will show you that croat.conf sort under
ISO-8859-2 code page except wrong Ss position, inserting character whit
code of win1250/cp1250 will destroy that oreder.

> 2. "croat" was renamed into "latin2_croatian_ci" in mistake
> in 4.1, it should have been "cp1250_croatian_ci" instead.

once again wrong, see above.

> 3. We didn't have Croatian sort order for latin2 character
> set in MySQL so far, and it should be just added now.

Partialy thrue, you have,but with wrong Ss position :)

> Your version could be used as a template for latin2 Croatian,
> but for my opinion, the patch fixes only a half of the problem,
> and an additional fix is required: to put CAPITAL LETTER Z WITH CARON
> on the same position with SMALL LETTER Z WITH CARON. Now they
> are sorted differently.

I will check tomorow morning what I can do with other part, maybe I will
need some help about meaning of other part of table.

> Please confirm this. Or am I mistaken?
> Please send the complete table for testing in this case.

I will send tomorow table for testing.

Thank You too, I understand that people outside Croatia, and many inside
dont understand what is going on, and also government do nothing to
make order in code page standard.

Dubravko Penezic

P.S.: Also I will try to make some test table for testing order and
lower/upper case.
P.S.S.: We use one table which may help to understanding code page whic
are in use in Croatia.
http://www.open.hr/hiz/kodsus/primjena.html


--
MySQL Bugs Mailing List
For list archives: http://lists.mysql.com/bugs
To unsubscribe: http://lists.mysql.com/bugs?unsub=gcdmb-bugs@m.gmane.org

Re: Wrong sorting order in croat.conf (spouse all version)

am 31.03.2004 12:37:52 von Alexander Barkov

--------------020403080509080905040001
Content-Type: text/plain; charset=us-ascii; format=flowed
Content-Transfer-Encoding: 7bit

Hello!


Dubravko Penezic wrote:
> Hi !
>
>>>>What you sed is complitly wrong becouse Ss in win-1250 is on position 9A
>>>>and 8A and in ISO-8859-2 is on position B9 and A9.
>>
>>Yes, I agree. The the above pages state this too.
>>And this is what I wrote in my previous letter
>>to Sinisa:
>>
>>>latin2:
>>>
>>>0xA9 0x0160 #LATIN CAPITAL LETTER S WITH CARON
>>>0xB9 0x0161 #LATIN SMALL LETTER S WITH CARON
>>>
>>>cp1250:
>>>
>>>0x8A 0x0160 #LATIN CAPITAL LETTER S WITH CARON
>>>0x9A 0x0161 #LATIN SMALL LETTER S WITH CARON
>>
>>I'm 100% sure abote these three facts:
>>
>>1. "croat" was created for cp1250 character set,
>>and it provides correct sort order for Croatian
>>language for cp1250.
>
>
> Wrong ... check in repository of codepage you have in source dir
> ./sql/share/charsets/

Did you try croat.conf on a Windows machine?
Did it produce wrong Croatian sort order?

> croat.conf
> win1250.conf
>
> what you talking about is win1250 in some case called cp1250

Yes, it is fine for Croatin too (and for some other languages).
That means we had two cp1250+Croatian compatible
configurations, and didn't have a single one for latin2+Croatian.

I'm attaching a new sort order array for latin2+Croatian
configuration. Can you please replace the old one by this
new array, and test if sort order is fine.

There is also HTML file attached, it demonstrates the
sort order in a clear manner.

Thank you!

> Croatian languge have only one code page standard which is recommandate by
> law, and that is ISO Latin 2 under ISO code 8859-2.
>
> CP1250 or win1250 or what ever Micro$oft would like ta call thay
> "standard", is only forced standard, but situation is radical changed in
> last 5 years.
>
> Also freely check linux/unix implementation of croatian character, you
> will finde only latin2 under ISO-8859-2 code shema.
>
> Also very simply test of ordering will show you that croat.conf sort under
> ISO-8859-2 code page except wrong Ss position, inserting character whit
> code of win1250/cp1250 will destroy that oreder.
>
>
>>2. "croat" was renamed into "latin2_croatian_ci" in mistake
>>in 4.1, it should have been "cp1250_croatian_ci" instead.
>
>
> once again wrong, see above.
>
>
>>3. We didn't have Croatian sort order for latin2 character
>>set in MySQL so far, and it should be just added now.
>
>
> Partialy thrue, you have,but with wrong Ss position :)
>
>
>>Your version could be used as a template for latin2 Croatian,
>>but for my opinion, the patch fixes only a half of the problem,
>>and an additional fix is required: to put CAPITAL LETTER Z WITH CARON
>>on the same position with SMALL LETTER Z WITH CARON. Now they
>>are sorted differently.
>
>
> I will check tomorow morning what I can do with other part, maybe I will
> need some help about meaning of other part of table.
>
>
>>Please confirm this. Or am I mistaken?
>>Please send the complete table for testing in this case.
>
>
> I will send tomorow table for testing.
>
> Thank You too, I understand that people outside Croatia, and many inside
> dont understand what is going on, and also government do nothing to
> make order in code page standard.
>
> Dubravko Penezic
>
> P.S.: Also I will try to make some test table for testing order and
> lower/upper case.
> P.S.S.: We use one table which may help to understanding code page whic
> are in use in Croatia.
> http://www.open.hr/hiz/kodsus/primjena.html
>


--
For technical support contracts, visit https://order.mysql.com/
__ ___ ___ ____ __
/ |/ /_ __/ __/ __ \/ / Mr. Alexander Barkov
/ /|_/ / // /\ \/ /_/ / /__ MySQL AB, Full-Time Developer
/_/ /_/\_, /___/\___\_\___/ Izhevsk, Russia
<___/ www.mysql.com +7-912-856-80-21

--------------020403080509080905040001
Content-Type: text/plain;
name="latin2-croat.txt"
Content-Transfer-Encoding: 7bit
Content-Disposition: inline;
filename="latin2-croat.txt"

00 01 02 03 04 05 06 07 08 09 0A 0B 0C 0D 0E 0F
10 11 12 13 14 15 16 17 18 19 1A 1B 1C 1D 1E 1F
20 21 22 23 24 25 26 27 28 29 2A 2B 2C 2D 2E 2F
30 31 32 33 34 35 36 37 38 39 3A 3B 3C 3D 3E 3F
40 41 4C 4E 56 5C 66 68 6A 6C 72 74 76 7E 80 86
90 92 94 9A A3 A9 B3 B5 B7 B9 BD C5 C6 C7 C8 C9
CA 41 4C 4E 56 5C 66 68 6A 6C 72 74 76 7E 80 86
90 92 94 9A A3 A9 B3 B5 B7 B9 BD CB CC CD CE CF
D0 D1 D2 D3 D4 D5 D6 D7 D8 D9 DA DB DC DD DE DF
E0 E1 E2 E3 E4 E5 E6 E7 E8 E9 EA EB EC ED EE EF
F0 41 F1 76 F2 76 9A F3 F4 9C 9A A3 BD F5 BF BD
F6 41 F7 76 F8 76 9A F9 FA 9C 9A A3 BD FB BF BD
94 41 41 41 41 76 52 4E 50 5C 5C 5C 5C 6C 6C 56
58 80 80 86 86 86 86 FC 94 A9 A9 A9 A9 B9 A3 9A
94 41 41 41 41 76 52 4E 50 5C 5C 5C 5C 6C 6C 56
58 80 80 86 86 86 86 FD 94 A9 A9 A9 A9 B9 A3 FE


--------------020403080509080905040001
Content-Type: text/plain; charset=us-ascii

--
MySQL Bugs Mailing List
For list archives: http://lists.mysql.com/bugs
To unsubscribe: http://lists.mysql.com/bugs?unsub=gcdmb-bugs@m.gmane.org
--------------020403080509080905040001--

Re: Wrong sorting order in croat.conf (spouse all version)

am 31.03.2004 12:44:46 von Alexander Barkov

--------------010907000204020104010102
Content-Type: text/plain; charset=us-ascii; format=flowed
Content-Transfer-Encoding: 7bit


> There is also HTML file attached, it demonstrates the
> sort order in a clear manner.

Sorry, it was attached malformed last time.
A better version is here.



--------------010907000204020104010102
Content-Type: text/plain; charset=us-ascii

--
MySQL Bugs Mailing List
For list archives: http://lists.mysql.com/bugs
To unsubscribe: http://lists.mysql.com/bugs?unsub=gcdmb-bugs@m.gmane.org
--------------010907000204020104010102--

Re: Wrong sorting order in croat.conf (spouse all version)

am 01.04.2004 12:13:23 von Dubravko Penezic

Hello !

I check complitly both sorting order for croatian character acording ISO
8859-2 and CP 1250 and recreate new table :

* Croatian Sorting Order acording Babic's "Hrvatski pravopis"
- character with ASCII code less then 41(Hex) a sorted on respekting
order to thay ASCII value
- charcter upper and lower case are same
- character with aditional elements which are not part of Croatian
alphabet come after same character without that elements
- every aditional character after 41(Hex) whicha are not part of
alphabet (special chracter, signe) a sorted by thay ASCII position and
come after last alphabet character
- ISO-8859-2 mostly use on Unix/Linux platform
- CP-1250 implemented on Microsoft Win platform
- according actual Croatian law ISO 8859-2 is recomedation


# sort_order array (must have 256 elements) Croataian ISO-8859-2
00 01 02 03 04 05 06 07 08 09 0A 0B 0C 0D 0E 0F
10 11 12 13 14 15 16 17 18 19 1A 1B 1C 1D 1E 1F
20 21 22 23 24 25 26 27 28 29 2A 2B 2C 2D 2E 2F
30 31 32 33 34 35 36 37 38 39 3A 3B 3C 3D 3E 3F
40 41 43 44 48 4B 4D 4E 4F 50 52 53 54 56 57 59
5B 5C 5D 5F 62 64 66 67 68 69 6B 6E 6F 70 71 72
73 41 43 44 48 4B 4D 4E 4F 50 52 53 54 56 57 59
5B 5C 5D 5F 62 64 66 67 68 69 6B 7B 7C 7D 7E 7F
80 81 82 83 84 85 86 87 88 89 8A 8B 8C 8D 8E 8F
90 91 92 93 94 95 96 97 98 99 9A 9B 9C 9D 9E 9F
A0 42 A2 55 A4 55 60 A7 A8 61 60 63 6C AD 6D 6C
B0 42 B2 55 B4 55 60 B7 B8 61 60 63 6C BD 6D 6C
5E 42 42 42 42 55 47 45 46 4C 4C 4C 4C 51 51 49
4A 58 58 5A 5A 5A 5A D7 5E 65 65 65 65 6A 63 60
5E 42 42 42 42 55 47 45 46 4C 4C 4C 4C 51 51 49
4A 58 58 5A 5A 5A 5A F7 5E 65 65 65 65 6A 63 FF

# sort_order array (must have 256 elements) Croataian CP-1250
00 01 02 03 04 05 06 07 08 09 0A 0B 0C 0D 0E 0F
10 11 12 13 14 15 16 17 18 19 1A 1B 1C 1D 1E 1F
20 21 22 23 24 25 26 27 28 29 2A 2B 2C 2D 2E 2F
30 31 32 33 34 35 36 37 38 39 3A 3B 3C 3D 3E 3F
40 41 43 44 48 4B 4D 4E 4F 50 52 53 54 56 57 59
5B 5C 5D 5F 62 64 66 67 68 69 6B 6E 6F 70 71 72
73 41 43 44 48 4B 4D 4E 4F 50 52 53 54 56 57 59
5B 5C 5D 5F 62 64 66 67 68 69 6B 7B 7C 7D 7E 7F
80 81 82 83 84 85 86 87 88 89 61 8B 60 63 6D 6C
90 91 92 93 94 95 96 97 98 99 61 9B 60 63 6D 6C
A0 42 A2 55 A4 42 60 A7 A8 61 60 63 6C AD 6D 6C
B0 42 B2 55 B4 55 60 B7 B8 42 60 63 55 BD 55 6C
5E 42 42 42 42 55 47 45 46 4C 4C 4C 4C 51 51 49
4A 58 58 5A 5A 5A 5A D7 5E 65 65 65 65 6A 63 60
5E 42 42 42 42 55 47 45 46 4C 4C 4C 4C 51 51 49
4A 58 58 5A 5A 5A 5A F7 5E 65 65 65 65 6A 63 FF


where I my found *_ci files ? ... is it in 4.1.x or 5.0.x source ?

Dubravko Penezic
ISA, SRCE
University Computing Center
Zagreb, Croatia

On Wed, 31 Mar 2004, Alexander Barkov wrote:

> Hello!
>
>
> Dubravko Penezic wrote:
> > Hi !
> >
> >>>>What you sed is complitly wrong becouse Ss in win-1250 is on position 9A
> >>>>and 8A and in ISO-8859-2 is on position B9 and A9.
> >>
> >>Yes, I agree. The the above pages state this too.
> >>And this is what I wrote in my previous letter
> >>to Sinisa:
> >>
> >>>latin2:
> >>>
> >>>0xA9 0x0160 #LATIN CAPITAL LETTER S WITH CARON
> >>>0xB9 0x0161 #LATIN SMALL LETTER S WITH CARON
> >>>
> >>>cp1250:
> >>>
> >>>0x8A 0x0160 #LATIN CAPITAL LETTER S WITH CARON
> >>>0x9A 0x0161 #LATIN SMALL LETTER S WITH CARON
> >>
> >>I'm 100% sure abote these three facts:
> >>
> >>1. "croat" was created for cp1250 character set,
> >>and it provides correct sort order for Croatian
> >>language for cp1250.
> >
> >
> > Wrong ... check in repository of codepage you have in source dir
> > ./sql/share/charsets/
>
> Did you try croat.conf on a Windows machine?
> Did it produce wrong Croatian sort order?
>
> > croat.conf
> > win1250.conf
> >
> > what you talking about is win1250 in some case called cp1250
>
> Yes, it is fine for Croatin too (and for some other languages).
> That means we had two cp1250+Croatian compatible
> configurations, and didn't have a single one for latin2+Croatian.
>
> I'm attaching a new sort order array for latin2+Croatian
> configuration. Can you please replace the old one by this
> new array, and test if sort order is fine.
>
> There is also HTML file attached, it demonstrates the
> sort order in a clear manner.
>
> Thank you!
>
> > Croatian languge have only one code page standard which is recommandate by
> > law, and that is ISO Latin 2 under ISO code 8859-2.
> >
> > CP1250 or win1250 or what ever Micro$oft would like ta call thay
> > "standard", is only forced standard, but situation is radical changed in
> > last 5 years.
> >
> > Also freely check linux/unix implementation of croatian character, you
> > will finde only latin2 under ISO-8859-2 code shema.
> >
> > Also very simply test of ordering will show you that croat.conf sort under
> > ISO-8859-2 code page except wrong Ss position, inserting character whit
> > code of win1250/cp1250 will destroy that oreder.
> >
> >
> >>2. "croat" was renamed into "latin2_croatian_ci" in mistake
> >>in 4.1, it should have been "cp1250_croatian_ci" instead.
> >
> >
> > once again wrong, see above.
> >
> >
> >>3. We didn't have Croatian sort order for latin2 character
> >>set in MySQL so far, and it should be just added now.
> >
> >
> > Partialy thrue, you have,but with wrong Ss position :)
> >
> >
> >>Your version could be used as a template for latin2 Croatian,
> >>but for my opinion, the patch fixes only a half of the problem,
> >>and an additional fix is required: to put CAPITAL LETTER Z WITH CARON
> >>on the same position with SMALL LETTER Z WITH CARON. Now they
> >>are sorted differently.
> >
> >
> > I will check tomorow morning what I can do with other part, maybe I will
> > need some help about meaning of other part of table.
> >
> >
> >>Please confirm this. Or am I mistaken?
> >>Please send the complete table for testing in this case.
> >
> >
> > I will send tomorow table for testing.
> >
> > Thank You too, I understand that people outside Croatia, and many inside
> > dont understand what is going on, and also government do nothing to
> > make order in code page standard.
> >
> > Dubravko Penezic
> >
> > P.S.: Also I will try to make some test table for testing order and
> > lower/upper case.
> > P.S.S.: We use one table which may help to understanding code page whic
> > are in use in Croatia.
> > http://www.open.hr/hiz/kodsus/primjena.html
> >
>
>
> --
> For technical support contracts, visit https://order.mysql.com/
> __ ___ ___ ____ __
> / |/ /_ __/ __/ __ \/ / Mr. Alexander Barkov
> / /|_/ / // /\ \/ /_/ / /__ MySQL AB, Full-Time Developer
> /_/ /_/\_, /___/\___\_\___/ Izhevsk, Russia
> <___/ www.mysql.com +7-912-856-80-21
>

--
MySQL Bugs Mailing List
For list archives: http://lists.mysql.com/bugs
To unsubscribe: http://lists.mysql.com/bugs?unsub=gcdmb-bugs@m.gmane.org