Adapt collation in th_TH locale to use the iso14651_t1_common file and sync the collation with CLDR - glibc.git - Mirror of https://sourceware.org/git/glibc.git

diff options

author	Mike FABIAN <mfabian@redhat.com>	2023-06-01 17:02:44 +0200
committer	Mike FABIAN <mfabian@redhat.com>	2023-09-21 10:34:35 +0200
commit	aceda10bd5131cf716830827d66da9c671dec649 (patch)
tree	85fce5090f9abd310d3a9e0f0e9748d729f98ac8 /sysdeps/unix/sysv/linux/fstat64.c
parent	672b91ba1060887aa8897d0b98af83b96d4a52b0 (diff)
download	glibc-aceda10bd5131cf716830827d66da9c671dec649.tar glibc-aceda10bd5131cf716830827d66da9c671dec649.tar.gz glibc-aceda10bd5131cf716830827d66da9c671dec649.tar.bz2 glibc-aceda10bd5131cf716830827d66da9c671dec649.zip

Adapt collation in th_TH locale to use the iso14651_t1_common file and sync the collation with CLDR

I made it to agree as much as possible with the rules from CLDR (see: https://github.com/unicode-org/cldr/blob/main/common/collation/th.xml). It seems to be impossible to follow the CLDR rules &[before 1]๚<ฯ # should be "variable" and &๛<ๆ # should be "variable" exactly though. These ask for a primary difference in punctuation characters whose primary weight should be "IGNORE". But using a secondary differnence instead still sorts the test data correctly and the previously used collation in th_TH used tertiary differences for these characters. There was old localedata/th_TH.in test data in TIS-620 encoding which was not used (it was not in the localedata/Makefile). I converted this to UTF-8 and moved it to localedata/th_TH.UTF-8.in and added it to localedata/Makefile. Using the existing collation rules in the th_TH locale did not sort that test file completely correct, I think my new collation rules based on iso14651_t1 are better.

Diffstat (limited to 'sysdeps/unix/sysv/linux/fstat64.c')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: