diff options
author | Mike FABIAN <mfabian@redhat.com> | 2017-12-11 18:26:22 +0100 |
---|---|---|
committer | Mike FABIAN <mfabian@redhat.com> | 2018-02-27 17:47:50 +0100 |
commit | 159738548130d5ac4fe6178977e940ed5f8cfdc4 (patch) | |
tree | 03f90b90e7bb794cfdbd4b3e66c9fff7ad6a9b24 /localedata/locales/ps_AF | |
parent | ce6636b06b67d6bb9b3d6927bf2a926b9b7478f5 (diff) | |
download | glibc-159738548130d5ac4fe6178977e940ed5f8cfdc4.tar glibc-159738548130d5ac4fe6178977e940ed5f8cfdc4.tar.gz glibc-159738548130d5ac4fe6178977e940ed5f8cfdc4.tar.bz2 glibc-159738548130d5ac4fe6178977e940ed5f8cfdc4.zip |
Adapt collation in several locales to the new iso14651_t1_common file
[BZ #22550] - es_ES locale (and other es_* locales): collation should
treat ñ as a primary different character, sync the collation
for Spanish with CLDR
[BZ #21547] - Tibetan script collation broken (Dzongkha and Tibetan)
* localedata/Makefile: Add new test files.
* localedata/lv_LV.UTF-8.in: Adapt test file to new collation order.
* localedata/sv_SE.ISO-8859-1.in: Adapt test file to new collation order.
* localedata/uk_UA.UTF-8.in: Adapt test file to new collation order.
* localedata/am_ET.UTF-8.in: New test file.
* localedata/az_AZ.UTF-8.in: Likewise.
* localedata/be_BY.UTF-8.in: Likewise.
* localedata/ber_DZ.UTF-8.in: Likewise.
* localedata/ber_MA.UTF-8.in: Likewise.
* localedata/bg_BG.UTF-8.in: Likewise.
* localedata/br_FR.UTF-8.in: Likewise.
* localedata/cmn_TW.UTF-8.in: Likewise.
* localedata/crh_UA.UTF-8.in: Likewise.
* localedata/csb_PL.UTF-8.in: Likewise.
* localedata/cv_RU.UTF-8.in: Likewise.
* localedata/cy_GB.UTF-8.in: Likewise.
* localedata/dz_BT.UTF-8.in: Likewise.
* localedata/eo.UTF-8.in: Likewise.
* localedata/es_ES.UTF-8.in: Likewise.
* localedata/fa_IR.UTF-8.in: Likewise.
* localedata/fi_FI.UTF-8.in: Likewise.
* localedata/fil_PH.UTF-8.in: Likewise.
* localedata/fur_IT.UTF-8.in: Likewise.
* localedata/gez_ER.UTF-8@abegede.in: Likewise.
* localedata/ha_NG.UTF-8.in: Likewise.
* localedata/ig_NG.UTF-8.in: Likewise.
* localedata/ik_CA.UTF-8.in: Likewise.
* localedata/kk_KZ.UTF-8.in: Likewise.
* localedata/ku_TR.UTF-8.in: Likewise.
* localedata/ky_KG.UTF-8.in: Likewise.
* localedata/ln_CD.UTF-8.in: Likewise.
* localedata/mi_NZ.UTF-8.in: Likewise.
* localedata/ml_IN.UTF-8.in: Likewise.
* localedata/mn_MN.UTF-8.in: Likewise.
* localedata/mr_IN.UTF-8.in: Likewise.
* localedata/mt_MT.UTF-8.in: Likewise.
* localedata/nb_NO.UTF-8.in: Likewise.
* localedata/om_KE.UTF-8.in: Likewise.
* localedata/os_RU.UTF-8.in: Likewise.
* localedata/ps_AF.UTF-8.in: Likewise.
* localedata/ro_RO.UTF-8.in: Likewise.
* localedata/ru_RU.UTF-8.in: Likewise.
* localedata/sc_IT.UTF-8.in: Likewise.
* localedata/se_NO.UTF-8.in: Likewise.
* localedata/sq_AL.UTF-8.in: Likewise.
* localedata/sv_SE.UTF-8.in: Likewise.
* localedata/szl_PL.UTF-8.in: Likewise.
* localedata/tg_TJ.UTF-8.in: Likewise.
* localedata/tk_TM.UTF-8.in: Likewise.
* localedata/tt_RU.UTF-8.in: Likewise.
* localedata/tt_RU.UTF-8@iqtelif.in: Likewise.
* localedata/ug_CN.UTF-8.in: Likewise.
* localedata/uz_UZ.UTF-8.in: Likewise.
* localedata/vi_VN.UTF-8.in: Likewise.
* localedata/yi_US.UTF-8.in: Likewise.
* localedata/yo_NG.UTF-8.in: Likewise.
* localedata/zh_CN.UTF-8.in: Likewise.
* localedata/locales/am_ET: Adapt collation rules to new iso14651_t1_common
file and fix bugs in the collation.
* localedata/locales/az_AZ: Likewise.
* localedata/locales/be_BY: Likewise.
* localedata/locales/ber_DZ: Likewise.
* localedata/locales/ber_MA: Likewise.
* localedata/locales/bg_BG: Likewise.
* localedata/locales/br_FR: Likewise.
* localedata/locales/br_FR@euro: Likewise.
* localedata/locales/ca_ES: Likewise.
* localedata/locales/cns11643_stroke: Likewise.
* localedata/locales/crh_UA: Likewise.
* localedata/locales/cs_CZ: Likewise.
* localedata/locales/csb_PL: Likewise.
* localedata/locales/cv_RU: Likewise.
* localedata/locales/cy_GB: Likewise.
* localedata/locales/da_DK: Likewise.
* localedata/locales/dz_BT: Likewise.
* localedata/locales/en_CA: Likewise.
* localedata/locales/eo: Likewise.
* localedata/locales/es_CU: Likewise.
* localedata/locales/es_EC: Likewise.
* localedata/locales/es_ES: Likewise.
* localedata/locales/es_US: Likewise.
* localedata/locales/et_EE: Likewise.
* localedata/locales/fa_IR: Likewise.
* localedata/locales/fi_FI: Likewise.
* localedata/locales/fil_PH: Likewise.
* localedata/locales/fur_IT: Likewise.
* localedata/locales/gez_ER@abegede: Likewise.
* localedata/locales/ha_NG: Likewise.
* localedata/locales/hr_HR: Likewise.
* localedata/locales/hsb_DE: Likewise.
* localedata/locales/hu_HU: Likewise.
* localedata/locales/ig_NG: Likewise.
* localedata/locales/ik_CA: Likewise.
* localedata/locales/is_IS: Likewise.
* localedata/locales/iso14651_t1_pinyin: Likewise.
* localedata/locales/kk_KZ: Likewise.
* localedata/locales/ku_TR: Likewise.
* localedata/locales/ky_KG: Likewise.
* localedata/locales/ln_CD: Likewise.
* localedata/locales/lt_LT: Likewise.
* localedata/locales/lv_LV: Likewise.
* localedata/locales/mi_NZ: Likewise.
* localedata/locales/ml_IN: Likewise.
* localedata/locales/mn_MN: Likewise.
* localedata/locales/mr_IN: Likewise.
* localedata/locales/mt_MT: Likewise.
* localedata/locales/nb_NO: Likewise.
* localedata/locales/om_KE: Likewise.
* localedata/locales/os_RU: Likewise.
* localedata/locales/pl_PL: Likewise.
* localedata/locales/ps_AF: Likewise.
* localedata/locales/ro_RO: Likewise.
* localedata/locales/ru_RU: Likewise.
* localedata/locales/ru_UA: Likewise.
* localedata/locales/sc_IT: Likewise.
* localedata/locales/se_NO: Likewise.
* localedata/locales/si_LK: Likewise.
* localedata/locales/sq_AL: Likewise.
* localedata/locales/sv_FI: Likewise.
* localedata/locales/sv_FI@euro: Likewise.
* localedata/locales/sv_SE: Likewise.
* localedata/locales/szl_PL: Likewise.
* localedata/locales/tg_TJ: Likewise.
* localedata/locales/ti_ER: Likewise.
* localedata/locales/tk_TM: Likewise.
* localedata/locales/tl_PH: Likewise.
* localedata/locales/tr_TR: Likewise.
* localedata/locales/tt_RU: Likewise.
* localedata/locales/tt_RU@iqtelif: Likewise.
* localedata/locales/ug_CN: Likewise.
* localedata/locales/uk_UA: Likewise.
* localedata/locales/uz_UZ: Likewise.
* localedata/locales/uz_UZ@cyrillic: Likewise.
* localedata/locales/vi_VN: Likewise.
* localedata/locales/yi_US: Likewise.
* localedata/locales/yo_NG: Likewise.
Diffstat (limited to 'localedata/locales/ps_AF')
-rw-r--r-- | localedata/locales/ps_AF | 224 |
1 files changed, 142 insertions, 82 deletions
diff --git a/localedata/locales/ps_AF b/localedata/locales/ps_AF index c20419db11..66f560ef44 100644 --- a/localedata/locales/ps_AF +++ b/localedata/locales/ps_AF @@ -56,93 +56,153 @@ map to_outpunct; (<U002E>,<U066B>);(<U002C>,<U066C>) END LC_CTYPE LC_COLLATE -% Collation rules updated as per requirement of glibc by Pravin Satpute <psatpute@redhat.com> -% see rh bug 482881 +% CLDR collation rules for Pashto: +% (see: https://unicode.org/cldr/trac/browser/trunk/common/collation/ps.xml) +% +% <collation type="standard"> +% <cr><![CDATA[ +% [normalization on] +% [reorder Arab] +% &َ<<ِ<<ُ<<ً<<ٍ<<ٌ +% &[before 1]ا<آ +% &ا<<أ<<ٲ<<ٱ<<إ<<ٳ<ء +% &ت<ټ<<ٹ +% &ج<ځ<<حٔ +% &چ<څ +% &د<ډ<<ڈ +% &ر<ړ<<ڑ +% &ژ<ږ +% &ک<<*ڪك +% &ګ<<گ +% &ڼ<<ڻ +% &و<<ؤ<<ۇ<<ۉ +% &ه<<<ۀ<<<هٔ<<*ەہھةۃ +% &ی<<*ىےيېۍ<<یٔ<<<ىٔ<<<ئ +% &\u00A0<<\u200C<<\u200D +% ]]></cr> +% </collation> +% +% And CLDR also lists the following +% index characters: +% (see: https://unicode.org/cldr/trac/browser/trunk/common/main/ps.xml) +% +% <exemplarCharacters type="index" draft="unconfirmed">[آ ا ء ب پ ت ټ ث ج ځ چ څ ح خ د ډ ذ ر ړ ز ژ ږ س ش ښ ص ض ط ظ ع غ ف ق ک ګ ل م ن ڼ ه و ی]</exemplarCharacters> +% +% The following rules implement the same order for glibc. copy "iso14651_t1" -collating-symbol <teh_with_ring> -collating-symbol <hah_with_hamza_above> -collating-symbol <hah_with_three_dots> -collating-symbol <dal_with_ring> -collating-symbol <reh_with_ring> -collating-symbol <reh_with_dot_below_and_above> -collating-symbol <seen_with_dot_below_and_above> -collating-symbol <kaaf_with_ring> -collating-symbol <noon_with_ring> -collating-symbol <farsi_yeh> -collating-symbol <arabic_letter_e> -collating-symbol <yeh_with_tail> - -reorder-after <teh> -<teh_with_ring> - -reorder-after <jeem> -<hah_with_hamza_above> - -reorder-after <tcheh> -<hah_with_three_dots> - -reorder-after <dal> -<dal_with_ring> - -reorder-after <reh> -<reh_with_ring> - -reorder-after <jeh> -<reh_with_dot_below_and_above> - -reorder-after <sheen> -<seen_with_dot_below_and_above> - -reorder-after <keheh> -<kaaf_with_ring> - -reorder-after <noon> -<noon_with_ring> - -reorder-after <heh> -<farsi_yeh> +collating-element <arabic-letter-heh-with-hamza-above> from "<U0647><U0654>" % هٔ +collating-element <arabic-letter-farsi-yeh-with-hamza-above> from "<U06CC><U0654>" % یٔ +collating-element <arabic-letter-alef-maksura-with-hamza-above> from "<U0649><U0654>" % ىٔ -reorder-after <alef_maksura> -<arabic_letter_e> -<yeh_with_tail> +% &َ<<ِ<<ُ<<ً<<ٍ<<ٌ +reorder-after <FATHA> +<KASRA> +<DAMMA> +<FATHATAN> +<KASRATAN> +<DAMMATAN> +% &[before 1]ا<آ +reorder-after <S08AC> % ARABIC LETTER ROHINGYA YEH +<S0622> % ARABIC LETTER ALEF WITH MADDA ABOVE -reorder-after <U062A> -<U067C> <teh_with_ring>;<BAS>;<MIN>;IGNORE - -reorder-after <U062C> -<U0681> <hah_with_hamza_above>;<BAS>;<MIN>;IGNORE - -reorder-after <U0686> -<U0685> <hah_with_three_dots>;<BAS>;<MIN>;IGNORE - -reorder-after <U062F> -<U0689> <dal_with_ring>;<BAS>;<MIN>;IGNORE - -reorder-after <U0631> -<U0693> <reh_with_ring>;<BAS>;<MIN>;IGNORE - -reorder-after <U0698> -<U0696> <reh_with_dot_below_and_above>;<BAS>;<MIN>;IGNORE - -reorder-after <U0634> -<U069A> <seen_with_dot_below_and_above>;<BAS>;<MIN>;IGNORE - -reorder-after <U06A9> -<U06AB> <kaaf_with_ring>;<BAS>;<MIN>;IGNORE - -reorder-after <U0646> -<U06BC> <noon_with_ring>;<BAS>;<MIN>;IGNORE - -reorder-after <U0648> -<U0647> <heh>;<BAS>;<MIN>;IGNORE -<U06CC> <farsi_yeh>;<BAS>;<MIN>;IGNORE - -reorder-after <U064A> -<U06D0> <arabic_letter_e>;<AYE>;<MIN>;IGNORE -<U0626> "<alef_maksura><hamza>";"<BAS><BAS>";"<MIN><MIN>";IGNORE -<U06CD> <yeh_with_tail>;<AYE>;<MIN>;IGNORE +% &ا<<أ<<ٲ<<ٱ<<إ<<ٳ<ء +% Entry in iso14651_t1_common: +% +% <U0627> <S0627>;<BASE>;<MIN>;<U0627> % ARABIC LETTER ALEF +% +<U0623> <S0627>;"<BASE><VRNT1>";<MIN>;<U0623> % ARABIC LETTER ALEF WITH HAMZA ABOVE +<U0672> <S0627>;"<BASE><VRNT2>";<MIN>;<U0672> % ARABIC LETTER ALEF WITH WAVY HAMZA ABOVE +<U0671> <S0627>;"<BASE><VRNT3>";<MIN>;<U0671> % ARABIC LETTER ALEF WASLA +<U0625> <S0627>;"<BASE><VRNT4>";<MIN>;<U0625> % ARABIC LETTER ALEF WITH HAMZA BELOW +<U0673> <S0627>;"<BASE><VRNT5>";<MIN>;<U0673> % ARABIC LETTER ALEF WITH WAVY HAMZA BELOW + +reorder-after <S0627> % ARABIC LETTER ALEF +<S0621> % ARABIC LETTER HAMZA + +% &ت<ټ<<ٹ +reorder-after <S062A> % ARABIC LETTER TEH +<S067C> % ARABIC LETTER TEH WITH RING + +<U067C> <S067C>;"<BASE><BASE>";<MIN>;<U067C> % ARABIC LETTER TEH WITH RING +<U0679> <S067C>;"<BASE><VRNT1>";<MIN>;<U0679> % ARABIC LETTER TTEH + +% &ج<ځ<<حٔ +reorder-after <S062C> % ARABIC LETTER JEEM +<S0681> % ARABIC LETTER HAH WITH HAMZA ABOVE + +<U0681> <S0681>;"<BASE><BASE>";<MIN>;<U0681> % ARABIC LETTER HAH WITH HAMZA ABOVE +<U062D> <S0681>;"<BASE><VRNT1>";<MIN>;<U062D> % ARABIC LETTER HAH + +% &چ<څ +reorder-after <S0686> % ARABIC LETTER TCHEH +<S0685> % ARABIC LETTER HAH WITH THREE DOTS ABOVE + +% &د<ډ<<ڈ +reorder-after <S062F> % ARABIC LETTER DAL +<S0689> % ARABIC LETTER DAL WITH RING + +<U0689> <S0689>;"<BASE><BASE>";<MIN>;<U0689> % ARABIC LETTER DAL WITH RING +<U0688> <S0689>;"<BASE><VRNT1>";<MIN>;<U0688> % ARABIC LETTER DDAL + +% &ر<ړ<<ڑ +reorder-after <S0631> % ARABIC LETTER REH + +<U0693> <S0693>;"<BASE><BASE>";<MIN>;<U0693> % ARABIC LETTER REH WITH RING +<U0691> <S0693>;"<BASE><VRNT1>";<MIN>;<U0691> % ARABIC LETTER RREH + +% &ژ<ږ +reorder-after <S0698> % ARABIC LETTER JEH +<S0696> % ARABIC LETTER REH WITH DOT BELOW AND DOT ABOVE + +% &ک<<*ڪك +<U06A9> <S06A9>;"<BASE><BASE>";<MIN>;<U06A9> % ARABIC LETTER KEHEH +<U06AA> <S06A9>;"<BASE><VRNT1>";<MIN>;<U06AA> % ARABIC LETTER SWASH KAF +<U0643> <S06A9>;"<BASE><VRNT2>";<MIN>;<U0643> % ARABIC LETTER KAF + +% &ګ<<گ +<U06AB> <S06AB>;"<BASE><BASE>";<MIN>;<U06AB> % ARABIC LETTER KAF WITH RING +<U06AF> <S06AB>;"<BASE><VRNT1>";<MIN>;<U06AF> % ARABIC LETTER GAF + +% &ڼ<<ڻ +<U06BC> <S06BC>;"<BASE><BASE>";<MIN>;<U06BC> % ARABIC LETTER NOON WITH RING +<U06BB> <S06BC>;"<BASE><VRNT1>";<MIN>;<U06BB> % ARABIC LETTER RNOON + +% &و<<ؤ<<ۇ<<ۉ +<U0648> <S0648>;"<BASE><BASE>";<MIN>;<U0648> % ARABIC LETTER WAW +<U0624> <S0648>;"<BASE><VRNT1>";<MIN>;<U0624> % ARABIC LETTER WAW WITH HAMZA ABOVE +<U06C7> <S0648>;"<BASE><VRNT2>";<MIN>;<U06C7> % ARABIC LETTER U +<U06C9> <S0648>;"<BASE><VRNT3>";<MIN>;<U06C9> % ARABIC LETTER KIRGHIZ YU + +% &ه<<<ۀ<<<هٔ<<*ەہھةۃ +<U0647> <S0647>;"<BASE><BASE>";"<MIN><VRNT1>";<U0647> % ARABIC LETTER HEH +<U06C0> <S0647>;"<BASE><BASE>";"<MIN><VRNT2>";<U06C0> % ARABIC LETTER HEH WITH YEH ABOVE +<arabic-letter-heh-with-hamza-above> <S0647>;"<BASE><BASE>";"<MIN><VRNT3>";"<U0647><U0654>" % ARABIC LETTER HEH WITH HAMZA ABOVE + +<U06D5> <S0647>;"<BASE><VRNT1>";<MIN>;<U06D5> % ARABIC LETTER AE +<U06C1> <S0647>;"<BASE><VRNT2>";<MIN>;<U06C1> % ARABIC LETTER HEH GOAL +<U06BE> <S0647>;"<BASE><VRNT3>";<MIN>;<U06BE> % ARABIC LETTER HEH DOACHASHMEE +<U0629> <S0647>;"<BASE><VRNT4>";<MIN>;<U0629> % ARABIC LETTER TEH MARBUTA +<U06C3> <S0647>;"<BASE><VRNT5>";<MIN>;<U06C3> % ARABIC LETTER TEH MARBUTA GOAL + +% &ی<<*ىےيېۍ<<یٔ<<<ىٔ<<<ئ +% Entry in iso14651_t1_common: +% +% <U06CC> <S06CC>;<BASE>;<MIN>;<U06CC> % ARABIC LETTER FARSI YEH +<U0649> <S06CC>;"<BASE><VRNT1><VRNT1>";<MIN>;<U0649> % ARABIC LETTER ALEF MAKSURA +<U06D2> <S06CC>;"<BASE><VRNT1><VRNT2>";<MIN>;<U06D2> % ARABIC LETTER YEH BARREE +<U064A> <S06CC>;"<BASE><VRNT1><VRNT3>";<MIN>;<U064A> % ARABIC LETTER YEH +<U06D0> <S06CC>;"<BASE><VRNT1><VRNT4>";<MIN>;<U06D0> % ARABIC LETTER E +<U06CD> <S06CC>;"<BASE><VRNT2><VRNT1>";<MIN>;<U06CD> % ARABIC LETTER YEH WITH TAIL +<arabic-letter-farsi-yeh-with-hamza-above> <S06CC>;"<BASE><VRNT2><VRNT2>";<MIN>;<U06CC> +<arabic-letter-alef-maksura-with-hamza-above> <S06CC>;"<BASE><VRNT2><VRNT2>";<CAP>;<U0649> +<U0626> <S06CC>;"<BASE><VRNT2><VRNT2>";<COMPATCAP>;<U0626> % ARABIC LETTER YEH WITH HAMZA ABOVE + +% &\u00A0<<\u200C<<\u200D +<U00A0> IGNORE;"<VRNT1>";IGNORE;<U00A0> % NO-BREAK SPACE +<U200C> IGNORE;"<VRNT2>";IGNORE;<U200C> % ZERO WIDTH NON-JOINER +<U200D> IGNORE;"<VRNT3>";IGNORE;<U200D> % ZERO WIDTH JOINER reorder-end |