about summary refs log tree commit diff
path: root/localedata
Commit message (Collapse)AuthorAgeFilesLines
* Add new codepage charmaps/IBM858 [BZ #21084]Mike FABIAN2017-09-141-0/+281
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This code page is identical to code page 850 except that X'D5' has been changed from LI61 (dotless i) to SC20 (euro symbol). The code points from /x01 to /x1f in the /localedata/charmaps/IBM858 file have the same mapping as those in localedata/charmaps/ANSI_X3.4-1968. That means they disagree with with ftp://ftp.software.ibm.com/software/globalization/gcoc/attachments/CP00858.txt in that range. For example, localedata/charmaps/IBM858 and localedata/charmaps/ANSI_X3.4-1968 have: “<U0001> /x01 START OF HEADING (SOH)” whereas CP00858.txt has: “01 SS000000 Smiling Face” That means that CP00858.txt is not really ASCII-compatible and to make it ASCII-compatible we deviate fro CP00858.txt in the code points from /x01 to /x1f. [BZ #21084] * benchtests/strcoll-inputs/filelist#en_US.UTF-8: Add IBM858 and ibm858.c. * iconvdata/Makefile: Add IBM858. * iconvdata/gconv-modules: Add IBM858. * iconvdata/ibm858.c: New file. * iconvdata/tst-tables.sh: Add IBM858 * localedata/charmaps/IBM858: New file.
* Removed redundant data (LC_TIME and LC_MESSAGES) for niu_NZ [BZ #22023]Akhilesh Kumar2017-09-141-78/+2
| | | | | | [BZ #22023] * locales/niu_NZ (LC_TIME): copy "niu_NU". * locales/niu_NZ (LC_MESSAGES): copy "niu_NU".
* Fix LC_TELEPHONE for az_AZ [BZ #22112]Akhilesh Kumar2017-09-141-4/+3
| | | | | | [BZ #22112] * localedata/locales/az_AZ(LC_TELEPHONE): Fix int_select and add tel_int_fmt.
* Finish change from “Bengali” to “Bangla”Mike FABIAN2017-09-091-4/+2
| | | | | | | | | | | | “Bengali” still remained in some comments in the bn_BD locale file, in iso-639.def and in a test input file. Change it there as well. “Bangla” is now used as the English name for this language in CLDR. [BZ #14925] * libio/tst-widetext.input: Change “Bengali” to “Bangla”. * locale/iso-639.def: Change “Bengali” to “Bangla”. * localedata/locales/bn_BD: “Bengali” was still used in some comments. Change it to “Bangla”.
* Use ASCII in LC_TIME of om_ET for better readabilityMike FABIAN2017-09-071-56/+44
| | | | * localedata/locales/om_ET (LC_TIME): use ASCII for better readability.
* Remove redundant LC_TIME data in om_KE which is identical to the data in om_ETAkhilesh Kumar2017-09-071-83/+1
| | | | | [BZ #22100] * locales/om_KE (LC_TIME): copy "om_ET".
* Improve utf8_gen.py to set the width for characters with ↵Mike FABIAN2017-09-064-13/+1652
| | | | | | | | | Prepended_Concatenation_Mark property to 1 [BZ #22070] * localedata/unicode-gen/utf8_gen.py: Set the width for characters with Prepended_Concatenation_Mark property to 1 * localedata/charmaps/UTF-8: Updated using the improved script.
* Write all ranges of neighbouring characters with the same width using the ↵Mike FABIAN2017-09-061-13/+38
| | | | | | | | | | | | | | | | | | | range notation in charmaps/UTF-8 Writing ranges of neighbouring characters with the same with like this <U000E0100>...<U000E01EF> 0 in charmaps/UTF-8 is more efficient than writing many single character lines like: <U000E0100> 0 <U000E0101> 0 ... [BZ #21750] * unicode-gen/utf8_gen.py: Write all ranges of neighbouring characters with the same width using the range notation in charmaps/UTF-8.
* Add miq_NI locale for MiskitoMike FABIAN2017-09-052-0/+189
| | | | | | | [BZ #20498] * locale/iso-639.def: add Miskito. * localedata/SUPPORTED: Add miq_NI/UTF-8. * localedata/locales/miq_NI: New file.
* Move all old ChangeLogs to a top-level ChangeLog.old directory.Zack Weinberg2017-09-011-12593/+0
|
* Fix decimal point and thousands separator in es_CU locale to agree with CLDR.Mike FABIAN2017-08-312-4/+12
| | | | | | | [BZ #15332] * locales/es_CU (LC_MONETARY): use “,” for mon_decimal_point and “.” for mon_thousands_sep (to agree with CLDR) * locales/es_CU (LC_NUMERIC): Likewise.
* Remove redundant data in li_BE which is identical to the data in li_NLAkhilesh Kumar2017-08-312-36/+6
| | | | | [BZ #22044] * locales/li_BE (LC_TIME): copy "li_NL".
* Use “.” in thousands_sep in the ca_ES localeMike FABIAN2017-08-302-4/+10
| | | | | | [BZ #16148] * locales/ca_ES (LC_NUMERIC): Use “.” in thousands_sep. And make it more readable by using ASCII.
* Add country_name for mni_INMike FABIAN2017-08-302-0/+7
| | | | | [BZ #22022] * locales/mni_IN (LC_ADDRESS): Add country_name.
* Fix abbreviated weeks and months for Somali to agree with CLDRAkhilesh Kumar2017-08-305-23/+32
| | | | | | | | | [BZ #22038] * locales/so_DJ (LC_TIME): Fix abday, abmon and make t_fmt in the comment agree with the value of t_fmt. * locales/so_ET (LC_TIME): Fix abday (From Axa to Axd) * locales/so_KE (LC_TIME): Fix abday (From Axa to Axd) * locales/so_SO (LC_TIME): Fix abday (From Axa to Axd)
* Update hanzi collation by strokeWei-Lun Chao2017-08-292-22643/+28242
| | | | | | [BZ #21951] * locales/cns11643_stroke: Update hanzi collation by stroke (Add license declaration, use strokes by component).
* Use “,” as mon_decimal_point for ru_RU and os_RU locales (follow CLDR)Mike FABIAN2017-08-293-17/+13
| | | | | | | | | | [BZ #13805] * locales/ru_RU (LC_MONETARY): Use “,” for mon_decimal_point (to agree with CLDR). * locales/ru_RU (LC_NUMERIC): Write mon_decimal_point in ASCII for readability. * locales/os_RU (LC_MONETARY): Copy from ru_RU, makes it agree with CLDR.
* Add locale for mfe_MUAkhilesh Kumar2017-08-293-0/+190
| | | | | | | | | | | | Add locale for “Morisyen” which is also called “Mauritian Creole” and is spoken in Mauritius. [BZ #21971] * localedata/SUPPORTED: Add mfe_MU/UTF-8. * localedata/locales/mfe_MU: New File. [BZ #21971] * locale/iso-639.def: add Morisyen.
* Use NFC for the strings in “day”, “yesstr”, and “nostr” in the ↵Mike FABIAN2017-08-292-3/+8
| | | | | | | sgs_LT locale * locales/sgs_LT (LC_TIME): Use NFC for the strings in “day”. * locales/sgs_LT (LC_MESSAGES): Use NFC for “yesstr” and “nostr”.
* Fix country_name in zu_ZA to agree with CLDRAkhilesh Kumar2017-08-282-2/+6
| | | | | [BZ #21959] * locales/zu_ZA (LC_ADDRESS): Fix country name to agree with CLDR.
* Use the range notation in charmaps/UTF-8 for all ranges of neighbouring ↵Mike FABIAN2017-08-182-113251/+300
| | | | | | | | characters with the same width [BZ #21750] * charmaps/UTF-8: Use the range notation for all ranges of neighbouring characters with the same width.
* Fix abmon for bem_ZMAkhilesh Kumar2017-08-172-13/+17
| | | | | | | Until now the abbreviated month names were in English. [BZ #21960] * locales/bem_ZM (LC_TIME): Fix abmon, make it agree with CLDR.
* Fix country name for xh_ZAAkhilesh Kumar2017-08-172-3/+7
| | | | | [BZ #21959] * locales/xh_ZA (LC_ADDRESS): Fix country name.
* Refresh generated charmap data and ChangeLogThorsten Glaser2017-08-172-127/+111355
| | | | | [BZ #21750] * charmaps/UTF-8: Refresh.
* Resolve some historically special cases of ambiguous widthThorsten Glaser2017-08-171-0/+12
| | | | | | | | [BZ #21750] * unicode-gen/utf8_gen.py (U+00AD): Set width to 1. * unicode-gen/utf8_gen.py (U+1160..U+11FF): Set width to 0. * unicode-gen/utf8_gen.py (U+3248..U+324F): Set width to 2. * unicode-gen/utf8_gen.py (U+4DC0..U+4DFF): Likewise.
* Handle more cases of combining charactersThorsten Glaser2017-08-171-1/+1
| | | | | [BZ #21750] * unicode-gen/utf8_gen.py: Treat category Me and Mn as combining.
* UnicodeData has precedence over EastAsianWidthThorsten Glaser2017-08-171-17/+9
| | | | | | | | [BZ #19852] [BZ #21750] * unicode-gen/utf8_gen.py: Process EastAsianWidth lines before UnicodeData lines so the latter have precedence; remove hack to group output by EastAsianWidth ranges.
* Change language name in LC_IDENTIFICATION of bn_BD and bn_IN from ↵Mike FABIAN2017-08-143-7/+14
| | | | | | | | | “Bengali” to “Bangla” [BZ #14925] * locales/bn_BD (LC_IDENTIFICATION): Change language name in “title” and “language” from Bengali to Bangla. * locales/bn_IN (LC_IDENTIFICATION): Likewise.
* Use “copy "i18n"” in km_KH localeMike FABIAN2017-08-142-824/+6
| | | | | | | | | | The custom stuff which was in LC_CTYPE of the km_KH locale seems to be a very incomplete subset of what one gets by using “copy "i18n"”. I cannot find anything special there which is not in “copy "i18n"”, only lots of stuff which is missing. [BZ #20008] * locales/km_KH (LC_CTYPE): Use “copy "i18n"”.
* Use two letter abbreviations in abday in all German locales.Mike FABIAN2017-08-147-24/+34
| | | | | | | | | | [BZ #20482] * locales/de_AT (LC_TIME): Use 2 letter abbreviations in abday. * locales/de_BE (LC_TIME): Use 2 letter abbreviations in abday. * locales/de_CH (LC_TIME): Use 2 letter abbreviations in abday. * locales/de_DE (LC_TIME): Use readable ASCII in abday. * locales/de_IT (LC_TIME): Use readable ASCII in abday. * locales/de_LU (LC_TIME): Use 2 letter abbreviations in abday.
* Fix thousands_sep and grouping and use readable ASCII for decimal_point in ↵Julen Ruiz Aizpuru2017-08-142-3/+9
| | | | | | | | eu_ES locale [BZ #12349] * locales/eu_ES (LC_NUMERIC): Fix thousands_sep and grouping and use readable ASCII for decimal_point.
* Adapt test case data to the changes in the thousands separatorsMike FABIAN2017-08-121-1/+1
| | | | | | | | [BZ #20756] * localedata/tst-langinfo.sh: Adapt test case data. * stdlib/tst-strfmon_l.c: Likewise. * stdlib/tst-strtod4.c: Likewise. * stdlib/tst-strtod5i.c: Likewise.
* Use U+202F instead of U+2009 as thousands_sep for es_MXMike FABIAN2017-08-122-1/+5
| | | | | | | | | See also [BZ #20756]. U+202F NARROW NO-BREAK SPACE: a narrow form of a no-break space, typically the width of a thin space or a mid space. U+2009 THIN SPACE.
* Locales: Use Unicode wise thousands separatorStanislav Brabec2017-08-1248-65/+133
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Many languages use small gap as thousands separator. Thousands separator should not be a plain space, but a narrow space. And additionally, it is not allowed to wrap line in the middle of the number. Locale data were created in a deep age of 8-bit encodings, so most of them use space (incorrect: it allows wrapping the line in the middle of the number), or NBSP (better, but typographically incorrect: space between groups is too wide). Now UNICODE is widely supported, so we should leave legacy characters in favor of correct UNICODE character. UNICODE has a dedicated character for this purpose: NNBSP U+202F NARROW NO-BREAK SPACE: a narrow form of a no-break space, typically the width of a thin space or a mid space The NNBSP exists since Unicode 3.0. Use of NNBSP will prevent line wrapping in the midle of number and improve readability of numbers. [BZ #20756] * locales/aa_DJ (LC_MONETARY): Replace space by NNBSP as thousands separator. * locales/az_AZ (LC_MONETARY): Likewise. * locales/be_BY (LC_MONETARY): Likewise. * locales/be_BY@latin (LC_MONETARY): Likewise. * locales/bg_BG (LC_MONETARY): Likewise. * locales/bs_BA (LC_MONETARY): Likewise. * locales/ce_RU (LC_MONETARY): Likewise. * locales/crh_UA (LC_MONETARY): Likewise. * locales/cs_CZ (LC_MONETARY): Likewise. * locales/cs_CZ (LC_NUMERIC): Likewise. * locales/cv_RU (LC_MONETARY): Likewise. * locales/de_AT (LC_MONETARY): Likewise. * locales/eo (LC_MONETARY): Likewise. * locales/es_CR (LC_MONETARY): Likewise. * locales/es_CR (LC_NUMERIC): Likewise. * locales/es_CU (LC_MONETARY): Likewise. * locales/et_EE (LC_MONETARY): Likewise. * locales/et_EE (LC_NUMERIC): Likewise. * locales/fi_FI (LC_MONETARY): Likewise. * locales/fi_FI (LC_NUMERIC): Likewise. * locales/fr_CA (LC_MONETARY): Likewise. * locales/fr_FR (LC_MONETARY): Likewise. * locales/fr_FR (LC_NUMERIC): Likewise. * locales/fr_LU (LC_MONETARY): Likewise. * locales/fr_LU (LC_NUMERIC): Likewise. * locales/hr_HR (LC_MONETARY): Likewise. * locales/ht_HT (LC_NUMERIC): Likewise. * locales/kk_KZ (LC_MONETARY): Likewise. * locales/kk_KZ (LC_NUMERIC): Likewise. * locales/ky_KG (LC_MONETARY): Likewise. * locales/ky_KG (LC_NUMERIC): Likewise. * locales/lv_LV (LC_MONETARY): Likewise. * locales/lv_LV (LC_NUMERIC): Likewise. * locales/mg_MG (LC_MONETARY): Likewise. * locales/mhr_RU (LC_MONETARY): Likewise. * locales/mk_MK (LC_MONETARY): Likewise. * locales/mk_MK (LC_NUMERIC): Likewise. * locales/mn_MN (LC_MONETARY): Likewise. * locales/nb_NO (LC_MONETARY): Likewise. * locales/nb_NO (LC_NUMERIC): Likewise. * locales/nl_AW (LC_MONETARY): Likewise. * locales/nl_NL (LC_MONETARY): Likewise. * locales/nn_NO (LC_MONETARY): Likewise. * locales/os_RU (LC_MONETARY): Likewise. * locales/pap_AW (LC_MONETARY): Likewise. * locales/pap_CW (LC_MONETARY): Likewise. * locales/ru_RU (LC_MONETARY): Likewise. * locales/ru_RU (LC_NUMERIC): Likewise. * locales/ru_UA (LC_MONETARY): Likewise. * locales/sk_SK (LC_MONETARY): Likewise. * locales/sk_SK (LC_NUMERIC): Likewise. * locales/sl_SI (LC_MONETARY): Likewise. * locales/sl_SI (LC_NUMERIC): Likewise. * locales/sq_MK (LC_MONETARY): Likewise. * locales/sv_SE (LC_MONETARY): Likewise. * locales/sv_SE (LC_NUMERIC): Likewise. * locales/tg_TJ (LC_MONETARY): Likewise. * locales/tt_RU (LC_MONETARY): Likewise. * locales/tt_RU@iqtelif (LC_MONETARY): Likewise. * locales/uk_UA (LC_MONETARY): Likewise. * locales/uk_UA (LC_NUMERIC): Likewise. * locales/unm_US (LC_MONETARY): Likewise. * locales/unm_US (LC_NUMERIC): Likewise. * locales/wo_SN (LC_MONETARY): Likewise.
* Remove redundant data for LC_MONETARY for Indian localesAkhilesh Kumar2017-08-1021-342/+60
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Reference is taken from https://en.wikipedia.org/wiki/Indian_numbering_system https://en.wikipedia.org/wiki/Indian_rupee CLDR has the currency format pattern “¤#,##,##0.00”. [BZ #21836] * locales/ar_IN (LC_MONETARY) : copy "hi_IN" * locales/as_IN (LC_MONETARY) : copy "hi_IN" * locales/bhb_IN (LC_MONETARY): copy "hi_IN" * locales/bn_IN (LC_MONETARY) : copy "hi_IN" * locales/en_IN (LC_MONETARY) : copy "hi_IN" * locales/gu_IN (LC_MONETARY) : copy "hi_IN" * locales/hi_IN (LC_MONETARY) : Fix mon_grouping, p_sep_by_space and n_sep_by_space * locales/kn_IN (LC_MONETARY) : copy "hi_IN" * locales/kok_IN(LC_MONETARY) : copy "hi_IN" * locales/ks_IN (LC_MONETARY) : copy "hi_IN" * locales/ml_IN (LC_MONETARY) : copy "hi_IN" * locales/mr_IN (LC_MONETARY) : copy "hi_IN" * locales/or_IN (LC_MONETARY) : copy "hi_IN" * locales/pa_IN (LC_MONETARY) : copy "hi_IN" * locales/sa_IN (LC_MONETARY) : copy "hi_IN" * locales/sd_IN (LC_MONETARY) : copy "hi_IN" * locales/ta_IN (LC_MONETARY) : copy "hi_IN" * locales/tcy_IN(LC_MONETARY) : copy "hi_IN" * locales/te_IN (LC_MONETARY) : copy "hi_IN" * locales/ur_IN (LC_MONETARY) : copy "hi_IN"
* cmn_TW: add hanzi collationWei-Lun Chao2017-08-103-18/+70790
| | | | | | | | | | [BZ #17563] [BZ #16905] * locales/cmn_TW (LC_COLLATE): Use cns11643_stroke file for sorting. * locales/cmn_TW (LC_TIME): Improve time and date formats. * locales/cmn_TW (LC_MESSAGES): Add yesstr and nostr. * locales/cns11643_stroke: New file, stroke count collation for traditional Chinese.
* Fix p_cs_precedes, n_cs_precedes, and n_sign_posn for mt_MTAkhilesh Kumar2017-08-092-3/+8
| | | | | | | | | | | | Positive Currency Format: €500.00 Negative Currency Format: -€500.00 References: http://www.kunsilltalmalti.gov.mt/filebank/documents/rapportdwarlismijiettalmunitaewropea.pdf, page 7, top right. CLDR has “¤#,##0.00” as the currency format pattern. [BZ #21920] * locales/mt_MT (LC_MONETARY): Fix p_cs_precedes/n_cs_precedes. * locales/mt_MT (LC_MONETARY): Fix n_sign_posn.
* Remove “% Charset: ...” comments from locale sourcesMike FABIAN2017-08-09221-273/+232
| | | | | | | | | These comments are useless and only confusing. The encodings used to create binary locales from source locales are listed in the localedata/SUPPORTED file. The source files itself are ASCII or UTF-8 encoded where non-ASCII UTF-8 is currently only used in comments. If all locale source files are UTF-8 anyway, there is no need to specify that in a special comment.
* Added new locale en_SCAkhilesh Kumar2017-08-093-0/+126
| | | | | | | | | New locale is added for the Seychelles which is a member of the African Union. English is an offical language for the Seychelles. [BZ #21854] * locales/en_SC: New file. * localedata/SUPPORTED : Add en_SC/UTF-8.
* Fix missing redirects in testsuite targetsAndreas Schwab2017-08-072-2/+7
|
* Add yesstr and nostr for doi_IN, kok_IN, and sat_INMike FABIAN2017-08-034-12/+24
| | | | | | | | | | | | For the locales doi_IN, kok_IN, and sat_IN, the words for “yes” and “no” were apparently in yesexpr and noexpr. Copy them from there to add yesstr and nostr. Also make yesexpr and noexpr more readable by using the POSIX portable character set. * locales/doi_IN (LC_MESSAGES): Add yesstr and nostr. * locales/kok_IN (LC_MESSAGES): Add yesstr and nostr. * locales/sat_IN (LC_MESSAGES): Add yesstr and nostr.
* Revert “Fix yesexpr in en_DK locale”Mike FABIAN2017-08-032-1/+21
| | | | | | | | | | This reverts commit 8f75515080ee099030547ba5bcae59e800a2cc08 Revert “Fix yesexpr in en_DK locale”. * locales/en_DK (LC_MESSAGES): Restore original yesexpr, noexpr, yesstr, nostr. Convert them to ASCII and add a comment why we want to have them like this.
* drop redundant ([eE][sS])? and [oO]? from yesexpr and noexpr in km_KH localeMike FABIAN2017-08-032-2/+7
| | | | | * locales/km_KH (LC_MESSAGES): drop redundant ([eE][sS])? and [oO]? from yesexpr and noexpr
* Drop redundant .* from yesexpr and noexpr in agr_PE localeMike FABIAN2017-08-032-8/+8
| | | | | | | And make the expressions more readable by using the POSIX portable character set instead of Unicode code points. * locales/agr_PE (LC_MESSAGES): drop .* from yesexpr and noexpr
* Added new Locale bho_NPAkhilesh Kumar2017-08-033-0/+99
| | | | | | | | | | | | Bhojpuri is one of the most common languages in Nepal and India. http://codefornepal.org/en/2014/02/top-10-languages-spoken-as-mother-tongues-in-nepal/ https://en.wikipedia.org/wiki/Bhojpuri_language Added "bho_NP" locale and Reference is taken form "bho_IN". [BZ #21845] * locales/bho_NP: New file, Bhojpuri locale for Nepal. * SUPPORTED: Add bho_NP/UTF-8
* Fix yesexpr in en_DK localeAkhilesh Kumar2017-08-032-4/+6
| | | | | [BZ #21867] * locales/en_DK (LC_MESSAGES): Fix yesexpr
* Fix abday in zh_SGAkhilesh Kumar2017-08-032-4/+6
| | | | | | | Make it the same as in zh_CN and zh_TW which agrees with "narrow" in CLDR. [BZ #21853] * locales/zh_SG (LC_TIME): Fix abday
* Remove CFLAG settings that turn off warning (no longer needed).Steve Ellcey2017-08-021-16/+0
| | | | | | | | | | 2017-08-02 Steve Ellcey <sellcey@cavium.com> * localedata/Makefile (CFLAGS-tst_iswalnum.c, CFLAGS-tst_iswalpha.c CFLAGS-tst_iswcntrl.c, CFLAGS-tst_iswdigit.c, CFLAGS-tst_iswgraph.c, CFLAGS-tst_iswlower.c, CFLAGS-tst_iswprint.c, CFLAGS-tst_iswpunct.c, CFLAGS-tst_iswspace.c, CFLAGS-tst_iswupper.c, CFLAGS-tst_iswxdigit.c, CFLAGS-tst_towlower.c, CFLAGS-tst_towupper.c): Remove.
* Fix country name in title of mai_NP localeAkhilesh Kumar2017-07-272-1/+6
| | | | | [BZ #21848] * locales/mai_NP (LC_IDENTIFICATION): Fix country name in title.
* Improve country_name in be_BY@latinIhar Hrachyshka2017-07-272-1/+5
|