about summary refs log tree commit diff
path: root/src/locale/codepages.h
Commit message (Collapse)AuthorAgeFilesLines
* add cp866 (dos cyrillic) to iconvRich Felker2017-12-181-0/+12
|
* add ibm1047 codepage (ebcdic representation of latin1) to iconvRich Felker2017-12-121-0/+20
|
* generalize iconv framework for 8-bit codepagesRich Felker2017-11-131-197/+220
| | | | | | | | | | | | | | | | | | | previously, 8-bit codepages could only remap the high 128 bytes; the low range was assumed/forced to agree with ascii. interpretation of codepage table headers has been changed so that it's possible to represent mappings for up to 256 slots (fewer if the initial portion of the map is elided because it coincides with unicode codepoints). this requires consuming a bit more of the 10-bit space of characters that can be represented in 8-bit codepages, but there's still a plenty left. the size of the legacy_chars table is actually reduced now by eliding the first 256 entries and considering them to map implicitly via the identity map. before these changes, there seem to have been minor bugs/omissions in codepage table generation, so it's likely that some actual bug fixes are silently included in this commit. round-trip testing of a few codepages was performed on the new version of the code, but no differential testing against the old version was done.
* add cp437 and cp850 to available iconv conversionsRich Felker2014-05-131-138/+165
| | | | | | | | | perhaps some additional legacy DOS-era codepages would also be useful to have, but these are the ones for which there has been demand. the size of the diff is due to the fact that legacychars.h is updated in such a way that new characters are inserted into the table in unicode codepoint order; thus other mappings in codepages.h have changed to reflect the new table indices of their characters.
* simplify iconv and support more legacy codepagesRich Felker2011-07-121-0/+238