diff options
author | DJ Delorie <dj@redhat.com> | 2021-09-01 13:17:34 -0400 |
---|---|---|
committer | DJ Delorie <dj@redhat.com> | 2021-09-01 14:29:59 -0400 |
commit | 69623c0db0a540f26ee537bae09446d3dcdf1f80 (patch) | |
tree | 089930c955e3a1d8779d4e19100ea17420e3a42e /posix/PTESTS2C.sed | |
parent | 224edada607ebc6aaa1aadaae423128fae7880df (diff) | |
download | glibc-69623c0db0a540f26ee537bae09446d3dcdf1f80.tar.gz glibc-69623c0db0a540f26ee537bae09446d3dcdf1f80.tar.xz glibc-69623c0db0a540f26ee537bae09446d3dcdf1f80.zip |
posix: remove some iso-8859-encoded characters
With the increasing adoption of UTF-8, modern editors may (will?) replace iso-8859-encoded characters in the range 0x80..0xff with their UTF-8 equivalent, as will mailers and other tools. This breaks our testsuite and corrupts patches. So, this patch starts replacing these problematic characters with \OCTal sequences instead (adding support for those in tst-fnmatch.c) or with plain ASCII characters (PTESTS). Reviewed-by: Carlos O'Donell <carlos@redhat.com>
Diffstat (limited to 'posix/PTESTS2C.sed')
-rw-r--r-- | posix/PTESTS2C.sed | 4 |
1 files changed, 3 insertions, 1 deletions
diff --git a/posix/PTESTS2C.sed b/posix/PTESTS2C.sed index b6850a3754..500b4ec21c 100644 --- a/posix/PTESTS2C.sed +++ b/posix/PTESTS2C.sed @@ -1,6 +1,8 @@ +# Future self: the vertical bar is being used here as a delimiter in +# the input file, not in the usual alternate-choice regex meaning. /^##/d s/^# \(.*\)/ { 0, 0, "\1", NULL, },/ s/^#W \(.*\)/ { 0, 0, NULL, "\1" },/ -s/\([^¦]*\)¦\([^¦]*\)¦\([^¦]*\)¦\([^¦]*\)¦\(.*\)/ { \1, \2, "\3", "\4", \5 },/ +s/\([^|]*\)|\([^|]*\)|\([^|]*\)|\([^|]*\)|\(.*\)/ { \1, \2, "\3", "\4", \5 },/ s/\\/\\\\/g s/ /\\r/g |