about summary refs log tree commit diff
path: root/posix/PTESTS2C.sed
diff options
context:
space:
mode:
authorDJ Delorie <dj@redhat.com>2021-09-01 13:17:34 -0400
committerDJ Delorie <dj@redhat.com>2021-09-01 14:29:59 -0400
commit69623c0db0a540f26ee537bae09446d3dcdf1f80 (patch)
tree089930c955e3a1d8779d4e19100ea17420e3a42e /posix/PTESTS2C.sed
parent224edada607ebc6aaa1aadaae423128fae7880df (diff)
downloadglibc-69623c0db0a540f26ee537bae09446d3dcdf1f80.tar.gz
glibc-69623c0db0a540f26ee537bae09446d3dcdf1f80.tar.xz
glibc-69623c0db0a540f26ee537bae09446d3dcdf1f80.zip
posix: remove some iso-8859-encoded characters
With the increasing adoption of UTF-8, modern editors may (will?)
replace iso-8859-encoded characters in the range 0x80..0xff with
their UTF-8 equivalent, as will mailers and other tools.  This breaks
our testsuite and corrupts patches.

So, this patch starts replacing these problematic characters with
\OCTal sequences instead (adding support for those in tst-fnmatch.c)
or with plain ASCII characters (PTESTS).

Reviewed-by: Carlos O'Donell <carlos@redhat.com>
Diffstat (limited to 'posix/PTESTS2C.sed')
-rw-r--r--posix/PTESTS2C.sed4
1 files changed, 3 insertions, 1 deletions
diff --git a/posix/PTESTS2C.sed b/posix/PTESTS2C.sed
index b6850a3754..500b4ec21c 100644
--- a/posix/PTESTS2C.sed
+++ b/posix/PTESTS2C.sed
@@ -1,6 +1,8 @@
+# Future self: the vertical bar is being used here as a delimiter in
+# the input file, not in the usual alternate-choice regex meaning.
 /^##/d
 s/^# \(.*\)/  { 0, 0, "\1", NULL, },/
 s/^#W \(.*\)/  { 0, 0, NULL, "\1" },/
-s/\([^¦]*\)¦\([^¦]*\)¦\([^¦]*\)¦\([^¦]*\)¦\(.*\)/  { \1, \2, "\3", "\4", \5 },/
+s/\([^|]*\)|\([^|]*\)|\([^|]*\)|\([^|]*\)|\(.*\)/  { \1, \2, "\3", "\4", \5 },/
 s/\\/\\\\/g
 s/
/\\r/g