summaryrefslogtreecommitdiff
path: root/tests/multibyte-white-space
Commit message (Collapse)AuthorAgeFilesLines
* maint: update copyright datesJim Meyering2023-01-011-1/+1
|
* maint: make update-copyrightJim Meyering2022-01-011-1/+1
|
* maint: run "make update-copyright"Paul Eggert2021-01-011-1/+1
|
* maint: update all copyright year number rangesJim Meyering2020-01-011-1/+1
| | | | | | | | Run "make update-copyright" and then... * gnulib: Update to latest with copyright year adjusted. * tests/init.sh: Sync with gnulib to pick up copyright year. * bootstrap: Likewise. * doc/grep.in.1: Use "-" in copyright year ranges, not \en.
* maint: update all copyright dates via "make update-copyright"Jim Meyering2019-01-011-1/+1
| | | | * gnulib: Also update submodule for its copyright updates.
* maint: update gnulib and copyright dates for 2018Jim Meyering2018-01-061-1/+1
| | | | | | * gnulib: Update to latest. * all files: Run "make update-copyright". * bootstrap: Update from gnulib.
* maint: update gnulib and copyright dates for 2017Jim Meyering2017-01-011-1/+1
| | | | | * gnulib: Update to latest. * all files: Run "make update-copyright".
* tests: use "returns_" rather than explicit comparison with "$?"Jim Meyering2016-11-141-4/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | * tests/sjis-mb (encode): Rearrange to emit desired input into a file, rather than piping directly into grep. That permits the use of returns_ 1 to verify timeout's exit status. * tests/euc-mb: Use "returns_ 1" rather than testing $? = 1 * tests/char-class-multibyte: Likewise. * tests/dfa-heap-overrun: Likewise. * tests/encoding-error: Likewise. * tests/fedora: Likewise. * tests/grep-dev-null: Likewise. * tests/init.cfg (envvar_check_fail): Likewise. * tests/kwset-abuse: Likewise. * tests/mb-non-UTF8-overrun: Likewise. * tests/multibyte-white-space: Likewise. * tests/pcre-infloop: Likewise. * tests/surrogate-pair: Likewise. * tests/warn-char-classes: Likewise. Do the same for other values: * tests/backref-multibyte-slow: Likewise. * tests/euc-mb: Likewise. * tests/pcre-abort: Likewise. * tests/pcre-jitstack: Likewise. * tests/repetition-overflow: Likewise. * tests/reversed-range-endpoints: Likewise. * tests/warn-char-classes: Likewise.
* tests: revamp multibyte-white-space test to be more permissiveJim Meyering2016-09-241-34/+57
| | | | | | | | | | | | | | | | This test elicits too many failures. Whether a system has accurate unicode "whitespace" attributes should not influence whether grep's test suite passes. In many cases, now you will see a warning that some multibyte characters do not pass whitespace-related tests, but this test no longer fails. However, if you run this test on a modern enough system, it does require that \s and \S do work properly with most of the listed characters. * tests/multibyte-white-space: Confirm that Fedora 24's locale tables still declare those four Unicode code points *not* whitespace. Honor a new column telling how to handle failure. Provide more information in each diagnostic. Reported by Nelson H. F. Beebe. https://bugs.gnu.org/24530
* maint: update copyright year, bootstrap, init.shJim Meyering2016-01-011-1/+1
| | | | | | | | Run "make update-copyright" and then... * gnulib: Update to latest. * tests/init.sh: Update from gnulib. * bootstrap: Likewise.
* maint: update copyright year ranges to include 2015Jim Meyering2015-01-011-1/+1
| | | | | Run "make update-copyright". Also, ... * grep.texi: Update manually, converting each "--" to "-".
* tests: avoid awk+printf+\xHH portability trapNorihiro Tanaka2014-11-081-2/+2
| | | | | | | | | | | | | | * tests/init.cfg (hex_printf_): Rewrite in terms of printf and sed. Using awk's printf with \xHH in the format string was not portable to the awk of Solaris 10, AIX 7 or HP-UX 11.23, as reported in http://debbugs.gnu.org/18987. * tests/word-multibyte: Use printf rather than hex_printf_, and give the character we're printing a name: e_acute (rather than A-grave), since that is used in other tests. a trailing \n in the format string, adjust by removing it, and instead invoking echo. * tests/multibyte-white-space: Simply remove each trailing \n. They were not needed.
* maint: move helper function, hex_printf to init.cfgJim Meyering2014-11-071-10/+2
| | | | | | * tests/init.cfg (hex_printf_): New function, from ... * tests/multibyte-white-space: ... here. Reflect the s/hex_print/hex_printf_/ renaming.
* dfa: process all MBCSET constructs via glibc's matcherNorihiro Tanaka2014-10-191-0/+10
| | | | | | | | | | | | | | | The DFA matcher does not support collating symbols or equivalence classes, so ensure that any MBCSET reference is handled by the glibc matcher. dfa.c already handled this in one case, but not the other, so that a command like "printf '\0' |src/grep -aE '^\s?$'" would mistakenly end up using dfa.c's match_mb_charset function rather than glibc's matcher. * src/dfa.c (dfaexec_main): Move that code into the State_transition macro. This renders the match_mb_charset unused by grep. * tests/multibyte-white-space: Add a test to exercise the just-rendered-inaccessible code path.
* maint: update copyright dates for 2014Jim Meyering2014-01-011-1/+1
| | | | Do that by running "make update-copyright".
* tests: port to non-GNU sedJim Meyering2013-11-251-1/+1
| | | | | | | | * tests/multibyte-white-space (utf8_space_characters): The generation of test inputs relied on GNU sed's interpretation of \<, but that is not portable, and caused spurious test failures. Adjust the sed regexp to work on all versions. Reported by Karl Dubost in http://bugs.gnu.org/15953.
* tests: port to bourne shells whose printf doesn't grok hexJim Meyering2013-10-231-2/+10
| | | | | | | | | | | | | | | Use octal escapes, not hex, in printf(1) format strings, and in one case, use $AWK's printf so we can continue to use the table of hex values. * tests/char-class-multibyte: Use printf octal escapes, not hex, for portability to shells like dash and Solaris 10's /bin/sh. * tests/backslash-s-vs-invalid-multitype: Likewise. * tests/surrogate-pair: Likewise. * tests/unibyte-bracket-expr: Count in decimal and convert to octal. * tests/multibyte-white-space (hex_printf): New function. Use it in place of printf so we can retain the table of hex digits without hitting the limitation of some bourne shells. Reported by Paul Eggert in http://bugs.gnu.org/15690#11
* tests: extend the multibyte-white-space testJim Meyering2013-10-191-19/+36
| | | | | | | * tests/multibyte-white-space (utf8_space_characters): Add more single-byte whitespace characters. Align RHS hex values and make the sed substitution less rigid, to accommodate. Also, ensure that grep '\S' exits with status 1.
* dfa: fix \s and \S to work for multibyteJim Meyering2013-10-011-0/+51
* src/dfa.c (lex): In multibyte mode, we can't treat \s and \S as we do in single-byte mode. Map them to [[:space:]] and [^[:space:]] respectively, to make the DFA matcher use the regex-matcher for this term. * tests/multibyte-white-space: New file. Test for the bug. * tests/Makefile.am (TESTS): Add it. This bug was introduced with the addition of DFA support for \s and \S in commit v2.5.4-112-gf979ca0.