| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
|
|
|
| |
When looking at scripts, we want to look
at the script of the *previous* character.
And then we need to exclude SHY from the
common script.
|
|
|
|
|
|
|
|
|
|
|
| |
I went a bit overboard in #603. The vertical bar
is too common, and this usage to specialized to
do it by default. To enable this, we should rather
make log attrs tweakable.
We still replace hyphenation points with hyphens.
Fixes: #630
|
| |
|
| |
|
|
|
|
|
|
|
| |
We were not correctly handling overlapping segments
in all cases, with <span segment='word'>. Improve this.
Update affected tests.
|
|
|
|
|
|
|
|
| |
We were not putting a line break at the end of the
text, in cases where a paragraph ends with a newline.
Enforce this in all break apis.
Update affected tests.
|
|
|
|
|
|
|
| |
Add attributes that let us override word and
sentence boundaries (and, indirectly, line breaks).
Tests included.
|
| |
|
|
|
|
|
|
| |
Print out all the log attrs we have.
Update expected test outputs.
|
|
|
|
|
|
|
|
|
|
|
| |
We want to insert word start+end if libthai
hands us line breaks between letters. But must
be careful not to overwrite existing word boundaries,
or we end up with a nonsense sequence.
This was found by checking log attr invariants.
Regenerate affected test outputs.
|
|
|
|
|
|
| |
Change the order in test-break output to be
sentence, word, grapheme. Update all affected
test outputs.
|
|
|
|
|
| |
These don't have a corresponding .break file, so
they are useless.
|
|
|
|
|
|
|
| |
When rendering the text char-by-char, use directional
controls to avoid any bidi reordering.
Update affected tests.
|
|
|
|
|
|
|
|
| |
Correctly classify tag characters as Grapheme_Extend,
so that we don't end up putting grapheme boundaries
in the middle of tag sequences.
Includes a test.
|
|
|
|
|
|
|
| |
Add grapheme boundaries to the output of test-break,
so we can check those easily, too.
Update expected test output for this.
|
| |
|
|
|
|
|
| |
Remove the dashes that cause the runs to break, and seem
to be causing issues on both Debian and newer Fedora.
|
|
|
|
|
|
|
| |
Thai language requires word dictionary from libthai project
for word/sentence breaks.
Adding this test case to check the results of sentence breaks.
|
|
|
|
| |
Recent changes changed breaks.
|
|
|
|
|
|
| |
test-break was using the is_soft_hyphen log attr
that we've just removed again. Adapt, and update
expected test output.
|
| |
|
|
|
|
| |
This one is taken from #218.
|
| |
|
|
|
|
|
| |
We were not diffing the output against the
expected results.
|
|
This is useful for exploring line breaking.
|