summaryrefslogtreecommitdiff
path: root/tests
Commit message (Collapse)AuthorAgeFilesLines
* Properly parse inline HTML in md_in_htmlIsaac Muse2020-10-191-0/+160
| | | Fixes #1040 and fixes #1045.
* Account for Etree Elements in HTML StashWaylan Limberg2020-10-141-1/+16
| | | | | | | | | | | | | | | | By calling str on all stash elements we ensure they don't raise an error. Worse case, soemthing like `<Element 'div' at 0x000001B2DAE94900>` gets inserted into the output. However, with the override in the md_in_html extension, we actually serialize and reinsert the original HTML. Worse case, an HTML block which should be parsed as Markdown gets skipped by the extension (`<div markdown="block"></div>` gets inserting into the output). The tricky part is testing as there should be no known cases where this ever occurs. Therefore, we forefully pass an etree Element directly to the method in the test. That said, as #1040 is unresolved at this point, I have tested locally with a real existing case and it works well. Related to #1040.
* Correctly parse raw `script` and `style` tags. (#1038)Waylan Limberg2020-10-121-0/+85
| | | | | | | * Ensure unclosed script tags are parsed correctly by providing a workaround for https://bugs.python.org/issue41989. * Avoid cdata_mode outside of HTML blocks, such as in inline code spans. Fixes #1036.
* Skip tests with pygments version mismatch.Waylan Limberg2020-10-082-261/+282
| | | | | | | | If pygments is installed and the version doesn't match the expected version. then any relevant tests will fail. To avoid failing tests due to different output by pygments, those tests will be skipped. The pygments tox env sets the `PYGMENTS_VERSION environment variable, so that env will always run those tests against the expected version.
* Some test tweaks.Waylan Limberg2020-10-081-1/+6
| | | | | | | | | | * Pygments specific tests now only run when the pygments version installed matches the expected version. That version is defined in an environment variable (PYGMENTS_VERSION) in the 'pygments' tox env (see #1030). * When the Python lib tidylib is installed but the underlying c lib is not, the relevant tests are now skipped rather than fail. This matches the behavior when the Python lib is not installed. The tox envs are now useful on systems which don't have the c lib installed.
* Ensure consistent handling of classes by fenced_code and codehilite (#1033)Waylan Limberg2020-10-081-17/+17
| | | | | | | * All non-language classes should always be assigned to the pre tag. * The language identifying class should never be included with the general list of classes. Fixes #1032
* Update tests for pygments-2.7.1Michał Górny2020-10-072-12/+12
| | | | Closes #1030
* Support unicode ids in toc (#970)Antoine2020-10-011-0/+22
| | | A second function, `slugify_unicode` was added rather than changing the existing function so as to maintain backward compatibility. While an `encoding` parameter was added to the `slugify` function, we can't expect existing third party functions to accept a third parameter. Therefore, the two parameter API was preserved with this change.
* Refactor HTML Parser (#803)Waylan Limberg2020-09-2247-684/+2752
| | | | | | | | | | The HTML parser has been completely replaced. The new HTML parser is built on Python's html.parser.HTMLParser, which alleviates various bugs and simplifies maintenance of the code. The md_in_html extension has been rebuilt on the new HTML Parser, which drastically simplifies it. Note that raw HTML elements with a markdown attribute defined are now converted to ElementTree Elements and are rendered by the serializer. Various bugs have been fixed. Link reference parsing, abbreviation reference parsing and footnote reference parsing has all been moved from preprocessors to blockprocessors, which allows them to be nested within other block level elements. Specifically, this change was necessary to maintain the current behavior in the rebuilt md_in_html extension. A few random edge-case bugs (see the included tests) were resolved in the process. Closes #595, closes #780, closes #830 and closes #1012.
* Fix complex scenarios with definition, ordered, and unordered lists (#1007)Isaac Muse2020-07-271-0/+323
| | | Fixes #918.
* Fix complex scenarios with lists and admonitions (#1006)Isaac Muse2020-07-261-0/+194
| | | | | Add better logic to admonitions to account for more complex list cases Fixes #1004
* Fix HR which follows strong em.Waylan Limberg2020-07-011-0/+16
| | | | Fixes #897.
* Support short reference image links.Waylan Limberg2020-07-011-0/+24
| | | | Fixes #894.
* Add suport for attr_lists in table headers.Waylan Limberg2020-06-301-6/+14
|
* Tune attr list regexWaylan Limberg2020-06-304-24/+72
| | | | | | | | | | | Ignore empty braces. Braces must contain at least one non-whitepsace character to be recognized as an attr list. Attr lists for table cells must be at the end of the cell content and must be seperated from the content by at least one space. This appears to be a breaking change. However, it is consistent with the behavior elsewhere. Fixes #898.
* Fix unescaping of HTML characters <> in CodeHilite. (#990)Rohitt Vashishtha2020-06-291-0/+23
| | | | | | | | | | | Previously, we'd unescape both `&amp;gt;` and `&gt;` to the same string because we were running the &amp; => & replacement first. By changing the order of this replacement, we now convert: `&amp;gt; &gt;` => `&gt; >` as expected. Fixes #988.
* Limit depth of blockquotes using Python's recursion limit. (#991)Waylan Limberg2020-06-291-0/+51
| | | | | | | | | | | | | | | | | If the Python stack comes within 100 frames of the recursion limit, then the nesting limit of blockquotes is met. Any remaining text, including angle brackets, are simply wrapped in a paragraph. To increasing the nesting depth, increase Python's recursion limit. However, be aware that each level of recursion will likely result in multiple frames being added to the Python stack. Therefore, the recursion depth and nesting depth are not one-to-one. Performance is an concern here. However, the current solution seems like a reasonable compromise. It doesn't slow things down too much, but also avoids Markdown input resulting in an error. This is mostly only a concern with contrived input anyway. For the average Markdown document, this will likely never be an issue. Fixes #799.
* Refactor fenced_code & codehilite options (#816)Waylan Limberg2020-06-237-455/+1303
| | | | | | | | | | | | | | | | | | | | * Add `language-` prefix to output when syntax highlighting is disabled for both codehilite and fenced_code extensions. * Add `lang_prefix` config option to customize the prefix. * Add a 'pygments' env to tox which runs the tests with Pygments installed. Pygments is locked to a specific version in the env. * Updated codehilite to accept any Pygments options. * Refactor fenced code attributes. - ID attr is defined on `pre` tag. - Add support for attr_list extension, which allows setting arbitrary attributes. - When syntax highlighting is enabled, any pygments options can be defined per block in the attr list. - For backward compatibility, continue to support `hi_lines` outside of an attr_list. That is the only attr other than lang which is allowed without the brackets (`{}`) of an attr list. Note that if the brackets exist, then everything, including lang and hl_lines, must be within them. * Resolves #775. Resolves #334. Addresses #652.
* Fix issues with complex emphasisfacelessuser2020-06-222-0/+28
| | | | | Resolves issue that can occur with complex emphasis combinations. Fixes #979
* Avoid importing packaging or pkg_resources for version validation (#948)Ran Benita2020-04-271-0/+24
| | | | | | | | | | | | | | Importing the `pkg_resources` module has high memory and startup time cost. A recent change in 102e01c already avoided it for loading extensions, but it's still used for validating that __version__ is correctly formatted. It is possible to avoid it by installing the `packaging` package, but that adds a dependency for something quite trivial. Instead, remove the validation and add tests which check the output is as expected. Since `setuptools` is no longer required at runtime, remove it from `install_required`.
* Correctly report if an extension raises a `TypeError`.Charles Merriam2020-04-191-2/+2
| | | Also Raise a `KeyError` when attempting to delete a nonexistent key from the extension registry.
* Fix typo, in the ten cut and pasted versionsCharles Merriam2020-04-1916-26/+26
|
* TOC fix for AtomicString handling (#934)Isaac Muse2020-04-061-0/+22
| | | Fixes #931.
* Fix escaping of HTML special chars (<, >, &) in `.toc_tokens`Jim Porter2020-02-121-13/+37
| | | | Fixes #906.
* Simplify xml.etree.ElementTree loading (#902)Dmitry Shachnev2020-02-031-50/+47
| | | | | | | | cElementTree is a deprecated alias for ElementTree since Python 3.3. Also drop the recommendation to import etree from markdown.util, and deprecate markdown.util.etree.
* Don't emit stashed HTML tag placeholders in `.toc_tokens` (#901)Jim Porter2020-01-311-24/+78
| | | | | | Note: this slightly changes existing behavior in that raw HTML tags are no longer included in the HTML `.toc`. However, the fact that that worked before was an oversight. The intention was always to strip all markup. Resolves #899.
* Add permalink_title option (#886)Waylan Limberg2019-11-261-0/+20
| | | | | Addes a new `permalink_title` option to the TOC extension, which allows the title attribute of a permalink to be set to something other than the default English string "Permanent link". Fixes #781.
* Add anchorlink_class and permalink_class options to TOCWaylan Limberg2019-11-261-0/+67
| | | | | | | | | Two new configuration options have been added to the toc extension: `anchorlink_class` and `permalink_class` which allows class(es) to be assigned to the `anchorlink` and `permalink` HTML respectively. This allows using icon fonts from CSS for the links. Therefore, an empty string passed to `permalink` now generates an empty `permalink`. Previously no `permalink` would have been generated. Based on #776.
* Unescape IDs in TOC.Waylan Limberg2019-11-251-0/+34
| | | | | | The slugify function will stript the STX and ETX characters from placeholders for backslash excaped characters. Therefore, we need to unescape any text before passing it to slugify. Fixes #864.
* Drop support for Python 2.7 (#865)Hugo van Kemenade2019-10-2422-49/+6
| | | | | | | * Python syntax upgraded using `pyupgrade --py3-plus` * Travis no longer uses `sudo`. See https://blog.travis-ci.com/2018-11-19-required-linux-infrastructure-migration See #760 for Python Version Support Timeline and related dicussion.
* Always wrap CodeHilite code in <code> tags (#862)Tim Martin2019-09-301-9/+9
| | | Pygments added the `wrapcode` option in version 2.4. Users need to have 2.4+ installed to see the change. However, as earlier versions accepted arbitrary keywords passed to the HTMLFormatter, no error will be raised if the user has an older version of Pygments installed.
* Refactor em strong to consolidate code and fix issue #792Isaac Muse2019-09-032-0/+45
|
* Optimize HTML_RE from quadratic time to linear (#804)Anders Kaseorg2019-08-143-24/+31
| | | | | | Remove misleading escaped_chars_in_js test Signed-off-by: Anders Kaseorg <andersk@mit.edu>
* Use more specific asserts throughout testsJon Dufresne2019-06-132-26/+26
| | | | | | | | | | | | | | | | In the event that a test fails, a more specific assert will usually provide more useful information about what went wrong. Prefer assertIs(..., True) over assertTrue() to assert the type as well as the truthy value. This is recommended by Python: https://docs.python.org/3/library/unittest.html#unittest.TestCase.assertTrue > Note that this is equivalent to bool(expr) is True and not to expr is > True (use assertIs(expr, True) for the latter). This method should > also be avoided when more specific methods are available (e.g. > assertEqual(a, b) instead of assertTrue(a == b)), because they provide > a better error message in case of failure.
* Use https:// links where availableJon Dufresne2019-06-121-3/+3
|
* Optimize several regexes from quadratic time to linear timeAnders Kaseorg2019-03-061-1/+1
| | | | | | Part of the discussion in #798. Signed-off-by: Anders Kaseorg <andersk@mit.edu>
* Add support for a range to toc_depth.Klaus Mueller2019-02-221-0/+55
| | | | Closes #786.
* Handle overlapping raw HTML matches (#770)Philip Trauner2019-02-071-0/+53
| | | | | Recursively run substitution to handle overlapping matches. Fixes #458.
* Emphasis pattern treats newlines as whitespace (#785)Waylan Limberg2019-02-071-0/+86
| | | | | | All whitespace characters should be treated the same by inline patterns. Previoulsy, emphasis patterns were only accounting for spaces, but not other whitepsace characters such as newlines. Fixes #783.
* Allow hashes to be escaped in headers (#763)Isaac Muse2018-12-221-0/+20
| | | | | Adjust pattern to allow for escaped hashes, but take care to not treat escaped escapes before hashes as escaped hashes. Close #762.
* Use a PEP562 implementation for deprecating attributes (#757)Isaac Muse2018-12-221-2/+2
| | | Use a vendored Pep562 backport to simulate Python 3.7's new PEP 562 feature. For Python3.7 and later, default to the official implementation.
* Fix typos in tests namesDmitry Shachnev2018-12-191-3/+3
|
* Collapse all whitespace in reference ids (#743)Isaac Muse2018-10-301-0/+37
| | | Previously only newlines preceded by whitespace were collapsed. Fixes #742.
* Deprecate version and version_info (#740)Isaac Muse2018-10-251-0/+39
| | | | This essentially implements the closest we can get to PEP 562 which allows for modules to control `__dir__` and `__getattr__` in order to deprecate attributes. Here we provide a wrapper class for the module in `util`. If a module has attributes that need to deprecated, we derive from the wrapper class and define the attributes as functions with the `property` decorator and the provided `deprecated` decorator. The class is instantiated with the module's `__name__` attribute and the class will properly replace the module with the wrapped module. When accessing the depracted attributes, a warning is raised. Closes #739.
* Ensure block elements are defined per instancefacelessuser2018-10-091-0/+12
| | | | Block level elements should be defined per instance, not as base class variables.
* foot note adjustments (#728)Isaac Muse2018-10-099-2455/+2496
| | | | | | Add a config to set the footnote separator. Also remove rel/rev as they aren't really compatible with HTML5 and we already have classes set for refs and backrefs. Fixes 723.
* Fix double escaping of block code (#727)Isaac Muse2018-10-071-0/+92
| | | | | Fixes #725
* Add a test for the markdown/serializers.py changeDmitry Shachnev2018-09-251-0/+4
|
* Make ENTITY_RE support hexadecimal entitiesissue712Dmitry Shachnev2018-09-251-0/+44
| | | | Fixes #712.
* Support custom labels in TOC. (#700)Waylan Limberg2018-09-051-9/+25
| | | | | | | | New `toc_tokens` attribute on Markdown class. Contains the raw tokens used to build the Table of Contents. Users can use this to build their own custom Table of Contents rather than needing to parse the HTML available on the `toc` attribute of the Markdown class.