Footnote ref number in TOC #660

mustafa0x · 2018-05-10T20:26:55Z

In [13]: t = '''
    ...: # Header with footnote[^1]
    ...: 
    ...: Lorem Ipsum
    ...: 
    ...: [^1]: footnote text
    ...: '''

In [14]: print(markdown.markdown('[TOC]\n\n' + t, extensions=['markdown.extensions.toc', 'markdown.extensions.footnotes']))

<div class="toc">
<ul>
<li><a href="#header-with-footnote1">Header with footnote1</a></li> <!-- Note the '1' in the header id and more importantly the link text -->
</ul>
</div>
<h1 id="header-with-footnote1">Header with footnote<sup id="fnref:1"><a class="footnote-ref" href="#fn:1" rel="footnote">1</a></sup></h1>
<p>Lorem Ipsum</p>
<div class="footnote">
<hr />
<ol>
<li id="fn:1">
<p>footnote text&#160;<a class="footnote-backref" href="#fnref:1" rev="footnote" title="Jump back to footnote 1 in the text">&#8617;</a></p>
</li>
</ol>
</div>

waylan · 2018-05-10T23:16:20Z

So the code which sanitizes the text for use in the TOC is pretty simple. It simply pulls the text from the HTML elements. It could be significantly more complex to exclude footnote refs. And I find it odd that we would need to only do this for a non-standard add-on syntax. Additionally, the fact that this is only being reported now suggests that this is an unusual edge case that not many users will encounter.

That said, it is clearly not what one would expect and should probably be fixed. Of course, pull requests are welcome.

- All postprocessors are run on heading content (not just `RawHtmlPostprocessor`). - Footnote references are stripped from heading content. Fixes Python-Markdown#660. - A more robust `striptags` is provided to convert headings to plain text. Unlike, markupsafe's implementation, HTML entities are not unescaped. - Both the plain text `name` and rich `html` are saved to `toc_tokens`, which means users can now access the full rich text content of the headings directly from the `toc_tokens`. - `data-toc-label` is sanitized separate from heading content. - A `html.unescape` call added to `slugify` and `slugify_unicode`, which ensures `slugify` operates on Unicode characters, rather than HTML entities. By including in the functions, users can override with their own slugify functions if they desire. Note that this first commit includes minimal changes to the tests to show very little change in behavior (mostly the new `html` attribute of the `toc_tokens` was added). A refactoring of the tests will be in a separate commit.

* All postprocessors are run on heading content. * Footnote references are stripped from heading content. Fixes #660. * A more robust `striptags` is provided to convert headings to plain text. Unlike, the `markupsafe` implementation, HTML entities are not unescaped. * The plain text `name`, rich `html` and unescaped raw `data-toc-label` are saved to `toc_tokens`, allowing users to access the full rich text content of the headings directly from `toc_tokens`. * `data-toc-label` is sanitized separate from heading content. * A `html.unescape` call is made just prior to calling `slugify` so that `slugify` only operates on Unicode characters. Note that `html.unescape` is not run on the `name` or `html`. * The `get_name` and `stashedHTML2text` functions defined in the `toc` extension are both **deprecated**. Instead, use some combination of `run_postprocessors`, `render_inner_html` and `striptags`. Co-authored-by: Oleh Prypin <oleh@pryp.in>

waylan added bug extension labels May 10, 2018

This was referenced Aug 15, 2018

is there a way to render math in toc? #699

Closed

Support custom labels in TOC. #700

Merged

waylan added the someday-maybe label Oct 23, 2018

chbndrhnns mentioned this issue Feb 16, 2023

TOC contains footnote numbers squidfunk/mkdocs-material#5057

Closed

4 tasks

waylan mentioned this issue Apr 18, 2023

Heading subscripts are stripped out in TOC extension #935

Closed

waylan mentioned this issue Feb 9, 2024

Refactor TOC sanitation #1441

Merged

waylan closed this as completed in #1441 Mar 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Footnote ref number in TOC #660

Footnote ref number in TOC #660

mustafa0x commented May 10, 2018 •

edited

Loading

waylan commented May 10, 2018 •

edited

Loading

Footnote ref number in TOC #660

Footnote ref number in TOC #660

Comments

mustafa0x commented May 10, 2018 • edited Loading

waylan commented May 10, 2018 • edited Loading

mustafa0x commented May 10, 2018 •

edited

Loading

waylan commented May 10, 2018 •

edited

Loading