This module imports a copy of
html.parser.HTMLParser and modifies it heavily through monkey-patches.
A copy is imported rather than the module being directly imported as this ensures that the user can import
and use the unmodified library for their own needs.
Extract raw HTML from text.
Returns char index in
self.rawdata for the start of the current line.
Returns True if current position is at start of line.
Allows for up to three blank spaces at start of line.
Returns the text of the end tag.
If it fails to extract the actual text from the raw data, it builds a closing tag with
Handle empty tags (