{py:mod}abx_plugin_htmltotext.htmltotext
:allowtitles:
Module Contents
Classes
:class: autosummary longtable
:align: left
* - {py:obj}`HTMLTextExtractor <abx_plugin_htmltotext.htmltotext.HTMLTextExtractor>`
-
Functions
:class: autosummary longtable
:align: left
* - {py:obj}`get_output_path <abx_plugin_htmltotext.htmltotext.get_output_path>`
- ```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.get_output_path
:summary:
```
* - {py:obj}`should_save_htmltotext <abx_plugin_htmltotext.htmltotext.should_save_htmltotext>`
- ```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.should_save_htmltotext
:summary:
```
* - {py:obj}`save_htmltotext <abx_plugin_htmltotext.htmltotext.save_htmltotext>`
- ```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.save_htmltotext
:summary:
```
API
:canonical: abx_plugin_htmltotext.htmltotext.get_output_path
```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.get_output_path
```
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor
Bases: {py:obj}`html.parser.HTMLParser`
````{py:attribute} TEXT_ATTRS
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.TEXT_ATTRS
:value: >
['alt', 'cite', 'href', 'label', 'list', 'placeholder', 'title', 'value']
```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.TEXT_ATTRS
```
````
````{py:attribute} NOTEXT_TAGS
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.NOTEXT_TAGS
:value: >
['script', 'style', 'template']
```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.NOTEXT_TAGS
```
````
````{py:attribute} NOTEXT_HREF
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.NOTEXT_HREF
:value: >
['data:', 'javascript:', '#']
```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.NOTEXT_HREF
```
````
````{py:method} _is_text_attr(name, value)
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor._is_text_attr
```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor._is_text_attr
```
````
````{py:method} _parent_tag()
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor._parent_tag
```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor._parent_tag
```
````
````{py:method} _in_notext_tag()
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor._in_notext_tag
```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor._in_notext_tag
```
````
````{py:method} handle_starttag(tag, attrs)
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.handle_starttag
```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.handle_starttag
```
````
````{py:method} handle_endtag(tag)
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.handle_endtag
```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.handle_endtag
```
````
````{py:method} handle_data(data)
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.handle_data
```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.handle_data
```
````
````{py:method} __str__()
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.__str__
````
:canonical: abx_plugin_htmltotext.htmltotext.should_save_htmltotext
```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.should_save_htmltotext
```
:canonical: abx_plugin_htmltotext.htmltotext.save_htmltotext
```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.save_htmltotext
```
Last updated