{py:mod}abx_plugin_htmltotext.htmltotext

:allowtitles:

Module Contents

Classes

:class: autosummary longtable
:align: left

* - {py:obj}`HTMLTextExtractor <abx_plugin_htmltotext.htmltotext.HTMLTextExtractor>`
  -

Functions

:class: autosummary longtable
:align: left

* - {py:obj}`get_output_path <abx_plugin_htmltotext.htmltotext.get_output_path>`
  - ```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.get_output_path
    :summary:
    ```
* - {py:obj}`should_save_htmltotext <abx_plugin_htmltotext.htmltotext.should_save_htmltotext>`
  - ```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.should_save_htmltotext
    :summary:
    ```
* - {py:obj}`save_htmltotext <abx_plugin_htmltotext.htmltotext.save_htmltotext>`
  - ```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.save_htmltotext
    :summary:
    ```

API

:canonical: abx_plugin_htmltotext.htmltotext.get_output_path

```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.get_output_path
```
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor

Bases: {py:obj}`html.parser.HTMLParser`

````{py:attribute} TEXT_ATTRS
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.TEXT_ATTRS
:value: >
   ['alt', 'cite', 'href', 'label', 'list', 'placeholder', 'title', 'value']

```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.TEXT_ATTRS
```

````

````{py:attribute} NOTEXT_TAGS
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.NOTEXT_TAGS
:value: >
   ['script', 'style', 'template']

```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.NOTEXT_TAGS
```

````

````{py:attribute} NOTEXT_HREF
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.NOTEXT_HREF
:value: >
   ['data:', 'javascript:', '#']

```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.NOTEXT_HREF
```

````

````{py:method} _is_text_attr(name, value)
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor._is_text_attr

```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor._is_text_attr
```

````

````{py:method} _parent_tag()
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor._parent_tag

```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor._parent_tag
```

````

````{py:method} _in_notext_tag()
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor._in_notext_tag

```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor._in_notext_tag
```

````

````{py:method} handle_starttag(tag, attrs)
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.handle_starttag

```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.handle_starttag
```

````

````{py:method} handle_endtag(tag)
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.handle_endtag

```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.handle_endtag
```

````

````{py:method} handle_data(data)
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.handle_data

```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.handle_data
```

````

````{py:method} __str__()
:canonical: abx_plugin_htmltotext.htmltotext.HTMLTextExtractor.__str__

````
:canonical: abx_plugin_htmltotext.htmltotext.should_save_htmltotext

```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.should_save_htmltotext
```
:canonical: abx_plugin_htmltotext.htmltotext.save_htmltotext

```{autodoc2-docstring} abx_plugin_htmltotext.htmltotext.save_htmltotext
```

Last updated