mirror of
https://github.com/vinta/awesome-python.git
synced 2025-05-10 05:24:01 +00:00
add textract to Web Content Extracting section
This commit is contained in:
parent
a7942efbbf
commit
d52ea10139
@ -565,6 +565,7 @@ A curated list of awesome Python frameworks, libraries and software. Inspired by
|
|||||||
* [Haul](https://github.com/vinta/Haul) - An Extensible Image Crawler.
|
* [Haul](https://github.com/vinta/Haul) - An Extensible Image Crawler.
|
||||||
* [python-readability](https://github.com/buriy/python-readability) - Fast Python port of arc90's readability tool.
|
* [python-readability](https://github.com/buriy/python-readability) - Fast Python port of arc90's readability tool.
|
||||||
* [opengraph](https://github.com/erikriver/opengraph) - A Python module to parse the Open Graph Protocol
|
* [opengraph](https://github.com/erikriver/opengraph) - A Python module to parse the Open Graph Protocol
|
||||||
|
* [textract](https://github.com/deanmalmgren/textract) - Extract text from any document, Word documents, PowerPoint presentations, PDFs, etc.
|
||||||
|
|
||||||
## Forms
|
## Forms
|
||||||
|
|
||||||
|
Loading…
x
Reference in New Issue
Block a user