mirror of
https://github.com/vinta/awesome-python.git
synced 2024-11-27 22:11:07 +00:00
add textract to Web Content Extracting section
This commit is contained in:
parent
a7942efbbf
commit
d52ea10139
|
@ -565,6 +565,7 @@ A curated list of awesome Python frameworks, libraries and software. Inspired by
|
|||
* [Haul](https://github.com/vinta/Haul) - An Extensible Image Crawler.
|
||||
* [python-readability](https://github.com/buriy/python-readability) - Fast Python port of arc90's readability tool.
|
||||
* [opengraph](https://github.com/erikriver/opengraph) - A Python module to parse the Open Graph Protocol
|
||||
* [textract](https://github.com/deanmalmgren/textract) - Extract text from any document, Word documents, PowerPoint presentations, PDFs, etc.
|
||||
|
||||
## Forms
|
||||
|
||||
|
|
Loading…
Reference in New Issue
Block a user