This commit is contained in:
jgnog 2014-07-01 22:36:40 +01:00
parent 960ffad2c8
commit c3f0aec7b4

View File

@ -380,7 +380,7 @@ A curated list of awesome Python frameworks, libraries and software. Inspired by
*Libraries for extracting web contents.* *Libraries for extracting web contents.*
* [newspaper](https://github.com/codelucas/newspaper) - News extraction, article extraction and content curation in Pythom. * [newspaper](https://github.com/codelucas/newspaper) - News extraction, article extraction and content curation in Python.
* [html2text](https://github.com/aaronsw/html2text) - Convert HTML to Markdown-formatted text. * [html2text](https://github.com/aaronsw/html2text) - Convert HTML to Markdown-formatted text.
* [python-goose](https://github.com/grangier/python-goose) - HTML Content/Article Extractor. * [python-goose](https://github.com/grangier/python-goose) - HTML Content/Article Extractor.
* [lassie](https://github.com/michaelhelmick/lassie) - Web Content Retrieval for Humans. * [lassie](https://github.com/michaelhelmick/lassie) - Web Content Retrieval for Humans.