This commit is contained in:
Vinta 2016-03-21 03:38:45 +08:00
parent 74c8446b24
commit f184109b42

View File

@ -220,13 +220,13 @@ Inspired by [awesome-php](https://github.com/ziadoz/awesome-php).
* Office * Office
* [Marmir](https://github.com/brianray/mm) - Takes Python data structures and turns them into spreadsheets. * [Marmir](https://github.com/brianray/mm) - Takes Python data structures and turns them into spreadsheets.
* [openpyxl](https://openpyxl.readthedocs.org/en/latest/) - A library for reading and writing Excel 2010 xlsx/xlsm/xltx/xltm files. * [openpyxl](https://openpyxl.readthedocs.org/en/latest/) - A library for reading and writing Excel 2010 xlsx/xlsm/xltx/xltm files.
* [pyexcel](http://pyexcel.readthedocs.org/en/latest/) - Providing one API for reading, manipulating and writing csv, ods, xls, xlsx and xlsm files.
* [python-docx](https://github.com/python-openxml/python-docx) - Reads, queries and modifies Microsoft Word 2007/2008 docx files. * [python-docx](https://github.com/python-openxml/python-docx) - Reads, queries and modifies Microsoft Word 2007/2008 docx files.
* [relatorio](http://relatorio.tryton.org/) - Templating OpenDocument files.
* [unoconv](https://github.com/dagwieers/unoconv) - Convert between any document format supported by LibreOffice/OpenOffice. * [unoconv](https://github.com/dagwieers/unoconv) - Convert between any document format supported by LibreOffice/OpenOffice.
* [XlsxWriter](https://xlsxwriter.readthedocs.org/en/latest/) - A Python module for creating Excel .xlsx files. * [XlsxWriter](https://xlsxwriter.readthedocs.org/en/latest/) - A Python module for creating Excel .xlsx files.
* [xlwings](http://xlwings.org/) - A BSD-licensed library that makes it easy to call Python from Excel and vice versa. * [xlwings](http://xlwings.org/) - A BSD-licensed library that makes it easy to call Python from Excel and vice versa.
* [xlwt](https://github.com/python-excel/xlwt) / [xlrd](https://github.com/python-excel/xlrd) - Writing and reading data and formatting information from Excel files. * [xlwt](https://github.com/python-excel/xlwt) / [xlrd](https://github.com/python-excel/xlrd) - Writing and reading data and formatting information from Excel files.
* [relatorio](http://relatorio.tryton.org/) - Templating OpenDocument files.
* [pyexcel](http://pyexcel.readthedocs.org/en/latest/) - Python Wrapper that provides one API for reading, manipulating and writing csv, ods, xls, xlsx and xlsm files
* PDF * PDF
* [PDFMiner](https://github.com/euske/pdfminer) - A tool for extracting information from PDF documents. * [PDFMiner](https://github.com/euske/pdfminer) - A tool for extracting information from PDF documents.
* [PyPDF2](https://github.com/mstamy2/PyPDF2) - A library capable of splitting, merging and transforming PDF pages. * [PyPDF2](https://github.com/mstamy2/PyPDF2) - A library capable of splitting, merging and transforming PDF pages.
@ -246,13 +246,13 @@ Inspired by [awesome-php](https://github.com/ziadoz/awesome-php).
*Libraries for working with human languages.* *Libraries for working with human languages.*
* [NLTK](http://www.nltk.org/) - A leading platform for building Python programs to work with human language data. * [NLTK](http://www.nltk.org/) - A leading platform for building Python programs to work with human language data.
* [Jieba](https://github.com/fxsjy/jieba) - Chinese text segmentation.
* [langid.py](https://github.com/saffsd/langid.py) - Stand-alone language identification system.
* [Pattern](http://www.clips.ua.ac.be/pattern) - A web mining module for the Python. * [Pattern](http://www.clips.ua.ac.be/pattern) - A web mining module for the Python.
* [Jieba](https://github.com/fxsjy/jieba) - Chinese text segmentation.
* [SnowNLP](https://github.com/isnowfy/snownlp) - A library for processing Chinese text. * [SnowNLP](https://github.com/isnowfy/snownlp) - A library for processing Chinese text.
* [spaCy](https://spacy.io/) - A library for industrial-strength natural language processing in Python and Cython.
* [TextBlob](http://textblob.readthedocs.org/en/latest/) - Providing a consistent API for diving into common NLP tasks. * [TextBlob](http://textblob.readthedocs.org/en/latest/) - Providing a consistent API for diving into common NLP tasks.
* [TextGrocery](https://github.com/2shou/TextGrocery) - A simple, efficient short-text classification tool based on LibLinear and Jieba. * [TextGrocery](https://github.com/2shou/TextGrocery) - A simple, efficient short-text classification tool based on LibLinear and Jieba.
* [Spacy.io](https://spacy.io/) - spaCy is a library for industrial-strength natural language processing in Python and Cython. * [langid.py](https://github.com/saffsd/langid.py) - Stand-alone language identification system.
## Documentation ## Documentation