From d3b29705a9a9d9afbea6796c8fca00313cfedb33 Mon Sep 17 00:00:00 2001 From: Daniel Reis Date: Thu, 3 Jul 2014 13:07:02 +0100 Subject: [PATCH] Add pyPDF2 to Text processing pyPDF2 is under active maintenance and can be complementary to PDFMiner --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 139101f1..4502b2c8 100644 --- a/README.md +++ b/README.md @@ -140,6 +140,7 @@ A curated list of awesome Python frameworks, libraries and software. Inspired by * [xlwt](https://github.com/python-excel/xlwt) / [xlrd](https://github.com/python-excel/xlrd) - Packages is for writing and reading data and formatting information from Excel files. * [XlsxWriter](https://xlsxwriter.readthedocs.org/) - A Python module for creating Excel .xlsx files. * [PDFMiner](https://github.com/euske/pdfminer) - A tool for extracting information from PDF documents. + * [PyPDF2](https://github.com/mstamy2/PyPDF2) - a pure-python PDF library capable of splitting, merging and transforming PDF pages. * [Python-Markdown](https://github.com/waylan/Python-Markdown) - A Python implementation of John Gruber’s Markdown. * [Mistune](https://github.com/lepture/mistune) - Fastest and full featured pure Python parsers of Markdown. * [PyYAML](http://pyyaml.org/) - YAML implementations for Python.