A curated list of awesome Python frameworks, libraries, software and resources
Go to file
2014-06-30 19:54:32 +05:30
.gitignore add .gitignore 2014-06-28 05:01:39 +08:00
Life is a piece of shit, you need Python.gif Life is short, you need Python. 2014-06-28 15:23:44 +08:00
README.md Add Beaker under caching libraries in the README.md 2014-06-30 19:54:32 +05:30

Awesome Python

A curated list of awesome Python frameworks, libraries and resources. Inspired by awesome-php.

Life is a piece of shit

so you need Python.

Environment Management

Libraries for Python version and environment management.

Package Management

Libraries for package and dependency management.

  • pip / Python Package Index - The package and dependency manager.
  • wheel - The new standard of python distribution and are intended to replace eggs.

Files

Libraries for file manipulation and MIME type detection.

  • mimetypes - (Python standard library) Map filenames to MIME types.
  • imghdr - (Python standard library) Determine the type of an image.
  • python-magic - A Python interface to the libmagic file type identification library.
  • path.py - A module wrapper for os.path.
  • watchdog - API and shell utilities to monitor file system events.
  • django-storages - A collection of custom storage backends for Django.

Date and Time

Libraries for working with dates and times.

  • arrow - Better dates & times for Python.
  • dateutil - Extensions to the standard Python datetime module.
  • delorean - A library for clearing up the inconvenient truths that arise dealing with datetimes in Python.
  • when.py - Providing user-friendly functions to help perform common date and time actions.
  • moment - A Python library for dealing with dates/times. Inspired by Moment.js.

Text Processing

Libraries for parsing and manipulating texts.

  • difflib - (Python standard library) Helpers for computing deltas.
  • Levenshtein - Fast computation of Levenshtein distance and string similarity.
  • fuzzywuzzy - Fuzzy String Matching.
  • esmre - Regular expression accelerator.
  • phonenumbers - Library for parsing, formatting, storing and validating international phone numbers.
  • python-user-agents - Browser user agent parser.
  • shortuuid - A generator library for concise, unambiguous and URL-safe UUIDs.
  • tablib - A module for Tabular Datasets in XLS, CSV, JSON, YAML.
  • sqlparse - A non-validating SQL parser.
  • Pygments - A generic syntax highlighter.
  • python-slugify - A Python slugify library that handles unicode.
  • unicode-slugify - A slugifier that generates unicode slugs. Developed by Mozilla.
  • unidecode - ASCII transliterations of Unicode text.
  • chardet - Python 2/3 compatible character encoding detector.
  • xpinyin - A library to translate chinese hanzi (漢字) to pinyin (拼音).
  • pangu.py - Spacing texts.
  • python-docx - Reads, queries and modifies Microsoft Word 2007/2008 docx files.
  • xlwt / xlrd - A package is for writing and reading data and formatting information from Excel files.
  • XlsxWriter - A Python module for creating Excel .xlsx files.
  • PyYAML - YAML implementations for Python.

Natural Language Processing

Libraries for working with human languages.

  • NLTK - A leading platform for building Python programs to work with human language data.
  • TextBlob - Providing a consistent API for diving into common natural language processing (NLP) tasks.
  • jieba - Chinese Words Segementation Utilities.
  • snownlp - A library for processing Chinese text.
  • loso - Another Chinese segmentation library.

Command-line Tools

Libraries for building command line utilities.

  • click - A package for creating beautiful command line interfaces in a composable way.
  • clint - Python Command-line Application Tools.
  • docopt - Pythonic command line arguments parser.
  • colorama - Cross-platform colored terminal text.

Documentation

Libraries for generating project documentation.

  • Sphinx - Python Documentation generator.

Imagery

Libraries for manipulating images.

  • pillow - Pillow is the friendly PIL fork. PIL is the Python Imaging Library.
  • thumbor - A smart imaging service. It enables on-demand crop, resizing and flipping of images.
  • imgSeek - A project for searching a collection of images using visual similarity.
  • python-qrcode - A pure Python QR Code generator.
  • pygram - Instagram-like image filters.
  • Quads - Computer art based on quadtrees.
  • nude.py - Nudity detection.
  • wand - Python bindings for MagickWand, C API for ImageMagick.

Audio

Libraries for manipulating audio.

  • django-elastic-transcoder - Django + AWS Elastic Transcoder.
  • beets - A music library manager and MusicBrainz tagger.
  • pyechonest - Python client for the Echo Nest API.
  • dejavu - Audio fingerprinting and recognition.
  • pydub - Manipulate audio with a simple and easy high level interface.
  • audioread - Cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding.
  • mutagen - A Python module to handle audio metadata.
  • tinytag - A library for reading music meta data of MP3, OGG, FLAC and Wave files.

Video

Libraries for manipulating video and GIFs.

  • moviepy - A module for script-based movie editing with many formats, including animated GIFs.
  • shorten.tv - Video summarization.
  • youtube-dl - A small command-line program to download videos from YouTube.

Geolocation

Libraries for geocoding addresses and working with latitudes and longitudes.

  • GeoDjango - A world-class geographic web framework.
  • geopy - Python Geocoding Toolbox.
  • pygeoip - Pure Python GeoIP API.
  • GeoIP - Python API for MaxMind GeoIP Legacy Database.
  • geojson - Python bindings and utlities for GeoJSON.

HTTP

Libraries for working with HTTP.

  • requests - HTTP Requests for Humans™.
  • httpie - A command line HTTP client, a user-friendly cURL replacement.

Database Drivers

Libraties for connecting and operating databases

  • mysql-python - The MySQL database connector for Python.
  • psycopg2 - The most popular PostgreSQL adapter for the Python.
  • PyMongo - The official Python client for MongoDB.
  • redis-py - The Redis Python Client.

ORM

Libraries that implement Object-Relational Mapping or datamapping techniques.

  • Relational Databases
  • NoSQL Databases
    • MongoEngine - A Python Object-Document-Mapper for working with MongoDB.
    • redisco - A Python Library for Simple Models and Containers Persisted in Redis.

Web Frameworks

Web development frameworks.

  • Django - The most popular web framework in Python.
  • Flask - A microframework for Python.
  • Bottle - A fast, simple and lightweight WSGI micro web-framework.
  • Pyramid - A small, fast, down-to-earth, open source Python web framework.

CMS

Content Management Systems

  • Mezzanine - A powerful, consistent, and flexible content management platform.
  • Wagtail - A Django content management system
  • django-oscar - An open-source ecommerce framework for Django.

RESTful API

Libraries for developing RESTful APIs.

  • django-rest-framework - A powerful and flexible toolkit that makes it easy to build Web APIs.
  • django-tastypie - Creating delicious APIs for Django apps.
  • flask-api - An implementation of the same web browsable APIs that django-rest-framework provides.
  • flask-restful - An extension for Flask that adds support for quickly building REST APIs.
  • falcon - A high-performance Python framework for building cloud APIs and web app backends.
  • eve - REST API framework powered by Flask, MongoDB and good intentions.
  • sandman - Automated REST APIs for existing database-driven systems.

Authentication and OAuth

Libraries for implementing authentications schemes.

Template Engine

Libraries and tools for templating and lexing.

  • Jinja2 - A modern and designer friendly templating language.

Queue

Libraries for working with event and task queues.

  • celery - An asynchronous task queue/job queue based on distributed message passing.
  • rq - Simple job queues for Python.

Libraries and software for indexing and performing search queries on data.

Asset Management

Tools for managing, compressing and minifying website assets.

  • django-compressor - Compresses linked and inline javascript or CSS into a single cached file.

Caching

Libraries for caching data.

  • Beaker Caching & Sessions - Beaker is a library for caching and sessions for use with web applications and stand-alone Python scripts and applications.

Email

Libraries for sending and parsing email.

Internationalization

Libraries for woking with i18n.

  • Babel - An internationalization library for Python.

URL Manipulation

Libraries for parsing URLs.

  • furl - A small Python library that makes manipulating URLs simple.
  • purl - A simple, immutable URL class with a clean API for interrogation and manipulation.

HTML Manipulation

Libraries for working with HTML and XML.

  • BeautifulSoup - Providing Pythonic idioms for iterating, searching, and modifying HTML or XML.
  • lxml - A very fast, easy-to-use and versatile library for handling HTML and XML.
  • html5lib - A standards-compliant library for parsing and serializing HTML documents and fragments.
  • pyquery - A jQuery-like library for parsing HTML.
  • cssutils - A CSS library for Python.
  • MarkupSafe - Implements a XML/HTML/XHTML Markup safe string for Python.
  • bleach - A whitelist-based HTML sanitization and text linkification library.
  • xmltodict - Working with XML feel like you are working with JSON.

Web Crawling

Libraries for scraping websites.

  • Scrapy - A fast high-level screen scraping and web crawling framework.
  • portia - Visual scraping for Scrapy.
  • feedparser - Universal feed parser.
  • RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser.

Web Content Extracting

Libraries for extracting web contents.

  • newspaper - News extraction, article extraction and content curation in Pythom.
  • html2text - Convert HTML to Markdown-formatted text.
  • python-goose - HTML Content/Article Extractor.
  • lassie - Web Content Retrieval for Humans.
  • micawber - A small library for extracting rich content from URLs.
  • sumy - A module for automatic summarization of text documents and HTML pages.
  • Haul - An Extensible Image Crawler.
  • python-readability - Fast Python port of arc90's readability tool.

Forms

Libraries for working with forms.

  • django-crispy-forms - A Django app which lets you create beautiful forms in a very elegant and DRY way.
  • WTForms - A flexible forms validation and rendering library.

Anti-spam

Libraries for fighting spam.

  • TODO

Networking

Libraries for network programming.

  • gevent - A coroutine-based Python networking library that uses greenlet.
  • Twisted - An event-driven networking engine.
  • Tornado - A Web framework and asynchronous networking library.

Admin Panels

Libraries for administrative interfaces.

  • Ajenti - The admin panel your servers deserve.
  • django-xadmin - Drop-in replacement of Django admin comes with lots of goodies.
  • flask-admin - Simple and extensible administrative interface framework for Flask.
  • flower - Real-time monitor and web admin for Celery.

DevOps Tools

Software and libraries for DevOps

  • OpenStack - Open source software for building private and public clouds.
  • Ansible - An IT automation tool.
  • SaltStack - Infrastructure automation and management system.
  • Fabric - Tool for streamlining the use of SSH for application deployment or systems administration tasks.
  • Fabtools - Tools for writing awesome Fabric files.
  • cuisine - Chef-like functionality for Fabric.
  • gunicorn - A WSGI HTTP Server for UNIX, fast clients and sleepy applications.
  • Supervisor - A Process Control System.
  • psutil - A cross-platform process and system utilities module.

GUI

Libraries for working with GUI applications.

  • PySide - Python bindings for the Qt cross-platform application and UI framework.
  • wxPython - A blending of the wxWidgets C++ class library with the Python.
  • kivy - A library for creating NUI applications, running on Windows, Linux, Mac OS X, Android and iOS.

Logging

Libraries for generating and working with log files.

  • logging - (Python standard library) Logging facility for Python.
  • Sentry - A realtime logging and aggregation server.
  • Raven - The Python client for Sentry.

Testing

Libraries for testing codebases and generating test data.

  • Testing Frameworks
    • unittest - (Python standard library) Unit testing framework.
    • nose - nose extends unittest.
    • pytest - A mature full-featured Python testing tool.
  • Mock
    • mock - A Python Mocking and Patching Library for Testing.
    • responses - A utility library for mocking out the requests Python library.
  • Fake Data
    • faker - A Python package that generates fake data.
    • model_mommy - Creating random fixtures for testing in Django.
  • Code Coverage

Code Analysis and Linter

Libraries and tools for analysing, parsing and manipulation codebases.

  • Flake8 - The modular source code checker: pep8, pyflakes and co.
  • Pylint - A source code analyzer.

Debugging Tools

Libraries for dubugging and developing

  • pdb - (Python standard library) The Python Debugger.
  • pyringe - Debugger capable of attaching to and injecting code into Python processes.
  • django-debug-toolbar - Display various debug information about the current request/response.
  • django-devserver - A drop-in replacement for Django's runserver.

Science and Data Aanalysis

Libraries for scientific computing and data analyzing.

  • SciPy - A Python-based ecosystem of open-source software for mathematics, science, and engineering.
  • NumPy - A fundamental package for scientific computing with Python.
  • matplotlib - A Python 2D plotting library.
  • NetworkX - A high-productivity software for complex networks.
  • Pandas - A library providing high-performance, easy-to-use data structures and data analysis tools.
  • PyMC - Markov Chain Monte Carlo sampling toolkit.
  • zipline - A Pythonic algorithmic trading library.
  • ggplot - Same API as ggplot2 for R.

Machine Learning

Libraries for Machine Learning

  • scikit-learn - A Python module for machine learning built on top of SciPy.
  • pattern - Web mining module for Python.
  • NuPIC - Numenta Platform for Intelligent Computing.
  • Pylearn2 - A Machine Learning library based on Theano.
  • hebel - GPU-Accelerated Deep Learning Library in Python.
  • gensim - Topic Modelling for Humans.
  • PyBrain - Another Python Machine Learning Library.
  • Crab - A flexible, fast recommender engine.
  • python-recsys - A Python library for implementing a Recommender System.

MapReduce

Framworks and libraries for MapReduce.

  • PySpark - The Spark Python API.
  • dpark - Python clone of Spark, a MapReduce alike framework in Python.
  • luigi - A module that helps you build complex pipelines of batch jobs.
  • mrjob - Run MapReduce jobs on Hadoop or Amazon Web Services.
  • dumbo - Python module that allows one to easily write and run Hadoop programs.

Third-party APIs

Libraries for accessing third party APIs.

  • apache-libcloud - One Python library for all clouds.
  • boto - Python interface to Amazon Web Services.
  • s3cmd - A command line tool for managing Amazon S3 and CloudFront.
  • twython - A Python wrapper for the Twitter API.

Algorithms and Design Patterns

Collections of algorithms and design patterns.

Miscellaneous

Useful libraries or tools that don't fit in the categories above.

  • IPython - IPython provides a rich toolkit to help you make the most out of using Python interactively.

Resources

Books

  • TODO

Websites

Articles

Contributing

Your contributions are always welcome!