awesome-python/README.md

44 KiB
Raw Blame History

Awesome Python

A curated list of awesome Python frameworks, libraries and software. Inspired by awesome-php.

Environment Management

Libraries for Python version and environment management.

Package Management

Libraries for package and dependency management.

  • pip / Python Package Index - The package and dependency manager.
  • wheel - The new standard of python distribution and are intended to replace eggs.

Distribution

Libraries to create packaged executables for release distribution.

  • cx-Freeze - Freezes Python scripts (cross-platform)
  • py2app - Freezes Python scripts (Mac OS X)
  • py2exe - Freezes Python scripts (Windows)

Files

Libraries for file manipulation and MIME type detection.

  • imghdr - (Python standard library) Determine the type of an image.
  • mimetypes - (Python standard library) Map filenames to MIME types.
  • path.py - A module wrapper for os.path.
  • pathlib - An cross-platform, object-oriented path library (included in Python 3.4)
  • python-magic - A Python interface to the libmagic file type identification library.
  • Unipath - An object-oriented approach to file/directory operations.
  • watchdog - API and shell utilities to monitor file system events.

Date and Time

Libraries for working with dates and times.

  • arrow - Better dates & times for Python.
  • dateutil - Extensions to the standard Python datetime module.
  • delorean - A library for clearing up the inconvenient truths that arise dealing with datetimes in Python.
  • moment - A Python library for dealing with dates/times. Inspired by Moment.js.
  • when.py - Providing user-friendly functions to help perform common date and time actions.

Text Processing

Libraries for parsing and manipulating texts.

  • General
    • chardet - Python 2/3 compatible character encoding detector.
    • difflib - (Python standard library) Helpers for computing deltas.
    • esmre - Regular expression accelerator.
    • fuzzywuzzy - Fuzzy String Matching.
    • Levenshtein - Fast computation of Levenshtein distance and string similarity.
    • pangu.py - Spacing texts.
    • python-slugify - A Python slugify library that handles unicode.
    • shortuuid - A generator library for concise, unambiguous and URL-safe UUIDs.
    • unicode-slugify - A slugifier that generates unicode slugs. Developed by Mozilla.
    • unidecode - ASCII transliterations of Unicode text.
    • xpinyin - A library to translate chinese hanzi (漢字) to pinyin (拼音).
  • Specific Format
    • PDFMiner - A tool for extracting information from PDF documents.
    • python-docx - Reads, queries and modifies Microsoft Word 2007/2008 docx files.
    • PyYAML - YAML implementations for Python.
    • tablib - A module for Tabular Datasets in XLS, CSV, JSON, YAML.
    • XlsxWriter - A Python module for creating Excel .xlsx files.
    • xlwt / xlrd - Packages is for writing and reading data and formatting information from Excel files.
  • Parser
    • phonenumbers - Library for parsing, formatting, storing and validating international phone numbers.
    • Pygments - A generic syntax highlighter.
    • python-nameparser - A simple Python module for parsing human names into their individual components.
    • python-user-agents - Browser user agent parser.
    • schema - A library for validating Python data structures.
    • sqlparse - A non-validating SQL parser.

Natural Language Processing

Libraries for working with human languages.

  • jieba - Chinese Words Segementation Utilities.
  • loso - Another Chinese segmentation library.
  • NLTK - A leading platform for building Python programs to work with human language data.
  • Pattern - A web mining module for the Python programming language. It has tools for natural language processing, machine learning, among others.
  • SnowNLP - A library for processing Chinese text.
  • TextBlob - Providing a consistent API for diving into common natural language processing (NLP) tasks. Stands on the giant shoulders of NLTK and Pattern, and plays nicely with both.

Configuration

Libraries for storing configuration options.

  • config - Hierarchical config from the author of logging.
  • ConfigObj - INI file parser with validation.
  • ConfigParser - (Python standard library) INI file parser.
  • profig - Config from multiple formats with value conversion.

Command-line Tools

Libraries for building command line utilities.

  • click - A package for creating beautiful command line interfaces in a composable way.
  • clint - Python Command-line Application Tools.
  • colorama - Cross-platform colored terminal text.
  • docopt - Pythonic command line arguments parser.

Documentation

Libraries for generating project documentation.

  • Mistune - Fastest and full featured pure Python parsers of Markdown.
  • MkDocs - Markdown friendly documentation generator.
  • Python-Markdown - A Python implementation of John Grubers Markdown.
  • reStructuredText - Markup Syntax and Parser Component of Docutils.
  • Sphinx - Python Documentation generator.

Imagery

Libraries for manipulating images.

  • imgSeek - A project for searching a collection of images using visual similarity.
  • nude.py - Nudity detection.
  • pillow - Pillow is the friendly PIL fork. PIL is the Python Imaging Library.
  • pyBarcode - Create barcodes in python without needing PIL.
  • pygram - Instagram-like image filters.
  • python-qrcode - A pure Python QR Code generator.
  • Quads - Computer art based on quadtrees.
  • thumbor - A smart imaging service. It enables on-demand crop, resizing and flipping of images.
  • wand - Python bindings for MagickWand, C API for ImageMagick.

Audio

Libraries for manipulating audio.

  • audioread - Cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding.
  • beets - A music library manager and MusicBrainz tagger.
  • dejavu - Audio fingerprinting and recognition.
  • django-elastic-transcoder - Django + AWS Elastic Transcoder.
  • mutagen - A Python module to handle audio metadata.
  • pydub - Manipulate audio with a simple and easy high level interface.
  • pyechonest - Python client for the Echo Nest API.
  • tinytag - A library for reading music meta data of MP3, OGG, FLAC and Wave files.

Video

Libraries for manipulating video and GIFs.

  • moviepy - A module for script-based movie editing with many formats, including animated GIFs.
  • shorten.tv - Video summarization.

Geolocation

Libraries for geocoding addresses and working with latitudes and longitudes.

  • GeoDjango - A world-class geographic web framework.
  • GeoIP - Python API for MaxMind GeoIP Legacy Database.
  • geojson - Python bindings and utlities for GeoJSON.
  • geopy - Python Geocoding Toolbox.
  • pygeoip - Pure Python GeoIP API.

HTTP

Libraries for working with HTTP.

  • Clients
    • httpie - A command line HTTP client, a user-friendly cURL replacement.
    • requests - HTTP Requests for Humans™.
  • WSGI Servers
    • bjoern - Asynchronous, very fast and written in C.
    • cherrypy - Multi-threaded and very stable.
    • diesel - Asynchronous (greenlets).
    • eventlet - Asynchronous framework with WSGI support.
    • fapws3 - Asynchronous (network side only), written in C.
    • gevent - Asynchronous (greenlets).
    • gunicorn - Pre-forked, partly written in C.
    • meinheld - Asynchronous, partly written in C.
    • paste - Multi-threaded, stable, tried and tested.
    • rocket - Multi-threaded.
    • tornado - Asynchronous, powers some parts of Facebook.
    • twisted - Asynchronous, well tested but... twisted.
    • waitress - Multi-threaded, poweres Pyramid.
    • wsgiref - (Python standard library) WSGI reference implementation, single-threaded.

Database

Databases implemented in Python.

  • ZODB - A native object database for Python. A key-value and object graph database.

Database Drivers

Libraties for connecting and operating databases.

  • Relational Databases
    • mysql-python - The MySQL database connector for Python.
    • psycopg2 - The most popular PostgreSQL adapter for the Python.
  • NoSQL Databases
    • PyMongo - The official Python client for MongoDB.
    • redis-py - The Redis Python Client.

ORM

Libraries that implement Object-Relational Mapping or datamapping techniques.

  • Relational Databases
    • Django Models - A part of Django.
    • peewee - A small, expressive ORM.
    • PonyORM - ORM that provides a generator-oriented interface to SQL.
    • SQLAlchemy - The Python SQL Toolkit and Object Relational Mapper.
  • NoSQL Databases
    • MongoEngine - A Python Object-Document-Mapper for working with MongoDB.
    • redisco - A Python Library for Simple Models and Containers Persisted in Redis.

Web Frameworks

Web development frameworks.

  • Bluebream - An open-source web application server, framework and library, created by the Zope community and formerly known as Zope 3.
  • Bottle - A fast, simple and lightweight WSGI micro web-framework.
  • CherryPy - A Minimalist Python Web Framework, HTTP/1.1-compliant and WSGI thread-pooled.
  • Django - The most popular web framework in Python.
  • Flask - A microframework for Python.
  • Grok - A framework built on the existing Zope 3 libraries, offers a lot of building blocks for web development.
  • Pyramid - A small, fast, down-to-earth, open source Python web framework.
  • TurboGears - The Web Framework that scales with you. Starts as a microframework and scales up to a fullstack solution.
  • web.py - A web framework for Python that is as simple as it is powerful.
  • web2py - A full stack web framework and platform focused in the ease of use.
  • Zope Toolkit - A set of libraries intended for reuse by projects to develop web applications or web frameworks.

CMS

Content management systems

  • django-cms - An Open source enterprise content management system based on the django framework.
  • django-oscar - An open-source ecommerce framework for Django.
  • Mezzanine - A powerful, consistent, and flexible content management platform.
  • Opps CMS - A Django-based CMS for magazines, newspapers websites and portals with high-traffic.
  • Plone - Content Management System built on top of the open source application server Zope and the accompanying Content Management Framework.
  • Quokka CMS - Flexible, extensible, small CMS powered by Flask and MongoDB.
  • Wagtail - A Django content management system.

RESTful API

Libraries for developing RESTful APIs.

  • cornice - A REST framework for Pyramid
  • django-rest-framework - A powerful and flexible toolkit that makes it easy to build Web APIs.
  • django-tastypie - Creating delicious APIs for Django apps.
  • eve - REST API framework powered by Flask, MongoDB and good intentions.
  • falcon - A high-performance Python framework for building cloud APIs and web app backends.
  • flask-api - An implementation of the same web browsable APIs that django-rest-framework provides.
  • flask-restful - An extension for Flask that adds support for quickly building REST APIs.
  • sandman - Automated REST APIs for existing database-driven systems.

Authentication and OAuth

Libraries for implementing authentications schemes.

  • django-allauth - Authentication app for Django that "just works."
  • django-oauth-toolkit - OAuth2 goodies for the Djangonauts.
  • django-oauth2-provider - Providing OAuth2 access to Django app.
  • Flask-OAuthlib - OAuth 1.0/a, 2.0 implementation of client and provider for Flask.
  • OAuthLib - A generic, spec-compliant, thorough implementation of the OAuth request-signing logic.
  • python-oauth2 - A fully tested, abstract interface to creating OAuth clients and servers.
  • python-social-auth - An easy-to-setup social authentication mechanism.
  • rauth - A Python library for OAuth 1.0/a, 2.0, and Ofly.

Template Engine

Libraries and tools for templating and lexing.

  • Chameleon - Chameleon is an HTML/XML template engine for Python. Modeled after ZPT, optimized for speed.
  • Genshi - Python templating toolkit for generation of web-aware output.
  • Jinja2 - A modern and designer friendly templating language.
  • Mako - Hyperfast and lightweight templating for the Python platform.
  • Spitfire - A very fast Python template compiler.

Queue

Libraries for working with event and task queues.

  • celery - An asynchronous task queue/job queue based on distributed message passing.
  • huey - Little multi-threaded task queue.
  • mrq - Mr. Queue - A distributed worker task queue in Python using Redis & gevent.
  • rq - Simple job queues for Python.

Libraries and software for indexing and performing search queries on data.

Asset Management

Tools for managing, compressing and minifying website assets.

  • django-compressor - Compresses linked and inline javascript or CSS into a single cached file.
  • django-storages - A collection of custom storage backends for Django.
  • fanstatic - Packages, optimizes, and serves static file dependencies as Python packages.
  • fileconveyor - Monitors changes, processes, and transports assets to CDNs and file storage systems.
  • webassets - Bundles, optimizes, and manages unique cache-busting URLs for static resources.

Caching

Libraries for caching data.

  • Beaker Caching & Sessions - Beaker is a library for caching and sessions for use with web applications and stand-alone Python scripts and applications.
  • Dogpile Cache - Dogpile Cache is next generation replacement for Beaker made by same authors.
  • HermesCache - Python caching library with tag-based invalidation and dogpile effect prevention.

Email

Libraries for sending and parsing email.

  • django-celery-ses - Django email backend with AWS SES and Celery.
  • flanker - A email address and Mime parsing library.
  • imbox - Python IMAP for Humans.
  • inbox.py - Python SMTP Server for Humans.
  • lamson - Pythonic SMTP Application Server.
  • marrow.mailer - High-performance extensible mail delivery framework.

Internationalization

Libraries for woking with i18n.

  • Babel - An internationalization library for Python.

URL Manipulation

Libraries for parsing URLs.

  • furl - A small Python library that makes manipulating URLs simple.
  • purl - A simple, immutable URL class with a clean API for interrogation and manipulation.
  • pyshorteners - A pure python URL shortening lib.

HTML Manipulation

Libraries for working with HTML and XML.

  • BeautifulSoup - Providing Pythonic idioms for iterating, searching, and modifying HTML or XML.
  • bleach - A whitelist-based HTML sanitization and text linkification library.
  • cssutils - A CSS library for Python.
  • html5lib - A standards-compliant library for parsing and serializing HTML documents and fragments.
  • lxml - A very fast, easy-to-use and versatile library for handling HTML and XML.
  • MarkupSafe - Implements a XML/HTML/XHTML Markup safe string for Python.
  • pyquery - A jQuery-like library for parsing HTML.
  • xhtml2pdf - HTML/CSS to PDF converter.
  • xmltodict - Working with XML feel like you are working with JSON.

Web Crawling

Libraries for scraping websites.

  • feedparser - Universal feed parser.
  • portia - Visual scraping for Scrapy.
  • RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser.
  • Scrapy - A fast high-level screen scraping and web crawling framework.

Web Content Extracting

Libraries for extracting web contents.

  • Haul - An Extensible Image Crawler.
  • html2text - Convert HTML to Markdown-formatted text.
  • lassie - Web Content Retrieval for Humans.
  • micawber - A small library for extracting rich content from URLs.
  • newspaper - News extraction, article extraction and content curation in Python.
  • python-goose - HTML Content/Article Extractor.
  • python-readability - Fast Python port of arc90's readability tool.
  • sumy - A module for automatic summarization of text documents and HTML pages.

Downloader

Libraries for downloading.

  • coursera - Script for downloading Coursera.org videos and naming them.
  • s3cmd - A command line tool for managing Amazon S3 and CloudFront.
  • you-get - A YouTube/Youku/Niconico video downloader written in Python 3.
  • youtube-dl - A small command-line program to download videos from YouTube.

Forms

Libraries for working with forms.

  • Deform - Python HTML form generation library influenced by the formish form generation library.
  • django-crispy-forms - A Django app which lets you create beautiful forms in a very elegant and DRY way.
  • WTForms - A flexible forms validation and rendering library.

Data Validation

Libraries for validating data. Used for forms in many cases.

  • colander - A system for validating and deserializing data obtained via XML, JSON, an HTML form post or any other equally simple data serialization.
  • dictshield - A fast way to validate and trim the values in a dictionary.
  • voluptuous - A Python data validation library. It is primarily intended for validating data coming into Python as JSON, YAML, etc.

Anti-spam

Libraries for fighting spam.

Tagging

Libraries for tagging items.

Build Tools

Compile software from source code.

  • buildout - A build system for creating, assembling and deploying applications from multiple parts, some of which may be non-Python-based.
  • PlatformIO - A console tool to build code with different development platforms.
  • SCons - A software construction tool.

Admin Panels

Libraries for administrative interfaces.

  • Ajenti - The admin panel your servers deserve.
  • django-xadmin - Drop-in replacement of Django admin comes with lots of goodies.
  • flask-admin - Simple and extensible administrative interface framework for Flask.
  • flower - Real-time monitor and web admin for Celery.

Processes and Threads

Libraries for woking with processes or threads

  • envoy - Python Subprocesses for Humans™.
  • sh - A full-fledged subprocess replacement for Python.

Networking

Libraries for network programming.

  • diesel - Greenlet-based event I/O Framework for Python.
  • gevent - A coroutine-based Python networking library that uses greenlet.
  • pulsar - Event-driven concurrent framework for Python.
  • pyzmq - A Python wrapper for the 0MQ message library.
  • Tornado - A Web framework and asynchronous networking library.
  • Twisted - An event-driven networking engine.

WebSocket

Libraries for woking with WebSocket.

Cryptography

  • PyCA's Cryptography - cryptography is a package designed to expose cryptographic primitives and recipes to Python developers.

GUI

Libraries for working with graphical user interface applications.

  • curses - Built-in wrapper for ncurses used to create terminal GUI applications.
  • kivy - A library for creating NUI applications, running on Windows, Linux, Mac OS X, Android and iOS.
  • pyglet - A cross-platform windowing and multimedia library for Python.
  • PyQt - Python bindings for the Qt cross-platform application and UI framework, with support for both Qt v4 and Qt v5 frameworks.
  • PySide - Python bindings for the Qt cross-platform application and UI framework, supporting the Qt v4 framework.
  • Tkinter - Tkinter is Python's de-facto standard GUI package.
  • urwid - A library for creating terminal GUI applications with strong support for widgets, events, rich colors, etc.
  • wxPython - A blending of the wxWidgets C++ class library with the Python.

Game Development

Awesome game development libraries.

  • Cocos2d - cocos2d is a framework for building 2D games, demos, and other graphical/interactive applications. It is based on pyglet.
  • Panda3D - 3D game engine developed by Disney and maintained by Carnegie Mellon's Entertainment Technology Center. Written in C++, completely wrapped in Python.
  • Pygame - Pygame is a set of Python modules designed for writing games.
  • PyOgre - Python bindings for the Ogre 3D render engine, can be used for games, simulations, anything 3D.
  • PyOpenGL - Python ctypes bindings for OpenGL and it's related APIs.
  • PySDL2 - A ctypes based wrapper for the SDL2 library.

Logging

Libraries for generating and working with log files.

  • logging - (Python standard library) Logging facility for Python.
  • Raven - The Python client for Sentry.
  • Sentry - A realtime logging and aggregation server.

Testing

Libraries for testing codebases and generating test data.

  • Testing Frameworks
    • contexts - A modern and flexible Behaviour-Driven-Development framework for Python 3.3 and above, inspired by C#'s Machine.Specifications.
    • mamba - The definitive testing tool for Python. Born under the banner of Behavior Driven Development (BDD).
    • nose - nose extends unittest.
    • pytest - A mature full-featured Python testing tool.
    • unittest - (Python standard library) Unit testing framework.
  • Mock
    • doublex - Powerful test doubles framework for Python.
    • mock - A Python Mocking and Patching Library for Testing.
    • responses - A utility library for mocking out the requests Python library.
  • Fake Data
    • faker - A Python package that generates fake data.
    • model_mommy - Creating random fixtures for testing in Django.
  • Code Coverage
  • Load Testing
    • locust - Scalable user load testing tool written in Python.
  • Error Handler
    • FuckIt.py - FuckIt.py uses state-of-the-art technology to make sure your Python code runs whether it has any right to or not.

Code Analysis and Linter

Libraries and tools for analysing, parsing and manipulation codebases.

  • Flake8 - The modular source code checker: pep8, pyflakes and co.
  • Pylint - A source code analyzer.
  • pysonar2 - A type inferencer and indexer for Python.

Debugging Tools

Libraries for dubugging and developing.

  • django-debug-toolbar - Display various debug information about the current request/response.
  • django-devserver - A drop-in replacement for Django's runserver.
  • ipdb - IPython-enabled pdb.
  • memory_profiler - Monitor Memory usage of Python code.
  • pdb - (Python standard library) The Python Debugger.
  • pyringe - Debugger capable of attaching to and injecting code into Python processes.
  • winpdb - A Platform Independent Python Debugger with GUI.

Science and Data Analysis

Libraries for scientific computing and data analyzing.

  • NetworkX - A high-productivity software for complex networks.
  • Numba - Python JIT (just in time) complier to LLVM aimed at scientific Python by the developers of Cython and NumPy.
  • NumPy - A fundamental package for scientific computing with Python.
  • Pandas - A library providing high-performance, easy-to-use data structures and data analysis tools.
  • PyDy - Short for Python Dynamics, used to assist with workflow in the modeling of dynamic motion based around NumPy, SciPy, IPython, and matplotlib.
  • PyMC - Markov Chain Monte Carlo sampling toolkit.
  • SciPy - A Python-based ecosystem of open-source software for mathematics, science, and engineering.
  • statsmodels - Statistical modeling and econometrics in Python.
  • SymPy - A Python library for symbolic mathematics.
  • zipline - A Pythonic algorithmic trading library.

Data Visualization

Libraries for visualizing data.

  • bokeh - Interactive Web Plotting for Python.
  • d3py - A plottling library for Python, based on D3.js.
  • ggplot - Same API as ggplot2 for R.
  • Kartograph.py - Rendering beautiful SVG maps in Python.
  • matplotlib - A Python 2D plotting library.
  • plotly - Collaborative web plotting for Python and matplotlib.
  • vincent - A Python to Vega translator.

Machine Learning

Libraries for Machine Learning.

  • Crab - A flexible, fast recommender engine.
  • gensim - Topic Modelling for Humans.
  • hebel - GPU-Accelerated Deep Learning Library in Python.
  • NuPIC - Numenta Platform for Intelligent Computing.
  • pattern - Web mining module for Python.
  • PyBrain - Another Python Machine Learning Library.
  • Pylearn2 - A Machine Learning library based on Theano.
  • python-recsys - A Python library for implementing a Recommender System.
  • scikit-learn - A Python module for machine learning built on top of SciPy.

MapReduce

Framworks and libraries for MapReduce.

  • dpark - Python clone of Spark, a MapReduce alike framework in Python.
  • dumbo - Python module that allows one to easily write and run Hadoop programs.
  • luigi - A module that helps you build complex pipelines of batch jobs.
  • mrjob - Run MapReduce jobs on Hadoop or Amazon Web Services.
  • PySpark - The Spark Python API.
  • streamparse - Run Python code against real-time streams of data. Integrates with Apache Storm.

Third-party APIs

Libraries for accessing third party APIs.

DevOps Tools

Software and libraries for DevOps.

  • Ansible - An IT automation tool.
  • cuisine - Chef-like functionality for Fabric.
  • Fabric - Tool for streamlining the use of SSH for application deployment or systems administration tasks.
  • Fabtools - Tools for writing awesome Fabric files.
  • OpenStack - Open source software for building private and public clouds.
  • psutil - A cross-platform process and system utilities module.
  • SaltStack - Infrastructure automation and management system.
  • Supervisor - A Process Control System.

Foreign Function Interface

Libraries for providing foreign function interface.

  • cffi - Foreign Function Interface for Python calling C code.
  • ctypes - (Python standard library) Foreign Function Interface for Python calling C code.

High Performance

Libraries for making Python faster.

  • Cython - Optimizing Static Complier for Python. Uses type mixins to compile Python into C or C++ modules resulting in large performance gains.
  • PyPy - An implmentation of Python in Python. The interpreter uses black magic to make Python very fast without having to add in additional type information.
  • Pyston - A Python implementation built using LLVM and modern JIT techniques with the goal of achieving good performance.
  • Stackless Python - An enhanced version of the Python.

Algorithms and Design Patterns

Collections of algorithms and design patterns.

Miscellaneous

Useful libraries or tools that don't fit in the categories above.

  • IPython - IPython provides a rich toolkit to help you make the most out of using Python interactively.

Editor Plugins

Plugins for various editors.

  • Vim
    • Python-mode - An all in one plugin for turning Vim into a Python IDE.
  • Emacs
    • TODO
  • Sublime Text
    • Anaconda - Anaconda turns your Sublime Text 3 in a full featured Python development IDE.
    • Djaneiro - Django support for Sublime Text.
    • SublimeJEDI - A Sublime Text plugin to the awesome autocomplete library Jedi.

Resources

Where to discover new Python libraries.

Websites

Weekly

  • Pycoder's Weekly - A free weekly newsletter, on Fridays, for those interested in Python development and various topics around Python.
  • Python Weekly - A free weekly newsletter featuring curated news, articles, new releases, jobs etc related to Python.

Twitter

Contributing

Your contributions are always welcome!