awesome-python/README.md
2014-06-30 11:03:25 -07:00

32 KiB

Awesome Python

A curated list of awesome Python frameworks, libraries and resources. Inspired by awesome-php.

Environment Management

Libraries for Python version and environment management.

Package Management

Libraries for package and dependency management.

  • pip / Python Package Index - The package and dependency manager.
  • wheel - The new standard of python distribution and are intended to replace eggs.

Files

Libraries for file manipulation and MIME type detection.

  • mimetypes - (Python standard library) Map filenames to MIME types.
  • imghdr - (Python standard library) Determine the type of an image.
  • python-magic - A Python interface to the libmagic file type identification library.
  • path.py - A module wrapper for os.path.
  • watchdog - API and shell utilities to monitor file system events.
  • django-storages - A collection of custom storage backends for Django.

Date and Time

Libraries for working with dates and times.

  • arrow - Better dates & times for Python.
  • dateutil - Extensions to the standard Python datetime module.
  • delorean - A library for clearing up the inconvenient truths that arise dealing with datetimes in Python.
  • when.py - Providing user-friendly functions to help perform common date and time actions.
  • moment - A Python library for dealing with dates/times. Inspired by Moment.js.

Text Processing

Libraries for parsing and manipulating texts.

  • difflib - (Python standard library) Helpers for computing deltas.
  • Levenshtein - Fast computation of Levenshtein distance and string similarity.
  • fuzzywuzzy - Fuzzy String Matching.
  • esmre - Regular expression accelerator.
  • phonenumbers - Library for parsing, formatting, storing and validating international phone numbers.
  • python-user-agents - Browser user agent parser.
  • shortuuid - A generator library for concise, unambiguous and URL-safe UUIDs.
  • tablib - A module for Tabular Datasets in XLS, CSV, JSON, YAML.
  • sqlparse - A non-validating SQL parser.
  • Pygments - A generic syntax highlighter.
  • python-slugify - A Python slugify library that handles unicode.
  • unicode-slugify - A slugifier that generates unicode slugs. Developed by Mozilla.
  • unidecode - ASCII transliterations of Unicode text.
  • chardet - Python 2/3 compatible character encoding detector.
  • xpinyin - A library to translate chinese hanzi (漢字) to pinyin (拼音).
  • pangu.py - Spacing texts.
  • python-docx - Reads, queries and modifies Microsoft Word 2007/2008 docx files.
  • xlwt / xlrd - A package is for writing and reading data and formatting information from Excel files.
  • XlsxWriter - A Python module for creating Excel .xlsx files.
  • PyYAML - YAML implementations for Python.

Natural Language Processing

Libraries for working with human languages.

  • NLTK - A leading platform for building Python programs to work with human language data.
  • Pattern - A web mining module for the Python programming language. It has tools for natural language processing, machine learning, among others.
  • TextBlob - Providing a consistent API for diving into common natural language processing (NLP) tasks. Stands on the giant shoulders of NLTK and Pattern, and plays nicely with both.
  • jieba - Chinese Words Segementation Utilities.
  • snownlp - A library for processing Chinese text.
  • loso - Another Chinese segmentation library.

Command-line Tools

Libraries for building command line utilities.

  • click - A package for creating beautiful command line interfaces in a composable way.
  • clint - Python Command-line Application Tools.
  • docopt - Pythonic command line arguments parser.
  • colorama - Cross-platform colored terminal text.

Documentation

Libraries for generating project documentation.

  • Sphinx - Python Documentation generator.

Imagery

Libraries for manipulating images.

  • pillow - Pillow is the friendly PIL fork. PIL is the Python Imaging Library.
  • thumbor - A smart imaging service. It enables on-demand crop, resizing and flipping of images.
  • imgSeek - A project for searching a collection of images using visual similarity.
  • python-qrcode - A pure Python QR Code generator.
  • pygram - Instagram-like image filters.
  • Quads - Computer art based on quadtrees.
  • nude.py - Nudity detection.
  • wand - Python bindings for MagickWand, C API for ImageMagick.

Audio

Libraries for manipulating audio.

  • django-elastic-transcoder - Django + AWS Elastic Transcoder.
  • beets - A music library manager and MusicBrainz tagger.
  • pyechonest - Python client for the Echo Nest API.
  • dejavu - Audio fingerprinting and recognition.
  • pydub - Manipulate audio with a simple and easy high level interface.
  • audioread - Cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding.
  • mutagen - A Python module to handle audio metadata.
  • tinytag - A library for reading music meta data of MP3, OGG, FLAC and Wave files.

Video

Libraries for manipulating video and GIFs.

  • moviepy - A module for script-based movie editing with many formats, including animated GIFs.
  • shorten.tv - Video summarization.
  • youtube-dl - A small command-line program to download videos from YouTube.

Geolocation

Libraries for geocoding addresses and working with latitudes and longitudes.

  • GeoDjango - A world-class geographic web framework.
  • geopy - Python Geocoding Toolbox.
  • pygeoip - Pure Python GeoIP API.
  • GeoIP - Python API for MaxMind GeoIP Legacy Database.
  • geojson - Python bindings and utlities for GeoJSON.

HTTP

Libraries for working with HTTP.

  • requests - HTTP Requests for Humans™.
  • httpie - A command line HTTP client, a user-friendly cURL replacement.

Database Drivers

Libraties for connecting and operating databases

  • Relational Databases
    • mysql-python - The MySQL database connector for Python.
    • psycopg2 - The most popular PostgreSQL adapter for the Python.
  • NoSQL Databases
    • PyMongo - The official Python client for MongoDB.
    • redis-py - The Redis Python Client.

ORM

Libraries that implement Object-Relational Mapping or datamapping techniques.

  • Relational Databases
  • NoSQL Databases
    • MongoEngine - A Python Object-Document-Mapper for working with MongoDB.
    • redisco - A Python Library for Simple Models and Containers Persisted in Redis.

Web Frameworks

Web development frameworks.

  • Django - The most popular web framework in Python.
  • Flask - A microframework for Python.
  • Bottle - A fast, simple and lightweight WSGI micro web-framework.
  • Pyramid - A small, fast, down-to-earth, open source Python web framework.
  • web2py - A full stack web framework and platform focused in the ease of use.

CMS

Content Management Systems

  • Mezzanine - A powerful, consistent, and flexible content management platform.
  • Wagtail - A Django content management system.
  • django-oscar - An open-source ecommerce framework for Django.
  • Quokka CMS - Flexible, extensible, small CMS powered by Flask and MongoDB.
  • Opps CMS - A Django-based CMS for magazines, newspapers websites and portals with high-traffic.

RESTful API

Libraries for developing RESTful APIs.

  • django-rest-framework - A powerful and flexible toolkit that makes it easy to build Web APIs.
  • django-tastypie - Creating delicious APIs for Django apps.
  • flask-api - An implementation of the same web browsable APIs that django-rest-framework provides.
  • flask-restful - An extension for Flask that adds support for quickly building REST APIs.
  • falcon - A high-performance Python framework for building cloud APIs and web app backends.
  • eve - REST API framework powered by Flask, MongoDB and good intentions.
  • sandman - Automated REST APIs for existing database-driven systems.

Authentication and OAuth

Libraries for implementing authentications schemes.

Template Engine

Libraries and tools for templating and lexing.

  • Jinja2 - A modern and designer friendly templating language.

Queue

Libraries for working with event and task queues.

  • celery - An asynchronous task queue/job queue based on distributed message passing.
  • rq - Simple job queues for Python.

Libraries and software for indexing and performing search queries on data.

Asset Management

Tools for managing, compressing and minifying website assets.

  • django-compressor - Compresses linked and inline javascript or CSS into a single cached file.

Caching

Libraries for caching data.

  • Beaker Caching & Sessions - Beaker is a library for caching and sessions for use with web applications and stand-alone Python scripts and applications.
  • HermesCache - Python caching library with tag-based invalidation and dogpile effect prevention.

Email

Libraries for sending and parsing email.

Internationalization

Libraries for woking with i18n.

  • Babel - An internationalization library for Python.

URL Manipulation

Libraries for parsing URLs.

  • furl - A small Python library that makes manipulating URLs simple.
  • purl - A simple, immutable URL class with a clean API for interrogation and manipulation.

HTML Manipulation

Libraries for working with HTML and XML.

  • BeautifulSoup - Providing Pythonic idioms for iterating, searching, and modifying HTML or XML.
  • lxml - A very fast, easy-to-use and versatile library for handling HTML and XML.
  • html5lib - A standards-compliant library for parsing and serializing HTML documents and fragments.
  • pyquery - A jQuery-like library for parsing HTML.
  • cssutils - A CSS library for Python.
  • MarkupSafe - Implements a XML/HTML/XHTML Markup safe string for Python.
  • bleach - A whitelist-based HTML sanitization and text linkification library.
  • xmltodict - Working with XML feel like you are working with JSON.

Web Crawling

Libraries for scraping websites.

  • Scrapy - A fast high-level screen scraping and web crawling framework.
  • portia - Visual scraping for Scrapy.
  • feedparser - Universal feed parser.
  • RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser.

Web Content Extracting

Libraries for extracting web contents.

  • newspaper - News extraction, article extraction and content curation in Pythom.
  • html2text - Convert HTML to Markdown-formatted text.
  • python-goose - HTML Content/Article Extractor.
  • lassie - Web Content Retrieval for Humans.
  • micawber - A small library for extracting rich content from URLs.
  • sumy - A module for automatic summarization of text documents and HTML pages.
  • Haul - An Extensible Image Crawler.
  • python-readability - Fast Python port of arc90's readability tool.

Forms

Libraries for working with forms.

  • django-crispy-forms - A Django app which lets you create beautiful forms in a very elegant and DRY way.
  • WTForms - A flexible forms validation and rendering library.

Anti-spam

Libraries for fighting spam.

Networking

Libraries for network programming.

  • gevent - A coroutine-based Python networking library that uses greenlet.
  • Twisted - An event-driven networking engine.
  • Tornado - A Web framework and asynchronous networking library.
  • pyzmq - A python wrapper for the 0MQ message library.

Admin Panels

Libraries for administrative interfaces.

  • Ajenti - The admin panel your servers deserve.
  • django-xadmin - Drop-in replacement of Django admin comes with lots of goodies.
  • flask-admin - Simple and extensible administrative interface framework for Flask.
  • flower - Real-time monitor and web admin for Celery.

DevOps Tools

Software and libraries for DevOps

  • OpenStack - Open source software for building private and public clouds.
  • Ansible - An IT automation tool.
  • SaltStack - Infrastructure automation and management system.
  • Fabric - Tool for streamlining the use of SSH for application deployment or systems administration tasks.
  • Fabtools - Tools for writing awesome Fabric files.
  • cuisine - Chef-like functionality for Fabric.
  • gunicorn - A WSGI HTTP Server for UNIX, fast clients and sleepy applications.
  • Supervisor - A Process Control System.
  • psutil - A cross-platform process and system utilities module.

GUI

Libraries for working with GUI applications.

  • PySide - Python bindings for the Qt cross-platform application and UI framework.
  • wxPython - A blending of the wxWidgets C++ class library with the Python.
  • kivy - A library for creating NUI applications, running on Windows, Linux, Mac OS X, Android and iOS.
  • curses - Built-in wrapper for ncurses used to create terminal GUI applications.
  • urwid - A library for creating terminal GUI applications with strong support for widgets, events, rich colors, etc.
  • pyglet - A cross-platform windowing and multimedia library for Python.

Game Development

Awesome game development libraries.

  • Pygame - Pygame is a set of Python modules designed for writing games.
  • Cocos2d - cocos2d is a framework for building 2D games, demos, and other graphical/interactive applications. It is based on pyglet.
  • PySDL2 - A ctypes based wrapper for the SDL2 library.
  • Panda3D - 3D game engine developed by Disney and maintained by Carnegie Mellon's Entertainment Technology Center. Written in C++, completely wrapped in Python.
  • PyOgre - Python bindings for the Ogre 3D render engine, can be used for games, simulations, anything 3D.
  • PyOpenGL - Python ctypes bindings for OpenGL and it's related APIs.

High Performance

Libraries for making Python faster.

  • Cython - Optimizing Static Complier for Python. Uses type mixins to compile Python into C or C++ modules resulting in large performance gains.
  • PyPy - An implmentation of Python in Python. The interpreter uses black magic to make Python very fast without having to add in additional type information.

Logging

Libraries for generating and working with log files.

  • logging - (Python standard library) Logging facility for Python.
  • Sentry - A realtime logging and aggregation server.
  • Raven - The Python client for Sentry.

Testing

Libraries for testing codebases and generating test data.

  • Testing Frameworks
    • unittest - (Python standard library) Unit testing framework.
    • nose - nose extends unittest.
    • pytest - A mature full-featured Python testing tool.
  • Mock
    • mock - A Python Mocking and Patching Library for Testing.
    • responses - A utility library for mocking out the requests Python library.
  • Fake Data
    • faker - A Python package that generates fake data.
    • model_mommy - Creating random fixtures for testing in Django.
  • Code Coverage

Code Analysis and Linter

Libraries and tools for analysing, parsing and manipulation codebases.

  • Flake8 - The modular source code checker: pep8, pyflakes and co.
  • Pylint - A source code analyzer.

Debugging Tools

Libraries for dubugging and developing

  • pdb - (Python standard library) The Python Debugger.
  • ipdb - IPython-enabled pdb.
  • pyringe - Debugger capable of attaching to and injecting code into Python processes.
  • django-debug-toolbar - Display various debug information about the current request/response.
  • django-devserver - A drop-in replacement for Django's runserver.

Science and Data Aanalysis

Libraries for scientific computing and data analyzing.

  • SciPy - A Python-based ecosystem of open-source software for mathematics, science, and engineering.
  • NumPy - A fundamental package for scientific computing with Python.
  • Numba - Python JIT (just in time) complier to LLVM aimed at scientific Python by the developers of Cython and NumPy.
  • matplotlib - A Python 2D plotting library.
  • NetworkX - A high-productivity software for complex networks.
  • Pandas - A library providing high-performance, easy-to-use data structures and data analysis tools.
  • PyMC - Markov Chain Monte Carlo sampling toolkit.
  • zipline - A Pythonic algorithmic trading library.
  • ggplot - Same API as ggplot2 for R.
  • PyDy - Short for Python Dynamics, used to assist with workflow in the modeling of dynamic motion based around NumPy, SciPy, IPython, and matplotlib.

Machine Learning

Libraries for Machine Learning

  • scikit-learn - A Python module for machine learning built on top of SciPy.
  • pattern - Web mining module for Python.
  • NuPIC - Numenta Platform for Intelligent Computing.
  • Pylearn2 - A Machine Learning library based on Theano.
  • hebel - GPU-Accelerated Deep Learning Library in Python.
  • gensim - Topic Modelling for Humans.
  • PyBrain - Another Python Machine Learning Library.
  • Crab - A flexible, fast recommender engine.
  • python-recsys - A Python library for implementing a Recommender System.

MapReduce

Framworks and libraries for MapReduce.

  • PySpark - The Spark Python API.
  • dpark - Python clone of Spark, a MapReduce alike framework in Python.
  • luigi - A module that helps you build complex pipelines of batch jobs.
  • mrjob - Run MapReduce jobs on Hadoop or Amazon Web Services.
  • dumbo - Python module that allows one to easily write and run Hadoop programs.

Third-party APIs

Libraries for accessing third party APIs.

Algorithms and Design Patterns

Collections of algorithms and design patterns.

Miscellaneous

Useful libraries or tools that don't fit in the categories above.

  • IPython - IPython provides a rich toolkit to help you make the most out of using Python interactively.

Resources

Editor Plugins

  • Vim
    • TODO
  • Emacs
    • TODO
  • Sublime Text
    • SublimeJEDI - A Sublime Text plugin to the awesome autocomplete library Jedi.
    • Anaconda - Anaconda turns your Sublime Text 3 in a full featured Python development IDE.
    • Djaneiro - Django support for Sublime Text.

Books

  • TODO

Websites

Weekly

  • Pycoder's Weekly - A free weekly newsletter, on Fridays, for those interested in Python development and various topics around Python.
  • Python Weekly - A free weekly newsletter featuring curated news, articles, new releases, jobs etc related to Python.

Twitter

Articles

Contributing

Your contributions are always welcome!