Text Processing

Libraries for parsing and manipulating plain texts.

fuzzywuzzy9.2K

Fuzzy String Matching.

pypinyin5K

Convert Chinese hanzi (漢字) to pinyin (拼音).

ftfy3.8K

Makes Unicode text less broken and more consistent automagically.

sqlparse3.8K

A non-validating SQL parser.

python-phonenumbers3.5K

Parsing, formatting, storing and validating international phone numbers.

textdistance3.4K

Compute distance between sequences with 30+ algorithms.

ply2.8K

Implementation of lex and yacc parsing tools for Python.

pyparsing2.3K

A general purpose framework for generating parsers.

chardet2.2K

Python 2/3 compatible character encoding detector.

shortuuid2.1K

A generator library for concise, unambiguous and URL-safe UUIDs.

python-slugify1.5K

A Python slugify library that translates unicode to ASCII.

python-user-agents1.5K

Browser user agent parser.

hashids1.4K

Implementation of hashids in Python.

pyfiglet1.4K

An implementation of figlet written in Python.

Levenshtein1.3K

Fast computation of Levenshtein distance and string similarity.

python-nameparser663

Parsing human names into their individual components.

awesome-slugify485

A Python slugify library that can preserve unicode.

unicode-slugify320

A slugifier that generates unicode slugs with Django as a dependency.

pangu.py260

Paranoid text spacing.