Natural Language Processing

Libraries for working with human languages.

funNLP70.5K

A collection of tools and datasets for Chinese NLP.

jieba33.6K

The most popular Chinese text segmentation library.

gensim15.8K

Topic Modeling for Humans.

pattern8.8K

A web mining module.

Stanza7.3K

The Stanford NLP Group's official Python library, supporting 60+ languages.

pkuseg-python6.6K

A toolkit for Chinese word segmentation in various domains.

snownlp6.5K

A library for processing Chinese text.

pytext6.3K

A natural language modeling framework based on PyTorch.

langid.py2.3K

Stand-alone language identification system.

polyglot2.3K

Natural language pipeline supporting hundreds of languages.

PyTorch-NLP2.2K

A toolkit enabling rapid deep learning NLP prototyping for research.