Specific Formats Processing

Libraries for parsing and manipulating specific text formats.

PyPDF28.6K

A library capable of splitting, merging and transforming PDF pages.

pdfminer.six6.1K

Pdfminer.six is a community maintained fork of the original PDFMiner.

csvkit6.1K

Utilities for converting to and working with CSV.

python-docx4.7K

Reads, queries and modifies Microsoft Word 2007/2008 docx files.

tablib4.7K

A module for Tabular Datasets in XLS, CSV, JSON, YAML.

Python-Markdown3.9K

A Python implementation of John Gruber’s Markdown.

XlsxWriter3.7K

A Python module for creating Excel .xlsx files.

xlwings3.1K

A BSD-licensed library that makes it easy to call Python from Excel and vice versa.

unoconv2.7K

Convert between any document format supported by LibreOffice/OpenOffice.

Mistune2.6K

Fastest and full featured pure Python parsers of Markdown.

python-pptx2.5K

Python library for creating and updating PowerPoint (.pptx) files.

xlrd2.2K

Writing and reading data and formatting information from Excel files.

docxtpl2.1K

Editing a docx document by jinja2 template

pyexcel1.2K

Providing one API for reading, manipulating and writing csv, ods, xls, xlsx and xlsm files.

unp425

A command line tool that can unpack archives easily.