Libraries for parsing and manipulating specific text formats.
A library capable of splitting, merging and transforming PDF pages.
Pdfminer.six is a community maintained fork of the original PDFMiner.
Utilities for converting to and working with CSV.
Reads, queries and modifies Microsoft Word 2007/2008 docx files.
A module for Tabular Datasets in XLS, CSV, JSON, YAML.
A Python implementation of John Gruber’s Markdown.
A Python module for creating Excel .xlsx files.
A BSD-licensed library that makes it easy to call Python from Excel and vice versa.
Convert between any document format supported by LibreOffice/OpenOffice.
Fastest and full featured pure Python parsers of Markdown.
Python library for creating and updating PowerPoint (.pptx) files.
Writing and reading data and formatting information from Excel files.
Editing a docx document by jinja2 template
Providing one API for reading, manipulating and writing csv, ods, xls, xlsx and xlsm files.
A command line tool that can unpack archives easily.