Home Pdfplumber
Pdfplumber
pdfplumber is a powerful Python library for extracting detailed information from PDF files, including text, tables, individual characters, shapes, and more. It allows precise programmatic access to underlying PDF content, making it highly useful for data extraction, automation, and analysis tasks.
Language
Python
Latest Release
v0.11.9
License
MIT License
Our Newsletter
Get new Development tools right in your inbox
Get short emails with useful development projects, releases, and repos worth watching.
Key Features
- Extracts text from PDF files
- Detects and extracts tables from PDFs
- Provides access to individual PDF elements (characters, lines, shapes)
- Supports complex PDF layouts
- Pythonic API for easy integration
Alternative Tools
pdfminerPyPDF2Tabula-pycamelot
Resources
Community
Stars
10.1k
Open Issues
88
Forks
875