Home Pdfplumber
Pdfplumber
pdfplumber is a powerful Python library for extracting detailed information from PDF files, including text, tables, individual characters, shapes, and more. It allows precise programmatic access to underlying PDF content, making it highly useful for data extraction, automation, and analysis tasks.
Language
Python
Latest Release
v0.11.8
License
MIT License
Key Features
- Extracts text from PDF files
- Detects and extracts tables from PDFs
- Provides access to individual PDF elements (characters, lines, shapes)
- Supports complex PDF layouts
- Pythonic API for easy integration
Alternative Tools
pdfminerPyPDF2Tabula-pycamelot
Resources
Community
Stars
9.3k
Open Issues
80
Forks
832