Home Pdfplumber

Pdfplumber

pdfplumber is a powerful Python library for extracting detailed information from PDF files, including text, tables, individual characters, shapes, and more. It allows precise programmatic access to underlying PDF content, making it highly useful for data extraction, automation, and analysis tasks.

Language
Python
Latest Release
v0.11.8
License
MIT License

Key Features

  • Extracts text from PDF files
  • Detects and extracts tables from PDFs
  • Provides access to individual PDF elements (characters, lines, shapes)
  • Supports complex PDF layouts
  • Pythonic API for easy integration

Alternative Tools

pdfminerPyPDF2Tabula-pycamelot


Community

Stars
9.3k
Open Issues
80
Forks
832