搜索关键词:pdf-extraction,共找到 17 个结果
排序方式:
当前按 更新时间 降序 排列

libresign/pdf-signature-validator

High-quality PDF signature extraction and validation primitives for LibreSign and external consumers.

版本:v0.2.1 下载:247 Stars:1 点击:15

时间:2026-04-23 23:33

content-extract/content-processor

Robust PHP library for batch document processing. Extracts content from PDFs/text and generates structured JSON according to user-defined schemas. Now with semantic structuring, OCR support for scanned PDFs, text normalization, and alias-driven field matching. Production-ready, secure, zero unnecess

版本:1.5.0 下载:17 Stars:1 点击:15

时间:2026-04-19 15:27

techulus/capture

Official PHP SDK for Capture (capture.page). Capture screenshots, generate PDFs, extract content and metadata from web pages.

版本:未知版本 下载:15 Stars:0 点击:15

时间:2026-04-05 22:26

toolcenter/sdk

Official PHP SDK for ToolCenter API - Web automation tools including screenshots, PDFs, QR codes, metadata extraction, and more.

版本:1.0.0 下载:6 Stars:0 点击:9

时间:2026-02-25 12:39

jcfrane/pdf-text-extractor

A Laravel PDF text extraction package with multiple strategies (PdfParser, XObject, AWS Textract, Tesseract OCR). Handles Canva-generated PDFs, scanned documents, and other edge cases with automatic fallback.

版本:v0.0.3 下载:144 Stars:2 点击:9

时间:2026-02-11 09:00

rhysleesltd/laravel-camelot

PHP Wrapper library for interfacing with the Camelot PDF table extraction library built in Python

版本:v1.0 下载:123 Stars:0 点击:6

时间:2026-01-31 14:54

daniel-jorg-schuppelius/php-pdf-toolkit

PHP 8.2+ library for PDF text extraction with automatic reader selection. Supports embedded text and scanned documents via OCR.

版本:v0.12.2 下载:260 Stars:0 点击:10

时间:2026-01-26 11:55

renamed-to/renamed-php

Official PHP SDK for the renamed.to API - AI-powered file renaming, PDF splitting, and data extraction

版本:v0.1.6 下载:8 Stars:1 点击:5

时间:2026-01-11 00:21

silverstripe/textextraction

Text Extraction API for SilverStripe CMS (mostly used with 'fulltextsearch' module)

版本:5.0.1 下载:177.81k Stars:9 点击:7

时间:2026-01-04 03:05

keyvan/german-ocr

High-performance German document OCR - Local & Cloud API

版本:未知版本 下载:0 Stars:107 点击:6

时间:2026-01-02 21:53

kreuzberg/kreuzberg

High-performance document intelligence for PHP. Extract text, metadata, and structured information from PDFs, Office documents, images, and 75 formats. Powered by Rust core for 10-50x speed improvements.

版本:4.10.0-rc.15 下载:157 Stars:8.25k 点击:5

时间:2025-12-27 13:40

shammaa/laravel-smart-scraper

Advanced intelligent web scraper for Laravel with caching, rate limiting, middleware, monitoring, and much more. Built on Puppeteer with smart features.

版本:v1.0.0 下载:2 Stars:1 点击:7

时间:2025-12-16 21:41

shibashish/pdf-reader

A comprehensive Laravel package for extracting text, HTML, images, and metadata from PDF files using Poppler utilities.

版本:v1.0.2 下载:0 Stars:0 点击:4

时间:2025-12-09 09:46

kayukoff/camelot-php

PHP Wrapper library for interfacing with the Camelot PDF table extraction library built in Python

版本:v1.1.0 下载:948 Stars:1 点击:3

时间:2023-12-01 12:30

joest8/pdfinterpreter

This class is designed to convert multiple PDF files, whether image-based or text-based, into an array of data.The class uses user-defined templates containing regular expressions to control the data extraction process, allowing for customized and flexible output.

版本:v1.0 下载:6 Stars:1 点击:7

时间:2023-11-05 19:09