搜索关键词:text-extraction,共找到 20 个结果
排序方式:
当前按 更新时间 降序 排列

content-extract/content-processor

Robust PHP library for batch document processing. Extracts content from PDFs/text and generates structured JSON according to user-defined schemas. Now with semantic structuring, OCR support for scanned PDFs, text normalization, and alias-driven field matching. Production-ready, secure, zero unnecess

版本:1.5.0 下载:17 Stars:1 点击:14

时间:2026-04-19 15:27

onstage2426/fuzor

Dependency-free full-text search for PHP. BM25 ranking, fuzzy and boolean modes, search-as-you-type prefix matching, stopword filtering and Snowball stemming for 62 languages, snippet extraction and result highlighting — one SQLite file, zero infrastructure.

版本:3.4.0 下载:12 Stars:1 点击:11

时间:2026-03-21 13:20

jcfrane/pdf-text-extractor

A Laravel PDF text extraction package with multiple strategies (PdfParser, XObject, AWS Textract, Tesseract OCR). Handles Canva-generated PDFs, scanned documents, and other edge cases with automatic fallback.

版本:v0.0.3 下载:144 Stars:2 点击:9

时间:2026-02-11 09:00

daniel-jorg-schuppelius/php-pdf-toolkit

PHP 8.2+ library for PDF text extraction with automatic reader selection. Supports embedded text and scanned documents via OCR.

版本:v0.12.2 下载:260 Stars:0 点击:9

时间:2026-01-26 11:55

apache-solr-for-typo3/tika

Apache Tika for TYPO3

版本:13.1.0 下载:579.39k Stars:9 点击:10

时间:2026-01-04 19:00

nojimage/twitter-text-php

A library of PHP classes that provide auto-linking and extraction of usernames, lists, hashtags and URLs from tweets.

版本:v3.4.0 下载:1.86M Stars:118 点击:7

时间:2026-01-04 18:09

silverstripe/textextraction

Text Extraction API for SilverStripe CMS (mostly used with 'fulltextsearch' module)

版本:5.0.1 下载:177.81k Stars:9 点击:5

时间:2026-01-04 03:05

keyvan/german-ocr

High-performance German document OCR - Local & Cloud API

版本:未知版本 下载:0 Stars:107 点击:4

时间:2026-01-02 21:53

kreuzberg/kreuzberg

High-performance document intelligence for PHP. Extract text, metadata, and structured information from PDFs, Office documents, images, and 75 formats. Powered by Rust core for 10-50x speed improvements.

版本:4.10.0-rc.15 下载:157 Stars:8.25k 点击:5

时间:2025-12-27 13:40

shibashish/pdf-reader

A comprehensive Laravel package for extracting text, HTML, images, and metadata from PDF files using Poppler utilities.

版本:v1.0.2 下载:0 Stars:0 点击:3

时间:2025-12-09 09:46

sharpapi/laravel-content-detect-emails

AI Email Detection for Laravel powered by SharpAPI.com

版本:v1.0.3 下载:1 Stars:1 点击:4

时间:2025-06-16 10:51

puma/libreria

Librería reconoce palabras que comienzan con mayusculas y te devuelve como palabras correctas y tambien extrae numeros del un texto.

版本:未知版本 下载:3 Stars:0 点击:4

时间:2025-05-12 00:44

fathkoc/php-textmagic

A lightweight PHP library for basic text analysis operations like summarization, sentiment analysis, keyword extraction, and classification.

版本:未知版本 下载:2 Stars:0 点击:6

时间:2024-10-04 13:49

joest8/pdfinterpreter

This class is designed to convert multiple PDF files, whether image-based or text-based, into an array of data.The class uses user-defined templates containing regular expressions to control the data extraction process, allowing for customized and flexible output.

版本:v1.0 下载:6 Stars:1 点击:5

时间:2023-11-05 19:09

kalimeromk/rssfeed

Full-Text RSS extraction package for Laravel - converts partial RSS feeds to full content

版本:v4.0.5 下载:843 Stars:3 点击:7

时间:2023-06-11 14:08