搜索关键词:text-extract,共找到 45 个结果
排序方式:
当前按 更新时间 降序 排列

content-extract/content-processor

Robust PHP library for batch document processing. Extracts content from PDFs/text and generates structured JSON according to user-defined schemas. Now with semantic structuring, OCR support for scanned PDFs, text normalization, and alias-driven field matching. Production-ready, secure, zero unnecess

版本:1.5.0 下载:17 Stars:1 点击:14

时间:2026-04-19 15:27

rembish/text-at-any-cost

Extract plain text from common document formats: DOC, PDF, PPT, RTF, DOCX, ODT, RAR

版本:v1.0.0 下载:5 Stars:70 点击:6

时间:2026-02-17 14:30

nojimage/twitter-text-php

A library of PHP classes that provide auto-linking and extraction of usernames, lists, hashtags and URLs from tweets.

版本:v3.4.0 下载:1.86M Stars:118 点击:8

时间:2026-01-04 18:09

spatie/pdf-to-text

Extract text from a pdf

版本:1.55.0 下载:5.85M Stars:992 点击:5

时间:2026-01-04 10:27

smalot/pdfparser

Pdf parser library. Can read and extract information from pdf file.

版本:v2.12.2 下载:30.6M Stars:2.63k 点击:7

时间:2026-01-04 10:04

ipwsystems/rtftools

Library used to extract raw text from an RTF file

版本:1.0.2 下载:43.04k Stars:0 点击:8

时间:2026-01-04 08:25

hello-solucoes/pdf-to-text

Extract text from a pdf

版本:1.1.1 下载:23.52k Stars:0 点击:4

时间:2026-01-04 05:19

kreuzberg/kreuzberg

High-performance document intelligence for PHP. Extract text, metadata, and structured information from PDFs, Office documents, images, and 75 formats. Powered by Rust core for 10-50x speed improvements.

版本:4.10.0-rc.15 下载:157 Stars:8.25k 点击:5

时间:2025-12-27 13:40

ediazaro/receipt-scanner

Use OpenAI to extract structured receipt and invoice data from Text, Html, Images and PDFs.

版本:v4.0.1 下载:187 Stars:0 点击:5

时间:2025-11-13 22:53

denisdeejay/pdfparser

(fork of smalot/pdfparser) Pdf parser library. Can read and extract information from pdf file.

版本:v2.12.2 下载:18 Stars:0 点击:3

时间:2025-09-16 07:44

ottosmops/office2text

Extract text from Microsoft Office (docx, pptx, xlsx) and LibreOffice (odt, odp, ods) documents using PHP and ZipArchive.

版本:0.9.0 下载:72 Stars:0 点击:9

时间:2025-09-01 11:22

mostlyserious/craft-text-extractor

A tool to extract text from documents.

版本:1.0.1 下载:15 Stars:0 点击:5

时间:2025-04-30 20:48

aleksanm/excel2txt

Extract text from MS Excel xlsx file using builtin tool: /usr/bin/ssconvert

版本:0.0.1 下载:2 Stars:0 点击:6

时间:2025-04-10 12:58

aleksanm/docx2txt

Extract text from docx using docx2txt

版本:0.0.17 下载:8 Stars:0 点击:1

时间:2025-04-09 12:49

ledsquare/pdfparser

Pdf parser library. Can read and extract information from pdf file.

版本:v1.0.0 下载:2 Stars:0 点击:4

时间:2025-01-17 15:22