ottosmops/pdftotext
最新稳定版本:2.0.1
Composer 安装命令:
composer require ottosmops/pdftotext
包简介
Extract text from PDF
README 文档
README
This package provides a class to extract text from a pdf.
\Ottosmops\Pdftotext\Extract::getText('/path/to/file.pdf') //returns the text from the pdf
Requirements
The Package uses pdftotext. Make sure that this is installed: which pdftotext
For Installation see: poppler-utils
If the installed binary is not found ("The command "which pdftotext" failed.") you can pass the full path to the _constructor (see below) or use putenv('PATH=$PATH:/usr/local/bin/:/usr/bin') (with the dir where pdftotext lives) before you call the class Extract.
Installation
composer require ottosmops/pdftotext
Usage
Extracting text from a pdf:
$text = (new Extract()) ->pdf('file.pdf') ->text();
Security note: If you pass user input as options or filenames to the library, make sure to validate or escape them to avoid shell injection. The library uses symfony/process, which provides basic protection, but unsafe options could still cause issues.
You can set the binary and you can specify options:
$text = (new Extract('/path/to/pdftotext')) ->pdf('path/to/file.pdf') ->options('-layout') ->text();
Default options are: -eol unix -enc UTF-8 -raw
License
The MIT License (MIT). Please see License File for more information.
统计信息
- 总下载量: 138.72k
- 月度下载量: 0
- 日度下载量: 0
- 收藏数: 6
- 点击次数: 1
- 依赖项目数: 0
- 推荐数: 0
其他信息
- 授权协议: MIT
- 更新时间: 2016-11-09