定制 xparse/parser 二次开发

按需修改功能、优化性能、对接业务系统,提供一站式技术支持

邮箱:yvsm@zunyunkeji.com | QQ:316430983 | 微信:yvsm316

xparse/parser

最新稳定版本:3.0.0

Composer 安装命令:

composer require xparse/parser

包简介

Parser client

README 文档

README

GitHub tag Software License Coverage Status Quality Score Total Downloads

Parser client

Install

Via Composer

$ composer require xparse/parser

Usage

 $parser = new \Xparse\Parser\Parser(); $title = $parser->get('http://funivan.com')->content('//*[@class="entry-title"]/a'); print_r($title);

Using with custom Middleware

If you are using custom Guzzle Middleware and it doesn't send real requests, in order to get last effective url you need to set it to response X-GUZZLE-EFFECTIVE-URL header manually.

Here is an example of __invoke() method in your custom Middleware

 public function __invoke(callable $handler) : \Closure { return function (RequestInterface $request, array $options) use ($handler) { # some code return $handler($request, $options)->then(function (ResponseInterface $response) use ($request) { $response = $response->withHeader('X-GUZZLE-EFFECTIVE-URL', $request->getUri()); # some code return $response; }); }; }

Recursive Parser

Recursive Parser allows you to parse website pages recursively. You need to pass link from where to start and set next page expression (xPath, css, etc).

Basic Usage

Try to find all links to the repositories on github. Our query will be xparse. With recursive pagination we can traverse all pagination links and process each resulting page to fetch repositories links.

 use Xparse\Parser\Parser; use Xparse\Parser\RecursiveParser; # init Parser $parser = new Parser(); # set expression to pagination links and initial page url $pages = new RecursiveParser( $parser, ["//*[@class='pagination']//a/@href"], ['https://github.com/search?q=xparse'] ); $allLinks = []; foreach($pages as $page){ # set expression to fetch repository links $adsList = $page->value("//*[@class='repo-list-name']//a/@href")->all(); # merge and remove duplicates $allLinks = array_values(array_unique(array_merge($allLinks, $adsList))); } print_r($allLinks);

Testing

 ./vendor/bin/phpunit

Contributing

Please see CONTRIBUTING for details.

Credits

License

The MIT License (MIT). Please see License File for more information.

统计信息

  • 总下载量: 40.57k
  • 月度下载量: 0
  • 日度下载量: 0
  • 收藏数: 3
  • 点击次数: 3
  • 依赖项目数: 1
  • 推荐数: 0

GitHub 信息

  • Stars: 2
  • Watchers: 3
  • Forks: 7
  • 开发语言: PHP

其他信息

  • 授权协议: MIT
  • 更新时间: 2026-01-04