webhubworks/site-crawler 问题修复 & 功能扩展

解决BUG、新增功能、兼容多环境部署,快速响应你的开发需求

邮箱:yvsm@zunyunkeji.com | QQ:316430983 | 微信:yvsm316

webhubworks/site-crawler

最新稳定版本:3.1.3

Composer 安装命令:

composer create-project webhubworks/site-crawler

包简介

A straightforward site crawler

README 文档

README

Use this site crawler as a quick way to crawl any website. This is useful to detect any slow pages or pages with HTTP errors.

Please use this crawler responsibly. Do not use it to crawl websites that you do not own or have permission to crawl.

Installation

composer global require webhubworks/site-crawler

Development

  • To run the crawler locally (instead of using the globally installed version): php site-crawler app:crawl URL
  • To build the standalone app, run php site-crawler app:build site-crawler and specify the next version.

Usage

Use the help: site-crawler --help

Example: site-crawler https://example.com --limit=50 --basic-auth=user:pass --exclude=action,imprint

Roadmap

  • Add support for websites containing links in JS generated markup
  • Run requests in parallel

统计信息

  • 总下载量: 39
  • 月度下载量: 0
  • 日度下载量: 0
  • 收藏数: 1
  • 点击次数: 4
  • 依赖项目数: 0
  • 推荐数: 0

GitHub 信息

  • Stars: 1
  • Watchers: 1
  • Forks: 0
  • 开发语言: PHP

其他信息

  • 授权协议: MIT
  • 更新时间: 2024-11-12