vipnytt/useragentparser
最新稳定版本:v1.0.7
Composer 安装命令:
composer require vipnytt/useragentparser
包简介
User-Agent parser for robot rule sets
关键字:
README 文档
README
User-Agent parser for robot rule sets
Parser and group determiner optimized for robots.txt, X-Robots-tag and Robots-meta-tag usage cases.
Requirements:
- PHP 5.5+, 7.0+ or 8.0+
Installation
The library is available for install via Composer. Just add this to your composer.json file:
{
"require": {
"vipnytt/useragentparser": "^1.0"
}
}
Then run php composer update.
Features
- Stripping of the version tag.
- List any rule groups the User-Agent belongs to.
- Determine the correct group of records by finding the group with the most specific User-agent that still matches.
When to use it?
- When parsing
robots.txtrule sets, for robots online. - When parsing the
X-Robots-TagHTTP header. - When parsing
Robots meta tagsin HTML / XHTML documents.
Note: Full User-agent strings, like them sent by eg. web-browsers, is not compatible, this is by design.
Supported User-agent string formats are UserAgentName/version with or without the version tag. Eg. MyWebCrawler/2.0 or just MyWebCrawler.
Getting Started
Strip the version tag.
use vipnytt\UserAgentParser; $parser = new UserAgentParser('googlebot/2.1'); $product = $parser->getProduct()); // googlebot
List different groups the User-agent belongs to
use vipnytt\UserAgentParser; $parser = new UserAgentParser('googlebot-news/2.1'); $userAgents = $parser->getUserAgents()); array( 'googlebot-news/2.1', 'googlebot-news/2', 'googlebot-news', 'googlebotnews', 'googlebot' );
Determine the correct group
Determine the correct group of records by finding the group with the most specific User-agent that still matches your rule sets.
use vipnytt\UserAgentParser; $parser = new UserAgentParser('googlebot-news'); $match = $parser->getMostSpecific(['googlebot/2.1', 'googlebot-images', 'googlebot'])); // googlebot
Cheat sheet
$parser = new UserAgentParser('MyCustomCrawler/1.2'); // Determine the correct rule set (robots.txt / robots meta tag / x-robots-tag) $parser->getMostSpecific($array); // string // Parse $parser->getUserAgent(); // string 'MyCustomCrawler/1.2' $parser->getProduct(); // string 'MyCustomCrawler' $parser->getVersion(); // string '1.2' // Crunch the data into groups, from most to less specific $parser->getUserAgents(); // array $parser->getProducts(); // array $parser->getVersions(); // array
Specifications
统计信息
- 总下载量: 910.27k
- 月度下载量: 0
- 日度下载量: 0
- 收藏数: 2
- 点击次数: 1
- 依赖项目数: 3
- 推荐数: 0
其他信息
- 授权协议: MIT
- 更新时间: 2016-04-08