joegreen0991/hyperloglog 问题修复 & 功能扩展

解决BUG、新增功能、兼容多环境部署,快速响应你的开发需求

邮箱:yvsm@zunyunkeji.com | QQ:316430983 | 微信:yvsm316

joegreen0991/hyperloglog

最新稳定版本:v1.0.0

Composer 安装命令:

composer require joegreen0991/hyperloglog

包简介

A hyper log log with min hash data structure library, for counting cardinalities. Union and intersection capable

README 文档

README

PHP implementation of the HyperLogLog algorithm. Based on Antirez/Redis implementation.

Resources

Note!

This version has been tuned to work with a P value of 14. This is a register size of 2^14 Bytes = 16KB

There is a large bias that can be seen in the graphs below, which begins when the set cardinality reaches around 2^P * 2.5. Polynomial regression has been used to calculate bias offsets BUT ONLY FOR P = 14. You are free to change the P value but the bias offsets will not be applied. Check out the code for more information

Some Professional Looking Graphs

####HyperLogLog

P=14 HyperLogLog P = 14

P=16 Note the offset bias around 2.5 * 2^16 ~= 165,000 HyerLogLog P = 16

P=20 Note the offset bias around 2.5 * 2^20 ~= 2,600,000 HyerLogLog P = 20

####MinHash

K=8192 MinHash K = 8129

统计信息

  • 总下载量: 16.14k
  • 月度下载量: 0
  • 日度下载量: 0
  • 收藏数: 21
  • 点击次数: 1
  • 依赖项目数: 0
  • 推荐数: 0

GitHub 信息

  • Stars: 21
  • Watchers: 1
  • Forks: 5
  • 开发语言: PHP

其他信息

  • 授权协议: MIT
  • 更新时间: 2014-07-07