承接 juanantoniodelgado/stopwords 相关项目开发

从需求分析到上线部署,全程专人跟进,保证项目质量与交付效率

邮箱:yvsm@zunyunkeji.com | QQ:316430983 | 微信:yvsm316

juanantoniodelgado/stopwords

最新稳定版本:1.4

Composer 安装命令:

composer require juanantoniodelgado/stopwords

包简介

A stopwords removal library

README 文档

README

Codacy Badge

PHP StopWords removal library with support for multiple languages.

Installation

composer require juanantoniodelgado/stopwords

Usage

use StopWords/StopWords;

$stopwords = new StopWords('en');
$stopwords->clean('your text to clean');

Supported languages

Arabic, Armenian, Basque, Bulgarian, Catalan, Cebuano, Chinese, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Gujarati, Hebrew, Hindi, Hungarian, Indonesian, Italian, Japanese, Latvian, Malay, Norwegian, Persian, Portuguese, Romanian, Russian, Slovak, Spanish, Swedish, Thai, Turkish, Ukrainian, and Vietnamese.

Notes

Language files are set according to ISO 639-2.

Sources

Language Source
Arabic https://github.com/Alir3z4/stop-words/blob/master/arabic.txt
Armenian https://docs.google.com/viewer?a=v&pid=sites&srcid=ZGVmYXVsdGRvbWFpbnxrZXZpbmJvdWdlfGd4OmM1NzgyOTk3NDA4NGJhZQ
Basque http://www.ranks.nl/stopwords/basque
Bulgarian https://github.com/Alir3z4/stop-words/blob/master/bulgarian.txt
Catalan http://www.ranks.nl/stopwords/catalan http://latel.upf.edu/morgana/altres/pub/ca_stop.htm
Cebuano https://github.com/digitalheir/cebuano-dictionary-js/blob/master/stop-words/stop-words.ts
Chinese https://docs.google.com/viewer?a=v&pid=sites&srcid=ZGVmYXVsdGRvbWFpbnxrZXZpbmJvdWdlfGd4OjRmZjlhYTNkNWZjMTc3NWI
Czech https://github.com/Alir3z4/stop-words/blob/master/czech.txt
Danish https://github.com/Alir3z4/stop-words/blob/master/danish.txt
Dutch https://github.com/Alir3z4/stop-words/blob/master/dutch.txt
English http://www.ranks.nl/stopwords
Estonian https://github.com/stopwords-iso/stopwords-et
Finnish https://github.com/Alir3z4/stop-words/blob/master/finnish.txt
French http://www.ranks.nl/stopwords/french https://github.com/Alir3z4/stop-words/blob/master/french.txt
German https://github.com/Alir3z4/stop-words/blob/master/german.txt
Greek https://docs.google.com/viewer?a=v&pid=sites&srcid=ZGVmYXVsdGRvbWFpbnxrZXZpbmJvdWdlfGd4OjI5MzI4MDRjMzk3M2Y2OWU
Gujarati https://github.com/Alir3z4/stop-words/blob/master/gujarati.txt
Hebrew https://github.com/Alir3z4/stop-words/blob/master/hebrew.txt
Hindi https://github.com/Alir3z4/stop-words/blob/master/hindi.txt
Hungarian https://github.com/Alir3z4/stop-words/blob/master/hungarian.txt
Indonesian https://github.com/Alir3z4/stop-words/blob/master/indonesian.txt
Italian https://raw.githubusercontent.com/Alir3z4/stop-words/master/italian.txt
Japanese https://docs.google.com/viewer?a=v&pid=sites&srcid=ZGVmYXVsdGRvbWFpbnxrZXZpbmJvdWdlfGd4OjdhNWQxZGQwOTE3ZjVkY2M
Latvian https://docs.google.com/viewer?a=v&pid=sites&srcid=ZGVmYXVsdGRvbWFpbnxrZXZpbmJvdWdlfGd4OjNiNGI5YTVmYjkxOWEwYmQ
Malay https://github.com/Alir3z4/stop-words/blob/master/malaysian.txt
Norwegian https://raw.githubusercontent.com/Alir3z4/stop-words/master/norwegian.txt
Persian https://docs.google.com/viewer?a=v&pid=sites&srcid=ZGVmYXVsdGRvbWFpbnxrZXZpbmJvdWdlfGd4OjY0MWMxMDBjZTc2Y2ZmZjk
Portuguese https://raw.githubusercontent.com/Alir3z4/stop-words/master/portuguese.txt
Romanian https://raw.githubusercontent.com/Alir3z4/stop-words/master/romanian.txt
Russian https://raw.githubusercontent.com/Alir3z4/stop-words/master/russian.txt
Slovak https://github.com/Alir3z4/stop-words/blob/master/slovak.txt
Spanish http://www.ranks.nl/stopwords/spanish http://snowball.tartarus.org/algorithms/spanish/stop.txt https://github.com/Alir3z4/stop-words/blob/master/spanish.txt
Swedish https://raw.githubusercontent.com/Alir3z4/stop-words/master/swedish.txt
Tagalog https://github.com/stopwords-iso/stopwords-tl
Thai https://github.com/stopwords-iso/stopwords-th
Turkish https://raw.githubusercontent.com/Alir3z4/stop-words/master/turkish.txt
Ukrainian https://raw.githubusercontent.com/Alir3z4/stop-words/master/ukrainian.txt
Vietnamese https://github.com/Alir3z4/stop-words/blob/master/vietnamese.txt

License

Contents of this repository are available under Attribution 4.0 International (CC BY 4.0).

统计信息

  • 总下载量: 53.02k
  • 月度下载量: 0
  • 日度下载量: 0
  • 收藏数: 10
  • 点击次数: 1
  • 依赖项目数: 1
  • 推荐数: 0

GitHub 信息

  • Stars: 10
  • Watchers: 1
  • Forks: 9
  • 开发语言: PHP

其他信息

  • 授权协议: CC-BY-4.0
  • 更新时间: 2021-05-29