首页 > 编程语言 > 详细

RAKE 快速、简单的关键词抽取算法

时间:2020-07-02 21:42:07      阅读:220      评论:0      收藏:0      [点我收藏+]

A Python implementation of the Rapid Automatic Keyword Extraction (RAKE) algorithm as described in: Rose, S., Engel, D., Cramer, N., & Cowley, W. (2010). Automatic Keyword Extraction from Individual Documents. In M. W. Berry & J. Kogan (Eds.), Text Mining: Theory and Applications: John Wiley & Sons.

了解该算法中的设计思维,关键词抽取的侧重点,对于后续的文本处理任务有很大帮助

开源地址:https://github.com/zelandiya/RAKE-tutorial

http://www.hlt.utdallas.edu/~saidul/code.html

https://pypi.org/project/yake/

Example use

from nlp_rake import rake

stoppath = data/stoplists/SmartStoplist.txt

rake_object = rake.Rake(stoppath, 5, 3, 4)

sample_file = open("data/docs/fao_test/w2167e.txt", r, encoding="iso-8859-1")
text = sample_file.read()

keywords = rake_object.run(text)

# 3. print results
print("Keywords:", keywords)

 

RAKE 快速、简单的关键词抽取算法

原文:https://www.cnblogs.com/demo-deng/p/13227253.html

(0)
(0)
   
举报
评论 一句话评论(0
关于我们 - 联系我们 - 留言反馈 - 联系我们:wmxa8@hotmail.com
© 2014 bubuko.com 版权所有
打开技术之扣,分享程序人生!