首页 > 其他 > 详细

期末综合大作业:词频统计

时间:2018-06-20 22:04:46      阅读:178      评论:0      收藏:0      [点我收藏+]
#1.
theFile = open(the.txt,mode="r",encoding=utf-8)
theText = theFile.read()
theFile.close()
print(theText)

#2.
replaceList = [,,.,"",\n]
for c in replaceList:
    theText = theText.replace(c, )
print(theText)

#3.
print(theText.split( ))
theList = theText.split( )

#4.
theSet = set(theList)
print(theSet)

theDict = {}
for word in theSet:
    theDict[word] = theList.count(word)

print(theDict)
for d in theDict:
    print(d,theDict[d])

#5.
wordCountList = list(theDict.items())
print(wordCountList)
wordCountList.sort(key=lambda x:x[1],reverse=True)
print(wordCountList)

#6.
for i in range(20):
    print(wordCountList)

#7.
theCountFile = open(theCount.txt,mode=a,encoding=utf-8)
for i in range(len(wordCountList)):
    theCountFile.write(str(wordCountList[i][1])+ +wordCountList[i][0]+\n)
theCountFile.close()

 

 

期末综合大作业:词频统计

原文:https://www.cnblogs.com/lvchenhui/p/9206217.html

(0)
(0)
   
举报
评论 一句话评论(0
关于我们 - 联系我们 - 留言反馈 - 联系我们:wmxa8@hotmail.com
© 2014 bubuko.com 版权所有
打开技术之扣,分享程序人生!