hive之wordcount

时间：2019-08-27 10:16:17 阅读：72 评论：0 收藏：0 [点我收藏+]

1．创建一张表，记录文件数据，使用换行符作为分隔符

　　create table file_data(content string)

　　row format delimited fields terminated by ‘/n‘

2．将准备的数据（/home/hadoop/wordcount.tx）添加到file_data 表中

　　load data local inpath ‘/home/hadoop/wordcount.tx‘ into table file_data

3．根据＂　＂切分数据，切分出来的每个单词作为一行记录到结果表。

　　（１）创建结果表，将切分的单词作为每一行记录到结果表中去

　　　　create table words(word string)

　　　　insert into table words select explode(split(line," ")) from file_data

　　（２）使用聚合函数count进行统计

　　　　select word,count(word)

　　　　from words

　　　　group by word

　　　　(可以将count(word)取别名count,然后利用order by count来进行排序)

原文：https://www.cnblogs.com/hdc520/p/11416382.html

踩

(0)

评论一句话评论（0）

分享档案

更多>