首页 > 其他 > 详细

nutch相关异常

时间:2014-08-09 00:18:16      阅读:380      评论:0      收藏:0      [点我收藏+]


1、在任务一开始运行,注入Url时即出现以下错误。

InjectorJob: Injecting urlDir: urls 
InjectorJob: Using class org.apache.gora.hbase.store.HBaseStore as the Gora storage class. 
InjectorJob: java.lang.RuntimeException: job failed: name=[20140000]inject urls, jobid=job_local1629320149_0001 
at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:54) 
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:233) 
at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251) 
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273) 
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) 
at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)

原因是regex-urlfilter.txt配置错误

nutch相关异常,布布扣,bubuko.com

nutch相关异常

原文:http://blog.csdn.net/jediael_lu/article/details/38445733

(0)
(0)
   
举报
评论 一句话评论(0
关于我们 - 联系我们 - 留言反馈 - 联系我们:wmxa8@hotmail.com
© 2014 bubuko.com 版权所有
打开技术之扣,分享程序人生!