Jsoup处理html空格乱码问题

时间：2014-06-20 10:16:57 阅读：558 评论：0 收藏：0 [点我收藏+]

由于在html中空格只能用 表示，当运用Jsoup抓取html页面后，我们将html页面进行解析时，Java对html页面的代码不识别，输入到控制台时出现乱码，在网上查了很多资料都没有找到很好的解决办法，最后在一篇论坛中说到“运用字符串替换”可以进行解决，于是运用简单的字符串替换原理对此进行处理。对其替换处理后再对html文件进行解析。具体实现代码如下：

//参数说明：oldFile为所需要替换的文件，即为原文件；   newFile为替换后新的文件 ；oldString为所需要替换的字符串；newString为替换字符串
	public static void replaceAllFileString(File oldFile, File newFile, String oldString, String newString){
		try {
		BufferedReader reader = new BufferedReader(new FileReader(oldFile));
		BufferedWriter writer = new BufferedWriter(new FileWriter(newFile));
		String teamString = null;
		while((teamString = reader.readLine()) != null){
			String str = teamString.replaceAll(oldString, newString);
			writer.write(str);
		}
		reader.close();
		writer.close();
		} catch (IOException e) {
			// TODO Auto-generated catch block
			e.printStackTrace();
		}
	}

Jsoup处理html空格乱码问题,布布扣,bubuko.com

Jsoup处理html空格乱码问题

原文：http://blog.csdn.net/winnerspring/article/details/28603843

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年09月23日 (328)
2021年09月24日 (313)
2021年09月17日 (191)
2021年09月15日 (369)
2021年09月16日 (411)
2021年09月13日 (439)
2021年09月11日 (398)
2021年09月12日 (393)
2021年09月10日 (160)
2021年09月08日 (222)