首页 > 其他 > 详细

[爬虫学习]过拉勾网反爬虫机制

时间:2020-02-18 22:35:13      阅读:93      评论:0      收藏:0      [点我收藏+]

在爬虫练习当中用到拉勾网来练习requests库,但是遇到瓶颈问题:被爬虫的反爬机制限制

技术分享图片 技术分享图片

 

原代码:

1 #原代码
2 url = https://www.lagou.com/jobs/positionAjax.json?city=%E5%8C%97%E4%BA%AC&needAddtionalResult=false
3 headers = {
4 "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:72.0) Gecko/20100101 Firefox/72.0",Referer:https://www.lagou.com/jobs/list_php?labelWords=&fromSearch=true&suginput=}
5 data = {first:true,pn:1,kd:python}
6 
7 resp = requests.post(url,data = data,headers = headers)
8 with open(rF:\Python\爬虫\网络请求\php.html,w,encoding = utf-8) as fp:
9     fp.write(resp.content.decode(utf-8))

运行结果:

1 {"status":false,"msg":"您操作太频繁,请稍后再访问","clientIp":"119.17.1.200","state":2402} 

很明显的报错表示我们遇到了反爬机制,因此我们需要重新思考哪里出了问题。

经过分析我们得到

技术分享图片

原因在于我没有传递登入后的Cookie信息,即便传递没有登入的Cookie信息也会失败,问题就在于我之间有尝试爬取拉勾网,那是估计是没有传递Referer的参数,所以我的Ip被识别为爬虫

 

我们可以深度伪造为我们是通过上一个申请网页跳至该网页,因此我们需要获取cookie来标识我们是循序渐进,而不是直捣黄龙。

在requests库当中我们可以用session()来完成此类操作,因此修改代码为:

 1 url1 = https://www.lagou.com/jobs/list_php?labelWords=&fromSearch=true&suginput=
 2 url = https://www.lagou.com/jobs/positionAjax.json?city=%E5%8C%97%E4%BA%AC&needAddtionalResult=false
 3 
 4 header = {
 5 "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:72.0) Gecko/20100101 Firefox/72.0",Referer:https://www.lagou.com/jobs/list_php?labelWords=&fromSearch=true&suginput=}
 6 data = {first:true,pn:1,kd:php}
 7 
 8 session = requests.session()
 9 session.get(url1,headers = header) #模拟登录,即伪造访问地址
10 
11 resp = session.post(url,data = data,headers = header)
12 with open(rF:\Python\爬虫\网络请求\php.html,w,encoding = utf-8) as fp:
13     fp.write(resp.content.decode(utf-8))

运行结果:

技术分享图片
1 {"success":true,"msg":null,"code":0,"content":{"showId":"675a076deade451f8ca73f927870d070","hrInfoMap":{"6816233":{"userId":4014063,"portrait":"i/image3/M01/67/8B/Cgq2xl5Llr2AEne9AAHG7NWSLGc508.jpg","realName":"Nicky","positionName":"HRBP","phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true},"6568672":{"userId":6092581,"portrait":"i/image/M00/58/14/CgqKkVfSIu6AKrX0AAA9z3Y7K7A145.png","realName":"colinwu","positionName":"HRBP","phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true},"5939455":{"userId":3309774,"portrait":"i/image2/M01/73/1D/CgotOVteuM-AbTGhAAAQzbW9OrU752.png","realName":"腾讯招聘","positionName":"招聘经理","phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true},"6815105":{"userId":15179647,"portrait":"i/image2/M01/0E/AC/CgotOVyhgc-AFnq_AACAHyZqbrs619.png","realName":"谭伟","positionName":"技术总监","phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true},"6813354":{"userId":13629054,"portrait":"i/image2/M01/0E/8C/CgoB5lyhgdiAN-4AAACeGEp-ay0931.png","realName":"卢女士","positionName":"招聘经理","phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true},"6815671":{"userId":16050296,"portrait":null,"realName":"","positionName":null,"phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true},"6596149":{"userId":94038,"portrait":"i/image2/M01/9A/CE/CgotOV2mwDOAC8Q5AACCRlS7bWw485.png","realName":"HRD","positionName":"人力资源部负责人","phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true},"6815636":{"userId":9398353,"portrait":"i/image2/M01/0E/8C/CgoB5lyhgdiAN-4AAACeGEp-ay0931.png","realName":"廖丽群","positionName":"人事部","phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true},"6814303":{"userId":5517794,"portrait":"i/image2/M01/A0/F3/CgoB5l23rCqAUBD7AAr0sMuSPv8070.jpg","realName":"HRM","positionName":"人力资源经理","phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true},"5998339":{"userId":4782743,"portrait":null,"realName":"赵志杰","positionName":"高级招聘经理","phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true},"6037413":{"userId":3735589,"portrait":"i/image2/M01/A0/A1/CgoB5l22rmeAZfw-AABZfDd39QI492.png","realName":"郑阳","positionName":"HRM","phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true},"6814301":{"userId":5517794,"portrait":"i/image2/M01/A0/F3/CgoB5l23rCqAUBD7AAr0sMuSPv8070.jpg","realName":"HRM","positionName":"人力资源经理","phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true},"4130564":{"userId":3557386,"portrait":"i/image2/M01/AB/57/CgoB5l3WOriAVsKXAACy4_kg6zM136.png","realName":"hrziroom","positionName":"HR","phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true},"6037634":{"userId":3735589,"portrait":"i/image2/M01/A0/A1/CgoB5l22rmeAZfw-AABZfDd39QI492.png","realName":"郑阳","positionName":"HRM","phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true},"6813144":{"userId":10344288,"portrait":"images/myresume/default_headpic.png","realName":"张策飞","positionName":"招聘经理","phone":null,"receiveEmail":null,"userLevel":"G1","canTalk":true}},"pageNo":1,"positionResult":{"resultSize":15,"result":[{"positionId":6037413,"positionName":"PHP","companyId":515740,"companyFullName":"西藏嗨娱网络科技有限公司","companyShortName":"西藏嗨娱","companyLogo":"i/image2/M01/46/3D/CgoB5l0A4sCAQ3n2AAAMfWv0964618.png","companySize":"50-150人","industryField":"移动互联网","financeStage":"不需要融资","companyLabelList":["绩效奖金","带薪年假","定期体检","午餐补助"],"firstType":"开发|测试|运维类","secondType":"后端开发","thirdType":"PHP","skillLables":["Python","PHP"],"positionLables":["游戏","Python"],"industryLables":["游戏","Python"],"createTime":"2020-02-18 17:53:40","formatCreateTime":"17:53发布","city":"北京","district":"朝阳区","businessZones":["望京","来广营"],"salary":"10k-20k","workYear":"1-3年","jobNature":"全职","education":"本科","positionAdvantage":"五险一金14薪 补充医疗 大牛团队","imState":"today","lastLogin":"2020-02-18 19:45:58","publisherId":3735589,"approve":1,"subwayline":"15号线","stationname":"东湖渠","linestaion":"14号线东段_东湖渠;15号线_望京东","latitude":"40.009879","longitude":"116.484758","hitags":null,"resumeProcessRate":0,"resumeProcessDay":0,"score":495,"newScore":0.0,"matchScore":98.06985,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":false},{"positionId":6814303,"positionName":"PHP开发工程师","companyId":433964,"companyFullName":"北京钛氪新媒体科技有限公司","companyShortName":"钛氪新媒体科技","companyLogo":"i/image2/M01/79/A7/CgotOVttLOiATJ0TAABZGq1v4uA368.jpg","companySize":"50-150人","industryField":"移动互联网","financeStage":"未融资","companyLabelList":[],"firstType":"开发|测试|运维类","secondType":"后端开发","thirdType":"PHP","skillLables":["客户端"],"positionLables":["移动互联网","客户端"],"industryLables":["移动互联网","客户端"],"createTime":"2020-02-18 20:35:38","formatCreateTime":"20:35发布","city":"北京","district":"朝阳区","businessZones":["红庙","呼家楼"],"salary":"10k-20k","workYear":"1-3年","jobNature":"全职","education":"本科","positionAdvantage":"互联网,PHP,架构","imState":"today","lastLogin":"2020-02-18 20:35:33","publisherId":5517794,"approve":1,"subwayline":"1号线","stationname":"国贸","linestaion":"1号线_国贸;1号线_大望路;6号线_金台路;6号线_呼家楼;10号线_呼家楼;10号线_金台夕照;10号线_国贸;14号线东段_金台路;14号线东段_大望路","latitude":"39.919103","longitude":"116.473057","hitags":null,"resumeProcessRate":5,"resumeProcessDay":1,"score":102,"newScore":0.0,"matchScore":6.328693,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":false},{"positionId":6814301,"positionName":"PHP高级开发工程师","companyId":433964,"companyFullName":"北京钛氪新媒体科技有限公司","companyShortName":"钛氪新媒体科技","companyLogo":"i/image2/M01/79/A7/CgotOVttLOiATJ0TAABZGq1v4uA368.jpg","companySize":"50-150人","industryField":"移动互联网","financeStage":"未融资","companyLabelList":[],"firstType":"开发|测试|运维类","secondType":"后端开发","thirdType":"PHP","skillLables":["后端"],"positionLables":["后端"],"industryLables":[],"createTime":"2020-02-18 20:35:38","formatCreateTime":"20:35发布","city":"北京","district":"朝阳区","businessZones":["红庙","呼家楼"],"salary":"15k-30k","workYear":"3-5年","jobNature":"全职","education":"本科","positionAdvantage":"互联网,php,架构,服务端","imState":"today","lastLogin":"2020-02-18 20:35:33","publisherId":5517794,"approve":1,"subwayline":"1号线","stationname":"国贸","linestaion":"1号线_国贸;1号线_大望路;6号线_金台路;6号线_呼家楼;10号线_呼家楼;10号线_金台夕照;10号线_国贸;14号线东段_金台路;14号线东段_大望路","latitude":"39.919103","longitude":"116.473057","hitags":null,"resumeProcessRate":5,"resumeProcessDay":1,"score":94,"newScore":0.0,"matchScore":5.8698626,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":false},{"positionId":6596149,"positionName":"php","companyId":7924,"companyFullName":"北京智启蓝墨信息技术有限公司","companyShortName":"蓝墨科技","companyLogo":"image1/M00/00/10/CgYXBlTUWDCABgk3AAB4Bbg7p8U523.jpg","companySize":"50-150人","industryField":"移动互联网","financeStage":"A轮","companyLabelList":["绩效奖金 ","年终分红 ","五险一金 ","带薪年假 "],"firstType":"开发|测试|运维类","secondType":"后端开发","thirdType":"PHP","skillLables":["后端","PHP"],"positionLables":["后端"],"industryLables":[],"createTime":"2020-02-17 14:50:53","formatCreateTime":"1天前发布","city":"北京","district":"海淀区","businessZones":null,"salary":"15k-20k","workYear":"1-3年","jobNature":"全职","education":"本科","positionAdvantage":"工资绩效奖金补助积分落户……","imState":"threeDays","lastLogin":"2020-02-17 14:50:48","publisherId":94038,"approve":1,"subwayline":"13号线","stationname":"上地","linestaion":"13号线_上地","latitude":"40.036028","longitude":"116.313915","hitags":null,"resumeProcessRate":0,"resumeProcessDay":0,"score":85,"newScore":0.0,"matchScore":42.837605,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":false},{"positionId":6813354,"positionName":"php开发工程师","companyId":556728,"companyFullName":"安芯网盾(北京)科技有限公司","companyShortName":"安芯网盾","companyLogo":"i/image3/M01/62/04/Cgq2xl4hLo2ANyLoAAFmAzOTxAA242.jpg","companySize":"15-50人","industryField":"企业服务,信息安全","financeStage":"A轮","companyLabelList":["绩效奖金","年终分红","定期体检","交通补助"],"firstType":"开发|测试|运维类","secondType":"后端开发","thirdType":"PHP","skillLables":["Laravel","后端"],"positionLables":["信息安全","Laravel","后端"],"industryLables":["信息安全","Laravel","后端"],"createTime":"2020-02-18 16:33:09","formatCreateTime":"16:33发布","city":"北京","district":"海淀区","businessZones":null,"salary":"18k-35k","workYear":"1-3年","jobNature":"全职","education":"本科","positionAdvantage":"大牛云集","imState":"today","lastLogin":"2020-02-18 19:52:18","publisherId":13629054,"approve":1,"subwayline":"昌平线","stationname":"西二旗","linestaion":"13号线_西二旗;昌平线_西二旗","latitude":"40.059039","longitude":"116.292155","hitags":null,"resumeProcessRate":33,"resumeProcessDay":2,"score":82,"newScore":0.0,"matchScore":5.616715,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":false},{"positionId":6815105,"positionName":"php架构师","companyId":498054,"companyFullName":"北京亿客科创信息技术有限公司","companyShortName":"爱股票","companyLogo":"i/image2/M01/D1/2A/CgotOVw_AQOAXDDbAAC3g_B-gwU238.png","companySize":"150-500人","industryField":"移动互联网","financeStage":"C轮","companyLabelList":["五险一金","绩效奖金","带薪年假","节日礼物"],"firstType":"开发|测试|运维类","secondType":"高端技术职位","thirdType":"架构师","skillLables":[],"positionLables":["移动互联网","金融"],"industryLables":["移动互联网","金融"],"createTime":"2020-02-18 11:47:27","formatCreateTime":"11:47发布","city":"北京","district":"海淀区","businessZones":["八里庄"],"salary":"30k-40k","workYear":"5-10年","jobNature":"全职","education":"本科","positionAdvantage":"福利待遇优厚,完整晋升体系,股票期权奖励","imState":"today","lastLogin":"2020-02-18 17:37:33","publisherId":15179647,"approve":1,"subwayline":"6号线","stationname":"海淀五路居","linestaion":"6号线_慈寿寺;6号线_海淀五路居;10号线_慈寿寺","latitude":"39.931142","longitude":"116.279543","hitags":null,"resumeProcessRate":0,"resumeProcessDay":0,"score":78,"newScore":0.0,"matchScore":5.9489713,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":false},{"positionId":6816233,"positionName":"PHP/Golang开发工程师","companyId":35361,"companyFullName":"北京百家互联科技有限公司","companyShortName":"跟谁学","companyLogo":"i/image2/M01/50/CF/CgotOV0Ru5CAZIIwAAAdQkIlneg727.png","companySize":"2000人以上","industryField":"移动互联网,教育","financeStage":"上市公司","companyLabelList":["节日礼物","技能培训","股票期权","精英团队"],"firstType":"开发|测试|运维类","secondType":"后端开发","thirdType":"PHP","skillLables":["PHP","GO"],"positionLables":["PHP","GO"],"industryLables":[],"createTime":"2020-02-18 16:50:03","formatCreateTime":"16:50发布","city":"北京","district":"海淀区","businessZones":["西北旺","上地","马连洼"],"salary":"25k-40k","workYear":"3-5年","jobNature":"全职","education":"本科","positionAdvantage":"六险一金年底奖金每日饭补免费班车","imState":"today","lastLogin":"2020-02-18 16:43:01","publisherId":4014063,"approve":1,"subwayline":null,"stationname":null,"linestaion":null,"latitude":"40.044084","longitude":"116.283833","hitags":null,"resumeProcessRate":100,"resumeProcessDay":2,"score":73,"newScore":0.0,"matchScore":5.0787764,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":true},{"positionId":6813144,"positionName":"php开发工程师","companyId":133327,"companyFullName":"北京山香时代教育科技股份有限公司","companyShortName":"山香教育","companyLogo":"i/image/M00/34/8C/Cgp3O1dWHpWAIyTNAADS4vk0O4k567.JPG","companySize":"500-2000人","industryField":"移动互联网,教育","financeStage":"不需要融资","companyLabelList":[],"firstType":"开发|测试|运维类","secondType":"后端开发","thirdType":"PHP","skillLables":["PHP","Javascript","后端"],"positionLables":["电商","教育","PHP","Javascript","后端"],"industryLables":["电商","教育","PHP","Javascript","后端"],"createTime":"2020-02-18 14:41:02","formatCreateTime":"14:41发布","city":"北京","district":"海淀区","businessZones":null,"salary":"15k-30k","workYear":"3-5年","jobNature":"全职","education":"本科","positionAdvantage":"新三板上市公司,上升期赛道","imState":"today","lastLogin":"2020-02-18 17:45:13","publisherId":10344288,"approve":1,"subwayline":"10号线","stationname":"知春路","linestaion":"10号线_知春路;13号线_大钟寺;13号线_知春路","latitude":"39.965964","longitude":"116.340968","hitags":null,"resumeProcessRate":67,"resumeProcessDay":1,"score":73,"newScore":0.0,"matchScore":5.2369933,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":false},{"positionId":6815671,"positionName":"PHP工程师实习生","companyId":181825,"companyFullName":"北京多氪信息科技有限公司","companyShortName":"36氪","companyLogo":"i/image2/M00/1E/5D/CgotOVoKxFWAHg7fAABBnhqwVzg328.jpg","companySize":"500-2000人","industryField":"移动互联网,电商","financeStage":"上市公司","companyLabelList":["帅哥多","美女多","岗位晋升","五险一金"],"firstType":"开发|测试|运维类","secondType":"后端开发","thirdType":"后端开发","skillLables":["后端","Linux/Unix","Laravel","Phalcon"],"positionLables":["后端","Linux/Unix","Laravel","Phalcon"],"industryLables":[],"createTime":"2020-02-18 14:46:43","formatCreateTime":"14:46发布","city":"北京","district":"朝阳区","businessZones":["甜水园","水碓子","朝阳公园"],"salary":"1k-2k","workYear":"不限","jobNature":"兼职","education":"本科","positionAdvantage":"发展前景好","imState":"today","lastLogin":"2020-02-18 16:52:43","publisherId":16050296,"approve":1,"subwayline":"6号线","stationname":"团结湖","linestaion":"6号线_金台路;10号线_团结湖;14号线东段_枣营;14号线东段_朝阳公园;14号线东段_金台路","latitude":"39.93226","longitude":"116.477087","hitags":null,"resumeProcessRate":0,"resumeProcessDay":0,"score":67,"newScore":0.0,"matchScore":4.8098063,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":true},{"positionId":6568672,"positionName":"PHP专家","companyId":119153,"companyFullName":"北京大生在线科技有限公司","companyShortName":"51Talk 无忧英语","companyLogo":"i/image3/M00/3C/9A/CgpOIFqvdy2AXqxqAAC1icI3g90370.png","companySize":"2000人以上","industryField":"移动互联网,教育","financeStage":"上市公司","companyLabelList":["带薪年假","股票期权","绩效奖金","定期体检"],"firstType":"开发|测试|运维类","secondType":"后端开发","thirdType":"PHP","skillLables":[],"positionLables":["教育"],"industryLables":["教育"],"createTime":"2020-02-18 18:53:33","formatCreateTime":"18:53发布","city":"北京","district":"海淀区","businessZones":["上地","西二旗","清河"],"salary":"35k-45k","workYear":"5-10年","jobNature":"全职","education":"本科","positionAdvantage":"团队氛围好 成长快 美国上市 在线教育","imState":"today","lastLogin":"2020-02-18 19:56:36","publisherId":6092581,"approve":1,"subwayline":"昌平线","stationname":"西二旗","linestaion":"13号线_西二旗;昌平线_西二旗","latitude":"40.045415","longitude":"116.308144","hitags":null,"resumeProcessRate":1,"resumeProcessDay":1,"score":54,"newScore":0.0,"matchScore":10.379056,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":false},{"positionId":6037634,"positionName":"初级PHP","companyId":515740,"companyFullName":"西藏嗨娱网络科技有限公司","companyShortName":"西藏嗨娱","companyLogo":"i/image2/M01/46/3D/CgoB5l0A4sCAQ3n2AAAMfWv0964618.png","companySize":"50-150人","industryField":"移动互联网","financeStage":"不需要融资","companyLabelList":["绩效奖金","带薪年假","定期体检","午餐补助"],"firstType":"开发|测试|运维类","secondType":"后端开发","thirdType":"PHP","skillLables":["PHP","Python"],"positionLables":["PHP","Python"],"industryLables":[],"createTime":"2020-02-18 17:53:41","formatCreateTime":"17:53发布","city":"北京","district":"朝阳区","businessZones":["望京","来广营"],"salary":"10k-20k","workYear":"1-3年","jobNature":"全职","education":"本科","positionAdvantage":"五险一金14薪 补充医疗 大牛团队","imState":"today","lastLogin":"2020-02-18 19:45:58","publisherId":3735589,"approve":1,"subwayline":"15号线","stationname":"东湖渠","linestaion":"14号线东段_东湖渠;15号线_望京东","latitude":"40.009879","longitude":"116.484758","hitags":null,"resumeProcessRate":0,"resumeProcessDay":0,"score":50,"newScore":0.0,"matchScore":10.078444,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":false},{"positionId":6815636,"positionName":"PHP开发工程师(YII2框架)","companyId":84447248,"companyFullName":"北京小杏科技有限公司","companyShortName":"小杏科技","companyLogo":"i/image2/M01/A4/3B/CgotOV3A49WAJbdSAAAKhBqkli0908.png","companySize":"少于15人","industryField":"软件开发","financeStage":"未融资","companyLabelList":[],"firstType":"开发|测试|运维类","secondType":"后端开发","thirdType":"PHP","skillLables":["Yii","HTML/CSS","PHP"],"positionLables":["Yii","HTML/CSS","PHP"],"industryLables":[],"createTime":"2020-02-18 14:40:31","formatCreateTime":"14:40发布","city":"北京","district":"昌平区","businessZones":["回龙观"],"salary":"9k-14k","workYear":"1-3年","jobNature":"全职","education":"本科","positionAdvantage":"提升空间","imState":"today","lastLogin":"2020-02-18 17:25:25","publisherId":9398353,"approve":1,"subwayline":"昌平线","stationname":"生命科学园","linestaion":"8号线北段_育知路;8号线北段_朱辛庄;昌平线_朱辛庄;昌平线_生命科学园","latitude":"40.091482","longitude":"116.310499","hitags":null,"resumeProcessRate":0,"resumeProcessDay":0,"score":44,"newScore":0.0,"matchScore":4.0345416,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":false},{"positionId":4130564,"positionName":"PHP架构师","companyId":44091,"companyFullName":"北京自如生活企业管理有限公司","companyShortName":"自如网","companyLogo":"i/image3/M00/06/87/Cgq2xlpgCByAMD8nAACUbQvvrew382.png","companySize":"2000人以上","industryField":"房产家居","financeStage":"A轮","companyLabelList":["丰厚年薪","六险一金","健康体检","带薪年假"],"firstType":"开发|测试|运维类","secondType":"后端开发","thirdType":"其他后端开发","skillLables":["Java","MySQL","PHP"],"positionLables":["Java","MySQL","PHP"],"industryLables":[],"createTime":"2020-02-18 20:22:16","formatCreateTime":"20:22发布","city":"北京","district":"朝阳区","businessZones":["酒仙桥","将台路","大山子"],"salary":"25k-40k","workYear":"5-10年","jobNature":"全职","education":"本科","positionAdvantage":"年底多薪、绩效奖金、股权激励、餐补、房补","imState":"today","lastLogin":"2020-02-18 20:40:59","publisherId":3557386,"approve":1,"subwayline":"14号线东段","stationname":"将台","linestaion":"14号线东段_将台","latitude":"39.97208007","longitude":"116.49605","hitags":null,"resumeProcessRate":100,"resumeProcessDay":2,"score":41,"newScore":0.0,"matchScore":7.7210054,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":false},{"positionId":5939455,"positionName":"PHP后台开发","companyId":451,"companyFullName":"腾讯科技(深圳)有限公司","companyShortName":"腾讯","companyLogo":"image1/M00/00/03/CgYXBlTUV_qALGv0AABEuOJDipU378.jpg","companySize":"2000人以上","industryField":"社交","financeStage":"上市公司","companyLabelList":["免费班车","成长空间","年度旅游","岗位晋升"],"firstType":"开发|测试|运维类","secondType":"后端开发","thirdType":"PHP","skillLables":["后端","PHP","Linux/Unix"],"positionLables":["后端","PHP","Linux/Unix"],"industryLables":[],"createTime":"2020-02-18 19:43:57","formatCreateTime":"19:43发布","city":"北京","district":"海淀区","businessZones":["学院路","中关村","知春路"],"salary":"30k-50k","workYear":"3-5年","jobNature":"全职","education":"本科","positionAdvantage":"大平台","imState":"today","lastLogin":"2020-02-18 20:18:53","publisherId":3309774,"approve":1,"subwayline":"10号线","stationname":"知春路","linestaion":"10号线_知春里;10号线_知春路;10号线_西土城;13号线_大钟寺;13号线_知春路","latitude":"39.977273","longitude":"116.33701","hitags":["免费班车","年轻团队","学习机会","mac办公","定期团建","开工利是红包"],"resumeProcessRate":100,"resumeProcessDay":4,"score":40,"newScore":0.0,"matchScore":7.546966,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":true},{"positionId":5998339,"positionName":"php","companyId":81144,"companyFullName":"北京快松果科技有限公司","companyShortName":"松果出行","companyLogo":"i/image2/M01/78/55/CgotOVtqg0CAJGhNAAD_xRk0SWk667.png","companySize":"150-500人","industryField":"消费生活","financeStage":"C轮","companyLabelList":["年底双薪","股票期权","带薪年假","年度旅游"],"firstType":"开发|测试|运维类","secondType":"后端开发","thirdType":"PHP","skillLables":["后端","PHP"],"positionLables":["后端"],"industryLables":[],"createTime":"2020-02-15 21:01:53","formatCreateTime":"3天前发布","city":"北京","district":"朝阳区","businessZones":["望京","大山子","花家地"],"salary":"25k-50k","workYear":"5-10年","jobNature":"全职","education":"本科","positionAdvantage":"有竞争力的薪资,小而美,技术棒,福利好","imState":"threeDays","lastLogin":"2020-02-17 11:18:19","publisherId":4782743,"approve":1,"subwayline":"14号线东段","stationname":"望京南","linestaion":"14号线东段_阜通;14号线东段_望京南","latitude":"39.986149","longitude":"116.481124","hitags":null,"resumeProcessRate":37,"resumeProcessDay":0,"score":39,"newScore":0.0,"matchScore":25.767796,"matchScoreExplain":null,"query":null,"explain":null,"isSchoolJob":0,"adWord":0,"plus":null,"pcShow":0,"appShow":0,"deliver":0,"gradeDescription":null,"promotionScoreExplain":null,"isHotHire":0,"count":0,"aggregatePositionIds":[],"famousCompany":false}],"locationInfo":{"city":"北京","district":null,"businessZone":null,"isAllhotBusinessZone":false,"locationCode":null,"queryByGisCode":false},"queryAnalysisInfo":{"positionName":"php","companyName":null,"industryName":null,"usefulCompany":false,"jobNature":null},"strategyProperty":{"name":"dm-csearch-experimentalPositionStrategy","id":0},"hotLabels":null,"hiTags":null,"industryField":null,"companySize":null,"positionName":null,"totalCount":380,"categoryTypeAndName":{"3":"PHP"}},"pageSize":15},"resubmitToken":null,"requestId":null}
View Code

这样我们就越过了拉勾网反爬机制。

关于后续的数据分析就好做了!

[爬虫学习]过拉勾网反爬虫机制

原文:https://www.cnblogs.com/yaoyaoyao123/p/12328995.html

(0)
(0)
   
举报
评论 一句话评论(0
关于我们 - 联系我们 - 留言反馈 - 联系我们:wmxa8@hotmail.com
© 2014 bubuko.com 版权所有
打开技术之扣,分享程序人生!