34-1 智联招聘职位搜索信息 - 图2

抓取的结果信息包含:

  • 职位名称
  • 公司名字
  • 薪资
  • 地点
  • 经验要求
  • 学历要求
  • 公司类型
  • 公司人数
  • 公司氛围

    结果示例图:

    34-1 智联招聘职位搜索信息 - 图3

    模板:

    1. {"_id":"zhilian-search","startUrl":["https://sou.zhaopin.com/?p=[1-5]&jl=530&kw=%E4%BA%A7%E5%93%81%E7%BB%8F%E7%90%86&kt=3"],"selectors":[{"id":"info","type":"SelectorElement","parentSelectors":["_root"],"selector":"div.contentpile__content__wrapper","multiple":true,"delay":0},{"id":"title","type":"SelectorText","parentSelectors":["info"],"selector":"span.contentpile__content__wrapper__item__info__box__jobname__title","multiple":false,"regex":"","delay":0},{"id":"company","type":"SelectorText","parentSelectors":["info"],"selector":"a.contentpile__content__wrapper__item__info__box__cname__title","multiple":false,"regex":"","delay":0},
    2. {"id":"salary","type":"SelectorText","parentSelectors":["info"],"selector":"p.contentpile__content__wrapper__item__info__box__job__saray","multiple":false,"regex":"","delay":0},
    3. {"id":"address","type":"SelectorText","parentSelectors":
    4. ["info"],"selector":"li:nth-of-type(1)","multiple":false,"regex":"","delay":0},{"id":"experience","type":"SelectorText","parentSelectors":
    5. ["info"],"selector":"li:nth-of-type(2)","multiple":false,"regex":"","delay":0},{"id":"xueli","type":"SelectorText","parentSelectors":
    6. ["info"],"selector":"li:nth-of-type(3)","multiple":false,"regex":"","delay":0},{"id":"company-intro1","type":"SelectorText","parentSelectors":["info"],"selector":"span.contentpile__content__wrapper__item__info__box__job__comdec__item:nth-of-type(1)","multiple":false,"regex":"","delay":0},
    7. {"id":"company-intro2","type":"SelectorText","parentSelectors":["info"],"selector":"span.contentpile__content__wrapper__item__info__box__job__comdec__item:nth-of-type(2)","multiple":false,"regex":"","delay":0},{"id":"tag","type":"SelectorText","parentSelectors":["info"],"selector":"div.job_welfare","multiple":false,"regex":"","delay":0}]}

    模板套用步骤:

    (1)进入需要抓取的搜索结果页面,例如:https://sou.zhaopin.com/?jl=530&kw=%E4%BA%A7%E5%93%81%E7%BB%8F%E7%90%86&kt=3
    (2)导入模板
    (3)替换 Start URL为要抓取的网页链接,(抓取多页需修改 Start URL 里的页码数)
    (4)开始抓取