1-9 知乎大v关注的所有问题 - 图1

抓取的结果信息包含:

  • 问题标题
  • 问题内容链接
  • 时间
  • 回答数
  • 关注人数

    结果示例图:

    1-9 知乎大v关注的所有问题 - 图2

    模板:

    1. {"_id":"zhihuproblem2","startUrl":["https://www.zhihu.com/people/wangdegnke/following/questions?page=[1-23]"],"selectors":[{"id":"outer","type":"SelectorElement","parentSelectors":
    2. ["_root"],"selector":"div.List-item","multiple":true,"delay":"2000"},{"id":"problem","type":"SelectorLink","parentSelectors":["outer"],"selector":"a","multiple":false,"delay":0},{"id":"time","type":"SelectorText","parentSelectors":["outer"],"selector":"span.ContentItem-statusItem:nth-of-type(1)","multiple":false,"regex":"","delay":0},{"id":"answer","type":"SelectorText","parentSelectors":["outer"],"selector":"span.ContentItem-statusItem:nth-of-type(2)","multiple":false,"regex":"","delay":0},{"id":"followers","type":"SelectorText","parentSelectors":["outer"],"selector":"span.ContentItem-statusItem:nth-of-type(3)","multiple":false,"regex":"","delay":0}]}

    模板套用步骤:

    (1)进入大V关注问题页面,例如:
    https://www.zhihu.com/people/wangdegnke/following/questions
    (2)导入模板
    (3)替换 Start URL为要抓取的网页链接,修改要抓取的页数,比如:”[1-5]“是 第 1 页到第 5 页所有问题。
    (4)开始抓取