image (3).png

抓取的结果信息包含:

  • 问答标题
  • 回答链接
  • 赞同数
  • 评论数

    结果示例图:

    image (4).png
    模板:
    1. {"_id":"zhihusearch","startUrl":["https://www.zhihu.com/search?type=content&q=%E5%A6%82%E4%BD%95"],"selectors":[{"id":"info","type":"SelectorElementScroll","parentSelectors":["_root"],"selector":"div.List-item","multiple":true,"delay":"3000"},{"id":"title","type":"SelectorLink","parentSelectors":
    2. ["info"],"selector":"h2 a","multiple":false,"delay":0},{"id":"likes","type":"SelectorText","parentSelectors":["info"],"selector":"button.VoteButton--up","multiple":false,"regex":"","delay":0},{"id":"comments","type":"SelectorText","parentSelectors":["info"],"selector":"button.ContentItem-action:nth-of-type(1)","multiple":false,"regex":"","delay":0}]}

    模板套用步骤:

    (1)进入知乎搜索结果综合页面,例如:https://www.zhihu.com/search?type=content&q=%E5%A6%82%E4%BD%95
    (2)导入模板
    (3)替换 Start URL为要抓取的网页链接
    (4)开始抓取

    注意事项

    (1)此模板只针对“知乎搜索结果的「综合」”栏目,请检查自己的网址正确
    (2)必须等弹出的抓取窗口下拉到最底部,加载完数据,自动关闭后,点击“refresh”,才能看到结果,否则无法保存数据。(下拉时间,取决于网页数据多少,请耐心等待)
    (3)如需提前中断,可以断网,等抓取窗口自动关闭,然后在点击“refresh”。

视频教程地址:https://m.lizhiweike.com/lecture2/17692220

下面的
1-4
1-5
1-6
1-7
步骤和这个相同,只是模板和start URL需要替换一下。
你可以对照一下。