23-4 当当图书评论 - 图1

抓取的结果信息包含:

  • 评论内容
  • 评论时间

    结果示例图:

    23-4 当当图书评论 - 图2

    模板:

    1. {"_id":"dangdang-comments","startUrl":["http://product.dangdang.com/27885398.html"],"selectors":[{"id":"info","type":"SelectorElementClick","parentSelectors":["_root"],"selector":"#comment_list div.comment_items","multiple":true,"delay":"2000","clickElementSelector":"a.btn.next","clickType":"clickMore","discardInitialElements":"do-not-discard","clickElementUniquenessType":"uniqueCSSSelector"},{"id":"content","type":"SelectorText","parentSelectors":
    2. ["info"],"selector":"span a","multiple":false,"regex":"","delay":0},{"id":"time","type":"SelectorText","parentSelectors":
    3. ["info"],"selector":".starline span:nth-of-type(1)","multiple":false,"regex":"","delay":0}]}

    模板套用步骤:

    (1)进入需要抓取的图书评论页面,例如:http://product.dangdang.com/27885398.html
    (2)导入模板
    (3)替换 Start URL为要抓取的网页链接,(抓取多页需修改 Start URL 里的页码数)
    (4)开始抓取