抓取的结果信息包含:
- 用户名字
- 评价分数
- 商品状态
- 时间
- 评价内容
结果示例图:
模板:
{"_id":"amazon-com-comments","startUrl":
["https://www.amazon.com/Apple-iPhone-Pro-Clear-Case/product-reviews/B07XQXZWVT/ref=cm_cr_arp_d_viewopt_sr?ie=UTF8&reviewerType=all_reviews&pageNumber=1&filterByStar=all_stars"],"selectors":[{"id":"info","type":"SelectorElementClick","parentSelectors":
["_root"],"selector":".a-row div.celwidget","multiple":true,"delay":"3000","clickElementSelector":".a-last a","clickType":"clickMore","discardInitialElements":"do-not-discard","clickElementUniquenessType":"uniqueCSSSelector"},{"id":"name","type":"SelectorText","parentSelectors":
["info"],"selector":"span.a-profile-name","multiple":false,"regex":"","delay":0},{"id":"score","type":"SelectorText","parentSelectors":
["info"],"selector":"> div:nth-of-type(2)","multiple":false,"regex":"","delay":0},{"id":"status","type":"SelectorText","parentSelectors":
["info"],"selector":"div.a-spacing-mini.review-data","multiple":false,"regex":"","delay":0},{"id":"time","type":"SelectorText","parentSelectors":
["info"],"selector":"span.a-color-secondary","multiple":false,"regex":"","delay":0},{"id":"content","type":"SelectorText","parentSelectors":
["info"],"selector":"div.a-spacing-small","multiple":false,"regex":"","delay":0}]}
模板套用步骤:
(1)进入需要抓取的商品评论页面,例如:https://www.amazon.com/Apple-iPhone-Pro-Clear-Case/product-reviews/B07XQXZWVT/ref=cm_cr_getr_d_paging_btm_next_3?ie=UTF8&reviewerType=all_reviews&pageNumber=3&filterByStar=all_stars
(2)导入模板
(3)替换 Start URL为要抓取的网页链接
(4)开始抓取
如需抓取不同类型,可以在抓取页面弹出后手动设置