抓取的结果信息包含:
- 书名
- 书籍链接
- 书籍详情页图片
结果示例图:
模板:
{"_id":"dangdang-xiangqingye","startUrl":["http://category.dangdang.com/cp01.22.04.00.00.00.html"],"selectors":[{"id":"info","type":"SelectorElement","parentSelectors":
["_root"],"selector":".bigimg li","multiple":true,"delay":0},{"id":"name","type":"SelectorLink","parentSelectors":
["info"],"selector":".name a","multiple":false,"delay":0},{"id":"img","type":"SelectorImage","parentSelectors":
["name"],"selector":".descrip img","multiple":false,"delay":0}]}
模板套用步骤:
(1)进入需要抓取的图书列表页面,例如:http://category.dangdang.com/cp01.22.04.00.00.00.html
(2)导入模板
(3)替换 Start URL为要抓取的网页链接,(抓取多页需修改 Start URL 里的页码数)
(4)开始抓取