26-4 amazon.com 某类商品信息 - 图1

抓取的结果信息包含:

  • 商品名字
  • 商品链接
  • 商品评分
  • 评论数
  • 价格
  • 运费情况

    结果示例图:

    26-4 amazon.com 某类商品信息 - 图2

    模板:

    1. {"_id":"amazon-com-fenlei","startUrl":["https://www.amazon.com/s?i=specialty-aps&bbn=16225018011&rh=n%3A7141123011%2Cn%3A16225018011%2Cn%3A1040660&ref=nav_em_T1_0_4_NaN_1__nav_desktop_sa_intl_clothing"],"selectors":[{"id":"info","type":"SelectorElement","parentSelectors":["_root","panination"],"selector":"div.s-expand-height","multiple":true,"delay":0},{"id":"title","type":"SelectorLink","parentSelectors":
    2. ["info"],"selector":".a-size-mini a","multiple":false,"delay":0},{"id":"score","type":"SelectorText","parentSelectors":
    3. ["info"],"selector":"i.a-icon-star-small","multiple":false,"regex":"","delay":0},{"id":"reviews","type":"SelectorText","parentSelectors":
    4. ["info"],"selector":"span.a-size-base","multiple":false,"regex":"","delay":0},{"id":"price","type":"SelectorText","parentSelectors":
    5. ["info"],"selector":".a-row div","multiple":false,"regex":"","delay":0},{"id":"ship","type":"SelectorText","parentSelectors":
    6. ["info"],"selector":"div.a-color-secondary","multiple":false,"regex":"","delay":0},{"id":"panination","type":"SelectorLink","parentSelectors":["_root","panination"],"selector":".a-last a","multiple":true,"delay":0}]}

    模板套用步骤:

    (1)进入需要抓取的商品分类页面,例如:https://www.amazon.com/s?i=fashion-womens-intl-ship&bbn=16225018011&rh=n%3A7141123011%2Cn%3A16225018011%2Cn%3A1040660&page=3&qid=1581393357&ref=sr_pg_3
    (2)导入模板
    (3)替换 Start URL为要抓取的网页链接
    (4)开始抓取

需要停止时,可以断网。