29-3 速卖通某类商品列表信息 - 图2

抓取的结果信息包含:

  • 商品标题
  • 商品链接
  • 价格
  • 评分
  • sold
  • 店铺名字
  • 店铺链接

    结果示例图:

    29-3 速卖通某类商品列表信息 - 图3

    模板:

    1. {"_id":"aliexpress-fenlei","startUrl":["https://www.aliexpress.com/category/100003109/women-clothing.html?trafficChannel=main&catName=women-clothing&CatId=100003109&ltype=wholesale&SortType=default&page=[1-5]&isrefine=y"],"selectors":
    2. [{"id":"info","type":"SelectorElementScroll","parentSelectors":["_root"],"selector":"li.list-item","multiple":true,"delay":"2000"},{"id":"title","type":"SelectorLink","parentSelectors":
    3. ["info"],"selector":"a.item-title","multiple":false,"delay":0},{"id":"price","type":"SelectorText","parentSelectors":
    4. ["info"],"selector":"div.item-price-row","multiple":false,"regex":"","delay":0},{"id":"score","type":"SelectorText","parentSelectors":
    5. ["info"],"selector":"span.rating-value","multiple":false,"regex":"","delay":0},{"id":"sold","type":"SelectorText","parentSelectors":
    6. ["info"],"selector":"a.sale-value-link","multiple":false,"regex":"\\d+","delay":0},{"id":"store-name","type":"SelectorLink","parentSelectors":
    7. ["info"],"selector":"a.store-name","multiple":false,"delay":0}]}

    模板套用步骤:

    (1)进入需要抓取的商品分类页面,例如:https://www.aliexpress.com/category/100003109/women-clothing.html?trafficChannel=main&catName=women-clothing&CatId=100003109&ltype=wholesale&SortType=default&page=1&isrefine=y
    (2)导入模板
    (3)替换 Start URL为要抓取的网页链接,(抓取多页需修改 Start URL 里的页码数)
    (4)开始抓取