40-1 youtube 视频搜索结果信息 - 图2

抓取的结果信息包含:

  • 标题
  • 链接
  • 观看次数
  • 发布时间
  • 点赞数
  • 踩数
  • 作者
  • 作者链接
  • 粉丝数

    结果示例图:

    40-1 youtube 视频搜索结果信息 - 图3

    模板:

    1. {"_id":"youtube-search-video","startUrl":
    2. ["https://www.youtube.com/results?search_query=iphone"],"selectors":[{"id":"info","type":"SelectorElementScroll","parentSelectors":
    3. ["_root"],"selector":"ytd-video-renderer:nth-of-type(-n+100)","multiple":true,"delay":"2000"},{"id":"title","type":"SelectorLink","parentSelectors":
    4. ["info"],"selector":"a.ytd-video-renderer","multiple":false,"delay":0},{"id":"watches","type":"SelectorText","parentSelectors":
    5. ["title"],"selector":"span.view-count","multiple":false,"regex":"","delay":0},{"id":"time","type":"SelectorText","parentSelectors":
    6. ["title"],"selector":"#date yt-formatted-string","multiple":false,"regex":"","delay":0},{"id":"likes","type":"SelectorText","parentSelectors":
    7. ["title"],"selector":"ytd-toggle-button-renderer.style-scope:nth-of-type(1) yt-formatted-string.style-scope","multiple":false,"regex":"","delay":0},{"id":"unlikes","type":"SelectorText","parentSelectors":
    8. ["title"],"selector":"ytd-toggle-button-renderer.style-scope:nth-of-type(2) yt-formatted-string.style-scope","multiple":false,"regex":"","delay":0},{"id":"author","type":"SelectorLink","parentSelectors":
    9. ["title"],"selector":".ytd-video-owner-renderer a","multiple":false,"delay":0},{"id":"followers","type":"SelectorText","parentSelectors":
    10. ["title"],"selector":"yt-formatted-string.ytd-video-owner-renderer","multiple":false,"regex":"","delay":0}]}

    模板套用步骤:

    (1)进入需要抓取的搜索结果页面,例如:https://www.youtube.com/results?search_query=iphone&pbjreload=10
    (2)导入模板
    (3)替换 Start URL为要抓取的网页链接
    (4)开始抓取

小提示:由于 YouTube 比较复杂,开始抓取后只能等抓取完成才能停止,中途停止则没有数据。如果时间紧,可以设置抓取 n 条数据,具体方法,修改模板中的控制参数 n 为想要的数字,如果抓取的数据总量小于 n,则会抓取完成后自动停止,具体操作方法见视频教程。