抓取的结果信息包含:
- 标题
- 链接
- 观看次数
- 发布时间
- 点赞数
- 踩数
- 作者
- 作者链接
- 粉丝数
结果示例图:
模板:
{"_id":"youtube-search-video","startUrl":
["https://www.youtube.com/results?search_query=iphone"],"selectors":[{"id":"info","type":"SelectorElementScroll","parentSelectors":
["_root"],"selector":"ytd-video-renderer:nth-of-type(-n+100)","multiple":true,"delay":"2000"},{"id":"title","type":"SelectorLink","parentSelectors":
["info"],"selector":"a.ytd-video-renderer","multiple":false,"delay":0},{"id":"watches","type":"SelectorText","parentSelectors":
["title"],"selector":"span.view-count","multiple":false,"regex":"","delay":0},{"id":"time","type":"SelectorText","parentSelectors":
["title"],"selector":"#date yt-formatted-string","multiple":false,"regex":"","delay":0},{"id":"likes","type":"SelectorText","parentSelectors":
["title"],"selector":"ytd-toggle-button-renderer.style-scope:nth-of-type(1) yt-formatted-string.style-scope","multiple":false,"regex":"","delay":0},{"id":"unlikes","type":"SelectorText","parentSelectors":
["title"],"selector":"ytd-toggle-button-renderer.style-scope:nth-of-type(2) yt-formatted-string.style-scope","multiple":false,"regex":"","delay":0},{"id":"author","type":"SelectorLink","parentSelectors":
["title"],"selector":".ytd-video-owner-renderer a","multiple":false,"delay":0},{"id":"followers","type":"SelectorText","parentSelectors":
["title"],"selector":"yt-formatted-string.ytd-video-owner-renderer","multiple":false,"regex":"","delay":0}]}
模板套用步骤:
(1)进入需要抓取的搜索结果页面,例如:https://www.youtube.com/results?search_query=iphone&pbjreload=10
(2)导入模板
(3)替换 Start URL为要抓取的网页链接
(4)开始抓取
小提示:由于 YouTube 比较复杂,开始抓取后只能等抓取完成才能停止,中途停止则没有数据。如果时间紧,可以设置抓取 n 条数据,具体方法,修改模板中的控制参数 n 为想要的数字,如果抓取的数据总量小于 n,则会抓取完成后自动停止,具体操作方法见视频教程。