26

网络爬虫之Splash负载均衡配置-Python学习者

 5 years ago
source link: https://blog.51cto.com/14246112/2373721
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
neoserver,ios ssh client
如果我们用 Splash 来做 JavaScript 动态渲染的页面的抓取的话,如果爬取的量非常大,任务非常多,如果我们用一个 Splash 服务来处理的话未免压力太大了,所以我们可以考虑搭建一个负载均衡器来把压力分散到各个服务器上,这样相当于多台机器多个服务共同参与任务的处理,可以减小单个 Splash 服务的压力。1. 配置Splash服务要搭建 Splash 负载均衡首先我们需要有多个 Sp

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK