2

[1805.07339] Scanner: Efficient Video Analysis at Scale

 2 years ago
source link: https://arxiv.org/abs/1805.07339
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
neoserver,ios ssh client

Computer Science > Computer Vision and Pattern Recognition

[Submitted on 18 May 2018]

Scanner: Efficient Video Analysis at Scale

Download PDF

A growing number of visual computing applications depend on the analysis of large video collections. The challenge is that scaling applications to operate on these datasets requires efficient systems for pixel data access and parallel processing across large numbers of machines. Few programmers have the capability to operate efficiently at these scales, limiting the field's ability to explore new applications that leverage big video data. In response, we have created Scanner, a system for productive and efficient video analysis at scale. Scanner organizes video collections as tables in a data store optimized for sampling frames from compressed video, and executes pixel processing computations, expressed as dataflow graphs, on these frames. Scanner schedules video analysis applications expressed using these abstractions onto heterogeneous throughput computing hardware, such as multi-core CPUs, GPUs, and media processing ASICs, for high-throughput pixel processing. We demonstrate the productivity of Scanner by authoring a variety of video processing applications including the synthesis of stereo VR video streams from multi-camera rigs, markerless 3D human pose reconstruction from video, and data-mining big video datasets such as hundreds of feature-length films or over 70,000 hours of TV news. These applications achieve near-expert performance on a single machine and scale efficiently to hundreds of machines, enabling formerly long-running big video data analysis tasks to be carried out in minutes to hours.

Comments: 14 pages, 14 figuers Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Graphics (cs.GR) DOI: 10.1145/3197517.3201394 Cite as: arXiv:1805.07339 [cs.CV]   (or arXiv:1805.07339v1 [cs.CV] for this version)

Submission history

From: Alex Poms [view email]
[v1] Fri, 18 May 2018 17:43:55 UTC (4,799 KB)

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK