How to Extract Key Frames from A Video with ffmpeg

2021-12-25

410 words 2 mins read 39 times read

In order to get enough information from videos, we often need to extract video frames. However, we do not want to extract every frame from a video due to:

information redundancy in successive video frames.
huge storage space needed by these video frames.

In this post, I want to share how to extract some key frames from video to reduce the storage, but still capture the information in the video.

Extract I-frame using ffmpeg

In video compression, they use the so called IPB frames to reduce the storage space:

I frames (Intra picture): a complete picture
P frames (predictive picture): p frames stores difference between current frame and previous frame.
B frames (Bidirectionally predicted picture): b-frame stores different between current frame and previous frame and later frame.

So we can extract I-frames only to get a fair amount of information from the video and reduce frame number.

The following command works:

ffmpeg -skip_frame nokey -i test.mp4 -vsync vfr -frame_pts true out-%02d.jpeg

-vsync vfr: discard the unused frames
-frame_pts true: use the frame index for image names, otherwise, the index starts from 1 and increments 1 each time

We can also use the filter syntax to extract keyframe:

ffmpeg -i test.mp4 -vf "select='eq(pict_type,I)'" -vsync vfr out-%02d.jpeg

The produced results are the same.

Extract scene-changing frames

If we only want to retain enough info from the video, extracting key frames may not be enough. The extracted key frames may still exhibit too much information redundancy. For example, if you have a slow-changing video, the difference between a frame and its subsequent frames will be negligible. To further reduce the number of images generated, we can also use scene to select frames which are likely to be a scene-changing frames.

It works like this:

ffmpeg -i input.mp4 -vf "select='gt(scene,0.4)'" -vsync vfr frame-%2d.jpg

The value for scene is between 0 and 1, which measures the pixel level difference between current frame and previous frame. We can tweak this value to generate different number of output images. Intuitively, the smaller the value, the larger number of images you will generate. Typically, a value of 0.3 or 0.4 may be enough.

In order to extract suitable number of frames from the video, for videos with fast-changing frames, you should set this value high, and for videos with mostly still frames, you should set this value low (maybe 0.1 or even less depending on the video content).

References

Author jdhao

LastMod 2021-12-28

License CC BY-NC-ND 4.0

Reward

Download Files to Local Storage in iPad

How to Extract Key Frames from A Video with ffmpeg

How to Extract Key Frames from A Video with ffmpeg

Extract I-frame using ffmpeg

Extract scene-changing frames

References

Recommend

The Quest to Trap Carbon in Stone—and Beat Climate Change

大家的日本語L23筆記

Download Files to Local Storage in iPad

3 Techniques to Improve Your Next Angular Project

A Clean Industrial Revolution Is the Only Way to Hit Net Zero

Hypothesis Mindset in Product Development - The Customer's Voice

Hypothesis Mindset in Product Development: Features That Get Used

Hypothesis Mindset in Product Development Centers the Customer

What 'The Matrix' Got Wrong About Cities of the Future | WIRED

An Atomic Accelerator Recruiting Retrospective for 2021

About Joyk