FFmpeg中基于深度学习模型的目标检测

从FFmpeg的代码提交记录lavfi: add filter dnn_detect for object detection中，我们发现，FFmpeg已经以滤镜的形式提供了基于DNN的目标检测能力。

dnn_detect滤镜

在FFmpeg中，基于DNN的目标检测能力由dnn_detect滤镜（vf_dnn_detect.c）提供。根据该能力的作者所提供的资料目标检测，FFmpeg中第一个基于深度学习模型的视频分析功能可知：

当前目标检测只支持OpenVINO后端，后续还将尽快加入更多功能，比如基于TensorFlow模型的目标检测、支持OpenVINO后端的目标识别、目标检测和识别结果的可视化等。

根据目标检测，FFmpeg中第一个基于深度学习模型的视频分析功能一文提供的demo可知，在当前的版本中，检测结果是通过showinfo滤镜以日志的形式输出的。

root@9d26c3a57bc7:/workspace# ffmpeg -i cici.jpg -vf dnn_detect=dnn_backend=openvino:model=face-detection-adas-0001.xml:input=data:output=detection_out:confidence=0.6:labels=face-detection-adas-0001.label,showinfo -f null -
...
[Parsed_showinfo_1 @ 0x561cf20c1f40]   side data - detection bounding boxes:
[Parsed_showinfo_1 @ 0x561cf20c1f40] source: face-detection-adas-0001.xml
[Parsed_showinfo_1 @ 0x561cf20c1f40] index: 0,  region: (1005, 813) -> (1086, 905), label: face, confidence: 10000/10000.
[Parsed_showinfo_1 @ 0x561cf20c1f40] index: 1,  region: (888, 839) -> (967, 926), label: face, confidence: 6917/10000.
...

日志格式的检测结果不利于直观的分析，因此我对该滤镜做了简单的修改，为其增加了检测结果框选的能力，以便可以更方便的评估dnn_detect的检测能力。具体效果如下所示：

dnn_detect滤镜的安装

安装libtensorflow

从https://storage.googleapis.com/tensorflow/中选择适合的libtensorflow版本下载即可，由于我是在MacOS上进行的实验，因此我选择了libtensorflow/libtensorflow-cpu-darwin-x86_64-2.4.1.tar.gz。

cd workspace
wget https://storage.googleapis.com/tensorflow/libtensorflow/libtensorflow-cpu-darwin-x86_64-2.4.1.tar.gz --no-check-certificate
tar xzvf libtensorflow-cpu-darwin-x86_64-2.4.1.tar.gz

安装openvino_toolkit

进入openvino_toolkit的下载页面，根据提示选择对应的版本下载并安装即可。MacOS的openvino_toolkit安装比较简单，下载对应的dmg文件之后，一路点击默认，然后安装就可以。默认会将相关的库文件安装在/opt/intel/openvino_2021目录。

重新编译FFmpeg

拉取FFmpeg的master分支代码。
根据vf_dnn_detect.7a6ea6ce2a.patch修改dnn_detect的代码。

设置相关的环境变量，如下所示：

export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:\
/opt/intel/openvino_2021/inference_engine/lib/intel64:\
/opt/intel/openvino_2021/inference_engine/external/tbb/lib:\
/opt/intel/openvino_2021/deployment_tools/ngraph/lib:\
/workspace/tensorflow/lib:\
/workspace/ffmpeg/outputs/lib

export DYLD_LIBRARY_PATH=$DYLD_LIBRARY_PATH:\
/workspace/ffmpeg/outputs/lib

编译FFmpeg

./configure --prefix=./outputs \
... \
--enable-libtensorflow \
--enable-libopenvino \
--extra-cflags="-I/workspace/tensorflow/include -I/opt/intel/openvino_2021/inference_engine/include" \
--extra-ldflags="-L/workspace/tensorflow/lib -L/opt/intel/openvino_2021/inference_engine/lib/intel64"

make && make install

根据目标检测，FFmpeg中第一个基于深度学习模型的视频分析功能中的demo下载对应的模型和相关文件，并使用ffplay播放。

cd outputs && ./bin/ffplay -i test.mp4 \
-vf "dnn_detect=dnn_backend=openvino:model=model/face-detection-adas-0001.xml:input=data:output=detection_out:confidence=0.6:labels=model/face-detection-adas-0001.label"

dnn_detect滤镜参数

从vf_dnn_detect.c可知，dnn_detect滤镜的参数主要有：

dnn_backend：控制dnn的后端，目前只支持openvivo
confidence：dnn目标检测时的置信阈值，可以根据实际情况来设置
labels：指定检测使用模型所对应的label文件路径
model：指定检测使用模型所对应的model文件路径
input：模型的输入
output：模型的输出
backend_configs, options：这两个参数都对应的是模型后端的相关配置
async：设置是否启用异步的DNN接口，默认为异步

在使用中，可以根据自己的需要来设置滤镜的相关参数，进而达到自己的目的。例如，通过设置async来启用同步DNN接口，具体如下：

./bin/ffplay -i test.mp4 \
    -vf "dnn_detect=dnn_backend=openvino:model=model/face-detection-adas-0001.xml:input=data:output=detection_out:confidence=0.6:labels=model/face-detection-adas-0001.label:async=0"

dnn_detect滤镜

dnn_detect滤镜的安装

安装libtensorflow

安装openvino_toolkit

重新编译FFmpeg

dnn_detect滤镜参数

Recommend

“永恒牛市”成立的三个理由（上篇）

Ubuntu MATE 21.04 更新，多项新功能来袭 | Linux 中国

使用 Linux 自动化工具提高生产率 | Linux 中国

如何在 Linux 中删除分区 | Linux 中国

我最喜欢的开源项目管理工具 | Linux 中国

How do One-Time passwords work?

Poor Man's Web

圣湘生物：因借款合同纠纷股东陈文义部分股份遭司法冻结

募资24.68亿！艾为电子闯关科创板IPO提交注册

深交所发布《创业板发行上市审核业务指引第1号——保荐业务现场督导》

About Joyk