Study｜Survey of Speaker Recognition System in NN Method

2 years ago

source link: https://zjcqn.github.io/posts/Speaker_recognition_system_survey/
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Study｜Survey of Speaker Recognition System in NN Method

Posted 1 year ago2021-01-23T15:33:00+08:00 by 陈钱牛

Feature Extraction

MFCC, PLP, SDC, PNCC, GFCC, CQCC.
Bottleneck, Phoneme Posterior Probability.
LLD/OpenSmile, Speech attributes, Acoustic-to-articulatory inversion, subglottal.
IMFCC, Modified Group Delay.

Variability Compensation

Backend Classification

Joint Bayesian
Cosine Similarity

Network Structure

Feed-forward DNN(FF-DNN): D-vector
RNN/LSTM
TDNN: x-vector

Study

SRS

This post is licensed under CC BY 4.0 by the author.

Recommend

Study｜Survey of Speaker Recognition System in NN Method

Study｜Survey of Speaker Recognition System in NN Method

Feature Extraction

Variability Compensation

Backend Classification

Network Structure

Recommend

golangci-lint timeout

Vim+Tmux配置Scheme编程环境

Netatalk缓冲区溢出漏洞(CVE-2018-1160)

深入理解计算机系统CacheLab-PartB实验报告

Redis数据结构及常用命令

BTCD源码分析之database存储

搭建Docker私有仓库

[Golang] Insert Line or String to File

[Golang] Union of Two Arrays

[Golang] XML Parsing Example (1)

About Joyk