AAM-Sinc-MV1D:A Model of Short-term Speaker Recognition Based on The Raw Waveform - Details

author：

Zhao, Yong (Zhao, Yong.) ^[1] | Zhang, Liumei (Zhang, Liumei.) ^[2] | Liu, Ximeng (Liu, Ximeng.) ^[3] (Scholars：刘西蒙) | Hu, Yuanjiao (Hu, Yuanjiao.) ^[4]

Indexed by：

CPCI-S EI Scopus

Abstract：

Speaker　recognition　is　a　cutting-edge　technology　that　focuses　on　identifying　individuals　based　on　their　unique　voice　characteristics.　To　address　the　challenges　associated　with　data　collection,　we　have　leveraged　deep　learning　techniques　to　introduce　two　innovative　and　lightweight　speaker　recognition　models:　Sinc-MN1D　and　AAM-Sinc-MN1D.　These　models　integrate　the　latest　advancements　in　deep　learning　and　speaker　verification　by　utilizing　a　modified　MobileNetV2　framework　as　the　core　module.To　capture　essential　short-term　speaker　features　effectively,　we　have　meticulously　replaced　the　initial　convolutional　layer　of　the　backbone　network　with　a　positively　modified　convolutional　layer　inspired　by　the　optimized　SincNet.　Furthermore,　to　enhance　the　extraction　of　critical　frequency　features,　we　have　incorporated　the　AAM-softmax　loss　function,　commonly　used　in　face　recognition,　to　enhance　the　models　capability　in　identifying　challenging　samples.　Our　method　has　been　rigorously　evaluated　on　the　TIMIT　dataset,　demonstrating　superior　performance　compared　to　the　baseline　approach.

Keyword：

component MobileNet raw waveform short-duration SincNet speaker recognition

Community：

[ 1 ] [Zhao, Yong]Xian Shiyou Univ, Sch Comp Sci, Xian, Peoples R China
[ 2 ] [Zhang, Liumei]Xian Shiyou Univ, Sch Comp Sci, Xian, Peoples R China
[ 3 ] [Hu, Yuanjiao]Xian Shiyou Univ, Sch Comp Sci, Xian, Peoples R China
[ 4 ] [Liu, Ximeng]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou, Peoples R China

Reprint 's Address：

[Zhang, Liumei]Xian Shiyou Univ, Sch Comp Sci, Xian, Peoples R China

Email：

zy18735170655@163.com |
zhangliumei@xsyu.edu.cn |
snbnix@gmail.com |
sandra@xsyu.edu.cn

Show more details

Version：

AAM-Sinc-MV1D:A Model of Short-term Speaker Recognition Based on the Raw Waveform
2024，
AAM-Sinc-MV1D:A Model of Short-term Speaker Recognition Based on the Raw Waveform
2024，Proceedings - 2024 International Conference on Networking and Network Applications, NaNA 2024

Related Keywords：

Research on Network Intrusion Detection Based on Support Vector Machine Optimized with Grasshopper Optimization Algorithm
2019，10th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems - Technology and Applications (IDAACS)
Detection of Violent Crowd Behavior Based on Statistical Characteristics of the Optical Flow
2014，11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)
Ship Detection Based on Improved YOLOv8 Algorithm
2024，2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024
The study of 3D design and protection characteristics of household power distribution switches
2018，3rd International Conference on Intelligent Green Building and Smart Grid (IGBSG)

Source ：

2024 INTERNATIONAL CONFERENCE ON NETWORKING AND NETWORK APPLICATIONS, NANA 2024

Year： 2024

Page： 453-458

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

计算机与大数据学院、软件学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to