• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Zhao, Yong (Zhao, Yong.) [1] | Zhang, Liumei (Zhang, Liumei.) [2] | Liu, Ximeng (Liu, Ximeng.) [3] (Scholars:刘西蒙) | Hu, Yuanjiao (Hu, Yuanjiao.) [4]

Indexed by:

CPCI-S EI Scopus

Abstract:

Speaker recognition is a cutting-edge technology that focuses on identifying individuals based on their unique voice characteristics. To address the challenges associated with data collection, we have leveraged deep learning techniques to introduce two innovative and lightweight speaker recognition models: Sinc-MN1D and AAM-Sinc-MN1D. These models integrate the latest advancements in deep learning and speaker verification by utilizing a modified MobileNetV2 framework as the core module.To capture essential short-term speaker features effectively, we have meticulously replaced the initial convolutional layer of the backbone network with a positively modified convolutional layer inspired by the optimized SincNet. Furthermore, to enhance the extraction of critical frequency features, we have incorporated the AAM-softmax loss function, commonly used in face recognition, to enhance the models capability in identifying challenging samples. Our method has been rigorously evaluated on the TIMIT dataset, demonstrating superior performance compared to the baseline approach.

Keyword:

component MobileNet raw waveform short-duration SincNet speaker recognition

Community:

  • [ 1 ] [Zhao, Yong]Xian Shiyou Univ, Sch Comp Sci, Xian, Peoples R China
  • [ 2 ] [Zhang, Liumei]Xian Shiyou Univ, Sch Comp Sci, Xian, Peoples R China
  • [ 3 ] [Hu, Yuanjiao]Xian Shiyou Univ, Sch Comp Sci, Xian, Peoples R China
  • [ 4 ] [Liu, Ximeng]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou, Peoples R China

Reprint 's Address:

  • [Zhang, Liumei]Xian Shiyou Univ, Sch Comp Sci, Xian, Peoples R China

Show more details

Version:

Related Keywords:

Related Article:

Source :

2024 INTERNATIONAL CONFERENCE ON NETWORKING AND NETWORK APPLICATIONS, NANA 2024

Year: 2024

Page: 453-458

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Online/Total:35/10071019
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1