Resources

DATASET RELEASE

  • HI-MIA                        Xiaoyi Qin, Hui Bu, Ming Li, “HI-MIA: a far-field text-dependent speaker verification database and the baselines”, Proc. of ICASSP 2020, 7609-7613.  http://openslr.org/85/
  • FFSVC22                   Xiaoyi Qin, Ming Li, Hui Bu, Shrikanth Narayanan, Haizhou Li, “The 2022 Far-field Speaker Verification Challenge: Exploring domain mismatch and semi-supervised learning under the far-field scenario“, Proc. of FFSVC workshop. 
  • FFSVC20                     Xiaoyi Qin, Ming Li, Hui Bu, Wei Rao, Rohan Kumar Das, Shrikanth Narayanan, Haizhou Li, “The INTERSPEECH 2020 Far-Field Speaker Verification Challenge”, Proc. of INTERSPEECH 2020, 3456-3460. http://2020.ffsvc.org/DataDownload
  • AISHELL3                   Yao Shi, Hui Bu, Xin Xu, Shaoji Zhang, Ming Li, “AISHELL-3: A Multi-Speaker Mandarin TTS Corpus”, Proc. of INTERSPEECH 2021, 2756-2760. http://www.aishelltech.com/aishell_3
  • DKU-JNU-EMA           Zexin Cai, Xiaoyi Qin, Danwei Cai, Ming Li, Xinzhong Liu, “The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion”, Proc. of ISCSLP 2018.  https://catalog.ldc.upenn.edu/LDC2019S14
  • RWF-2000          Ming Cheng, Kunjing Cai, Ming Li, “RWF-2000: An Open Large Scale Video Database for Violence Detection”, Proc. of ICPR 2020, 4183-4190. https://github.com/mchengny/RWF2000-Video-Database-for-Violence-Detection
  • Cross Age Speaker Verification Trials: Xiaoyi Qin, Na Li, Chao Weng, Dan Su, Ming Li, “Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings”, Interspeech 2022 https://github.com/qinxiaoyi/Cross-Age_Speaker_Verification
  • Slingua                     Xingming Wang, Hao Wu, Chen Ding, Chuanzeng Huang, Ming Li, “Exploring Universal Singing Speech Language Identification Using Self-Supervised Learning Based Front-End Features”, ICASSP 2023     https://github.com/Doctor-Do/Slingua
  • VoxBlink                  Yuke Lin, Xiaoyi Qin, Guoqing Zhao, Ming Cheng, Ning Jiang, Haiying Wu, Ming Li, “Voxblink: A Large Scale Speaker Verification Dataset on Camera”, ICASSP 2024.   VoxBlink: A Large Scale Speaker Verification Dataset on Camera
  • SlideSpeech            HaoxuWang, Fan Yu, Xian Shi, Yuezhang Wang, Shiliang Zhang, Ming Li, “Slidespeech: A Large Scale Slide-Enriched Audio-Visual Corpus”, ICASSP 2024. SlideSpeech-Corpus
  • KunquDB               Huali Zhou, Yuke Lin, Dong Liu, Ming Li, “KunquDB: An Attempt for Speaker Verification in the Chinese Opera Scenario”, submitted to ICPR 2024. GitHub – hualizhou167/KunquDB: the official site for KunquDB dataset

Co-Organized Challenges

SLT 2024: Source Speaker Tracing Challenge (SSTC2024) https://sstc-challenge.github.io/

SLT 2024: Stuttering Speech Challenge StutteringSpeech Challenge

SLT 2024: Low-Resource Dysarthria Wake-Up Word Spotting Challenge (LRDWWS Challenge) http://lrdwws.org/

INTERSPEECH 2022: Far Field Speaker Verification Challenge (FFSVC 22) https://ffsvc.github.io

ISCSLP 2021: Personalized Voice Trigger Challenge (PVTC) https://www.pvtc.org.cn/

INTERSPEECH 2020: Far Field Speaker Verification Challenge (FFSVC 20) http://2020.ffsvc.org/