ASVspoof logo 

 

Fake speech examples from ASVspoof 2019 LA

 

Attack A07 - Type TTS

Human perception score 0.32

Machine assessment score 0.99

Attack A10 - Type TTS

Human perception score 0.04

Machine assessment score 0.99

Attack A08 - Type TTS

Human perception score 0.59

Machine assessment score 0.99

Attack A10 - Type TTS

Human perception score 0.04

Machine assessment score 0.99

Attack A18 - Type VC

Human perception score 0.54

Machine assessment score 0.99

Attack A19 - Type VC

Human perception score 0.48

Machine assessment score 0.99

Attack A18 - Type VC

Human perception score 0.54

Machine assessment score 0.99

Attack A19 - Type VC

Human perception score 0.48

Machine assessment score 0.99

 

Significance of scores

     human speech                               machine-generated speech

0 . . . . . . . . . . 0.5 . . . . . . . . . . 1

 

References

ASVspoof2019 Article

Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas Evans, Md Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling, "ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech," Computer Speech & Language, Volume 64, 2020, ISSN 0885-2308. 

https://www.sciencedirect.com/science/article/pii/S0885230820300474.

ASVspoof2019 LA database

https://datashare.ed.ac.uk/handle/10283/3336

Human perceptual assessment

https://zenodo.org/record/4460906#.Y5yh9ezMLzf

Machine assessment

Jee-weon Jung, Hee-Soo Heo, Hemlata Tak, Hye-jin Shim, Joon Son Chung, Bong-Jin Lee, Ha-Jin Yu, Nicholas Evans, "AASIST: Audio Anti-Spoofing Using Integrated Spectro-Temporal Graph Attention Networks," ICASSP 2022: 6367-6371.

https://arxiv.org/abs/2110.01200

Hemlata Tak, Massimiliano Todisco, Xin Wang, Jee-weon Jung, Junichi Yamagishi, Nicholas Evans "Automatic Speaker Verification Spoofing and Deepfake Detection Using Wav2vec 2.0 and Data Augmentation," Odyssey 2022: 112-119.

https://arxiv.org/abs/2202.12233