Posts by Collection

publications

Range-Based Equal Error Rate for Spoof Localization

Published in Proc. Interspeech 2023, 1900

Citation: Lin Zhang, Xin Wang, Erica Cooper, Nicholas Evans, Junichi Yamagishi. "Range-Based Equal Error Rate for Spoof Localization," in Proc. Interspeech 2023, pp. 3212-3216.

Unsupervised Adaptive Speaker Recognition by Coupling-Regularized Optimal Transport

Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing, 1900

Citation: Ruiteng Zhang, Jianguo Wei, Xugang Lu, Wenhuan Lu, Di Jin, Lin Zhang, and Junhai Xu. (2024). "Unsupervised Adaptive Speaker Recognition by Coupling-Regularized Optimal Transport." IEEE/ACM Transactions on Audio, Speech, and Language Processing. vol. 32, pp. 3603-3617.

Analysis of ABC Frontend Audio Systems for the NIST-SRE24

Published in Proc. Interspeech 2025, 1900

Citation: Sara Barahona, Anna Silnova, Ladislav Mošner, Junyi Peng, Oldřich Plchot, Johan Rohdin, Lin Zhang, et. al. "Analysis of ABC Frontend Audio Systems for the NIST-SRE24" in Proc. Interspeech 2025, 5763-5767.

CodecFake+: A Large-Scale Neural Audio Codec-Based Deepfake Speech Dataset

Published in IEEE Transactions on Audio, Speech and Language Processing, 1900

Citation: Jiawei Du, Xuanjun Chen, Haibin Wu, Lin Zhang, I-Ming Lin, I-Hsiang Chiu, Wenze Ren, Yuan Tseng, Yu Tsao, Jyh-Shing Roger Jang, Hung-yi Lee. (2025) "CodecFake+: A Large-Scale Neural Audio Codec-Based Deepfake Speech Dataset" (submitted to IEEE Transactions on Audio, Speech and Language Processing)

Multi-Sinkhorn Teacher Knowledge Aggregation Framework for Adaptive Audio Anti-Spoofing

Published in IEEE Transactions on Audio, Speech and Language Processing, 1900

Citation: Ruiteng Zhang, Jianguo Wei, Xugang Lu, Lin Zhang, Di Jin, Wenhuan Lu, Junhai Xu. (2025) "Multi-Sinkhorn Teacher Knowledge Aggregation Framework for Adaptive Audio Anti-Spoofing" in IEEE Transactions on Audio, Speech and Language Processing, vol. 33, pp. 3850-3865.

SHDA: Sinkhorn Domain Attention for Cross-Domain Audio Anti-Spoofing

Published in IEEE Transactions on Information Forensics and Security, 1900

Citation: Ruiteng Zhang, Jianguo Wei, Xugang Lu, Lin Zhang, Di Jin, Junhai Xu, Wenhuan Lu. (2025) "SHDA: Sinkhorn Domain Attention for Cross-Domain Audio Anti-Spoofing" IEEE Transactions on Information Forensics and Security, vol. 20, pp. 6474-6489, 2025, doi: 10.1109/TIFS.2025.3576576

ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts

Published in NeurIPS 2025 Datasets and Benchmarks Track (submitted), 1900

Citation: Ashi Garg, Zexin Cai, Lin Zhang, Henry Li Xinyuan, Leibny Paola Garcia Perera, Kevin Duh, Sanjeev Khudanpur, Matthew Wiesner, Nicholas Andrews "ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts" (Submitted to NeurIPS 2025 Datasets and Benchmarks Track)

WeDefense: A Toolkit to Defend Against Fake Audio

Published in Workshop / Toolkit (in preparation), 1900

Citation: Lin Zhang, Xin Wang, Johan Rohdin, Junyi Peng, Tianchi Liu, You Zhang, Hieu-Thi Luong, Shuai Wang, Anna Silnova, Chengdong Liang, Nicholas Evans. "WeDefense: A Toolkit to Defend Against Fake Audio" (in preparation)

talks

What’s Happening on Partial Spoof?

Published:

  • 2025-03-24: Invited talk at UEF, Finland, in person
  • 2025-02-25: Invited talk at EURECOM, France, in person
  • 2025-04-18: Speech Technologies reading group, JHU, USA, in person

An Overview of Partially Fake Speech

Published:

  • 2025-11-20, IEEE SPS Webinar, online.
    • Title: “Minor Manipulations, Major Threat: An Overview of Partially Fake Speech”
    • Slides
  • 2025-11-10, CLSP Webinar, JHU, USA.
    • Title: “Minor Manipulations, Major Threat: An Overview of Partially Fake Speech”.
  • 2025-08-15, Invited talk, Reality Defender, online
    • Title: “Small Changes, Big Threat: A Story of Partial Spoof”