Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Pages

Posts

Future Blog Post

less than 1 minute read

Published:

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

Blog Post number 4

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 3

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 2

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 1

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

publications

Range-Based Equal Error Rate for Spoof Localization

Published in Proc. Interspeech 2023, 1900

Citation: Lin Zhang, Xin Wang, Erica Cooper, Nicholas Evans, Junichi Yamagishi. "Range-Based Equal Error Rate for Spoof Localization," in Proc. Interspeech 2023, pp. 3212-3216.

Unsupervised Adaptive Speaker Recognition by Coupling-Regularized Optimal Transport

Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing, 1900

Citation: Ruiteng Zhang, Jianguo Wei, Xugang Lu, Wenhuan Lu, Di Jin, Lin Zhang, and Junhai Xu. (2024). "Unsupervised Adaptive Speaker Recognition by Coupling-Regularized Optimal Transport." IEEE/ACM Transactions on Audio, Speech, and Language Processing. vol. 32, pp. 3603-3617.

Analysis of ABC Frontend Audio Systems for the NIST-SRE24

Published in Proc. Interspeech 2025, 1900

Citation: Sara Barahona, Anna Silnova, Ladislav Mošner, Junyi Peng, Oldřich Plchot, Johan Rohdin, Lin Zhang, et. al. "Analysis of ABC Frontend Audio Systems for the NIST-SRE24" in Proc. Interspeech 2025, 5763-5767.

CodecFake+: A Large-Scale Neural Audio Codec-Based Deepfake Speech Dataset

Published in IEEE Transactions on Audio, Speech and Language Processing, 1900

Citation: Jiawei Du, Xuanjun Chen, Haibin Wu, Lin Zhang, I-Ming Lin, I-Hsiang Chiu, Wenze Ren, Yuan Tseng, Yu Tsao, Jyh-Shing Roger Jang, Hung-yi Lee. (2025) "CodecFake+: A Large-Scale Neural Audio Codec-Based Deepfake Speech Dataset" (submitted to IEEE Transactions on Audio, Speech and Language Processing)

Multi-Sinkhorn Teacher Knowledge Aggregation Framework for Adaptive Audio Anti-Spoofing

Published in IEEE Transactions on Audio, Speech and Language Processing, 1900

Citation: Ruiteng Zhang, Jianguo Wei, Xugang Lu, Lin Zhang, Di Jin, Wenhuan Lu, Junhai Xu. (2025) "Multi-Sinkhorn Teacher Knowledge Aggregation Framework for Adaptive Audio Anti-Spoofing" in IEEE Transactions on Audio, Speech and Language Processing, vol. 33, pp. 3850-3865.

SHDA: Sinkhorn Domain Attention for Cross-Domain Audio Anti-Spoofing

Published in IEEE Transactions on Information Forensics and Security, 1900

Citation: Ruiteng Zhang, Jianguo Wei, Xugang Lu, Lin Zhang, Di Jin, Junhai Xu, Wenhuan Lu. (2025) "SHDA: Sinkhorn Domain Attention for Cross-Domain Audio Anti-Spoofing" IEEE Transactions on Information Forensics and Security, vol. 20, pp. 6474-6489, 2025, doi: 10.1109/TIFS.2025.3576576

ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts

Published in NeurIPS 2025 Datasets and Benchmarks Track (submitted), 1900

Citation: Ashi Garg, Zexin Cai, Lin Zhang, Henry Li Xinyuan, Leibny Paola Garcia Perera, Kevin Duh, Sanjeev Khudanpur, Matthew Wiesner, Nicholas Andrews "ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts" (Submitted to NeurIPS 2025 Datasets and Benchmarks Track)

WeDefense: A Toolkit to Defend Against Fake Audio

Published in Workshop / Toolkit (in preparation), 1900

Citation: Lin Zhang, Xin Wang, Johan Rohdin, Junyi Peng, Tianchi Liu, You Zhang, Hieu-Thi Luong, Shuai Wang, Anna Silnova, Chengdong Liang, Nicholas Evans. "WeDefense: A Toolkit to Defend Against Fake Audio" (in preparation)

talks

What’s Happening on Partial Spoof?

Published:

  • 2025-03-24: Invited talk at UEF, Finland, in person
  • 2025-02-25: Invited talk at EURECOM, France, in person
  • 2025-04-18: Speech Technologies reading group, JHU, USA, in person

An Overview of Partially Fake Speech

Published:

  • 2025-11-20, IEEE SPS Webinar, online.
    • Title: “Minor Manipulations, Major Threat: An Overview of Partially Fake Speech”
    • Slides
  • 2025-11-10, CLSP Webinar, JHU, USA.
    • Title: “Minor Manipulations, Major Threat: An Overview of Partially Fake Speech”.
  • 2025-08-15, Invited talk, Reality Defender, online
    • Title: “Small Changes, Big Threat: A Story of Partial Spoof”