U.S. flag

An official website of the United States government, Department of Justice.

NCJRS Virtual Library

The Virtual Library houses over 235,000 criminal justice resources, including all known OJP works.
Click here to search the NCJRS Virtual Library

SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge

NCJ Number
310084
Author(s)
You Zhang; Yongyi Zang; Jiatong Shi; Ryuichi Yamamoto; Tomoki Toda; Zhiyao Duan
Date Published
January 2025
Length
6 pages
Annotation

This article reports on the findings from an examination of how to identify artificial intelligence-generated singing voices from voices of authentic, human singers.

Abstract

With the advancements in singing voice generation and the growing presence of AI singers on media platforms, the inaugural Singing Voice Deepfake Detection (SVDD) Challenge aims to advance research in identifying AI-generated singing voices from authentic singers. This challenge features two tracks: a controlled setting track (CtrSVDD) and an in-the-wild scenario track (WildSVDD). The CtrSVDD track utilizes publicly available singing vocal data to generate deepfakes using state-of-the-art singing voice synthesis and conversion systems. Meanwhile, the WildSVDD track expands upon the existing SingFake dataset, which includes data sourced from popular user-generated content websites. For the CtrSVDD track, we received submissions from 47 teams, with 37 surpassing our baselines and the top team achieving a 1.65 percent equal error rate. For the WildSVDD track, the authors benchmarked the baselines. This paper reviews these results, discusses key findings, and outlines future directions for SVDD research. (Published Abstract Provided)