PaperPulse logo
FeedTopicsAI Researcher FeedBlogPodcastAccount

Stay Updated

Get the latest research delivered to your inbox

Platform

  • Home
  • About Us
  • Search Papers
  • Research Topics
  • Researcher Feed

Resources

  • Newsletter
  • Blog
  • Podcast
PaperPulse•

AI-powered research discovery platform

© 2024 PaperPulse. All rights reserved.

YingMusic-SVC: Real-World Robust Zero-Shot Singing Voice Conversion with Flow-GRPO and Singing-Specific Inductive Biases

ArXivSource

Gongyu Chen, Xiaoyu Zhang, Zhenqiang Weng, Junjie Zheng, Da Shen, Chaofan Ding, Wei-Qiang Zhang, Zihao Chen

cs.SD
cs.AI
|
Dec 4, 2025
5 views

One-line Summary

YingMusic-SVC is a robust zero-shot singing voice conversion system that improves timbre similarity and naturalness in real-world conditions using innovative techniques like Flow-GRPO and singing-specific biases.

Plain-language Overview

The paper presents YingMusic-SVC, a new system designed to convert singing voices from one singer to another while maintaining the original melody and lyrics. Unlike previous systems, YingMusic-SVC is more resilient to challenges like background harmonies and pitch errors, which often disrupt voice conversion. The system uses advanced machine learning techniques to separate and adapt different aspects of the voice, ensuring the converted voice sounds natural and similar to the target singer. Tests show that YingMusic-SVC performs better than existing methods, especially in complex musical environments.

Technical Details