A Neural Parametric Singing Synthesizer – arXiv Vanity
Por um escritor misterioso
Last updated 11 março 2025

We present a new model for singing synthesis based on a modified version of the WaveNet architecture. Instead of modeling raw waveform, we model features produced by a parametric vocoder that separates the influence of pitch and timbre. This allows conveniently modifying pitch to match any target melody, facilitates training on more modest dataset sizes, and significantly reduces training and generation times. Our model makes frame-wise predictions using mixture density outputs rather than categorical outputs in order to reduce the required parameter count. As we found overfitting to be an issue with the relatively small datasets used in our experiments, we propose a method to regularize the model and make the autoregressive generation process more robust to prediction errors. Using a simple multi-stream architecture, harmonic, aperiodic and voiced/unvoiced components can all be predicted in a coherent manner. We compare our method to existing parametric statistical and state-of-the-art concatenative methods using quantitative metrics and a listening test. While naive implementations of the autoregressive generation algorithm tend to be inefficient, using a smart algorithm we can greatly speed up the process and obtain a system that’s competitive in both speed and quality.

HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis

Jukebox: A Generative Model for Music – arXiv Vanity

Singing Synthesis: with a little help from my attention – arXiv Vanity

A Tutorial on Deep Learning for Music Information Retrieval

Singing voice synthesis based on frame-level sequence-to-sequence

Conditioning Deep Generative Raw Audio Models for Structured

Learning Singing From Speech – arXiv Vanity

A Comparative Study of Voice Conversion Models with Large-Scale

Multimodal speech synthesis architecture for unsupervised speaker

Tacotron: Towards End-to-End Speech Synthesis – arXiv Vanity

DiffSinger: Singing Voice Synthesis via Shallow Diffusion

HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis

A Neural Parametric Singing Synthesizer
Recomendado para você
-
Chromebook Charger, 45W 65W Type C USB C Laptop Charger Universal USB C Charger for HP Chromebook Charger Replacement, Dell, Google Chromebook11 março 2025
-
Fonte Input 100 240 Vac 50 60 Hz 0 3 A11 março 2025
-
Adaptador de energia para viagem, difusor de aroma, plugue11 março 2025
-
Press Releases Archives - DAP Health11 março 2025
-
Just Dance 2023 Ultimate Edition - Xbox (digital) : Target11 março 2025
-
Seasonic G12 GM-850 850W 80 Plus Gold Semi Modular11 março 2025
-
Neural DSP Quad Cortex Power Supply – Thomann Portuguesa11 março 2025
-
The Terrace Outdoor Soundbar LST70T11 março 2025
-
2x JBL EON710 10 Powered Speaker w Bluetooth 1300W + 2x Cables +11 março 2025
-
CIRMECH AC 100V-240V Power Adapter Converter DC 24V 5A Power11 março 2025
você pode gostar
-
Gioco MotoGP23 per PS411 março 2025
-
Parte da BR-280 desmorona no Norte de SC por causa das chuvas e trecho é interditado, Santa Catarina11 março 2025
-
1972 Suzuki GT750 – Iconic Motorbike Auctions11 março 2025
-
Além dos treinos intensos: o que os jogadores de futebol fazem (e não fazem!) para turbinar suas performances - Minha Vida11 março 2025
-
DBS Mangá Chaper 58 Color By: DBSuperHDMX Anime dragon ball super, Dragon ball super manga, Anime dragon ball11 março 2025
-
yanmu Roblox Woman Face Mug Funny Girl Cute Gamer Birthday Gift11 março 2025
-
Brian Cox, Edie Falco, Lisa Kudrow and Dean Norris Join New Line Comedy 'The Parenting11 março 2025
-
Jogos Gratuitos PS Plus de Julho 202311 março 2025
-
Boné baseball Branco 300 This Is Sparta Filme Cinema Arte Aba11 março 2025
-
bob (@rainornme) / X11 março 2025