Prafulla Dhariwal (OpenAI) - Jukebox: A Generative Model for Music

  Рет қаралды 9,986

Vector Institute

Vector Institute

3 жыл бұрын

Prafulla Dhariwal (OpenAI)
Jukebox: A Generative Model for Music
Presentation recorded June 19, 2020
Abstract: Music is an extremely challenging domain for generative modeling: it’s highly diverse, humans are perceptive to small errors, and it has extremely long range dependencies to learn if generated as raw audio. We show it’s possible to generate music with singing directly in the raw audio domain. We tackle the long sequence lengths of raw audio using a multi-scale VQ-VAE to compress it to discrete codes, and model those using autoregressive Transformers. We show that the combined model at scale can generate high-fidelity and diverse songs with coherence up to multiple minutes. We can condition on artist and genre to steer the musical and vocal style, and on unaligned lyrics to make the singing more controllable.
Bio: Prafulla Dhariwal is a research scientist at OpenAI leading work on generative models under the guidance of Ilya Sutskever. His work focuses on modeling high dimensional data while preserving fidelity and diversity, with prominent works being Glow, a normalizing flow generating high resolution images with fast sampling; and Variational Lossy Auto-encoder, a way to understand and prevent latent collapse with autoregressive decoders in VAE’s. In the past, he’s also worked on reinforcement learning, including PPO, a popular on-policy RL algorithm; and GamePad, an environment to make it easier to apply RL to formal theorem proving. He obtained his undergraduate degree from MIT in 2017 with a double major in Computer Science and Mathematics.

Пікірлер: 4
@masternobin
@masternobin Ай бұрын
Here after open AI launched chat gpt+4o
@AstronomywithManas
@AstronomywithManas 3 жыл бұрын
Prafull Dhariwal is from my School from INDIA, he is A Maths Prodigy and a very very Intelligent guy. After so many years he doing such a Great Work!!!!!!! Very informative Content.
@freshedits8408
@freshedits8408 Ай бұрын
It's really, which school
@Chadpritai
@Chadpritai Ай бұрын
👻which school bro?
Ming Yu Liu (NVIDIA) - Image and Video Synthesis with Conditional GANs
1:10:27
The Man Who Revolutionized Computer Science With Math
7:50
Quanta Magazine
Рет қаралды 2,8 МЛН
- А что в креме? - Это кАкАооо! #КондитерДети
00:24
Телеканал ПЯТНИЦА
Рет қаралды 7 МЛН
Heartwarming moment as priest rescues ceremony with kindness #shorts
00:33
Fabiosa Best Lifehacks
Рет қаралды 8 МЛН
🤔Какой Орган самый длинный ? #shorts
00:42
How Many Balloons Does It Take To Fly?
00:18
MrBeast
Рет қаралды 129 МЛН
Stuart Russell, "AI: What If We Succeed?" April 25, 2024
1:29:57
Neubauer Collegium
Рет қаралды 18 М.
An Observation on Generalization
57:21
Simons Institute
Рет қаралды 157 М.
Ilya: the AI scientist shaping the world
11:46
The Guardian
Рет қаралды 715 М.
How ChatGPT Works Technically For Beginners
33:11
Kurdiez
Рет қаралды 1 МЛН
The Turing Lectures: The future of generative AI
1:37:37
The Alan Turing Institute
Рет қаралды 574 М.
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
How I'd Learn AI (If I Had to Start Over)
15:04
Thu Vu data analytics
Рет қаралды 743 М.
AI Vocals: The Music Revolution Begins
16:05
Doctor Mix
Рет қаралды 1 МЛН
Did AI Just End Music? Ft. Rick Beato
25:46
ColdFusion
Рет қаралды 694 М.
- А что в креме? - Это кАкАооо! #КондитерДети
00:24
Телеканал ПЯТНИЦА
Рет қаралды 7 МЛН