Prafulla Dhariwal (OpenAI) - Jukebox: A Generative Model for Music

  Рет қаралды 9,432

Vector Institute

Vector Institute

3 жыл бұрын

Prafulla Dhariwal (OpenAI)
Jukebox: A Generative Model for Music
Presentation recorded June 19, 2020
Abstract: Music is an extremely challenging domain for generative modeling: it’s highly diverse, humans are perceptive to small errors, and it has extremely long range dependencies to learn if generated as raw audio. We show it’s possible to generate music with singing directly in the raw audio domain. We tackle the long sequence lengths of raw audio using a multi-scale VQ-VAE to compress it to discrete codes, and model those using autoregressive Transformers. We show that the combined model at scale can generate high-fidelity and diverse songs with coherence up to multiple minutes. We can condition on artist and genre to steer the musical and vocal style, and on unaligned lyrics to make the singing more controllable.
Bio: Prafulla Dhariwal is a research scientist at OpenAI leading work on generative models under the guidance of Ilya Sutskever. His work focuses on modeling high dimensional data while preserving fidelity and diversity, with prominent works being Glow, a normalizing flow generating high resolution images with fast sampling; and Variational Lossy Auto-encoder, a way to understand and prevent latent collapse with autoregressive decoders in VAE’s. In the past, he’s also worked on reinforcement learning, including PPO, a popular on-policy RL algorithm; and GamePad, an environment to make it easier to apply RL to formal theorem proving. He obtained his undergraduate degree from MIT in 2017 with a double major in Computer Science and Mathematics.

Пікірлер: 4
@masternobin
@masternobin 27 күн бұрын
Here after open AI launched chat gpt+4o
@AstronomywithManas
@AstronomywithManas 3 жыл бұрын
Prafull Dhariwal is from my School from INDIA, he is A Maths Prodigy and a very very Intelligent guy. After so many years he doing such a Great Work!!!!!!! Very informative Content.
@freshedits8408
@freshedits8408 27 күн бұрын
It's really, which school
@Chadpritai
@Chadpritai 21 күн бұрын
👻which school bro?
Ming Yu Liu (NVIDIA) - Image and Video Synthesis with Conditional GANs
1:10:27
Ilya: the AI scientist shaping the world
11:46
The Guardian
Рет қаралды 691 М.
🍕Пиццерия FNAF в реальной жизни #shorts
00:41
When Jax'S Love For Pomni Is Prevented By Pomni'S Door 😂️
00:26
MOM TURNED THE NOODLES PINK😱
00:31
JULI_PROETO
Рет қаралды 35 МЛН
Why Does Diffusion Work Better than Auto-Regression?
20:18
Algorithmic Simplicity
Рет қаралды 180 М.
Two GPT-4os interacting and singing
5:55
OpenAI
Рет қаралды 2,8 МЛН
Satya Nadella & Sam Altman: Dawn of the AI Wars | The Circuit with Emily Chang
24:02
Google CEO Sundar Pichai and the Future of AI | The Circuit
24:02
Bloomberg Originals
Рет қаралды 2,6 МЛН
🍕Пиццерия FNAF в реальной жизни #shorts
00:41