No video

NEW Multi-Modal AI by APPLE

  Рет қаралды 2,672

code_your_own_AI

code_your_own_AI

Күн бұрын

Apple published new Machine Learning (ML) models on its GitHub repo: 4M-21. Massively Multimodal Masked Modelling.
All rights w/ authors:
4M-21: An Any-to-Any Vision Model
for Tens of Tasks and Modalities
arxiv.org/pdf/...
Video from Apple and Lausanne:
storage.google...
#appleai
#apple
#multimodalai

Пікірлер: 11
@MeinDeutschkurs
@MeinDeutschkurs Ай бұрын
Great! Watched both, your video and the video by EPFL. Hope, the community will create a dataset that is not based on synthetic data to increase the quality. I was impressed by the video-frame demo. I hope that some day, audio and video/animation will be included. That’s so exciting!
@tomw4688
@tomw4688 2 ай бұрын
Great catch! Thanks for reviewing this.
@mshonle
@mshonle 2 ай бұрын
It’s about time that we went back to encoder/decoder architectures again!
@user-zd8ub3ww3h
@user-zd8ub3ww3h 2 ай бұрын
it is very good of Any-to-Any introduction.
@thesimplicitylifestyle
@thesimplicitylifestyle 2 ай бұрын
Very useful! Thank you! 😎🤖
@fontenbleau
@fontenbleau 2 ай бұрын
i don't understand why they release such miniscule useless models year long, the decent models in my experience starting from 30 billions only (yes, i have 128Gb RAM). Only such size provide some quality of more than function (a glimpse of intelligence esp uncensored) in squezzed quantised versions.
@code4AI
@code4AI 2 ай бұрын
Now I could explain to you, that current phones do have compute limitations on board or I could explain that research projects start with a smaller complexity to document proof of concept, but would you understand it?
@fontenbleau
@fontenbleau 2 ай бұрын
@@code4AI It's hard to tell their real motives, Apple is the most closed tech group. Yes, phones are incapable today as robots, no good chip anywhere. I understand perfectly, that's just my opinion and Apple will never release big models publicly, such are valuable asset. Llama 7B is good but only as dictionary/translator, anything less even more primitive. For spyware like Recall this small model is perfect.
@falklumo
@falklumo Ай бұрын
You seem to be confused. This work is not about an LLM, your parameter count intuition does not apply. This is better be compared with stable diffusion which DOES an ok job on 8GB GPUs.
@fontenbleau
@fontenbleau Ай бұрын
@@falklumo that's a weird reply and why you referencing to stable diff at all, an image generator? Kinda long to explain, but first Apple's stylus writing recognition (a grandfather of current Ai) was horrible, they bought patent license to use better one in Newton device, made by others.
GROKKED LLM beats RAG Reasoning (Part 3)
30:03
code_your_own_AI
Рет қаралды 8 М.
NEW TextGrad by Stanford: Better than DSPy
41:25
code_your_own_AI
Рет қаралды 12 М.
Underwater Challenge 😱
00:37
Topper Guild
Рет қаралды 33 МЛН
Zombie Boy Saved My Life 💚
00:29
Alan Chikin Chow
Рет қаралды 27 МЛН
WHO CAN RUN FASTER?
00:23
Zhong
Рет қаралды 46 МЛН
My Cheetos🍕PIZZA #cooking #shorts
00:43
BANKII
Рет қаралды 28 МЛН
Multi-Modal Foundation Models | Amir Zamir
28:27
Applied Machine Learning Days
Рет қаралды 1,5 М.
How AI Really Works - Intro to Open Source Large Language Models
1:24:54
Talking Tech and AI with Tim Cook!
16:33
Marques Brownlee
Рет қаралды 3 МЛН
The Attention Mechanism in Large Language Models
21:02
Serrano.Academy
Рет қаралды 91 М.
Multi-Agent Systems for Everyone
31:50
AI Makerspace
Рет қаралды 3 М.
What Is an AI Anyway? | Mustafa Suleyman | TED
22:02
TED
Рет қаралды 1,4 МЛН
What are Transformer Models and how do they work?
44:26
Serrano.Academy
Рет қаралды 114 М.
The future of AI looks like THIS (& it can learn infinitely)
32:32
Underwater Challenge 😱
00:37
Topper Guild
Рет қаралды 33 МЛН