NEW Multi-Modal AI by APPLE

  Рет қаралды 2,316

code_your_own_AI

code_your_own_AI

26 күн бұрын

Apple published new Machine Learning (ML) models on its GitHub repo: 4M-21. Massively Multimodal Masked Modelling.
All rights w/ authors:
4M-21: An Any-to-Any Vision Model
for Tens of Tasks and Modalities
arxiv.org/pdf/2406.09406
Video from Apple and Lausanne:
storage.googleapis.com/four_m...
#appleai
#apple
#multimodalai

Пікірлер: 10
@mshonle
@mshonle 24 күн бұрын
It’s about time that we went back to encoder/decoder architectures again!
@tomw4688
@tomw4688 22 күн бұрын
Great catch! Thanks for reviewing this.
@user-zd8ub3ww3h
@user-zd8ub3ww3h 24 күн бұрын
it is very good of Any-to-Any introduction.
@thesimplicitylifestyle
@thesimplicitylifestyle 23 күн бұрын
Very useful! Thank you! 😎🤖
@fontende
@fontende 24 күн бұрын
i don't understand why they release such miniscule useless models year long, the decent models in my experience starting from 30 billions only (yes, i have 128Gb RAM). Only such size provide some quality of more than function (a glimpse of intelligence esp uncensored) in squezzed quantised versions.
@code4AI
@code4AI 23 күн бұрын
Now I could explain to you, that current phones do have compute limitations on board or I could explain that research projects start with a smaller complexity to document proof of concept, but would you understand it?
@fontende
@fontende 23 күн бұрын
@@code4AI It's hard to tell their real motives, Apple is the most closed tech group. Yes, phones are incapable today as robots, no good chip anywhere. I understand perfectly, that's just my opinion and Apple will never release big models publicly, such are valuable asset. Llama 7B is good but only as dictionary/translator, anything less even more primitive. For spyware like Recall this small model is perfect.
@falklumo
@falklumo Күн бұрын
You seem to be confused. This work is not about an LLM, your parameter count intuition does not apply. This is better be compared with stable diffusion which DOES an ok job on 8GB GPUs.
@fontende
@fontende Күн бұрын
@@falklumo that's a weird reply and why you referencing to stable diff at all, an image generator? Kinda long to explain, but first Apple's stylus writing recognition (a grandfather of current Ai) was horrible, they bought patent license to use better one in Newton device, made by others.
BEST RAG you can buy: LAW AI (Stanford)
19:12
code_your_own_AI
Рет қаралды 4,6 М.
5 Easy Ways to help LLMs to Reason
50:37
code_your_own_AI
Рет қаралды 3,9 М.
Каха и суп
00:39
К-Media
Рет қаралды 5 МЛН
WHAT’S THAT?
00:27
Natan por Aí
Рет қаралды 5 МЛН
THE POLICE TAKES ME! feat @PANDAGIRLOFFICIAL #shorts
00:31
PANDA BOI
Рет қаралды 25 МЛН
아이스크림으로 체감되는 요즘 물가
00:16
진영민yeongmin
Рет қаралды 56 МЛН
What the hell is a Neural Engine?
14:21
Definitive Mac Upgrade Guide
Рет қаралды 19 М.
AI Agents Explained: How This Changes Everything
10:35
Bot Nirvana
Рет қаралды 14 М.
Apple just changed everything. Again.
12:37
fpt.
Рет қаралды 604 М.
Solid State Batteries Are REALLY Here: Yoshino Power Station
12:23
Undecided with Matt Ferrell
Рет қаралды 620 М.
Visual Mathematical AI Reasoning: WE-MATH
21:27
code_your_own_AI
Рет қаралды 2 М.
The Next Generation Of Brain Mimicking AI
25:46
New Mind
Рет қаралды 128 М.
How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini
34:22
Google for Developers
Рет қаралды 41 М.
Multi-Modal Foundation Models | Amir Zamir
28:27
Applied Machine Learning Days
Рет қаралды 1,3 М.
NEW TextGrad by Stanford: Better than DSPy
41:25
code_your_own_AI
Рет қаралды 10 М.
Собери ПК и Получи 10,000₽
1:00
build monsters
Рет қаралды 2,7 МЛН
Cheapest gaming phone? 🤭 #miniphone #smartphone #iphone #fy
0:19
Pockify™
Рет қаралды 2,7 МЛН
1$ vs 500$ ВИРТУАЛЬНАЯ РЕАЛЬНОСТЬ !
23:20
GoldenBurst
Рет қаралды 1,7 МЛН
PART 52 || DIY Wireless Switch forElectronic Lights - Easy Guide!
1:01
HUBAB__OFFICIAL
Рет қаралды 50 МЛН