Warning GPT-4o: DON'T translate to Chinese (MIT)

  Рет қаралды 1,749

code_your_own_AI

code_your_own_AI

2 ай бұрын

MIT (Massachusetts Institute of Technology) Tech review reports on massive problems with GPT-4o regarding the Chinese language, discovering heavy tokenizer "pollution".
Warning if you use this AI to translate business correspondence into Chinese, since MIT reports on a heavy data pollution with Chinese tokens.
Currently double check the translation results of GPT-4o with an independent source, especially your business communication to your Chinese partners. Otherwise you might find your company and yourself in a strange business situation ....
All rights w/ authors:
GPT-4o’s Chinese token-training data is polluted by spam and porn websites
www.technologyreview.com/2024...
#airesearch
#gpt4o

Пікірлер: 8
@Quaintcy
@Quaintcy 2 ай бұрын
I suspect teaming up with microsoft and using their search data is a mistake
@mshonle
@mshonle Ай бұрын
It’s time we move to something more advanced and curated than BPE for tokenizers. The “just add more data, scale!” crowd seems to have a major blind spot here. The “bitter lesson” applies to neural architectures and generalizing it to other big data tasks is, to put it nicely, a failed experiment.
@justindressler5992
@justindressler5992 Ай бұрын
So the new model is meant to be better at multilingual it was well of the bigger gains. But I fails to translate for 1.2 billion people, classic.
@propeacemindfortress
@propeacemindfortress 2 ай бұрын
crap in crap out...
@dragonbone1020
@dragonbone1020 Ай бұрын
I just asked it to write a business letter in Chinese and it was all fine.
@code4AI
@code4AI Ай бұрын
Thanks for the update. Running multiple generations of models in parallel on multiple clusters helps with internal switching.
GraphRAG or SpeculativeRAG ?
25:51
code_your_own_AI
Рет қаралды 3,7 М.
Why AI doesn't speak every language
10:15
Vox
Рет қаралды 564 М.
Happy 4th of July 😂
00:12
Pink Shirt Girl
Рет қаралды 61 МЛН
THEY made a RAINBOW M&M 🤩😳 LeoNata family #shorts
00:49
LeoNata Family
Рет қаралды 38 МЛН
Two GPT-4os interacting and singing
5:55
OpenAI
Рет қаралды 2,8 МЛН
The ARM chip race is getting wild… Apple M4 unveiled
4:07
Fireship
Рет қаралды 1,2 МЛН
Funny Moments You Missed From the G7 Summit in Italy
4:34
On Demand News
Рет қаралды 1,4 МЛН
Has Generative AI Already Peaked? - Computerphile
12:48
Computerphile
Рет қаралды 871 М.
Adversarial Questions Test Multimodal MED AI sys
21:08
code_your_own_AI
Рет қаралды 1,3 М.
26 Incredible Use Cases for the New GPT-4o
21:58
The AI Advantage
Рет қаралды 772 М.
What Is an AI Anyway? | Mustafa Suleyman | TED
22:02
TED
Рет қаралды 1,2 МЛН
Q* explained: Complex Multi-Step AI Reasoning
55:11
code_your_own_AI
Рет қаралды 7 М.
GPT-4o is WAY More Powerful than Open AI is Telling us...
28:18
MattVidPro AI
Рет қаралды 269 М.
Hisense Official Flagship Store Hisense is the champion What is going on?
0:11
Special Effects Funny 44
Рет қаралды 3,1 МЛН
После ввода кода - протирайте панель
0:18
Up Your Brains
Рет қаралды 1,3 МЛН
КРУТОЙ ТЕЛЕФОН
0:16
KINO KAIF
Рет қаралды 5 МЛН
iPhone socket cleaning #Fixit
0:30
Tamar DB (mt)
Рет қаралды 12 МЛН
Игровой Комп с Авито за 4500р
1:00
ЖЕЛЕЗНЫЙ КОРОЛЬ
Рет қаралды 2,2 МЛН