GPT-4 Turbo vs GPT-4o in Reasoning TEST

  Рет қаралды 1,578

code_your_own_AI

code_your_own_AI

23 күн бұрын

New GPT-4o, means new GPT-4 OMNI, with new video and voice functionalities, but what about its classical reasoning performance?
Tested w/ my personal test suite, maybe you should hold on to your trusted GPT-4-TURBO for causal reasoning and logic deductions.
Not a statistically relevant test or results, since not performed 10000 times on multiple machines on multiple days. Just my personal test impressions.
It helps me to get a feeling for the new GPT4o. When to use it, and when to switch back to TURBO.
#airesearch
#newtech
#gpt4o

Пікірлер: 10
@pin65371
@pin65371 21 күн бұрын
One thing I'm finding with GPT-4 is it doesnt seem to expand much on topics when I try to dig in deeper. It basically just rewords what it already said.
@tijendersingh5363
@tijendersingh5363 21 күн бұрын
your engery is always amazing love it
@dasistdiewahrheit9585
@dasistdiewahrheit9585 21 күн бұрын
Reminder: Don't give ClosedAI power!
@propeacemindfortress
@propeacemindfortress 21 күн бұрын
I bet both will argue that marketing is a vital component of adoption 😂
@Charles-Darwin
@Charles-Darwin 19 күн бұрын
I think 'o' actually stands for orbitofrontal cortex. To mimic our own structure, It could be a smaller/narrow receptive input network that doesn't really retain or memorize beyond simple and critical pathways, and a much larger network that assesses the weighted inputs - for bottom-up top-down approach. Because of this, I think 4o is a double ended model that are working together/in tandem for distilling input and assessment. This region of the brain is multimodal, but just as our organic builds, vision is the primary input where the other modalities also largely construct to visual representations (hear a garbage truck outside, visualize what that truck looks like in your head). This region is also extremely low latency by necessity as responses to visual input needs near-automatic responses (driving a car, walking). All things considered I think this is the analogue of our orbitofrontal cortex and perhaps the applicability extends far farther and wider than theorized prior to implementing the solution. Shy of having the equivalent biological function to survive, I think this is AGI and we've only seen the baby brother. I don't think we'll get the whole enchilada this year or the next, rather what they've been saying, an agentic version of Jr to do biddings to paid subscribers will come this year then next year will be an incremental improvement and prescense in robotics for sure. They'll keep the big one privately running to bolster its abilities and maybe out of precautious reasons. This kind of a breakthrough also aligns with the primary scientists (and alignment conscientious) taking their leave as the management has turned on the primary objective, allocating infrastructure resources to press forward with the model's expansion over creating safety for it. I think those scientists are spurring a company dedicated to alignment.
@hl236
@hl236 20 күн бұрын
Perfect. Just what a needed. A reasoning comparison that’s not coding related.
@RoulDukeGonzo
@RoulDukeGonzo 21 күн бұрын
Have you tried react with an inference engine? Just curious
@IdPreferNot1
@IdPreferNot1 21 күн бұрын
Purely anecdotal, but when i run some random work "think in steps" prompts on the new model, that i would have assumed the old model in practice didnt have a problem with in past workflow examples, gpt-4o.......sucked! Other than the entirely improved speed (at apparent real costs), im not sure what actual prompts its IMPROVING for !?!?
@lighteningrod36
@lighteningrod36 21 күн бұрын
Omni is a Omni channel patch :( probably for training gpt5, and I’m paying for their benefit.
@adinsoftic
@adinsoftic 21 күн бұрын
Can you please elaborate?
GPT-4o in stealth as im-a-good-gpt2-chatbot
15:20
code_your_own_AI
Рет қаралды 2,4 М.
LLM - Reasoning SOLVED (new research)
47:51
code_your_own_AI
Рет қаралды 6 М.
ELE QUEBROU A TAÇA DE FUTEBOL
00:45
Matheus Kriwat
Рет қаралды 30 МЛН
ПАРАЗИТОВ МНОГО, НО ОН ОДИН!❤❤❤
01:00
Chapitosiki
Рет қаралды 2,7 МЛН
When someone reclines their seat ✈️
00:21
Adam W
Рет қаралды 19 МЛН
Cute Barbie Gadget 🥰 #gadgets
01:00
FLIP FLOP Hacks
Рет қаралды 37 МЛН
GPT4o: 11 STUNNING Use Cases and Full Breakdown
30:56
Matthew Berman
Рет қаралды 98 М.
Does GPT-4o create better articles than GPT-4-Turbo?
12:59
Are LLMs Just Databases? The Real Story + Apple AI Predictions
59:39
Navarre Training
Рет қаралды 1,5 М.
What's actually inside a $100 billion AI data center?
27:15
ChatGPT 4o vs. Gemini 1.5 Pro - Ultimate Head to Head Comparison!
12:36
EXCLUSIVE: Torture Testing GPT-4o w/ SHOCKING Results!
22:00
Dr. Know-it-all Knows it all
Рет қаралды 111 М.
Better Searches With Local AI
8:30
Matt Williams
Рет қаралды 21 М.
I wish every AI Engineer could watch this.
33:49
1littlecoder
Рет қаралды 34 М.
POCO F6 PRO - ЛУЧШИЙ POCO НА ДАННЫЙ МОМЕНТ!
18:51
Теперь это его телефон
0:21
Хорошие Новости
Рет қаралды 2 МЛН