Рет қаралды 55
gpt-4o claims to outperform claude opus.
I don't trust benchmarks.
So I ran my own little test:
test #1 → copy the UI of a website
As you can see, same old problems for claude opus.
Their AI ethics are through the roof.
It's sometimes impossible to cover the simplest task.
OG chatgpt does not care though.