Announcing ARC-AGI-3 - A benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.

brianpeiris@lemmy.ca · edit-2 3 days ago

Announcing ARC-AGI-3 - A benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.

hitmyspot@aussie.zone · 23 hours ago

No, the analogy is about not understanding but regurgitating data. It’s more complex than that but the gist is that they don’t understand or have knowledge of the data being presented.

They are statistical models for what is desirable output. They don’t understand what they give as an answer. That is why they halluncinate information that sounds plausible and confident.

We’re not refuting your point about how the technology works, but rather that the person you replied to provided a poor analogy. They didn’t. It served the purpose it was designed to do. If you don’t understand that, that’s on you, not them. Maybe ask an ai to explain. ;)

PhoenixDog@lemmy.world · 18 hours ago

Nailed it.

mechoman444@lemmy.world · edit-2 15 hours ago

Someone else in the comments said it perfectly. Al is just data regurgitation. It’s like calling me highly intelligent because I read you a paragraph from Wikipedia. I didn’t know anything. I just read a thing and said it out loud.

Christ on a stick.

The original analogy literally states “AI is just data regurgitation” now you’re what? Saying it’s more complex? Ever heard of a motte and Bailey. Cuz that’s what you’re doing now.

Once again, for the people in the back, the analogy is a failure. It does not work. Llms are not regurgitation machines.

Motte and bailey so it’s faster for you to look up.

hitmyspot@aussie.zone · 3 hours ago

They simplified it, and also used hyperbole. That’s not the same as motte and bailey.

You’re being too literal, while still being imprecise. It’s likely why you’re struggling with what the analogy is for.

mechoman444@lemmy.world · 1 hour ago

He is claiming the analogy works, then retreating to a more defensible position by admitting the system is more complex.

I am not being overly simplistic or imprecise. I am stating plainly that the analogy fails. LLMs do not regurgitate stored information. They generate novel outputs by statistically modeling and interpreting patterns in their training data. I supported that position with objective facts, and no one has attempted to directly refute them. Instead, the responses rely on vague arguments about “precision” and “simplicity,” which do not address the core claim.

Announcing ARC-AGI-3 - A benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.

Announcing ARC-AGI-3 - A benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.

Announcing ARC-AGI-3 | ARC Prize