Researchers just mathematically proved that AI can't recursively self-improve its way to superintelligence.

? Offline

The fact that we learn before acquiring language is itself a demonstration of the fact that mammalian thought and reasoning, and human thought and reasoning, is fundamentally not based on language. Your argument is disproving your point.

? Offline

@resuna

You're debating a strawman. Try debating something I've actually written instead?

@devsimsek

? Offline

@troed @devsimsek what you have actually written is based on a category error. You're confusing the platform, neurons, with the software.

? Offline

@resuna

Wouldn't it be prudent if you learnt anything about the subject first?

Here - I'll help: One of the better books on the subject is "Consciousness: An Introduction" by Susan Blackmore.

I read it 15 years ago. That you believe there's a "platform" and "software" means you have no idea how human cognition works.

@devsimsek

? Offline

@troed @devsimsek

I have been following this approach since the first steps in the '80s. I'm pretty clear on how it works.

Software is a metaphor, the connectome is obviously a different kind of construct than procedural code, but it is the connections, not the fact that it is built out of neurons, that determines the kind of reasoning and model construction that the human brain performs. You are looking at the implementation and ignoring the big picture.

? Offline

@resuna

Here - read a scientific paper:

"Our findings reveal reasoning-like mechanisms within the LLM's layers that operate across structurally similar tasks. Crucially, these mechanisms remain stable despite variations in input and output data, suggesting the existence of internal processes that transcend basic language processing."

There's no hardware and software in humans. The hardware and the software are one and the same.

https://www.sciencedirect.com/science/article/pii/S2949882126000010

@devsimsek

? Offline

@troed @devsimsek

All the "internal processes" are linguistic, and not even sophisticated linguistic processing, anything else is hallucinated by the researchers fooled by the "clever hans" effect.

All mammals have basically the same hardware. All behavioral differences are due to differences in the size and arrangement of the connections between the neurons.

? Offline

@resuna

Your argument is "when my gut feeling doesn't agree with science I trust my gut feeling"?

I mean. That's a choice.

@devsimsek

? Offline

@troed @devsimsek

Ah, abuse. It always comes down to abuse.

? Offline

@resuna

I'm sorry, did I hurt your feels by pointing out that you're choosing emotions over facts?

@devsimsek

? Offline

@troed @devsimsek

I'm not going to respond to abuse with my own abuse, no matter how tempted I am.

? Offline

@devsimsek Hi,
I just wanted to ask what the difference between this and fitting a simple regression model on predicted outcomes? I feel the conclusions of this study is pretty obvious even from a simple regression case - or is it not? Why would we expect something different from LLMs?

? Offline

@resuna

This is "abuse":

"anything else is hallucinated by the researchers fooled by [...]"

Don't debate subjects you don't understand. Your initial post to me is laughably ignorant and directly contradicted by science:

"[LLMs] do not build models or reason about them"

@devsimsek

? Offline

@troed @devsimsek

What? I didn't even mention or refer to you in the text you quoted.

? Offline

@resuna

Why do you think abuse of others is ok?

@devsimsek

? Offline

@resuna

Yeah sorry, that doesn't qualify as a loaded question. If you want to play Debate then you need to understand the rules.

@devsimsek

devsimsek@universeodon.com

@drmambobob hi
Before I reply completely you must know that I'm not an expert on this topic but just a curious guy.

You made a great remark, it is essentially the same problem scaled up. The obvi part is that you cannot get something from nothing. The main reason I assume the paper authors are talking about it differently with llms is that while a regression model just gets stale, olm goes probabilistic hell. It basically actively deletes its output diversity until it produces a single acceptable response.

I hope this answers your question.

? Offline

@devsimsek

? Offline

@devsimsek I am the farthest thing from a mathematician but just from a general lay persons perspective the thesis seems a little rigid. Assuming all training data outside the internet is corrupt, assuming that synthetic data only comes from recursive souring, etc. I don't really want to cheerlead AGI as I assume it will bring about a bunch of existential risk but I'm still in the "wouldn't bet against it" camp.

? Offline

@paulschoe

other.li Forum

Researchers just mathematically proved that AI can't recursively self-improve its way to superintelligence.