Discussion about this post

User's avatar
IA & Educação's avatar

Very interesting and carefully welll constructed! Logically exposed why the language models are not intelligent and why passing a benchmark that the models memorizes the answer for is not enough. I would suggest researching about neural networks, that some specialists like Gary Marcus (also here on substack) propose as a path for the large language models.

But congratulations on your essay! Keep writing!

Michael Coleman, Ph.D.'s avatar

Well argued. I was curious about the ability of current frontier models to handle Counterfactuals. I was quite surprised when Gemini Flash easily handled my first test question. Perhaps someone had written down the question and answer for its training but I can't shake the impression that the parrot has some kind of comprehension. Here it is:

Prompt: Assume that human blood is blue not red. Imagine we could change physical laws. If Rayleigh scattering was proportional to the wavelength to the 4th power, would it still make sense to call the moon during an eclipse a Blood Moon?

Gemini gave a long, detailed correct answer with this summary: "Yes, in this beautifully inverted hypothetical universe, it would still make perfect sense to call it a Blood Moon! To understand why, we have to look at how your proposed change to physical laws alters the color of the eclipse, and then match it to your alternate biology."

Possibly there is some higher structure being added to this model - time will tell

No posts

Ready for more?