>> RE Chomsky: You can see it's like epicycles: with enough parameters, an LLM i...

hyperthesis · on July 14, 2023

> not predictive models, but explanatory theories

Maybe an advantage of an explanatory theory is in revealing more of the "black box", giving more ways to check the theory. (But I'm not sure how this could apply to Newton's gravity, since the only observations were outcomes. And no plausible way to "experiment".)

> If it wasn't for the ancients stumbling and fumbling in the dark for millennia

Is there any evidence that the epicyclic models helped scientific understanding, even indirectly? Later theories didn't seem to build on it. I wonder if it actually detoured understanding, with its misleadingly impressive accuracy, so that understanding would have progressed more quickly without it.

Thinking of pg's "great work" (https://news.ycombinator.com/item?id=36550615): to be the Newton of neural nets would seem the most ambitious aspiration of our times. But it took a bunch of geniuses just to get to Newton... and it seems an even harder problem than planetary motion. Though a difference is neural nets are based on actual neurons (loosely!).

It's looking like working human-level AI will precede understanding... perhaps by those 2000 years?

sdenton4 · on July 13, 2023

The gravity model is similar though: we posit a force that pulls things together, but we don't know /why/ that force seems to exist, no more than the ancients knew /why/ the planets seemed to move in smaller circles along their circular paths. We're really not /that/ enlightened, after all.

YeGoblynQueenne · on July 13, 2023

I think that's right, but ultimately all explanations we have are based on prior knowledge that is itself not necessarily complete. It's explanations all the way down, until we hit some primary observations or axiomatic assumptions that are the hardest to get rid of.

"Enlightened" was my bad choice of a word. I get overexcited when I think of how much we have learned in the past couple thousand years and I forget that we mainly learned how little we know. Or can explain!

hyperthesis · on July 14, 2023

Yes, ultimately it's also descriptive.

I'd like to think explaining means giving a model simpler than the observations. But this also can be true of a purely predictive model, that offers no "why". Another commenter pointed out that epicycles do simplify - so they do "explain" in this sense.

What defines an "explanation"? What makes something a "why"?

foobarqux · on July 12, 2023

> Chomsky used it to support his argument about the poverty of the stimulous but linguistics

Do you know where Chomsky refers (directly or indirectly) to Gold? I've been searching for a reference for some time.

YeGoblynQueenne · on July 13, 2023

No, I'm sorry. I'm not a linguist so I only know the relation between Gold's result and linguistics second-hand. I'm more interested in it from the point of view of inductive generalisation in machine learning; that's my schtick.

Just to make sure I didn't hallucinate all that, I had an admittedly perfunctory search online and I could find this paper:

https://proceedings.neurips.cc/paper/2002/file/04ad5632029cb...

Whose introduction describes how Gold's result is considered to support the arguments for linguistic nativism from the poverty of the stimulus. Then again, the author doesn't seem to be a linguist himself and he doesn't give any more specific references, so I'm now a little worried; and your question remains un-answered.

Have you tried wading through Chomsky's early work on linguistics? I don't have the courage to. The closest I've got to is I have a friend who has read a couple of Chomsky's linguistics books. My friend is making a living as an astrologist now so maybe that's a bit of a warning there :P

hyperthesis · on July 14, 2023

(not who you asked) I thought this would be in the linked transcript, but it's not. Norvig must be getting it from elsewhere (maybe in the 404ed video?), but it seems like misrepresentation.