Humanist Discussion Group

Humanist Archives: July 20, 2025, 8:55 a.m. Humanist 39.87 - repetition vs intelligence: on LLMs

				
              Humanist Discussion Group, Vol. 39, No. 87.
        Department of Digital Humanities, University of Cologne
                      Hosted by DH-Cologne
                       www.dhhumanist.org
                Submit to: humanist@dhhumanist.org


    [1]    From: Mcgann, Jerome (jjm2f) <jjm2f@virginia.edu>
           Subject: Re: [Humanist] 39.84: repetition vs intelligence: on LLMs (284)

    [2]    From: Tim Smithers <tim.smithers@cantab.net>
           Subject: Re: [Humanist] 39.84: repetition vs intelligence: on LLMs (7)

    [3]    From: Tim Smithers <tim.smithers@cantab.net>
           Subject: Re: [Humanist] 39.84: repetition vs intelligence: on LLMs (34)

    [4]    From: Willard McCarty <willard.mccarty@mccarty.org.uk>
           Subject: 'repetition vs intelligence': a lengthy discussion (24)


--[1]------------------------------------------------------------------------
        Date: 2025-07-19 10:52:03+00:00
        From: Mcgann, Jerome (jjm2f) <jjm2f@virginia.edu>
        Subject: Re: [Humanist] 39.84: repetition vs intelligence: on LLMs

Jim's criticism of Tim's opening passages is correct only in a very narrow frame
of reference.  All the senses are developed and neurologically connected in
utero, and smell taste touch and hearing develop very early.    Visual sense is
late and quite undeveloped at birth.  It rapidily develops soon thereafter.  Of
course the neural network in some "born deaf" babies may have been so
undeveloped in utero that the sense of hearing never develops at all.  But
there's no doubt that many "born deaf" babies have developed neural network
hearing.  Keats's famous passage is relevant here:

Heard melodies are sweet, but those unheard
       Are sweeter; therefore, ye soft pipes, play on;
Not to the sensual ear, but, more endear'd,
       Pipe to the spirit ditties of no tone:

(The lovely game with the offrhyme words "unheard" and "endeared" is a brief
snatch of a "spirit ditty" that has always ravished me; and the phrase "spirit
ditties" is a performance Keats makes for us to hear too, briefly.)
      Babies have been listening to their bodies and their uterine environment,
most especially to their fully developed mother, for quite a while.
      More importantly, they been developing their other sensory-based language
skills  as well in that "prehistory", where they've been communicating in utero
with all their senses, though least of all with sight.  That's important to
recognize when we think about human language, especially when we think about the
so-called "language learning" of LLMs.  Tim's larger point about LLM language
learning and human language learning seems to be obviously correct.  Spirit
ditties are altogether beyond the capacity of machines that have not undergone
primary biochemical development.  Even if, like some postmodern Dr Frankenstein,
you "listened in" to that uterine history and reported out in data, you would
not have had the experience AND you would not have gotten to the meaning (pace
TS Eliot).
      All of that, for me, explains why linguists take spoken language as their
point of departure (rather than written language).  But we'd get further along
with our understanding of language if we began where, for instance, Robert
Bringhurst began, or Jaime de Angulo: with the communication networks operating
in so-called paleolithic and neolithic contexts, particularly those that
survived long enough to interact with our anthropocene.   Language is a
networked multisensory gift.  Levi-Strauss was onto something when he suggested
that human civilizations across the globe were most fully developed/integrated
in the neolithic period.

Jerry

PS. I've said nothing about Tim's excellent set of cautionary tales he has told
after those opening passages.


-----

From: Humanist <humanist@dhhumanist.org>
Date: Friday, July 18, 2025 at 3:53 AM
To: Mcgann, Jerome (jjm2f) <jjm2f@virginia.edu>
Subject: [Humanist] 39.84: repetition vs intelligence: on LLMs


              Humanist Discussion Group, Vol. 39, No. 84.
        Department of Digital Humanities, University of Cologne
                      Hosted by DH-Cologne
                       www.dhhumanist.org<http://www.dhhumanist.org>
                Submit to: humanist@dhhumanist.org


    [1]    From: Gabriel Egan <mail@gabrielegan.com>
           Subject: Re: [Humanist] 39.82: repetition vs intelligence: on LLMs
(157)

    [2]    From: James Rovira <jamesrovira@gmail.com>
           Subject: Re: [Humanist] 39.82: repetition vs intelligence: on LLMs
(43)


--[1]------------------------------------------------------------------------
        Date: 2025-07-17 12:34:01+00:00
        From: Gabriel Egan <mail@gabrielegan.com>
        Subject: Re: [Humanist] 39.82: repetition vs intelligence: on LLMs

Dear Humanists

There's rather a lot to respond to in
Tim Smithers's latest 9,750-word
posting on why Large Language Models
are not realy modelling human meaning.

I suspect that if anybody here wants
to know my response to Smithers's
posting, they'll want something
shorter than that. I believe that
an important part of where Smithers
goes wrong occurs in the first
2000 words of his posting, in
sections 1 and 2 about how human
language works. So, for this reply
I'll confine myself to just that
part.

Smithers writes:

 > It's important in what follows to remember
 > that the letters of the alphabets we use
 > to write with are symbols for sounds.

No, people who are deaf from birth have
no trouble understanding letters and
alphabets, without ever having heard
a sound.

 > They [the letters of the alphabet] are
 > not symbols for bits of meaning: the
 > meanings, to us as readers and writers
 > of words, are not built from the
 > letters we use to write them with.
 > It is the sound of words that is
 > built from the letters we write
 > them with, and read them from.

If written meaning were primarily a
matter of sound then people who have
never heard a sound would not find it
so easy to make meaning using the
same written system as those who
can hear sounds.

 > Writing systems are not semantic
 > representation building systems.
 > Writing systems are made of multiple
 > systems of symbols needed to
 > present well enough in written
 > form the sound of spoken
 > languaging.

Again, this insistence that sound is
primary in the construction of meaning
runs counter to the everyday experiences
of people who have never heard a sound.

 > To build a LLM . . . we must remove
 > [from the training data] all non-text
 > material, images, pictures, photographs,
 > diagrams, etc, and rip out all font
 > selection and typographical formatting
 > instructions; all the setting of
 > the text on the page or screen;
 > everything we put there to make
 > the text readable and understandable
 > by us readers.

This account is giving too much credit
to the font selection and typographical
formatting in the creation of meaning.
An LLM is perfectly capable of knowing
that "Just do it" is the slogan of the
Nike shoe brand despite not having the
benefit of learning from the accompanying
'checkmark' logo, the font, the all-caps
typographical styling, and so on. We
might note that the Humanist list postings
I receive have no font selection or
typographical formatting, unless we
count spaces and line-endings used to
make paragraph breaks as formatting.
By Smithers's model, he and I should
not be able to understand each other
in this styling-free medium.

 > Good typography is not just decoration.
 > It's needed for good accurate reading
 > and understanding of what was written
 > . . . .

This is demonstrably untrue. For all I
know Smithers could be reading my words
in a monospaced typeface printout from
a teleprinter, and for all he knows I
could be reading his in Braille.

In fact, I once supervised a student
whose only access to Shakespeare's
writing was in the form of Braille. I
found her accuracy of reading and
understanding of the play to be well
above the class average. She would
be insulted by Smithers's remarks.

 > If LLMs really deal in words and
 > meanings, why is all this needed
 > aid [the typography] to reading
 > the text first removed [in the
 > training of an LLM]?

The premise here is mistaken: what
Smithers calls the "needed aid" is
not needed. Any deaf or blind
student of language could easily
debunk the model of language
presented by Smithers.

 > Or, are we all now to believe
 > that good typographical design
 > plays no part in real language
 > understanding from writing?

Yes, we should believe that. All
the evidence from people who have
no access to that typographical
design is against Smithers's model
of language.

To summarize then. Smithers claims:

i) that meaning is essentially a matter
of sound not writing. Classicists
and post-structuralists will notice
that this is Plato's attitude in
'Phaedrus', which as Jacques Derrida
pointed out is a thereby self-defeating
piece of written argument.

ii) that typography and layout are
essential in the construction of
written meaning.

I have not yet worked through the rest
of Smithers's posting, but since he
spends 2000 words on the above I assume
that points (i) and (ii) are important
to his overall view of the limitations
of Large Language Models. My aim in
these remarks is to persuade Humanists
that Smithers is wrong about (i) and (ii).

Regards

Gabriel Egan

--[2]------------------------------------------------------------------------
        Date: 2025-07-17 11:44:58+00:00
        From: James Rovira <jamesrovira@gmail.com>
        Subject: Re: [Humanist] 39.82: repetition vs intelligence: on LLMs

Much appreciation for Tim's most recent post. That's exactly the point at which
I wanted to respond to Gabriel myself - the assumption, without any support,
that computer data of any kind holds "meaning" to the computer itself,
especially text display.

Tim's post could be a great jumping off point for a discussion of Derrida and
his deconstruction of the primacy of speech over writing. but there's no need to
go there right now.

My response is much simpler.

Words have meaning to human beings because they are associated with sounds and
feelings and experiences and memory and printed text. They are visceral, or at
least can be. They are also associated with relationships in different social
contexts, and so on, and all of this together combines to associate words with
personal identity.

And that is the point, at least one point and possibly the most important point,
at which words have meaning for human beings. And while we don't think only in
words, we think a great deal using words. Our thoughts are often comprised of
words.

Of course, different kinds of meaning are possible and we should discuss that
too, but I'm not going to right now. However, much of what I just said even
applies to working out math problems or conducting scientific experiments or
writing a computer program.

Words to a computer are just pixels. They are a display option. They don't have
any of those associations at all. As a result, words have no meaning to
computers. They don't understand them in any way comparable to human beings.
Computers do not think in words. They think, if we can use that term, in high
and low voltage states that are transformed into display patterns that human
beings ascribe meaning to. But that doesn't mean the computer itself does.

So using the language of meaning and understanding in relationship to words and
computers is a very bad form of equivocation. Yes, it is a fantasy. You have to
not be thinking in any detail at all about how computers work and how the human
mind works to think that they're equivalent or even comparable in any meaningful
way. It's the kind of claim you make to people who think that pro wrestling is
real, or if you want to write some ridiculous movie script along the lines of
the most recent Mission Impossible movie or The Matrix

Jim R




--[2]------------------------------------------------------------------------
        Date: 2025-07-18 09:12:36+00:00
        From: Tim Smithers <tim.smithers@cantab.net>
        Subject: Re: [Humanist] 39.84: repetition vs intelligence: on LLMs

Thank you, Jim!

If only I could write as many words as you do.

-- Tim

[...]

--[3]------------------------------------------------------------------------
        Date: 2025-07-18 09:12:28+00:00
        From: Tim Smithers <tim.smithers@cantab.net>
        Subject: Re: [Humanist] 39.84: repetition vs intelligence: on LLMs

Gabriel,

From what you appear to have understood of my previous post,
it would seem I wrote it in Swahili, a language I could say a
few things in when I was four but have hardly used since.

Try this ...

    Emily M Bender and Alexander Koller, 2020.  Climbing
    towards NLU: On Meaning, Form, and Understanding, in the
    Age of Data, Proceedings of the 58th Annual Meeting of the
    Association for Computational Linguistics, pp 5185–5198.
    <https://aclanthology.org/2020.acl-main.463.pdf>

and this ...

    Emily M Bender, Timnit Gebru, Angelina McMillan-Major, and
    Margaret Mitchell, 2021.  On the Dangers of Stochastic
    Parrots: Can Language Models Be Too Big?.  Proceedings of
    the 2021 ACM Conference on Fairness, Accountability, and
    Transparency.  doi:10.1145/3442188.3445922.
    <https://dl.acm.org/doi/10.1145/3442188.3445922>

instead.  They are both written in English.

-- Tim


PS: You seem to have lost 37 words from my post.  It had
58,311 characters, 9,787 words (not just 9,750), 1,141 lines,
and an average word length of 5.  These statistics are
essential for getting the intended meaning from the text.

[...]

--[4]------------------------------------------------------------------------
        Date: 2025-07-18 08:25:13+00:00
        From: Willard McCarty <willard.mccarty@mccarty.org.uk>
        Subject: 'repetition vs intelligence': a lengthy discussion

By 'lengthy' I'm not referring to Tim Smithers' recent contribution to
the discussion on LLMs, rather to the persistence of this topic.
Anything that awakens Humanist to a consequential issue on which we are
obviously qualified to comment is in my view to be celebrated. Some may
disagree with Tim's note in this or that respect, but his intention is
exactly right for Humanist. The whole of the humanities and social
sciences needs clear and critical explanation of how these AI gizmos
work. Gleick's recent NYRB article is one, Tim's another.

Many of us, I suspect, are not activists who (for the benefit of all) go
door to door or out onto the streets--may they continue, peaceably--or
tend gardens and orchards to provide examples as well as fulfil a
passion. Many are theorists who act by means of the word, scholars,
journalists, public intellectuals & al. Humanist sometimes, as now,
provides a venue.

All best for more of the same,
WM
--
Willard McCarty,
Professor emeritus, King's College London;
Editor, Interdisciplinary Research in the Arts,
   Sciences and Humanities (Berghahn); Humanist
www.mccarty.org.uk


_______________________________________________
Unsubscribe at: http://dhhumanist.org/Restricted
List posts to: humanist@dhhumanist.org
List info and archives at at: http://dhhumanist.org
Listmember interface at: http://dhhumanist.org/Restricted/
Subscribe at: http://dhhumanist.org/membership_form.php