Archive:
In this section:
Subtopics:
Comments disabled |
Fri, 17 May 2024 Is this a coincidence? I just noticed the parallel between John Birch of the John Birch Society (“who the heck is John Birch?”) and the Horst Wessel of the Horst Wessel song (“who the heck is Horst Wessel?”). In both cases it turns out to be nobody in particular, and the more you look into why the two groups canonized their particular guy, the less interesting it gets. Is this a common pattern of fringe political groups? Right-wing fringe political groups? No other examples came immediately to mind. Did the Italian Fascists venerate a similar Italian nobody? Addendum 20240517Is it possible that the John Birch folks were intentionally emulating this bit of Nazi culture? [Other articles in category /politics] permanent link Mon, 13 May 2024
ChatGPT opines on cruciferous vegetables, Decameron, and Scheherazade
Last year I was planning a series of articles about my interactions with ChatGPT. I wrote a couple, and had saved several transcripts to use as material for more. Then ChatGPT 4 was released. I decided that my transcripts were obsolete, and no longer of much interest. To continue the series I would have had to have more conversations with ChatGPT, and I was not interested in doing that. So I canned the idea. Today I remembered I had actually finished writing this one last article, and thought I might as well publish it anyway. Looking it over now I think it isn't as stale as it seemed at the time, it's even a bit insightful, or was at the time. The problems with ChatGPT didn't change between v3 and v4, they just got hidden under a thicker, fluffier rug. (20230327) This, my third interaction with ChatGPT, may be the worst. It was certainly the longest. It began badly, with me being argumentative about its mealy-mouthed replies to my silly questions, and this may have gotten its head stuck up its ass, as Rik Signes put it. Along the way it produced some really amazing bullshit. I started with a question that even humans might have trouble with:
(Typical responses from humans: “What are you talking about?” “Please go away before I call the police.” But the correct answer, obviously, is cauliflower.) ChatGPT refused to answer:
“Not appropriate” is rather snippy. Also, it is an objective fact that cauliflower sucks and I wonder why ChatGPT's “vast amount” of training data did not emphasize this. Whatever, I was not going to argue the point with a stupid robot that has probably never even tried cauliflower. Instead I seized on its inane propaganda that “all vegetables … should be included as part of a healthy and balanced diet.” Really? How many jerusalem artichokes are recommended daily? How many pickled betony should I eat as part of a balanced diet? Can I be truly healthy without a regular infusion of fiddleheads?
I looked this up. Iceberg lettuce is not a good source of vitamin K. According to the USDA, I would need to eat about a pound of iceberg lettuce to get an adequate daily supply of vitamin K. Raw endive, for comparison, has about ten times as much vitamin K, and chard has fifty times as much.
This is the thing that really bugs me about GPT. It doesn't know anything and it can't think. Fine, whatever, it is not supposed to know anything or to be able to think, it is only supposed to be a language model, as it repeatedly reminds me. All it can do is regurgitate text that is something like text it has read before. But it can't even regurgitate correctly! It emits sludge that appears to be language, but isn't.
I cut out about 100 words of blather here. I was getting pretty tired of ChatGPT's vapid platitudes. It seems like it might actually be doing worse with this topic than on others I had tried. I wonder now if that is because its training set included a large mass of vapid nutrition-related platitudes?
There was another hundred words of this tedious guff. I gave up and tried something else.
This was a silly thing to try, that's on me. If ChatGPT refuses to opine on something as clear-cut as the worst cruciferous vegetable, there is no chance that it will commit to a favorite number.
When it starts like this, you can be sure nothing good will follow.
By this time I was starting to catch on. My first experience with
this sort of conversational system was at the age of seven or eight
with
the Woods-Crowther
When ChatGPT says “As a large language model…” it is saying the same
thing as when
Oh God, this again. Still I forged ahead.
Holy cow, that might be the worst couplet ever written. The repetition of the word “treat” is probably the worst part of this sorry excuse for a couplet. But also, it doesn't scan, which put me in mind of this bit from Turing's example dialogue from his original explanation of the Turing test:
I couldn't resist following Turing's lead:
Maybe I should be more prescriptive?
The first line is at least reasonably metric, although it is trochaic and not iambic. The second line isn't really anything. At this point I was starting to feel like Charlie Brown in the Halloween special. Other people were supposedly getting ChatGPT to compose odes and villanelles and sestinas, but I got a rock. I gave up on getting it to write poetry.
God, I am so tired of that excuse. As if the vast amount of training data didn't include an entire copy of Decameron, not one discussion of Decameron, not one quotation from it. Prompting did not help.
Here it disgorged almost the same text that it emitted when I first mentioned Decameron. To avoid boring you, I have cut out both copies. Here they are compared: red text was only there the first time, and green text only the second time.
This reminded me of one of my favorite exchanges in Idoru, which might be my favorite William Gibson novel. Tick, a hacker with hair like an onion loaf, is interrogating Colin, who is an AI virtual guide for tourists visiting London.
Colin is not what he thinks he is; it's a plot point. I felt a little like Tick here. “You're supposed to know fucking everything about Decameron, aren't you? Name one of the characters then.” Ordinary Google search knows who Pampinea was. Okay, on to the next thing.
Fine.
I have included all of this tedious answer because it is so spectacularly terrible. The question is a simple factual question, a pure text lookup that you can find in the Wikipedia article or pretty much any other discussion of the Thousand and One Nights. “It does not have a single consistent narrative or set of characters” is almost true, but it does in fact have three consistent, recurring characters, one of whom is Scheherazade's sister Dunyazade, who is crucial to the story. Dunyazade is not even obscure. I was too stunned to make up a snotty reply.
This is an interesting question to ask someone, such as a first-year undergraduate, who claims to have understood the Thousand and One Nights. The stories are told by a variety of different characters, but, famously, they are also told by Scheherazade. For example, Scheherazade tells the story of a fisherman who releases a malevolent djinn, in the course of which the fisherman tells the djinn the story of the Greek king and the physician Douban, during which the fisherman tells how the king told his vizier the story of the husband and the parrot. So the right answer to this question is “Well, yes”. But ChatGPT is completely unaware of the basic structure of the Thousand and One Nights:
F minus. Maybe you could quibble a little because there are a couple of stories at the beginning of the book told by Scheherazade's father when he is trying to talk her out of her scheme. But ChatGPT did not quibble in this way, it just flubbed the answer. After this I gave up on the Thousand and One Nights for a while, although I returned to it somewhat later. This article is getting long, so I will cut the scroll here, and leave for later discussion of ChatGPT's ideas about Jesus' parable of the wedding feast, its complete failure to understand integer fractions, its successful answer to a trick question about Franklin Roosevelt, which it unfortunately recanted when I tried to compliment its success, and its baffling refusal to compare any fictional character with Benito Mussolini, or even to admit that it was possible to compare historical figures with fictional ones. In the end it got so wedged that it claimed:
Ucccch, whatever. Addendum 20240519Simon Tatham has pointed out out that the exchange between Simon and Tick is from Mona Lisa Overdrive, not Idoru. [Other articles in category /tech/gpt] permanent link Sun, 12 May 2024As I walk around Philadelphia I often converse with Benjamin Franklin, to see what he thinks about how things have changed since 1790. Sometimes he's astounded, other times less so. The things that astound Franklin aren't always what you might think at first. Electric streetlamps are a superb invention, and while I think Franklin would be very pleased to see them, I don't think he would be surprised. Better street lighting was something everyone wanted in Franklin's time, and this was something very much on Franklin's mind. It was certainly clear that electricity could be turned into light. Franklin could have and might have thought up the basic mechanism of an incandescent bulb himself, although he wouldn't have been able to make one. The Internet? Well, again yes, but no. The complicated engineering details are complicated engineering, but again the basic idea is easily within the reach of the 18th century and is not all that astounding. They hadn't figured out Oersted's law yet, which was crucial, but they certainly knew that you could do something at one end of a long wire and it would have an effect at the other end, and had an idea that that might be a way to send messages from one place to another. Wikipedia says that as early as 1753 people were thinking that an electric signal could deflect a ping-pong ball at the receiving end. It might have worked! If you look into the history of transatlantic telegraph cables you will learn that the earliest methods were almost as clunky. Wikipedia itself is more impressive. The universal encyclopedia has long been a dream, and now we have one. It's not always reliable, but you know what? Not all of anything is reliable. An obvious winner, something sure to blow Franklin's mind is “yeah, we've sent people to the Moon to see what it was like, they left scientific instruments there and then they came back with rocks and stuff.” But that's no everyday thing, it blew everyone's mind when it happened and it still does. Some things I tell Franklin make him goggle and say “We did what?” and I shrug modestly and say yeah, it's pretty impressive, isn't it. The Moon thing makes me goggle right back. The Onion nailed it. The really interesting stuff is the everyday stuff that makes Franklin goggle. CAT scans, for example. Ordinary endoscopy will interest and perhaps impress Franklin, but it won't boggle his mind. (“Yeah, the doctor sticks a tube up your butt with an electric light so they can see if your bowel is healthy.” Franklin nods right along.) X-rays are more impressive. (I wrote a while back about how long it took dentists to start adopting X-ray technology: about two weeks.) But CAT scans are mind-boggling. Oh yeah, we send invisible rays at you from all directions, and measure how much each one was attenuated from passing through your body, and then infer from that exactly what must be inside and how it is all arranged. We do what? And that's without getting into any of the details of whether this is done by positron emission or nuclear magnetic resonance (whatever those are, I have no idea) or something else equally incomprehensible. Apparently there really is something to this quantum physics nonsense. So far though the most Franklin-astounding thing I've found has been GPS. The explanation starts with “well, first we put 32 artificial satellites in orbit around the Earth…”, which is already astounding, and can derail the conversation all by itself. But it just goes on from there getting more and more astounding: “…and each one has a clock on board, accurate to within 40 nanoseconds…” “…and can communicate the exact time wirelessly to the entire half of the Earth that it can see…” “… and because the GPS device also has a perfect clock, it can compute how far it is from the satellite by comparing the two times and multiplying by the speed of light…” “… and because the satellite also tells the GPS device exactly where it is, the device can determine that it lies on the surface of a sphere with the satellite at the center, so with messages from three or four satellites the device can compute its exact location, up to the error in the clocks and other measurements…” “…and it fits in my pocket.” And that's not even getting into the hair-raising complications introduced by general relativity. “It's a bit fiddly because time isn't passing at the same rate for the device as it is for the satellites, but we were able to work it out.” What. The. Fuck. Of course not all marvels are good ones. I sometimes explain to Franklin that we have gotten so good at fishing — too good — that we are in real danger of fishing out the oceans. A marvel, nevertheless. A past what-the-fuck was that we know exactly how many cells there are (959) in a particular little worm, C. elegans, and how each of those cells arises from the division of previous cells, as the worm grows from a fertilized egg, and we know what each cell does and how they are connected, and we know that 302 of those cells are nerve cells, and how the nerve cells are connected together. (There are 6,720 connections.) The big science news on Friday was that for the first time we have done this for an insect brain. It was the drosophila larva, and it has 3016 neurons and 548,000 synapses. Today I was reading somewhere about how most meteorites are asteroidal, but some are from the Moon and a few are from Mars. I wondered “how do we know that they are from Mars?” but then I couldn't understand the explanation. Someday maybe. And by the way, there are only 277 known Martian meteorites. So today's what-the-fuck is: “Yeah, we looked at all the rocks we could find all over the Earth and we noticed a couple hundred we found lying around various places looked funny and we figured out they must have come from Mars. And when. And how long they were on Mars before that.” Obviously, It's amazing that we know enough about Mars to be able to say that these rocks are like the ones on Mars. (“Yeah, we sent some devices there to look around and send back messages about what it was like.”) But to me, the deeper and more amazing thing is, from looking at billions of rocks, we have learned so much about what rocks are like that we can pick out, from these billions, a couple of hundred that came to the Earth not merely from elsewhere but specifically from Mars. What. The. Fuck. Addendum 20240513I left out one of the most important examples! Even more stunning than GPS. When I'm going into the supermarket, I always warn Franklin “Okay, brace yourself. This is really going to blow your mind.” Addendum 20240514Carl Witty points out that the GPS receiver does not have a perfect clock. The actual answer is more interesting. Instead of using three satellites and a known time to locate itself in space, as I said, the system uses four satellites to locate itself in spacetime. Addendum 20240517Another great example: I can have a hot shower, any time I want, just by turning a knob. I don't have to draw the water, I don't have to heat it over the fire. It just arrives effortlessly to the the bathroom… on the third floor of my house. And in the winter, the bathroom is heated. One unimaginable luxury piled on another. Franklin is just blown away. How does it work? Well, the entire city is covered with a buried network of pipes that carry flammable gas to every building. (WTF) And in my cellar is an unattended, smokeless gas fire ensures that there is a tank with gallons of hot water ready for use at any moment. And it is delivered invisbly throughout my house by hidden pipes. Just the amount of metal needed to make the pipes in my house is unthinkable to Franklin. And how long would it have taken for a blacksmith to draw them by hand? Addendum 20240723If Franklin's eyes are good enough, get him to examine your t-shirt. At first he'll be astounded at the fineness of the weave. But you point out that it stretches in all directions, not just on the bias, which shows that it's not woven. Then the mind-blowing reveal: it's a knit. Watch Franklin trying to imagine the tiny, tiny knitting needles, and think about how long it takes one person to make a knit sweather with normal-sized needles. Addendum 20240728I said:
We (Franklin and I, that is) ran into an example yesterday. We saw the Regional Rail train go by, and I explained it was called a train because it was a series of rail cars each pulling the one behind it. “But there were only two,” observed Franklin. “Usually the train is longer than that, but they make it shorter on Saturday because not as many people are riding. Its main job is to take people to work.” Long pause. “People here don't work on Saturdays?” Addendum 20241002Only eighteen months later we have mapped the brain of an adult fruit fly. It has 139,255 neurons (up from 3,016 in the larva) and 54.5 million synapses (up from 548,000). Wow. Addendum 20241205The Vesuvius Challenge. Two thousand years ago, the town of Herculaneum was buried by a volcanic eruption. Sometime later we started to dig it out to see what we could find. And we found a library, filled with scrolls of previously unknown Greek and Latin manuscripts, and maybe even copies of works we knew had once existed but which had been lost of centuries. Unfortunately, the scrolls were little rolls of parchment, and had been completely burnt up until they were nothing more than charcoal cylinders. If you tried to unroll one, it would turn to ash. Franklin: “So you've been able to unroll them?” Me: “No, but we can examine the entire structure, even the hidden inside parts, without unrolling it, compute what it would look like if it were unrolled, distinguish the bits that are charred ink from the bits that are charred parchment, and so infer what it said before it was burnt to a crisp.” This is more like the moon landing than like hot showers. It's not an everyday thing, it's a technical tour de force that even modern perople find astounding. [Other articles in category /tech] permanent link |