Werewolf wrote: ↑Mon Apr 15, 2024 12:25 am Your Robot Overlord has just received an update.
ChatGPT 4 (paid version) has been updated (4.x Turbo?) and is noticeably better at chess.
It played 25 moves without making a single mistake (i.e no illegal moves) and played some decent chess. I would say positionally it was 1700 FIDE Elo but there’s no calculation so it blundered eventually in a sequence of exchanges.
For those who haven’t been following, this may not seem very impressive. But to me it looks like a large leap forward for a LLM.
ChatGPT 5 rumoured to arrive in a few months…
It is astonishing to me that a chatbot with no chess programming can play chess.
I use LLMs on a daily basis: they're not perfect, but they're more than good enough to be very useful. Here's my ordering of the free ones I've used:
1. Gemini. Easily the most intelligent and useful of the free LLMs. When extra intelligence is needed, this is the one I turn to.
2. Claude.ai. When I want a long political document summarised, this is the one I turn to. It doesn't do as good a job as Gemini, but I have been surprised by just how irritating Gemini's political correctness (PC) is! I would have thought that I, personally, would be OK with it - but no: for summarising political documents, Claude's relative freedom (even Claude has its limits) from PC more than makes up for its lower intelligence
3. Pi.ai. Enjoyed it when I used to use it, but dropped it when I discovered how intelligent Gemini is. Haven't used it for a while
4. ChatGPT. A revelation when it first came out - but the free version no longer looks clever in comparison to the free alternatives above
Copilot has been added to my Windows taskbar whether I want it or not. I don't really use it, but when I've tried it, it seems to be roughly comparable to Gemini in intelligence.
I can clearly see that an intelligent assistant is of great value, so I would consider paying - but I haven't yet reached the point where I'm ready to pay. Also, at this time, free versions are still improving dramatically: the current version of Gemini was launched 40 days ago, and for me it was a big uptick in intelligence compared to all the free LLMs that went before.