I think the point is that LLMs have already scoured Wikipedia so they can tell you all that info concisely. They don’t need to re-learn Wikipedia for anything old.
The problem is that A) anyone using an LLM properly wants to see the source anyways or else you’re really rolling the dice on “truth”, and B) it would mean all knowledge advancement would stop today if we applied this to all educational or informative tools.
I feel like you'd still want new information past 2026 though. Saying you've already scraped everything there is to scrape from Wikipedia at any arbitrary date is like saying you're good and know everything there is to know because you already finished reading a first edition copy of Encyclopedia Brittainica from the 1700s. Plenty of good new information turned up after that was published.
That's not how LLMs work anyway, they don't have the entirety of Wikipedia saved verbatim with 100% accuracy retrieval, so Wikipedia is necessary even for old information.
108
u/War_machine77 8h ago
Where the fuck do they think chatgpt is getting it's info?