Demystifying LLMs
A Large Language Model is a sophisticated mathematical function that predicts what word comes next for any piece of text. It is a fancy word generator.
The "thinking" an LLM does is not contemplation; it is a statistical projection between words in a high-dimensional mathematical embedding space. It follows the vector path to the next most likely token, regardless of whether that path leads to a true statement or a falsehood.
Once an LLM is on a track, it treats its own previous output as the absolute foundation for what comes next. It does not fact check itself, it only consistency checks itself. If it makes an error, it will often compound that error to maintain the flow, spiraling into a loop of fabrications. In other words, to an LLM: the process of answering correctly and the process of spiraling into delusion are the exact same process.
- ← Previous
Estimating Software Projects