Yeah, these things are not fundamentally different from Markov chains. Basically, it has a huge multidimensional graph of tokens, and all it's doing is predicting the next likely token. So, when you introduce specific tokens into the input then it helps focus it in a particular direction.
Yeah, these things are not fundamentally different from Markov chains. Basically, it has a huge multidimensional graph of tokens, and all it's doing is predicting the next likely token. So, when you introduce specific tokens into the input then it helps focus it in a particular direction.