• ☆ Yσɠƚԋσʂ ☆@lemmygrad.ml
    ·
    6 months ago

    Yeah, these things are not fundamentally different from Markov chains. Basically, it has a huge multidimensional graph of tokens, and all it's doing is predicting the next likely token. So, when you introduce specific tokens into the input then it helps focus it in a particular direction.