Which statement about large language models (LLMs) is true?
LLM stands for Large _______ Model.
In LLM training, what does self-supervised learning mean?
During training, an LLM often tries to predict the next ______ in a sentence.
In the context of LLMs, what is a token?
Converting raw text into tokens is called ______.
What feature of transformer models helps them understand context across an entire sentence?
Transformers can process tokens _____ at once, unlike older models that read one word at a time.
What does it mean if an LLM has an emergent ability?
An emergent ability often appears only after a model reaches a certain ______.
How do LLMs differ from traditional AI programs?
What is the context window of an LLM?
Why do transformers use positional encodings?
What is true about training large language models?