Large Language Models - The Low Down On
FutureCIO, Monday, August 14,2023
Gartner defines a large language model (LLM) as a specialised type of artificial intelligence (AI) that has been trained on vast amounts of text to understand existing content and generate original content.
Cem Dilmegani, a principal analyst at AIMultiple, says LLMs are foundation models that utilise deep learning in natural language processing (NLP) and natural language generation (NLG) tasks. Using techniques such as fine-tuning, in-context learning, and zero-/one-/few-shot learning, these models can be adapted for downstream (specific) tasks such s question answering, sentiment analysis, object recognition and following instructions.