LLM
A large language model (LLM) is a type of artificial intelligence trained on vast amounts of text data. It learns patterns in language, enabling it to generate human-like responses, follow instructions, summarise information, and solve problems expressed in natural language.
LLMs work by predicting the next word or token in a sentence, using context to maintain coherence and relevance. This allows them to handle a wide range of tasks across domains, from technical reasoning to creative writing.
Traditionally, LLMs require significant compute resources and are hosted in the cloud. However, recent advancements in model architecture, quantisation, and hardware acceleration have enabled smaller, optimised versions to run on local devices. These local LLMs can operate offline, preserve user privacy, and reduce latency, making them ideal for use in modular robotics, IoT systems, and edge environments.
Last updated