Large Language Models (LLMs) [Part 1]

Pioneering Future of Artificial Intelligence As a Game-Changer

Kubilay Tuna
11 min readMar 27, 2024
Photo was created & produced by author with GenAI

Thanks to Large Language Models (or LLMs for short), Artificial Intelligence (AI) has now caught the attention of pretty much everyone. GPT3, possibly the most famous LLM, has immediately skyrocketed in popularity due to the fact that natural language is such a, well, natural interface that has made the recent breakthroughs in AI accessible to everyone. Nevertheless, how LLMs work is still less commonly understood, unless you are a Data Scientist or in another AI-related role. In this article, we will try to explore LLMs together from the beginning by sharing very general information.

If you’ve fastened your seat belt, let’s start exploring LLMs! 🚀

Before delving into LLMs, let’s explore Language Models (LMs). A LM is a statistical model used to predict the likelihood of a sequence of words occurring in a given context. Essentially, it serves as a tool that aids computers in understanding and generating human language.

These are the building block of the LLMs. Basically, they are language models that learn patterns and relationships between words in a set of text data (token) and accordingly make predictions about the next token in the sequence, given the previous words. For example, let’s take into…

--

--

Kubilay Tuna

Machine Learning Engineer || Passionate about machine learning, data science, and programming