Transformer model

A transformer model is a type of machine learning architecture designed to handle sequential data, such as text or time series, more efficiently than older models like recurrent neural networks (RNNs).

Transformers use an attention mechanism to weigh the importance of different parts of the input data. This allows them to understand context and relationships across long sequences, making them powerful for tasks such as translation, summarization, and answering questions.

During model training, transformers process information in parallel rather than step by step, which speeds up computation. They form the basis for many modern AI systems, including large language models (LLMs).

May 3, 2024

Machine Learning, Explained

32 min

Data Science
Sep 5, 2024

Generative AI Models Explained

19 min

Data Science
Jul 13, 2025

Language Models, Explained: How GPT and Other Models Work

20 min

Data Science
Mar 8, 2023

Machine Learning and AI in Travel

15m 30s

Travel
Feb 17, 2022

How Computer Vision Applications Work

13m 15s

Travel
Jan 29, 2020

AI and Data Science in Aviation Industry: 5 Real-life Use Cases

11m 1s

Travel

We use cookies

Our website uses cookies to ensure you get the best experience. By browsing the website you agree to our use of cookies. Please note, we don’t collect sensitive data and child data.

To learn more and adjust your preferences click Cookie Policy and Privacy Policy. Withdraw your consent or delete cookies whenever you want here.

Allow all cookies

Transformer model

Subscribe to our newsletter

Recommended content for you

Machine Learning, Explained

Generative AI Models Explained

Language Models, Explained: How GPT and Other Models Work

Machine Learning and AI in Travel

How Computer Vision Applications Work

AI and Data Science in Aviation Industry: 5 Real-life Use Cases

Get in Touch