Timeline of AI and language models

Alan’s work guides official AI docs for Microsoft, Harvard, MIT, UN, G7...^†
Get The Memo.

Next in 2026

Selected major milestones in AI development of post-2020 large language and multimodal models (less focus on text-to-image models). Western models mostly (less focus on China).

See a more comprehensive view of all model highlights—including counts for parameters and tokens—in the Models Table.

1947

Turing lecture

First public lecture (London, 1947) to mention computer intelligence. Turing said: ‘What we want is a machine that can learn from experience… the possibility of letting the machine alter its own instructions provides the mechanism for this.’ A few months later, he introduced many of the central concepts of AI in an unpublished paper: Intelligent Machinery. Britannica

[bold_timeline_item_button title=”Expand” style=”” shape=”” color=”” size=”inline” url=”#” el_class=”bold_timeline_group_button”]

1950

Turing test (paper)

Read the paper

[bold_timeline_item_button title=”Expand” style=”” shape=”” color=”” size=”inline” url=”#” el_class=”bold_timeline_group_button”]

1956

‘Artificial intelligence’ coined by Minsky et al

Read the article by Dartmouth, USA

[bold_timeline_item_button title=”Expand” style=”” shape=”” color=”” size=”inline” url=”#” el_class=”bold_timeline_group_button”]

1966

ELIZA (chatbot)

MIT

Read the Wiki article

[bold_timeline_item_button title=”Expand” style=”” shape=”” color=”” size=”inline” url=”#” el_class=”bold_timeline_group_button”]

2011

February

Watson (system)

IBM

Appeared on Jeopardy! against champions Brad Rutter and Ken Jennings, winning the first place prize of $1m.

Read my comparison with GPT-3.

[bold_timeline_item_button title=”Expand” style=”” shape=”” color=”” size=”inline” url=”#” el_class=”bold_timeline_group_button”]

2017

August

Transformer (architecture)

Google

Read the Google blog

[bold_timeline_item_button title=”Expand” style=”” shape=”” color=”” size=”inline” url=”#” el_class=”bold_timeline_group_button”]

2018

January

ULMFit 34M (model)

fast.ai

Read the paper

June

GPT-1 117M (model)

OpenAI

Read the paper

October

BERT 340M (model)

Google

Read the paper

[bold_timeline_item_button title=”Expand” style=”” shape=”” color=”” size=”inline” url=”#” el_class=”bold_timeline_group_button”]

2019

February

GPT-2 1.5B (model)

OpenAI

Read the paper

October

BERT used for search

Google

Read the Google blog

[bold_timeline_item_button title=”Expand” style=”” shape=”” color=”” size=”inline” url=”#” el_class=”bold_timeline_group_button”]

2020

January

Meena 2.6B (chatbot model)

Google

Read the Google blog

April

BlenderBot 1.0 (chatbot model)

Facebook

Read the Facebook blog

May

GPT-3 175B (model)

OpenAI

Read the paper
Alan’s analysis

September

GPT-3 writes a newspaper column

The Guardian/OpenAI

Read the article

[bold_timeline_item_button title=”Expand” style=”” shape=”” color=”” size=”inline” url=”#” el_class=”bold_timeline_group_button”]

2021

January

The Pile v1 (dataset)

EleutherAI

Read the EleutherAI blog

March

Wudao 1.0 (model)

BAAI

Read the paper

April

The GPT-3 Leta AI video series

LifeArchitect.ai

LifeArchitect.ai/Leta
Archive.org/details/leta-ai

June

GPT-J-6B (model)

EleutherAI

See the GitHub repo

June

LaMDA 137B (chatbot model)

Google

Read the Google blog
Alan’s analysis

June

Wudao 2.0 (model)

BAAI

Read the paper

June

M6 1T – MultiModality-to-MultiModality Multitask Mega-transformer (sparse model)

Alibaba Dharma Academy

Read the release (Chinese)

August

Jurassic-1 178B (model)

AI21

Read the paper

October

Megatron-Turing NLG 530B (model)

NVIDIA + Microsoft

November

M6 10T – MultiModality-to-MultiModality Multitask Mega-transformer (sparse model)

Alibaba Dharma Academy

Read the release (Chinese)

November

BERT 480B & 200B (model)

Google

Read the release, 2

December

52B (model)

Anthropic

Read the paper

December

GLaM 1.1T (model)

Google inc

Read the Google blog

December

Gopher 280B (model)

Google AI

Read the paper

December

ERNIE 3.0 Titan 260B (model)

Baidu

Read the paper

[bold_timeline_item_button title=”Expand” style=”” shape=”” color=”” size=”inline” url=”#” el_class=”bold_timeline_group_button”]

2022

March

Chinchilla 70B (model)

DeepMind

Read the paper
Alan’s analysis

March

BLOOM – tr11-176B-ml (model)

BigScience

See the repo

April

PaLM 540B (model)

Google Inc

Read the Google blog
Alan’s analysis

April

Flamingo (Chinchilla 70B + 10B visual model)

DeepMind

Read the blog + paper

May

OPT-175B (model)

Meta AI

Read the paper

May

LaMDA 2 137B (chatbot model)

Google AI

Watch the launch video

May

Gato (Cat) 1.18B (general model)

DeepMind

Read the paper

November

GPT-3.5 – text-davinci-003 (model)

OpenAI

Alan’s analysis

November

ChatGPT (model)

OpenAI

Read the blog
Alan’s analysis

December

RT-1 35M (general model)

Google

Read the paper

December

RL-CAI 52B (model)

Anthropic

Read the paper

December

OPT-IML 175B (model)

Meta AI

Read the paper

[bold_timeline_item_button title=”Expand” style=”” shape=”” color=”” size=”inline” url=”#” el_class=”bold_timeline_group_button”]

2023

February

LLaMA-65B (model)

Meta AI

Read the paper

March

Alpaca 7B (model)

Stanford

Read the release

March

GPT-4 1.76T (model)

OpenAI

Read the paper,
Alan’s analysis

May

PaLM 2 340B (model)

Google

Read the paper

June

phi-1 1.3B (model)

Microsoft

Read the paper

June

Inflection-1 (model)

Inflection AI

Read the paper

July

Claude 2 (model)

Anthropic

Model	Months since last release
GPT-4 Mar/2023	14m	14 months
GPT-3 2022 text-davinci-002 Jan/2022	20m	20 months
GPT-3 May/2020	15m	15 months
GPT-2 Feb/2019	8m	8 months
GPT-1 Jun/2018	Baseline

Next in 2026

1947

Turing lecture

1950

Turing test (paper)

1956

‘Artificial intelligence’ coined by Minsky et al

1966

ELIZA (chatbot)

2011

Watson (system)

2017

Transformer (architecture)

2018

ULMFit 34M (model)

GPT-1 117M (model)

BERT 340M (model)

2019

GPT-2 1.5B (model)

BERT used for search

2020

Meena 2.6B (chatbot model)

BlenderBot 1.0 (chatbot model)

GPT-3 175B (model)

GPT-3 writes a newspaper column

2021

The Pile v1 (dataset)

Wudao 1.0 (model)

The GPT-3 Leta AI video series

GPT-J-6B (model)

LaMDA 137B (chatbot model)

Wudao 2.0 (model)

M6 1T – MultiModality-to-MultiModality Multitask Mega-transformer (sparse model)

Jurassic-1 178B (model)

Megatron-Turing NLG 530B (model)

M6 10T – MultiModality-to-MultiModality Multitask Mega-transformer (sparse model)

BERT 480B & 200B (model)

52B (model)

GLaM 1.1T (model)

Gopher 280B (model)

ERNIE 3.0 Titan 260B (model)

2022

Chinchilla 70B (model)

BLOOM – tr11-176B-ml (model)

PaLM 540B (model)

Flamingo (Chinchilla 70B + 10B visual model)

OPT-175B (model)

LaMDA 2 137B (chatbot model)

Gato (Cat) 1.18B (general model)

GPT-3.5 – text-davinci-003 (model)

ChatGPT (model)

RT-1 35M (general model)

RL-CAI 52B (model)

OPT-IML 175B (model)

2023

LLaMA-65B (model)

Alpaca 7B (model)

GPT-4 1.76T (model)

PaLM 2 340B (model)

phi-1 1.3B (model)

Inflection-1 (model)

Claude 2 (model)

Llama 2 70B (model)

Falcon 180B (model)

ERNIE 4.0 (model)

Grok-1 314B (model)

Gemini (model)

2024

Sora (world model)

Gemini 1.5 (model)

Claude 3 Opus (model)

Llama 3 70B (model)

phi-3 14B (model)

Nemotron-4-340B (model)

Claude 3.5 Sonnet (model)

Llama 3.1 405B (model)

Grok-2 (model)

o1 (model)

Claude with computer use (model)

Quantity of AI-generated articles surpasses human-written articles (finding Nov/2024, published Nov/2025)