AI Report Card

Download annotated papers (latest first)

Model Download annotated paper
Google DeepMind Gemini Download paper (PDF).
Available exclusively to full subscribers of The Memo.
Google Med-PaLM 2 Download paper (PDF).
Available exclusively to full subscribers of The Memo.
OpenAI GPT-4 Download paper (PDF).
Available exclusively to full subscribers of The Memo.

Download report cards (latest first)

Designed in 2022, the LifeArchitect.ai report card provides a standard template for assessing new large language models and multimodal models.

Model Lab Date Grade/ALScore Download
Gemini 1.5T Google DeepMind Dec/2023 A- (22.4) Gemini report card (PDF)
Llama 2 70B Meta AI Jul/2023 B+ (1.2) Llama 2 report card (PDF)
PaLM 2 340B Google May/2023 PX (3.7) PaLM 2 report card (PDF)
Updated to show poor performance on SuperGLUE=86.4%
GPT-4 1T OpenAI Mar/2023 PX (14.9) GPT-4 report card (PDF)
LLaMA-65B Meta AI Feb/2023 B- (1.0) LLaMA-65B report card (PDF)
Galactica 120B Meta AI Nov/2022 B- (0.8) GAL 120B report card (PDF)
AlexaTM 20B Amazon Alexa AI Aug/2022 C (0.5) Amazon AlexaTM 20B report card (PDF)
PaLM 540B Google Jul/2022 A (2.2) Google PaLM report card (PDF)
InstructGPT OpenAI Jul/2022 B- (-) OpenAI InstructGPT report card (PDF)

Template

Report Card Template (PDF)

The report card template is open source, and available for download here.

Zoomed preview

Video


Get The Memo

by Dr Alan D. Thompson · Be inside the lightning-fast AI revolution.
Bestseller. 10,000+ readers from 142 countries. Microsoft, Tesla, Google...
Artificial intelligence that matters, as it happens, in plain English.
Get The Memo.

Dr Alan D. Thompson is an AI expert and consultant, advising Fortune 500s and governments on post-2020 large language models. His work on artificial intelligence has been featured at NYU, with Microsoft AI and Google AI teams, at the University of Oxford’s 2021 debate on AI Ethics, and in the Leta AI (GPT-3) experiments viewed more than 4.5 million times. A contributor to the fields of human intelligence and peak performance, he has held positions as chairman for Mensa International, consultant to GE and Warner Bros, and memberships with the IEEE and IET. Technical highlights.

This page last updated: 7/Feb/2024. https://lifearchitect.ai/report-card/