The image above was generated by AI for this paper (Midjourney v5.2)1Image generated in a few seconds, on 24 June 2023, text prompt by Alan D. Thompson: ‘entrancing and magical sky of Earth from space, high resolution, nasa photography –ar 3:2’. Using Midjourney v5.2 via Discord https://www.midjourney.com/
Alan D. Thompson
Watch the video version of this paper at: https://youtu.be/RnkyM74-LAwAll reports in The sky is... AI report series
Most recent at top.
|Jun/2023||The sky is entrancing|
|Dec/2022||The sky is infinite|
|Jun/2022||The sky is bigger than we imagine|
|Dec/2021||The sky is on fire|
[By the year 2023], we will have the technological means to create superhuman intelligence… we are on the edge of change comparable to the rise of human life on Earth… the major changes happening in a matter of hours… I think it’s fair to call this event… ‘the Singularity’… a point where our old models must be discarded and a new reality rules.
— Prof Vernor Vinge (March 1993)2https://archive.nytimes.com/www.nytimes.com/library/cyber/surf/1120surf-vinge.html 1993, The Coming Technological Singularity: How to Survive in the Post-Human Era by Vernor Vinge (San Diego State University), p 11. National Aeronautics and Space Administration.
Author’s note: I’ve aimed to keep this report succinct, as the progress of AI in the first half of 2023 could easily be a book. Last year, I turned down three competing publishing offers because this field is moving much too quickly for that process. If you’re after bleeding edge updates as they happen, I provide visibility to major governments like Iceland, research teams like RAND, and companies like Google via The Memo: LifeArchitect.ai/memo.
It was clear from day one that this would be a noisy year. On New Year’s Day, OpenAI’s president commented:3https://twitter.com/gdb/status/1609244547460255744
2023 will make 2022 look like a sleepy year for AI advancement & adoption.
Prediction: 2023 will make 2022 look like a sleepy year for AI advancement & adoption.
— Greg Brockman (@gdb) December 31, 2022
In my previous AI report, The sky is infinite, I closed with: ‘The promise of artificial intelligence and its benefits is life-changing for everyone. As more and more people begin sitting up to take notice of today’s AI performance, spectacular new models are already in the works and ready for release in 2023.’
Across both metrics—visibility to the public and model releases—the first half of 2023 has been extraordinary. Compared to my last report, a significant percentage of the global population now knows about post-2020 AI and its possibilities. In the US, the National Bureau of Economic Research (NBER) picked up one of my soundbites:4https://www.nber.org/system/files/working_papers/w30957/w30957.pdf
ChatGPT, a version of OpenAI’s GPT3.5 model that was released on November 28th, 2022, gained more than 100m users in its first two months, and is now estimated to produce a volume of text every 14 days that is equivalent to all the printed works of humanity.
According to Pew Research,5https://www.pewresearch.org/short-reads/2023/05/24/a-majority-of-americans-have-heard-of-chatgpt-but-few-have-tried-it-themselves/ by March 2023, 58% of US adults had heard of ChatGPT, while 14% had tried it. A separate study found that ChatGPT is used in the workplace by nearly 30% of professionals from companies including Amazon, Bank of America, Edelman, Google, IBM, JPMorgan, and Twitter.6https://thehill.com/policy/technology/3821400-nearly-30-percent-of-professionals-say-they-have-used-chatgpt-at-work/
Best of mid-2023
When it comes to usefulness, here are my top picks for mid-2023.
|Best … in mid-2023||Model name||Notes|
|Language model (open)||OpenAI
|Dialogue model (open)||Inflection AI Pi8https://heypi.com/talk||Released May/2023|
|Text-to-image model (open)||Midjourney v5.29https://www.midjourney.com/||Released Jun/2023|
|AI app||Quora Poe10https://poe.com/||Released Feb/2023|
Table: Alan’s ‘best of’ AI models and apps for mid-2023.
Large language models: 100,000 new models in six months
The main highlight in the first half of the year really was large language models. There were many models released in 2023, though only a handful made media headlines. On 30 December 202211https://web.archive.org/web/20221230185349/https://huggingface.co/models model-hosting platform Hugging Face showed 108,473 models. This number had increased to 242,000 by 30 June 2023. Yes, that’s over 133,000 models made available in 6 months!
From the extraordinarily late release of OpenAI’s GPT-4 in March (which finished training eight months earlier in August 2022), to Google’s ‘catchup’ release of PaLM 2, for those of us tracking the plethora of new models, the sheer volume was incredible.
Chart: 2023–2024 Optimal language models — highlights (Jun/2023).
Generally, all 2023 large language models saw an average of 10x more data during training compared to similar releases in 2022. Labs began drawing on novel data like videos from YouTube, code from massive repositories, and dialogue sets focused on conversation. For a closer look, here are some tables and numbers…
Large language models have been top of mind for AI labs around the world, with the majority of ranking models in June 2023 being completely new (that is, they weren’t even around for the last report in December 2022). This table shows LLMs ranked by an ‘ALScore’, which is a simple calculation of tokens and parameters to show raw power capacity.
You can view a compact version of these tables at LifeArchitect.ai/models:
It wasn’t just that these models were seeing 10x more data during training, or increasing in size exponentially—the capabilities of these models were also shocking. Here’s GPT-4 versus an average human:
Chart: GPT-4 vs human tests — simple (May/2023).
I have previously used the term ‘laptop model’ to refer to any model that can fit in RAM on a 2023 laptop(!), but I’m sure the term will be superseded as hardware gets better. After the leak of Meta AI’s LLaMA 65B model in February 2023, various groups began trying to fine-tune these smaller models using the outputs from larger models.
Playing on the LLaMA animal family name (actually shorthand for ‘Large LAnguage Model Meta AI’), groups presented smaller models called Alpaca, Vicuña, and Guanaco. A more comprehensive (though not exhaustive) list looks like this:
Airoboros, Alpaca, Baize, Cabrita, Camel, Chimera, Fin-LLaMA, Guanaco, Gorilla, GPT4All, GPTQ-for-LLaMA, Hippogriff, Koala, LLaVA, LIMA, Lit-LLaMA, Manticore, Minotaur, OpenAlpaca, OpenAssistant LLaMA, OpenBuddy, Orca, Panda, Pygmalion, QLoRA, StackLLaMA, StableVicuna, Vicuña, Vigogne (French), WizardLM…
Researchers at UC Berkeley13https://arxiv.org/abs/2305.15717 denounced laptop models trained by imitation, finding that ‘imitation models close little to none of the gap from the base LM to ChatGPT on tasks that are not heavily supported in the imitation data.’
Large language models in every corner
Thanks to a US$10 billion14https://www.semafor.com/article/01/09/2023/microsoft-eyes-10-billion-bet-on-chatgpt infusion from Microsoft, OpenAI hit a valuation of $29 billion15https://archive.ph/sQXU5 in the first week of 2023.
A source went into further detail about the deal:
Once $92 billion in profit plus $13 billion in initial investment are repaid [by OpenAI to Microsoft], and once the other venture investors earn $150 billion, all of the equity reverts back to OpenAI.
Whether it’s 1 billion or 100 billion or even a trillion dollars of investment, this news is not particularly surprising. Indeed, any organization that leads us to artificial general intelligence has untold value. But, in a post-scarcity world—and perhaps in a post-capitalist world and a post-money world—that value will not be measured in dollars.16https://moores.samaltman.com/
Though many people have tried, it is still difficult to imagine exactly what it looks like when AI outperforms any human at any mental process. This becomes an even bigger challenge when we try to imagine (positively and optimistically rather than relying on our built-in negativity bias and Hollywood movies) what it looks like when an embodied agent has fully arrived. Superintelligent AI with real physical embodiment will advance exponentially in 2023.
At the end of the day, Microsoft’s investment is just a number, and not even a big number (in January 2023, Microsoft also offered $68.7B—more than 6x the OpenAI offer—to acquire Activision Blizzard… a gaming company!).
I would be throwing everything at AI models right now: pushing as much data, PhD-level brain power, and massive computer processing as possible. Entire countries and governments could help fund and insulate this. Places like the UK,17https://www.theguardian.com/technology/2023/mar/15/uk-to-invest-900m-in-supercomputer-in-bid-to-build-own-britgpt Japan, China, and many others are doing just that.
Whatever resources are necessary can and should be applied, and on the other side will be… something even more ‘miraculous’ than the neutered ChatGPT or the more powerful GPT-4 engine. When companies get to the next phase of AI, how much they’ve invested will not even be a matter of discussion. We will be in a completely new world.
Chart: The first 50 GPT-4 enterprise users.
This list is now measured in the tens of thousands, with OpenAI partner Microsoft Azure bringing in an additional 4,500 companies18https://azure.microsoft.com/en-us/blog/build-next-generation-ai-powered-applications-on-microsoft-azure/ on top of OpenAI’s direct offering to businesses large and small.
Embodied artificial intelligence: Robots
Putting large language models inside physical robots is the latest exponential leap. In February 202319https://www.microsoft.com/en-us/research/group/autonomous-systems-group-robotics/articles/chatgpt-for-robotics/ Microsoft did it with ChatGPT and drones. Several labs including Meta AI20https://youtu.be/F1MUEZ0dl7k joined Boston Dynamics’ Spot robot with large language models like ChatGPT. Amazon is combining language models21https://archive.md/AJQzE with their Astro wheeled bot, in a project called ‘Burnham’. Tesla has been promising it for years, presenting a new version of a humanoid robot in May 2023.22https://youtu.be/XiQkeWOFwmk
But the real winner must be 1X’s NEO humanlike android.23https://www.1x.tech/ In Mar/2023, the company raised investment of $23.5M, led by OpenAI.24https://1xtech.medium.com/1x-raises-23-5m-in-series-a2-funding-led-by-openai-6040af4f3f4f
Our android model NEO is an innovative bipedal type of humanoid robot. That means they’re designed to look and move like a human: NEO has a head, torso, arms, and legs, just like us, so they can interact with the world in a way that feels familiar to us; they’ll walk, grab things, and communicate through facial expressions…
Using embodied artificial intelligence, NEO will understand its environment deeper, thanks to the fusion of their AI ‘senses’ and their physical body. NEO continuously learns and improves, becoming smarter and more capable over time… NEO will understand their own environment and how to handle fragile items, creating seamless collaboration and immersive human–robot interaction.25https://www.1x.tech/neo
NEO’s stats are comparable with several other humanoid robots:
|Stat||1X NEO||Figure 0126https://www.figure.ai/ (competitor)|
|Height||160 cm / 5’3″ (estimate)||167 cm / 5’6″|
|Weight||30 kg / 66 lbs||60 kg / 132lbs|
|Power||2–4 hours||5 hours|
|Speed (walk)||4 kmh / 2.5 mph||4.3 kmh / 2.6 mph|
|Speed (run)||12 kmh / 7.5 mph||4.3 kmh / 2.6 mph|
|Payload/carry capacity||20 kg / 44 lbs||20 kg / 44 lbs|
Image: 1X NEO robot (Jun/2023).
I think we might exceed a one-to-one ratio of humanoid robots to humans. It’s not even clear what an economy means at that point because an economy is output per person times persons, but if output is much higher and there’s no limit on persons, then what’s the actual limit on the economy?
Next up in 2023
I generally shy away from predictions, but in the second half of 2023, I am very much looking forward to:
- All senses, all modalities. Models like Microsoft Kosmos-1 1.6B can see. Many models can see and hear. The PandaGPT 13B28https://github.com/yxuansu/PandaGPT model, based on Meta AI’s ImageBind, can see, hear, and even feel, with processing for six modalities: text, image and video, audio, depth, thermal, and inertial measurement unit which includes accelerometer/gyroscope/compass.
- Self-driving cars in your city. There are so many competitors approaching Level 5 (full automation) now: Cruise Origin, Waymo, Amazon Zoox, Didi Neuron, and the autonomous sidewalk delivery service by NVIDIA-backed startup Serve Robotics. Some cities will be quicker on the uptake than others; where you live will determine how rapidly you get access to this kind of AI.
- Beyond trillion-parameter models. OpenAI’s GPT-529https://lifearchitect.ai/gpt-5/ and Google DeepMind’s Gemini30https://lifearchitect.ai/gemini/ will change the world completely.
- Personalized explainers. We are now in urgent need of having more than eight billion people understand the necessity of amplifying intelligence. Personal AI—embodied or otherwise—can sit down and have endless conversations with each person in the world, addressing hopes, fears, and the reality of this evolution as it happens.
Full steam ahead
The first half of 2023 saw new directions for ensuring models are truthful and grounded, new physical embodiment of models, and ongoing innovation from dozens of AI labs. AI is advancing much more quickly than any previous technology or invention.
It is now nearly a certainty that artificial general intelligence (AGI)—and the runaway speed of the singularity31https://en.wikipedia.org/wiki/Technological_singularity—will occur long before governmental or intergovernmental organizations have put effective policy in place.
As with nearly all major new human inventions post-GPT-4, AI should be directly engaged to solve this puzzle. Applying limited human minds to ‘super wicked problems’32https://en.wikipedia.org/wiki/Wicked_problem#Super_wicked_problems like global climate change, economic overhauls, AI policy, and even AI alignment, is both inefficient and ineffective. Instead, these conundrums are best given to AI itself.
A note of caution
There is a default mode in biological humans called ‘be scared of anything, and argue over everything’. Actually, it’s called negativity bias, and its effects are fight, flight, or similar. If you were to spend time arguing against your own evolution, just be aware that you don’t get that time back.
You may recall the expression ‘In times of peace, the warlike man attacks himself’. Friedrich Nietzsche’s commentary33Friedrich Nietzsche, 1989, Beyond Good & Evil: Prelude to a Philosophy of the Future https://www.amazon.com/gp/product/0679724656/ on the human condition seems particularly poignant during the current AI revolution. I have been observing with great concern (and sometimes amusement) the knee jerk reactions to post-2020 artificial intelligence. While its potential promises a better life for all, many feel threatened by its ability to ‘take their jobs,’ while others are fearful that they will lose their sense of control.
Beginning last year in June 2022, I began awarding a monthly ‘AI Cheese Award’34https://lifearchitect.ai/cheese/ to the most egregious cases, from celebrity outcries to national bans of models in the Western world. Without straying too far into personal development, it is one of my responsibilities to remind you and every human being of the profound evolution enabled by post-2020 AI.
This is not 1984’s Terminator or any number of other Hollywood movies. AI is us; it was trained on us, tuned on us, aligned with us, and functions just like us (in ways very similar to how our brains work). The quote below is not by Einstein. Its source is unknown.35https://en.wikiquote.org/wiki/Albert_Einstein#Misattributed probably words by Frederic William Henry Myers, quoted in a 1912 book by Emil Carl Wilm.
I think the most important question facing humanity is, ‘Is the universe a friendly place?’ This is the first and most basic question all people must answer for themselves. For if we decide that the universe is an unfriendly place, then we will use our technology, our scientific discoveries and our natural resources to achieve safety and power by creating bigger walls to keep out the unfriendliness, and bigger weapons to destroy all that which is unfriendly and I believe that we are getting to a place where technology is powerful enough that we may either completely isolate or destroy ourselves as well in this process.
If we decide that the universe is neither friendly nor unfriendly and that God is essentially ‘playing dice with the universe’, then we are simply victims to the random toss of the dice and our lives have no real purpose or meaning.
But if we decide that the universe is a friendly place, then we will use our technology, our scientific discoveries and our natural resources to create tools and models for understanding that universe. Because power and safety will come through understanding [the universe’s] workings and its motives.
A note of peace
As AI is coupled with integrated hardware like augmented vision and brain–computer interfaces, we become moments away from having superintelligence supporting us in designing our own reality instantly. Having our own reality allows each of us to imagine and create anything, presenting humanity—perhaps for the first time—with expanded options for achieving higher states of evolution than ever before. Supported by AI, the true definition of peace may finally be in reach.
In 2023, large language models are being used in incredible ways:
- At home, saving $150/week and five hours36https://www.afr.com/technology/chantelle-saves-five-hours-and-150-a-week-using-chatgpt-here-s-how-20230612-p5dfv9 by using GPT for meal planning with dietary considerations.
- In business, assigning the role of company CEO37https://bwpeople.businessworld.in/article/CS-India-Decides-To-Appoint-ChatGPT-As-Its-New-CEO/09-02-2023-465005/ to the latest AI models.
- In law, using LLMs to support judges and lawyers in treating individuals equitably and fairly.38https://nypost.com/2023/03/29/judge-asks-chatgpt-for-decision-in-murder-trial/39https://interestingengineering.com/innovation/chatgpt-makes-humane-decision-columbia40https://malegislature.gov/Bills/193/SD1827
- In chemistry, using GPT-4 to generate instructions that activate a robot arm that synthesizes molecules.41https://twitter.com/andrewwhite01/status/1670794000398184451
- In medicine, the theory of chatbot doctors bringing higher quality responses and 9.8 times higher prevalence of empathy to patients.42https://jamanetwork.com/journals/jamainternalmedicine/article-abstract/2804309
- Solving climate change.
- Ensuring personalized education.
- Facilitating affordable and high-quality healthcare.
- Bringing economic freedom for everyone to pursue what they find most fulfilling.
And there are countless more examples waiting for you to imagine and reveal. Humanity has evolved to a stage where massive superintelligence—artificial general intelligence, a new way of doing things—is on our doorstep. The next leap will change everything. From a purely statistical point of view, it is a virtual impossibility that we were born into this time. The exponential unfolding of post-2020 superintelligence is here, it’s entrancing, and you’re living right in the middle of it.
This paper has a related video at: https://youtu.be/RnkyM74-LAw
Dr Alan D. Thompson is an AI expert and consultant, advising Fortune 500s and governments on post-2020 large language models. His work on artificial intelligence has been featured at NYU, with Microsoft AI and Google AI teams, at the University of Oxford’s 2021 debate on AI Ethics, and in the Leta AI (GPT-3) experiments viewed more than 4.5 million times. A contributor to the fields of human intelligence and peak performance, he has held positions as chairman for Mensa International, consultant to GE and Warner Bros, and memberships with the IEEE and IET. Technical highlights.
This page last updated: 17/Oct/2023. https://lifearchitect.ai/the-sky-is-entrancing/↑
- 1Image generated in a few seconds, on 24 June 2023, text prompt by Alan D. Thompson: ‘entrancing and magical sky of Earth from space, high resolution, nasa photography –ar 3:2’. Using Midjourney v5.2 via Discord https://www.midjourney.com/
- 2https://archive.nytimes.com/www.nytimes.com/library/cyber/surf/1120surf-vinge.html 1993, The Coming Technological Singularity: How to Survive in the Post-Human Era by Vernor Vinge (San Diego State University), p 11. National Aeronautics and Space Administration.
- 33Friedrich Nietzsche, 1989, Beyond Good & Evil: Prelude to a Philosophy of the Future https://www.amazon.com/gp/product/0679724656/
- 35https://en.wikiquote.org/wiki/Albert_Einstein#Misattributed probably words by Frederic William Henry Myers, quoted in a 1912 book by Emil Carl Wilm.