ChatGPT launches
Dialogue-tuned AI goes mainstream overnight, resetting expectations for search, productivity and customer service in one stroke.
The Story of AI So Far
On 30 November 2022, a chatbot demo changed the trajectory of software. This is the journey from that moment to today's multimodal, agentic, regulation-shaped AI: the milestones that mattered, why they mattered, and what they mean for your business.
Chapter zero
AI research had been accelerating for a decade, but it lived in papers and demos. ChatGPT did something different: it put a frontier model behind a text box anyone could use. One hundred million people arrived within months, the fastest adoption of any consumer product in history.
What followed wasn't just bigger models. It was a category change: AI became a platform, woven into search, browsers, code editors, phones and business workflows. The story below tracks that transformation in four chapters.
2022 – 2023
On 30 November 2022, ChatGPT turned a research curiosity into the fastest-adopted consumer product in history. Within a year, AI had moved from the lab into search engines, browsers and boardrooms, and the race was on.
Dialogue-tuned AI goes mainstream overnight, resetting expectations for search, productivity and customer service in one stroke.
The first major search and browser integration signals that AI will live inside existing software, not just separate chat apps.
The first widely deployed multimodal model raises the ceiling on reasoning and reliability, and accepts images, not just text.
AI learns to use tools. The design pattern behind every modern agent, automation and structured workflow starts here.
A serious open-weight model family becomes free for commercial use, igniting the open-source AI ecosystem.
A compact European model outperforms its size class, shifting the conversation to efficiency and deployability.
Twenty-eight governments put frontier AI safety on the geopolitical agenda for the first time.
Google repositions its entire model stack around a natively multimodal family: Ultra, Pro and Nano.
2024
Models stopped being chatbots. They learned to see, hear, speak in real time, read entire books in one go. And the law finally caught up, with the EU AI Act becoming the first binding framework in a major market.
Million-token context windows turn models from short-form assistants into document and codebase workers.
The Haiku / Sonnet / Opus tiering arrives: pick your balance of speed, cost and intelligence.
Proof the AI wave is bigger than chat: model advances begin reshaping drug discovery and biology itself.
Live voice and vision interaction makes AI feel less like typing into a box and more like talking to a colleague.
The first large-scale, binding legal framework for AI. Governance shifts from principles to obligations.
AI moves from describing interfaces to operating them: clicking, typing and completing tasks on screen.
Source-linked web search lands inside the chat box, and the line between chatbot and search engine dissolves.
2025
The market stopped asking "what can it write?" and started asking "what can it do?" Reasoning models, coding agents and open challengers from unexpected places made AI that plans, acts and verifies the new normal.
A fully open reasoning model at a fraction of expected cost forces every lab to rethink its economics.
Coding agents enter real developer workflows; models learn to decide when to think hard and when to answer fast.
Agent development gets standard building blocks: unified tools, state and web search out of the box.
Open-weight, natively multimodal mixture-of-experts models push context length and efficiency higher.
Long-running autonomous tasks and agent workflows become the headline capability, not the demo.
General-purpose model rules take effect: compliance becomes a real product constraint, not a future worry.
2026
Agentic AI is no longer a frontier feature. It is the product. Million-token context, computer use and any-input-to-any-output generation are becoming standard, and the question for businesses has changed from "should we?" to "how, exactly?"
Agentic coding, planning and million-token context windows become standard frontier features, not headlines.
"Agentic Gemini": stronger action-taking and any-input-to-any-output media generation, announced to 8.5 million monthly developers.
Command A+ and Mistral Medium 3.5 make multilingual, self-hostable agentic AI practical without hyperscaler compute.
The bigger picture
Models stopped being text-only. Today's systems see images, hear speech, hold live conversations and generate any output from any input. The interface barrier between humans and software is dissolving.
Context windows grew from a few pages to over a million tokens. AI went from answering questions to reading your entire contract pile, codebase or case history in one sitting.
Tool use, function calling and computer use turned models from talkers into doers: planning, searching, clicking, executing and reporting back. This is the shift that makes business automation real.
Efficiency breakthroughs (sparse architectures, small high-performance models, prompt caching) collapsed the cost of serving powerful AI. Capability that cost pounds per query in 2023 now costs fractions of a penny.
Open-weight models from Meta, Mistral, DeepSeek and others created real choices: on-premise deployment, data residency, customisation and freedom from any single vendor. Picking well is now a strategic decision.
So what?
The winners of this era aren't the companies with the cleverest prompts. They're the ones who turn fast-moving capability into reliable systems: lead capture that never sleeps, documents that process themselves, follow-up that never slips.
The technology is ready, and every milestone above proves it. The hard part is choosing the right combination of models, tools and guardrails for a real workflow, and building it so it stays reliable. That's the work we do.
Questions
Within months, AI moved from a standalone chatbot into search engines, browsers and office software. GPT-4 brought multimodality, function calling brought tool use, and by 2024–2026 the focus had shifted to agentic AI (systems that plan and act, not just answer) alongside the first binding regulation such as the EU AI Act.
GPT-4 accepted images in March 2023, but the mainstream moment came across 2024: Google's Gemini family was natively multimodal, Claude 3 added vision, and GPT-4o made real-time voice and vision interaction a standard consumer experience.
Meta's Llama family, Mistral's models, Google's Gemma and DeepSeek's R1 are the best-known open-weight releases. "Open-weight" is the precise term: the weights are downloadable, but licence terms vary, which matters when choosing models for commercial deployment.
The Act entered into force in August 2024, with prohibited-practice rules applying from February 2025 and general-purpose AI obligations from August 2025. Most UK businesses using AI tools face indirect effects through suppliers, but anyone deploying AI in or into the EU should understand their risk category.
Manchester PC
AI assistant. A human confirms every booking
AI assistant, so it can make mistakes. Anything important: 0161 660 0356