,

The History of Open Source: From Linux to Llama

history-of-open-source-featured

Learn Our Proven AI Frameworks

Beginners in AI created 6 branded frameworks to help you master AI: STACK for prompting, BUILD for business, ADAPT for learning, THINK for decisions, CRAFT for content, and CRON for automation.

The Deeper Context: Why AI History Matters for Understanding Today’s Technology

Understanding the history of artificial intelligence is not just an academic exercise. The patterns, breakthroughs, and failures of AI’s past directly shape the tools, debates, and opportunities you encounter today. When you understand where AI came from, you understand why it works the way it does, why certain problems remain unsolved, and why experts make the predictions they do about where this technology is heading.

The Recurring Pattern: Hype, Winter, and Breakthrough

One of the most striking patterns in AI history is the cycle of excitement and disappointment. In the 1950s and 1960s, early AI pioneers made bold predictions that human-level AI was just around the corner. By the 1970s, progress had stalled, funding dried up, and the first “AI winter” set in. The pattern repeated in the 1980s, when expert systems generated enormous enthusiasm, followed by another crash in the early 1990s when these systems proved too brittle and expensive to maintain at scale.

Each winter ended with a genuine breakthrough that changed what was possible. The deep learning revolution that began gaining momentum around 2012 with AlexNet’s dramatic win at the ImageNet competition was one such breakthrough. The release of GPT-3 in 2020 and ChatGPT in late 2022 represent another step change. Understanding this history helps calibrate your expectations: the current wave of AI enthusiasm is backed by real capability improvements, but history also teaches us that not every promised application will materialize on schedule.

Key Figures Who Shaped Modern AI

The development of AI has been shaped by a relatively small number of visionary researchers whose ideas, often dismissed at the time, eventually proved transformative:

  • Alan Turing (1912-1954): Defined the philosophical foundations of machine intelligence with his 1950 paper “Computing Machinery and Intelligence” and the famous Turing Test
  • John McCarthy (1927-2011): Coined the term “artificial intelligence” in 1956 and organized the Dartmouth Conference that launched AI as a formal research field
  • Marvin Minsky (1927-2016): Co-founder of MIT’s AI Lab and pioneering researcher in neural networks, robotics, and cognitive science
  • Geoffrey Hinton (born 1947): Often called the “Godfather of Deep Learning,” his decades of work on neural networks laid the groundwork for modern AI; notably left Google in 2023 to speak freely about AI risks
  • Yann LeCun (born 1960): Pioneer of convolutional neural networks, which became foundational for image recognition and many modern AI systems
  • Sam Altman (born 1985): CEO of OpenAI, whose decisions about product releases like ChatGPT have shaped how billions of people first encountered modern AI

The Paradigm Shifts That Define AI Progress

AI history can be organized around a series of fundamental paradigm shifts, each representing a completely different approach to building intelligent systems. The first era was defined by rule-based systems: programmers tried to encode human knowledge as explicit logical rules. This approach had real successes, particularly in narrow domains like chess and medical diagnosis, but could not scale to the messiness of real-world environments.

The second major paradigm was statistical machine learning, which shifted the focus from hand-crafted rules to learning patterns from data. Instead of telling a spam filter what spam looks like, you showed it millions of examples of spam and let it figure out the patterns. This approach scaled much better and produced the recommendation engines, search algorithms, and fraud detection systems that quietly powered the internet through the 2000s and 2010s.

The current paradigm is deep learning and foundation models. Rather than building separate models for each task, researchers discovered that training very large neural networks on enormous amounts of data produces systems with surprisingly general capabilities. The transformer architecture, introduced in 2017, proved especially powerful for language, and the scale of modern large language models like GPT-4 and Claude represents a qualitative change from anything that came before.

What History Tells Us About the Future

The history of AI does not give us a crystal ball, but it does offer some useful lessons. First, the problems that seemed hardest to AI researchers in the early days, like playing chess or solving calculus problems, turned out to be relatively tractable once the right methods were found. Meanwhile, the things that seemed trivially easy, like understanding a sarcastic joke or navigating a crowded room, have proven remarkably difficult to solve in general ways.

This pattern, sometimes called Moravec’s Paradox, suggests we should be humble about predicting which AI capabilities will come easily and which will remain elusive. It also reinforces why the current generation of large language models, which have made surprising progress on tasks that seemed distinctly human, feels so historically significant. Whether we are at another inflection point or approaching a new period of slower progress is the central debate in AI research today, and understanding the historical precedents is essential for engaging with that debate intelligently.

The Deeper Context: Why AI History Matters for Understanding Today’s Technology

Understanding the history of artificial intelligence is not just an academic exercise. The patterns, breakthroughs, and failures of AI’s past directly shape the tools, debates, and opportunities you encounter today. When you understand where AI came from, you understand why it works the way it does, why certain problems remain unsolved, and why experts make the predictions they do about where this technology is heading.

The Recurring Pattern: Hype, Winter, and Breakthrough

One of the most striking patterns in AI history is the cycle of excitement and disappointment. In the 1950s and 1960s, early AI pioneers made bold predictions that human-level AI was just around the corner. By the 1970s, progress had stalled, funding dried up, and the first “AI winter” set in. The pattern repeated in the 1980s, when expert systems generated enormous enthusiasm, followed by another crash in the early 1990s when these systems proved too brittle and expensive to maintain at scale.

Each winter ended with a genuine breakthrough that changed what was possible. The deep learning revolution that began gaining momentum around 2012 with AlexNet’s dramatic win at the ImageNet competition was one such breakthrough. The release of GPT-3 in 2020 and ChatGPT in late 2022 represent another step change. Understanding this history helps calibrate your expectations: the current wave of AI enthusiasm is backed by real capability improvements, but history also teaches us that not every promised application will materialize on schedule.

Key Figures Who Shaped Modern AI

The development of AI has been shaped by a relatively small number of visionary researchers whose ideas, often dismissed at the time, eventually proved transformative:

  • Alan Turing (1912-1954): Defined the philosophical foundations of machine intelligence with his 1950 paper “Computing Machinery and Intelligence” and the famous Turing Test
  • John McCarthy (1927-2011): Coined the term “artificial intelligence” in 1956 and organized the Dartmouth Conference that launched AI as a formal research field
  • Marvin Minsky (1927-2016): Co-founder of MIT’s AI Lab and pioneering researcher in neural networks, robotics, and cognitive science
  • Geoffrey Hinton (born 1947): Often called the “Godfather of Deep Learning,” his decades of work on neural networks laid the groundwork for modern AI; notably left Google in 2023 to speak freely about AI risks
  • Yann LeCun (born 1960): Pioneer of convolutional neural networks, which became foundational for image recognition and many modern AI systems
  • Sam Altman (born 1985): CEO of OpenAI, whose decisions about product releases like ChatGPT have shaped how billions of people first encountered modern AI

The Paradigm Shifts That Define AI Progress

AI history can be organized around a series of fundamental paradigm shifts, each representing a completely different approach to building intelligent systems. The first era was defined by rule-based systems: programmers tried to encode human knowledge as explicit logical rules. This approach had real successes, particularly in narrow domains like chess and medical diagnosis, but could not scale to the messiness of real-world environments.

The second major paradigm was statistical machine learning, which shifted the focus from hand-crafted rules to learning patterns from data. Instead of telling a spam filter what spam looks like, you showed it millions of examples of spam and let it figure out the patterns. This approach scaled much better and produced the recommendation engines, search algorithms, and fraud detection systems that quietly powered the internet through the 2000s and 2010s.

The current paradigm is deep learning and foundation models. Rather than building separate models for each task, researchers discovered that training very large neural networks on enormous amounts of data produces systems with surprisingly general capabilities. The transformer architecture, introduced in 2017, proved especially powerful for language, and the scale of modern large language models like GPT-4 and Claude represents a qualitative change from anything that came before.

What History Tells Us About the Future

The history of AI does not give us a crystal ball, but it does offer some useful lessons. First, the problems that seemed hardest to AI researchers in the early days, like playing chess or solving calculus problems, turned out to be relatively tractable once the right methods were found. Meanwhile, the things that seemed trivially easy, like understanding a sarcastic joke or navigating a crowded room, have proven remarkably difficult to solve in general ways.

This pattern, sometimes called Moravec’s Paradox, suggests we should be humble about predicting which AI capabilities will come easily and which will remain elusive. It also reinforces why the current generation of large language models, which have made surprising progress on tasks that seemed distinctly human, feels so historically significant. Whether we are at another inflection point or approaching a new period of slower progress is the central debate in AI research today, and understanding the historical precedents is essential for engaging with that debate intelligently.

Before Open Source Had a Name

In the early days of computing, software was free — not as in “free beer,” but because it was essentially worthless without the hardware it ran on. IBM mainframes in the 1960s shipped with source code. Universities shared programmes on magnetic tapes. Researchers collaborated freely because the culture was academic, not commercial.

That changed when computers became products. In 1976, a young Bill Gates wrote an “Open Letter to Hobbyists” complaining that people were copying his BASIC interpreter without paying for it. Software was becoming property. The era of proprietary software had begun.

Not everyone accepted this new order. Richard Stallman, a brilliant and combative programmer at MIT’s Artificial Intelligence Lab, watched his colleagues’ collaborations fragment as proprietary restrictions multiplied. In 1983, he announced the GNU Project — an attempt to build a complete free Unix-compatible operating system from scratch. In 1985, he published the GNU Manifesto, arguing that software freedom was a moral imperative, not just a technical preference.

The GPL: Viral Freedom

Stallman’s most lasting contribution wasn’t the software he wrote — it was a legal document. The GNU General Public License (GPL), first published in 1989, was an act of legal judo. Copyright law normally restricts copying; the GPL used copyright to mandate it. Any software incorporating GPL-licensed code had to be released under the same terms. Proprietary companies couldn’t absorb free software and sell it closed.

Critics called this “copyleft” a virus. Advocates called it brilliance. Either way, the GPL created an expanding ecosystem of freely available software that no single company could capture. The Free Software Foundation that Stallman founded still maintains the GPL and advocates for software freedom today.

By 1991, the GNU Project had most of the tools needed for a complete operating system — compilers, editors, utilities — but lacked one crucial component: a kernel, the core that manages hardware resources. That gap was about to be filled by an annoyed Finnish student.

Linux: The Accidental Revolution

On August 25, 1991, Linus Torvalds posted to a Usenet newsgroup: “I’m doing a (free) operating system (just a hobby, won’t be big and professional like gnu) for 386(486) AT clones.” The post was one of the great understatements in technology history. The Linux kernel, released under the GPL in 1992, combined with GNU’s tools to create a complete, free, and powerful operating system.

Linux’s development model was itself revolutionary. Torvalds didn’t manage a team — he managed contributions from thousands of developers worldwide, accepting patches via email, publicly critiquing bad code, and releasing early and often. In 1997, Eric Raymond documented this approach in an influential essay called “The Cathedral and the Bazaar,” arguing that Linux’s “bazaar” model of open development produced better software than the closed “cathedral” model of traditional companies.

Today, Linux runs approximately 96% of the world’s top web servers, all of the world’s top 500 supercomputers, the Android operating system on billions of smartphones, and the cloud infrastructure of AWS, Google Cloud, and Azure. The hobby project became the most important software infrastructure on Earth.

Get Smarter About AI Every Morning

Free daily newsletter — one story, one tool, one tip. Plain English, no jargon.

Free forever. Unsubscribe anytime.

Apache, MySQL, and the LAMP Stack

Linux didn’t succeed alone. The Apache HTTP Server, released in 1995, powered the majority of websites for over a decade. MySQL provided a free database. PHP and Perl offered scripting languages. Together, these formed the LAMP stack (Linux, Apache, MySQL, PHP/Perl/Python) — the foundation on which most of the early web was built.

Wikipedia runs on LAMP. WordPress (which this very site uses) runs on LAMP. The entire blogging revolution of the 2000s was built on free, open-source tools that any developer could download, modify, and deploy without paying a cent. Open source had moved from ideology to infrastructure.

For a history of AI development that parallels this story — including how open-source values shaped AI research culture — see our history of AI.

GitHub: Social Coding Changes Everything

Linus Torvalds created a second transformative technology in 2005: Git, a distributed version control system that let developers track changes to code independently, branch freely, and merge contributions from anywhere. Git solved the collaboration problem at scale.

In 2008, GitHub launched as a hosting platform for Git repositories with a social layer: developers could “fork” any project, make changes, and submit “pull requests” proposing their changes be merged back. GitHub made contributing to open source dramatically easier and turned software development into a social activity with profiles, followers, and stars.

By the time Microsoft acquired GitHub for $7.5 billion in 2018, it hosted over 85 million repositories and had become the de facto home of open-source software. The acquisition proved that open source had won: the world’s most prominent closed-source software company had to own the platform to stay relevant.

10 Open-Source AI Lessons That Aged Well

  • Open weights compounded the AI ecosystem. Llama, Mistral, Qwen released weights. Downstream work multiplied. The compounding is real.
  • Open does not equal free of governance. Open-weight models still require governance for deployment. Open is a license decision, not a hands-off policy.
  • License gradients matter more than binary. Apache, MIT, Llama Community, restricted commercial — each license shapes downstream possibility.
  • Hugging Face became the GitHub of models. The distribution platform shapes adoption more than the technology itself. Platform effects are real.
  • Open-source AI follows similar arc to open-source software. Initial skepticism, then quality parity for many use cases, then dominance for some niches.
  • Self-hosting is the privacy moat. For sensitive data and regulated industries, self-hosted open-source AI is the only option. The category will grow.
  • Fine-tuning becomes accessible at the open layer. LoRA, QLoRA on open models democratize customization. Closed APIs cannot match this.
  • Open-source ecosystem requires sustained investment. Meta, Mistral, others fund open AI. The funding model is not settled long-term.
  • Frontier vs open gap is narrowing but persists. Open models close on benchmarks but lag on agentic and reasoning. The gap pattern echoes earlier tech.
  • The next decade of open AI is uncertain. Cost structure, regulatory pressure, geopolitical dynamics all shape the future. Open AI as a force is not guaranteed to persist.

Open Source Meets AI: TensorFlow, PyTorch, and Hugging Face

The deep learning revolution that began around 2012 was accelerated enormously by open-source tools. Google released TensorFlow in 2015, making its internal machine learning framework freely available. Facebook AI Research released PyTorch in 2016, which became the preferred framework for researchers due to its intuitive design.

Hugging Face, founded in 2016 as a chatbot company, pivoted to become the GitHub of AI models — a platform for sharing pre-trained models, datasets, and tools. By 2024, Hugging Face hosted over 500,000 models and 100,000 datasets. Our Hugging Face guide explains how to use this remarkable resource even if you have no coding background.

These tools democratised AI research. A graduate student in Lagos or Lisbon could now access the same machine learning infrastructure as researchers at Google or Meta. The only barrier was computing power — and that barrier was about to be challenged.

The Llama Moment: Meta Opens the Frontier

In February 2023, Meta AI released Llama — a series of large language models “for research.” The initial release required an application, but within days a torrent file containing the model weights appeared online. Within weeks, a community had formed around running, fine-tuning, and adapting Llama on consumer hardware.

Meta leaned into this. Llama 2 in July 2023 came with a commercial licence. Llama 3 in April 2024 was released with even fewer restrictions. Our Meta Llama 4 guide covers the latest developments in this rapidly evolving family of models.

The Llama releases changed the AI landscape profoundly. Suddenly, companies and individuals could run capable AI systems on their own hardware, without sending data to OpenAI or Anthropic, without paying API fees, and with the ability to fine-tune models on their own data. For regulated industries like healthcare and finance, local deployment is often required — Llama made it possible.

Chinese AI lab DeepSeek added another chapter to the open-source story. Our DeepSeek AI guide explains how a Chinese team trained competitive models at a fraction of Western costs and released weights openly, sending shockwaves through the industry.

The Open vs. Closed AI Debate

The AI industry is split on openness in ways that parallel the proprietary vs. free software debates of the 1990s. OpenAI began as a non-profit with open ambitions — it’s in the name — but GPT-4 and later models have not had weights released publicly. Anthropic, maker of Claude, has not open-sourced its frontier models.

Proponents of open AI argue that scrutiny improves safety, democratisation is inherently good, and that the benefits of AI should not be locked behind API paywalls. Critics argue that releasing powerful model weights creates unacceptable risks — that the same model that helps a developer build a product can help a bad actor build a weapon.

This is not a simple debate, and its resolution will shape who benefits from AI. The open source AI guide on this site offers a comprehensive breakdown of the current landscape, including which models are genuinely open and which are “open-ish.”

Frequently Asked Questions

What is the difference between free software and open source?

Richard Stallman’s Free Software Foundation emphasises the ethical dimension — software freedom as a right. The “open source” term, coined in 1998 by Christine Peterson and promoted by Eric Raymond and others, focuses on practical benefits: better software through community collaboration. Most software that qualifies as one qualifies as both, but the philosophies differ in emphasis.

What does GPL mean?

The GNU General Public License (GPL) is a copyleft licence that grants users freedom to use, study, share, and modify software, with the requirement that any distributed modifications also carry the GPL. This “viral” property prevents proprietary incorporation. The MIT and Apache licences are “permissive” alternatives that allow proprietary use.

Is Llama truly open source?

It depends on your definition. Llama model weights are freely downloadable and modifiable, but Meta’s licence restricts use for applications serving over 700 million monthly active users and prohibits certain uses. Purists argue it’s not truly “open source” under the Open Source Initiative definition. Practically speaking, it’s open enough for the vast majority of developers and researchers.

Why did companies start releasing open-source software?

Various motivations: IBM and others found supporting Linux cheaper than maintaining proprietary systems. Google released TensorFlow to attract talent and establish its approach as the standard. Meta releases Llama partly to commoditise the model layer so that AI applications — where Meta competes — become more valuable. Open source can serve commercial interests even when it appears altruistic.

How do I get started with open-source AI tools?

Hugging Face is the best starting point — you can try thousands of models directly in your browser without installing anything. For running models locally, Ollama provides a simple interface for downloading and running open models on a Mac, Windows, or Linux computer. Our open source AI guide walks through both options in detail.

Get free AI tips delivered dailySubscribe to Beginners in AI

Continue exploring: Open Source AI Guide | Hugging Face Explained | Meta Llama 4 | History of AI | DeepSeek AI

Sources

This article draws on official documentation, product pages, and industry reporting. Specific sources are linked inline throughout the text.

Last reviewed: April 2026

You May Also Like

Discover more from Beginners in AI

Subscribe now to keep reading and get access to the full archive.

Continue reading