NVIDIA CEO Jensen Huang's Latest Article: The "Five Layers of AI"
Original Title: AI Is a Five‑Layer Cake
Original Author: Nvidia
Translation: Peggy, BlockBeats
Editor's Note: Artificial intelligence is transforming from a cutting-edge technology to the infrastructure that underpins the modern economy. In its first in-depth article published on its official account, Nvidia attempts to systematically examine the industry structure of AI from first principles, covering a complete five-layer technology stack from energy and chips to data center infrastructure, models, and applications.
The article points out that AI is not just about the competition of software or models, but a global industrial construction involving energy, computing power, manufacturing, and applications. Its scale may become one of the largest infrastructure expansions in human history. Through the lens of this "five-layer cake," Nvidia seeks to illustrate that the true meaning of AI is not just smarter software but a foundational infrastructure revolution comparable in scale to the advent of electricity and the internet.
Below is the original text:
Artificial intelligence is one of the most powerful forces shaping the world today. It is not a clever application, nor is it a single model; it is an infrastructure as essential as electricity and the internet.
AI runs on real hardware, real energy, and a real economy. It turns raw materials into "intelligence" at scale. Every company will use it, and every nation will build it.
To understand why AI unfolds in this way, it is helpful to start from first principles and examine the fundamental changes that have occurred in the computing field.
From "Pre-fabricated Software" to "Real-time Generated Intelligence"
For most of the history of computing development, software has been "pre-fabricated." Humans first describe an algorithm, and then the computer follows the instructions. Data must be carefully structured, placed into tables, and retrieved by precise queries. SQL is indispensable because it makes this whole system work.
AI breaks this pattern.
For the first time, we have a computer that can understand unstructured information. It can see images, read text, listen to sounds, and understand their meaning; it can infer context and intent. More importantly, it can generate intelligence in real-time.
Each response is newly generated. Every answer depends on the context you provide. It is no longer software retrieving pre-existing instructions from a database; it is software reasoning in real-time and generating intelligence on demand.
Because intelligence is generated in real time, the entire computing technology stack that supports it must also be reinvented.
AI as Infrastructure
Looking at AI from an industrial perspective, it can actually be broken down into a five-layer structure.
Energy
At the bottom is energy.
Real-time generated intelligence requires real-time generated power. The generation of each token means electrons in motion, heat being managed, and energy being transformed into computational capability.
Below this layer, there is no abstraction. Energy is the first principle of AI infrastructure and the fundamental constraint on how much intelligence a system can produce.
Chips
Above energy is chips. The design goal of these processors is to convert energy into computational power with extreme efficiency at scale.
AI workloads require massive parallel computing, high-bandwidth memory, and fast interconnects. Progress at the chip level determines the speed of AI expansion and ultimately how cheap "intelligence" will become.
Infrastructure
Above chips is infrastructure. This includes land, power delivery, cooling systems, construction, network systems, and scheduling systems that organize tens of thousands of processors into one machine.
These systems are essentially AI factories. They are not designed to store information but to manufacture intelligence.
Models
Above infrastructure are models. AI models can understand various types of information: language, biology, chemistry, physics, finance, medicine, and the real world itself.
Language models are just one type. One of the most transformative works is happening in the following areas: protein AI, chemistry AI, physical simulation, robotics, autonomous systems.
Applications
At the top is the application layer, where economic value is truly generated. For example, a drug discovery platform, industrial robots, a legal copilot, self-driving cars.
A self-driving car is essentially a "machine-hosted AI application"; a humanoid robot is a "body-hosted AI application." The underlying technology stack is the same; only the final form of presentation is different.
So, this is the five-layer structure of AI: Energy → Chip → Infrastructure → Model → Application. Every successful application will pull all layers down to the bottom, all the way to the power plant that powers it.
Infrastructure Development Still in Its Early Days
We are just starting this development. The current investment scale is only a few hundred billion dollars, and in the future, we still need to build infrastructure at the level of tens of trillions of dollars.
On a global scale, we are seeing: chip factories, computer assembly plants, AI factories.
Infrastructure is being built at an unprecedented scale. This is becoming one of the largest infrastructure projects in human history.
Labor Demand in the AI Era
The workforce needed to support this development is very large.
AI factories need: electricians, plumbers, pipeline installers, structural steel workers, network technicians, equipment installers, operations and maintenance personnel.
All of these are highly technical, well-paid positions, and currently in extremely short supply. Engaging in this transformation does not necessarily require a computer science Ph.D.
At the same time, AI is boosting productivity in the knowledge economy. Take radiology, for example. AI has already started to assist in medical image interpretation, but the demand for radiologists is still growing.
This is not contradictory.
The true responsibility of a radiologist is to care for patients, and reading scans is just one part of the job. As AI takes over more and more repetitive tasks, doctors can spend more time on diagnosis, communication, and treatment.
The hospital's efficiency improves, allowing more patients to be served, thereby requiring more manpower. Productivity creates capacity, and capacity creates growth.
What Has Changed in the Past Year?
In the past year, AI has crossed a key threshold.
The models are now good enough to truly make an impact in large-scale scenarios.
· Significant improvement in reasoning ability
· Significant reduction in hallucinations
· Substantial enhancement in "grounding" with the real world
For the first time, AI-based applications are starting to create real economic value.
A clear product-market fit has emerged in the following areas: drug discovery, logistics, customer service, software development, manufacturing
These applications are driving the entire technology stack.
The Role of the Open-Source Model
The open-source model plays a key role in this. The vast majority of AI models worldwide are free. Researchers, startups, businesses, and even entire countries rely on open-source models to participate in the cutting-edge AI competition.
When open-source models reach the technological frontier, they not only transform software but also activate demand across the entire technology stack.
DeepSeek-R1 is a prime example. By making a powerful inference model widely available, it has driven rapid growth at the application layer while also increasing demand for training power, infrastructure, chips, and energy.
What Does This Mean?
When you view AI as infrastructure, everything becomes clear. AI may have started with Transformers and large language models, but it is much more than that.
It is an industrial-scale transformation that will reshape:
· how energy is produced and consumed
· how factories are built
· how work is organized
· the pattern of economic growth
AI factories are being built because intelligence can now be generated in real-time. Chips are being redesigned because efficiency determines the speed of intelligent expansion. Energy has become central because it determines how much intelligence a system can produce. Applications are exploding because models have finally crossed the threshold of "scale available."
Each layer reinforces the others.
That's why this construction is on such a massive scale, why it will simultaneously impact so many industries, and why it will not be limited to a single country or a single field.
Every company will use AI.
Every country will build AI.
We are still in the early stages.
Many infrastructures have yet to be built, much labor has yet to be trained, and many opportunities have yet to be realized.
But the direction is already very clear.
Artificial intelligence is becoming the foundational infrastructure of the modern world.
And the choices we make today, the speed of our build, the breadth of our engagement, the responsibilities we assume, will determine what kind of age we live in.
You may also like

ZachXBT: Humanity private key leak and abnormal surge in H token should be viewed separately
On June 9, according to related disclosures, on-chain investigator ZachXBT posted an update on Humanity’s roughly $31 million security incident, saying that after further analyzing fund flows, he currently tends to believe the project team was not involved in an “inside job” or a self-staged attack. According to him, the official explanation about the private key leak was broadly accurate, but before the token unlock, the price of H had been artificially pushed higher, and the hacker later took advantage of that market environment; therefore, the private key leak and the earlier abnormal price pumping should be regarded as two separate and independent events. This reframing has shifted the market’s understanding of the nature of the incident. Earlier discussion around Humanity had focused on whether the team directly participated in the attack or used the security incident to cover up internal operations. ZachXBT’s latest remarks shift the focus from “whether it was self-theft” to “whether there were pre-unlock market structure issues.” He also questioned whether the team may have.

Morning Report | OpenAI has submitted an S-1 registration statement draft to the U.S. SEC; Morpho completes $175 million financing

Morning Report | BitMine increased its holdings by 126,971 ETH last week; trader Eugene announced his exit from the crypto market

Wang Chuan: How can one not feel anxious after the neighbor Old Wang made thirty times profit by investing in storage stocks? (Seven) - A quarter-century cycle

Cryptocurrency CEXs are flocking to sell US stocks, and traditional brokerages are facing an "uninvited guest."

$75 billion in foreign capital has fled, and South Korean retail investors have absorbed it all using leverage

Japan’s Three Megabanks Plan Joint Stablecoin Issuance in Fiscal 2026
MUFG, SMBC, and Mizuho reportedly plan to jointly issue fiat-pegged stablecoins in fiscal 2026, signaling Japan’s growing push into bank-led digital payment infrastructure.

Humanity Discloses H Token Dual-Chain Attack Details, With Losses on Ethereum and BSC Exceeding $36 Million
Humanity said the H token attack across Ethereum and BSC caused more than $36 million in losses after leaked ProxyAdmin keys enabled malicious contract upgrades and token minting.

White House Discusses CLARITY Act With Law Enforcement Ahead of Senate Vote
The White House discussed the CLARITY Act with law enforcement ahead of a Senate vote, focusing on illicit finance risks and developer protections.

Bitcoin Trading Guide 2026: Strategies for Experienced Traders

What Is XAUT and PAXG? Why Tokenized Gold Is Booming in 2026

Will the SpaceX IPO Hurt Bitcoin? Here's What Traders Are Watching

Foreign selling in the South Korean stock market accelerates, with cumulative net sales reportedly reaching $75 billion this year
On June 9, The Kobeissi Letter, citing Goldman Sachs data, reported that global investors are selling South Korean stocks at an unusually rapid pace. In the latest trading session, foreign investors sold about $801 million worth of Kospi constituent stocks again; total foreign outflows last week reached about $10 billion, and the market has been in net foreign selling on nearly every trading day over the past month. According to the data cited in the report, foreign investors have sold about $75 billion worth of South Korean stocks so far this year. Meanwhile, South Korean retail and institutional investors together recorded roughly $69 billion in net buying over the same period, suggesting that the market’s main buying support has come from domestic capital rather than returning overseas funds. The information currently disclosed still mainly comes from The Kobeissi Letter’s retelling and Goldman Sachs data summaries, while public details on the statistical period and the specific definition of “selling” remain relatively limited.

Fortune Warns of Strategy’s Financing Structure Risks as Bitcoin Premium Narrows
Fortune warned that Strategy’s Bitcoin treasury model faces growing financing risks as MSTR’s net asset premium narrows and preferred stock dividend pressure increases.

Ferrari Challenge Le Mans: Carl Moon to Dominate in WEEX Livery

Sahara AI Responds to SAHARA’s Sharp Drop: No Contract or Product Security Issues Found, Internal Investigation Underway
Sahara AI responded to SAHARA’s 60% price drop, saying no token contract or product security issues have been found and an internal investigation is underway.

WEEX Deposit/Withdrawal Dynamic Island: Your Asset Status, Always in Sight

Scaling Crypto Derivatives: The Digital Asset Infrastructure Behind High-Volume Trading
In the fast-moving digital asset ecosystem, derivatives platforms face an extreme architectural test. High-leverage futures markets demand more than just standard security—they require absolute operational precision, zero-latency matching engines, and ironclad structural scalability, all while navigating intense market volatility.
As global platforms scale to meet these demands, the industry is shifting away from rigid, monolithic setups toward a more agile, "decoupled" infrastructure philosophy.
The Blueprint for High-Volume Copy TradingFor elite global exchanges like WEEX (founded in 2018), this architectural choice becomes critical when scaling high-volume retail features like social copy trading. When thousands of users automatically mirror the real-time strategies of elite traders simultaneously, it triggers sudden, monumental spikes in concurrent transactional volume.
To prevent execution latency or settlement bottlenecks during these peak volatility events, a platform's primary engine must remain entirely dedicated to risk management, copy-trade synchronization, and order matching.
The Architectural Rule: New-generation platforms must separate front-end user execution engines from heavy backend infrastructural overhead to eliminate operational friction.
By separating these layers, platforms can maintain complete sovereignty over their trading environments and user experiences while strategically aligning with institutional-grade infrastructure ecosystems. This strategic framework allows modern exchanges to leverage advanced Digital Asset Custody infrastructure such as Cobo’s behind the scenes, ensuring that backend wallet management scales elastically alongside trading spikes.
Capitalizing on Market Momentum and 400× LeverageIn a derivatives arena where platforms offer up to 400× leverage on perpetual contracts, capital efficiency and market agility are core business metrics. To capture market momentum, an exchange needs the ability to rapidly expand its asset offerings, supporting everything from legacy crypto assets to sudden, trending altcoins across a massive library of trading pairs.
Adopting a flexible, scalable Wallet-as-a-Service (WaaS) solution such as Cobo’s could completely rewrite the development timeline for high-growth exchanges. Instead of spending months of engineering capital building out custom backend wallet architectures for every new blockchain network, platforms can deploy localized infrastructure in days.
This agility allows platforms to instantly scale their listings to over a thousand trading pairs without compromising security or delaying time-to-market. It mirrors the exact operational advantages seen during high-velocity market events, similar to how advanced wallet infrastructure empowers platforms during sudden asset surges; allowing exchanges to pass that speed and liquidity directly to their global user base.
A Mature Foundation for GrowthThe synergy between trusted infrastructure ecosystems and global trading platforms represents the natural evolution of a maturing crypto market. As WEEX continues to scale its global spot and derivatives offerings for over 6 million users, adopting robust backend paradigms proves that platforms no longer have to compromise between cutting-edge trading velocity and uncompromised structural security.
ZachXBT: Humanity private key leak and abnormal surge in H token should be viewed separately
On June 9, according to related disclosures, on-chain investigator ZachXBT posted an update on Humanity’s roughly $31 million security incident, saying that after further analyzing fund flows, he currently tends to believe the project team was not involved in an “inside job” or a self-staged attack. According to him, the official explanation about the private key leak was broadly accurate, but before the token unlock, the price of H had been artificially pushed higher, and the hacker later took advantage of that market environment; therefore, the private key leak and the earlier abnormal price pumping should be regarded as two separate and independent events. This reframing has shifted the market’s understanding of the nature of the incident. Earlier discussion around Humanity had focused on whether the team directly participated in the attack or used the security incident to cover up internal operations. ZachXBT’s latest remarks shift the focus from “whether it was self-theft” to “whether there were pre-unlock market structure issues.” He also questioned whether the team may have.
