06/10/2026 | Press release | Distributed by Public on 06/10/2026 08:34
Imagine you're visiting a new city and want to find a restaurant that impresses your vegan in-laws. Using voice conversations on the Meta AI app, you ask, "Hey Meta, what are the best vegan options around?"
Within seconds, Meta AI - powered by Muse Spark - responds with a list of local vegan restaurants, a short description of each restaurant's vibe, and a map showing you exactly where the restaurants are. It's a quick and seamless interaction that feels effortless, but behind that brief exchange are layers of calculations enabled by compute power.
What Is Compute Power?
Simply put, compute power is the measure of how much work a computer chip can do and how fast it can do it - like horsepower in a car engine. Compute power is measured in FLOPS: floating-point operations per second, or the number of calculations that a chip can perform in one second. FLOPS measure the speed of compute and gigawatts measure the scale of it, or how many chips you can keep running at once.
When you ask Meta AI to find a vegan restaurant, it runs billions of calculations in just a few seconds. Your voice is captured, converted from sound waves into text, and routed to computers or servers inside a data center . From there, a large language model (LLM) , and the result is delivered right to your ear.
Even simple actions, searching for a local barbershop on Instagram, require layers of computation: understanding language, processing your query, scanning an index, generating results, and delivering it back to you, all before your thumb leaves the screen. All of this processing power is made possible by processing chips inside the servers inside our data centers.
So why does the future of AI depend on compute? Here's a closer look.
The Building Blocks of Compute
Compute is an abstract concept, but it's delivered by physical chips. Different chips are designed to handle different types of calculations and workloads.
How Does Compute Power Meta's AI?
At Meta, we're building a global network of AI-optimized data centers, each designed with the flexibility to support both our AI workloads and the other workloads that are central to our apps and services. We believe that building at this scale requires a diversified approach to infrastructure. T hat's why we're sourcing silicon from a range of partners to ensure the right chips are matched with the right workload, allowing us to build and deliver new AI experiences at a faster pace .
Our custom MTIA silicon is an essential part of our efforts. We're developing and deploying four new generations of chips within the next two years to support ranking, recommendations, and generative AI workloads. In April, we announced an expanded partnership with Broadcom to co-develop multiple generations of MTIA chips.
And earlier this year, we announced a partnership with Arm to co-develop the Arm AGI CPU, - the first data center processor specifically designed to handle the massive amount of data movement demanded by AI workloads. We've also announced partnerships with industry leaders AWS , AMD , and NVIDIA to supply chips for our compute portfolio.
These partnerships will enable us to continue innovating and building AI tools for the future. We recently announced Muse Spark , our most advanced AI model to date and the first LLM built by Meta Superintelligence Labs. Muse Spark is natively multimodal, processing voice, text, and images together. What makes it possible is compute at every level - from training models across thousands of GPUs to supporting billions of inferences each day on custom MTIA chips - and all of it running through efficient networks of servers at data centers around the world.
The demand for more powerful and efficient compute will only accelerate. As AI continues to become more capable, personal, and integrated into people's lives, we'll keep building the infrastructure needed to power it.