June 13, 2025 - Sam Altman Warns AI is Growing Faster than Infrastructure Can Handle
Speaking at AMD’s Advancing AI event, OpenAI CEO Sam Altman claimed that the only way AI can continue not just its frenetic growth pace, but deliver fancy new capabilities is with tons more GPUs and memory. But I think we knew this.
June 13, 2025 - CoreWeave to Supply GCP Who’ll Supply OpenAI
NVIDIA will sell GPUs to CoreWeave who’ll in turn host them from Google Cloud who’ll in turn supply then to OpenAI, who already has an $11.9 billion, 5 year deal with CoreWeave. So why do this? Google wants to compete with Microsoft, while OpenAI and CoreWeave want to make themselves less dependent on Microsoft.
June 13, 2025 - AMD Introduces New GPUs at Advancing AI Event
AMD introduced its Blackwell competitors - the MI350x and Mi355x GPUs - at its Advancing AI event this week. They are manufactured with the same fab provider, TSMC, and using the same Chip-on-Wafer-on-Substrate (CoWoS) technology as NVIDIA’s Blackwells. AMD is trying to get a leg up on NVIDIA by offering 50% more memory and over 100% more memory bandwidth than its big competitor. Oracle/OCI has already begun deploying the chips, and AMD-funded TensorWave will likely follow. The Oracle press release went to town playing up “agentic” hype.
June 11, 2025 - NVIDIA CEO Jensen Huang Now Says Quantum Computing is Reaching an “Inflection Point”
After saying quantum computing could be 15 years away, Jensen Huang is saying it’s getting closer to solving real problems. Quantum computing is often cited as the number one threat to NVIDIA’s business, and its $3 trillion market cap, so it makes sense that he’d start addressing it head on. There are a growing number of quantum startups, as well as Google’s “Quantum AI” initiative, which includes its own “Willow” chip design, but by the time these things are ready for mass production, the hype about all things agentic will be in the history books and they will be solving other problems.
June 11, 2025 - Amazon to Invest $20 billion in Pennsylvania to expand AI and Cloud Infrastructure
Where there are politicians, there are press releases - lots of them. If you’re looking to track or estimate actual capital investment in AI infrastructure the more relevant number than this announcement is that AWS is currently spending close to 70% of its revenue on capex.
June 10, 2025 - WEKA and Nebius Announce GPU-Storage Partnership
Parallel file systems are required to provide the high I/O needed for model training. WEKA, a leading provider of such systems, is joining forces with Nebius to provide its hardware to customers renting GPU services.
June 10, 2025 - Oracle Seeking 5 GW of US Data Center Capacity
TD Cowen has put out a note suggesting Oracle is looking to spend approximately $160 billion over the next year and a half on 5 GW of data center capacity and related GPUs. They project just under 60% of this will go to GPUs (and networks), with the balance on facility infrastructure. After languishing for years as a 4th place cloud, Oracle/OCI is taking full advantage of the open competition for GPU clouds.
June 9, 2025 - Nebius Announces B300 Clusters will be available in the UK in Q4
You can expect a lot of B300 announcements in the second half of this year as NVIDIA unveils its latest Blackwell chip that brings 50% more FLOPs, 50% more memory, and 50% more memory bandwidth than the B200. Getting out ahead of other cloud providers who’ll also have B300 news, Nebius will deploy the chips in the UK by the end of 2025.
June 5, 2025 - Broadcom Reports $15 billion of revenue, including $4.4 billion in AI hardware
Broadcom reported 20% revenue growth today for its the second quarters of FY2025. AI revenue came in at $4.4 billion with the company projecting AI semiconductor revenue of $5.1 billion next quarter “due to hyperscalers”. Much of this revenue comes from its agreement with Google to co-design the cloud provider’s Tensor Processing Units, or TPUs. In particular, the company provides the designs for I/O, SerDes, and other peripheral features where it holds extensive expertise and most cloud providers don’t.
June 5, 2025 - Meta Signs Nuclear Power Deal with Constellation Energy
Nuclear power is making a big comeback due to its ability to power AI data centers. Google has already pledged support for the technology now Meta is buying a Power Purchase Agreement of over 1 Gigawatt to support a plant that was going to close. Note that as a PPA, this agreement is to support clean power financially to offset carbon burning natural gas and coal elsewhere, not a direct purchase for Meta data centers.
June 4, 2025 - Amazon to Invest $10 billion in North Carolina AI Infrastructure
Another day, another multi-billion dollar investment in AI data centers. In this case Amazon will be committing $10 billion to a facility near Charlotte. Like most of these announcements, there is a broad number without specifying where it’s actually going. That said, a rough rule of thumb is about $15,000 of facility capex per kW of capacity…although in this case they’re really vague and don’t mention how many MW or GW of capacity they’re building out.
June 4, 2025 - Vertiv unveils trio of liquid cooling CDUs for AI data centers
CDUs, or Coolant Distribution Units, provide direct-to-chip cooling at either a rack or row level within a data center. Vertiv is adding three models that provide cooling capacity from 70 to 600 kW of cooling capacity.
June 2, 2025 - Applied Digital Announces 250MW AI Data Center Lease With CoreWeave in North Dakota
CoreWeave has signed a 15 year deal worth up to $7 billion for 250 MW of capacity with Allied Digital at its HPC Data Center facility in Ellendale, North Dakota, with an option for 150 MW of additional capacity. Why Ellendale? In addition to government incentives, Applied Digital chose the town for its access to energy resources, particularly its location in the middle of a dense wind power region in the Plains.
June 1, 2025 - New Startup Sygaldry Aims to Rethink AI Infrastructure With Quantum Hardware
Sygaldry is joining the chorus of companies claiming Quantum Computing can deal with AI’s energy consumption, especially for image processing and low latency inferencing. Sygaldry is backed by Y Combinator and led by compute hardware veterans.
May 29, 2025 - Blackwell Now GA on AWS in US-West-2
AWS is now offering Blackwell-based B200 GPUs in its Oregon region, in its “p6” series of processors. At this point they are only available by purchasing Capacity Blocks - reservations from 1 to 26 weeks. Effective hourly costs for the 8 GPU p6-b200.48xl are $65.12.
May 28, 2025 - Atlas Cloud Announces Inference Service to Boost GPU Throughput
GPU provider Atlas Cloud has announced an inference service that promises greater throughput and lower cost through its load balancing and compute-memory segregation technologies. Unlike other neoclouds, the company is targeting traditional business users who need packaged solutions and often have lower budgets than research-intensive tech companies or HPC labs.
May 27, 2025 - Intel Unveils New Xeon 6 CPUs to Maximize GPU-Accelerated AI Performance
Intel announced new Xeon CPUs which will be integrated into NVIDIA’s upcoming DGX B300 systems. While NVIDIA tends to promote its own Arm-based Grace CPUs more heavily, the Xeons are still used by many enterprise users who need x86 compatibility, and can tolerate the lower transfer speeds of PCIe connections between the CPU and GPU.
May 26, 2025 - NVIDIA to Launch Cheaper Blackwell Chip for China to Get Around Export Curbs
NVIDIA will be launching the B20, a far less powerful version of its flagship Blackwell chip that will meet requirements for export to China. A key difference will be the use of GDDR7 memory, a major drop in memory bandwidth from the HBM memory that’s been in all of its data center chips going back to its pre-Tensor Core Pascal chip which was released in 2016, but cannot be exported to China under current law. The B20 is expected to sell for nearly 90% less than the B100. NVIDIA Is keen to keep a foothold in China to prevent Huawei from completely taking over the market.
May 23, 2025 - CoreWeave and Flexential - Scaling AI with High Density Data Centers
CoreWeave does not own any of data centers, but is pushing towards 1 GW of total leased capacity across 33 facilities. It has recently added 260 MW in the Panhandle of Texas with Galaxy, and announced today it’s adding 13 MW with Flexential in Plano. The company announced an $11.2 billion, 5 year deal to supply GPUs and related infrastructure to OpenAI in March, and filed with the SEC recently that it captured $4 billion contract, which is also believed to be from OpenAI.
May 22, 2025 - OpenAI Expanding Stargate to UAE with 1 GW Cluster
As mentioned yesterday, Stargate is OpenAI ‘s joint venture to invest $500 billion in infrastructure. Each Stargate campus is being built out to approximately 1 GW of capacity, with up to 400,000 NVIDIA chips each. The first facility is currently under construction in Abilene, Texas, and the company has now announced its first international facility in Abu Dhabi as part of its OpenAI for Countries initiative. The Abilene facility is being built out by Crusoe per yesterday’s update.
May 21, 2025 - Crusoe secures $11.6 billion for Texas data center
AI data center builder Crusoe has secured over $11 billion in funding to build out a facility in Abilene, Texas. The company has a contract with Oracle, who in turn provides service to OpenAI, and will use the facility as part of its Stargate joint venture, a $500 billion investment in data centers and AI infrastructure. Crusoe holds patents in energy technologies that are intended to reduce the carbon footprint, and ultimately costs, of supporting GPUs and AI.
May 20, 2025 - NVIDIA provides Omniverse Blueprint for AI factory digital twins
Omniverse is NVIDIA’s plaform for building 3D industrial platforms, and modeling physical environments. Data centers use it for Computational Fluid Dynamics (CFD) analysis to optimize airflows and cooling. Today at Computex it announced it’s added new partners for Omniverse Blueprint, its platform for AI digital twins, expanding the ability of manufacturers to create AI models of their products as part of the design and build process.
May 18, 2025 - NVIDIA Unveils NVLink Fusion
NVLink is NVIDIA’s proprietary chip-to-chip interconnect that the company has aggressively marketed as faster than PCIe. Furthering its battle against that standard, NVIDIA has announced NVLink Fusion, opening its interconnect to CPU makers like Qualcomm and Fujitsu. NVLink runs out of the PCIe interface, but uses its own silicon technology.
May 14, 2025 - AI infrastructure firm TensorWave raises $100 million
TensorWave announced it has raised $100 million to build out its GPU infrastructure service. Unlike other providers, TensorWave does not provide any NVIDIA products, focusing instead on AMD hardware, notably the MI300X and MI325X [latforms.