Infinigence Unveils Next-Gen AI Infrastructure Suite, Aims to Lead China’s AI Deployment Revolution

Infinigence Unveils Next-Gen AI Infrastructure Suite, Aims to Lead China’s AI Deployment Revolution

Xia Lixue, Co-founder and CEO of Infinigence

AsianFin -- Infinigence, an AI infrastructure startup backed by Tsinghua University, introduced a sweeping portfolio of performance-optimized computing platforms targeting the full spectrum of AI deployment at this year’s World Artificial Intelligence Conference (WAIC 2025) .

The company officially launched three flagship products under its integrated solution suite: Infinicloud, a global-scale AI cloud platform for clusters of up to 100,000 GPUs; InfiniCore, a high-performance intelligent computing platform designed for multi-thousand-GPU clusters; and InfiniEdge, a lean, edge computing solution optimized for terminal deployments with as few as one GPU.

Together, the platforms represent what CEO Xia Lixue calls a “software-hardware co-designed infrastructure system for the AI 2.0 era.” Built for compatibility across heterogeneous computing environments, the Infinigence stack offers full lifecycle support—from model scheduling and performance optimization to large-scale application deployment.

“We’re addressing a core bottleneck in China’s AI industry: fragmentation in compute infrastructure,” Xia said. “With InfiniCloud, InfiniCore, and InfiniEdge, we’re enabling AI developers to move seamlessly between different chips, architectures, and workloads—unlocking intelligent performance at scale.”

In a fast-evolving AI landscape dominated by open-source large language models such as DeepSeek, GLM-4.5, and MiniMax M1, Chinese infra startups are racing to build the backbone that powers model deployment and inference.

Early on July 29, Infinigence announced that InfiniCloud now supports Zhipu AI’s latest GLM-4.5 and GLM-4.5-air models, which currently rank third globally in performance. The move signals Infinigence’s ambition to anchor the growing synergy between Chinese model developers and domestic chipmakers.

Xia likened the trio of newly launched platforms to “three bundled boxes” that can be matched to AI workloads of any scale. “From a single smartphone to clusters of 100,000 GPUs—our system is designed to ensure resource efficiency and intelligent elasticity,” he said.

Infinigence’s platforms are already powering Shanghai ModelSpeed Space, the world’s largest AI incubator. The facility sees daily token call volumes exceed 10 billion, supports over 100 AI use cases, and reaches tens of millions of monthly active users across its applications.

A key challenge for China’s AI infrastructure sector is hardware heterogeneity. With dozens of domestic chip vendors and proprietary architectures, developers often struggle to port models across systems.

Xia emphasized that Infinigence has developed a “universal compute language” that bridges chips with disparate instruction sets. “We treat computing resources like supermarket goods—plug-and-play, interoperable, and composable,” he said.

The company’s infrastructure has already achieved full-stack adaptation for more than a dozen domestic chips, delivering 50%–200% performance gains through algorithm and compiler optimization. It also supports unified scheduling and mixed-precision computing, enabling cost-performance ratios that beat many international offerings.

“What’s missing in China’s ecosystem is a feedback loop,” Xia said. “In the U.S., NVIDIA and OpenAI form a tight cycle: model developers know what chips are coming, and chipmakers know what models are being built. We’re building that loop domestically.”

Infinigence is also targeting AI democratization with a first-of-its-kind cross-regional federated reinforcement learning system. The system links idle GPU resources from different regional AIDC centers into a unified compute cluster—allowing SMEs to build and fine-tune domain-specific inference models using consumer-grade cards.

To support this, Infinigence launched the “AIDC Joint Operations Innovation Ecosystem Initiative” in partnership with China’s three major telecom providers and 20+ AIDC institutions.

Xia noted that while training still depends heavily on NVIDIA hardware, inference workloads are rapidly migrating to domestic accelerators. “Users often start with international chips on our platform, but we help them transition to Chinese cards—many of which now deliver strong commercial value,” he said.

Infinigence has also rolled out a series of on-device and edge inference engines under its Infini-Ask line. These include:

  • Infini-Megrez2.0, co-developed with the Shanghai Institute of Creative Intelligence, the world’s first on-device intrinsic model.

  • Infini-Mizar2.0, built with Lenovo, which enables heterogeneous computing across AI PCs, boosting local model capacity from 7B to 30B parameters.

  • A low-cost FPGA-based large model inference engine, jointly developed with Suzhou Yige Technology.

Founded in May 2023, Infinigence has raised more than RMB 1 billion in just two years, including a record-setting RMB 500 million Series A round in 2024—the largest to date in China’s AI infrastructure sector.

Its product portfolio now spans everything from model hosting and cloud management to edge optimization and model migration—serving clients across intelligent computing centers, model providers, and industrial sectors.

The company’s broader mission, Xia said, is to balance scale, performance, and resource availability. “Our vision is to deliver ‘boundless intelligence and flawless computing’—wherever there's compute, we want Infinigence to be the intelligence that flows through it.”

特别声明:[Infinigence Unveils Next-Gen AI Infrastructure Suite, Aims to Lead China’s AI Deployment Revolution] 该文观点仅代表作者本人,今日霍州系信息发布平台,霍州网仅提供信息存储空间服务。

猜你喜欢

TVB收视|受台风韦帕袭港影响收视激增,《东张西望》创下年度新高(tvb收视率)

根据最新公布的收视数据(14日至20日),无线电视的节目表现亮眼。特别是在周日(20日),由于台风韦帕袭港,10号风球高挂,许多观众选择在家观看电视。此外,周日晚上的两档综艺节目同样收视可观,清谈音乐节目《今…

TVB收视|受台风韦帕袭港影响收视激增,《东张西望》创下年度新高(tvb收视率)

工程防护大会开幕 瑞利光测携核心设备亮相(工程防护大会开场白)

高速光纤光栅解调设备,原理基于光纤光栅解调技术,解调速率高达 100kHz,测量可实现对 FBG 传感器中心波长的高速精准解调。 基于OFDR 技术的高精度监测设备,最高空间分辨率达 0.64mm,应变重…

工程防护大会开幕 瑞利光测携核心设备亮相(工程防护大会开场白)

SCRM平台的电商集成是什么?(scrm线上商店)

现在很多像快鲸这样的SCRM工具,对接主流电商平台(比如天猫、有赞)和企业微信都做了标准化流程,上手其实挺快的,关键看业务需求是否清晰。即使规模小,只要你在多个平台(比如微信+淘宝)做生意,想更高效地服务客户…

SCRM平台的电商集成是什么?(scrm线上商店)

送妈妈母亲紧致抗皱面霜哪个牌子好?适合夏天用抗皱面霜推荐测评(送妈妈的贴心礼物)

优势2:它的淡纹紧致效果表现稳定,长期使用后,细纹和表情纹略有减轻,肌肤看起来更加平滑。 优势1:这款面霜质地柔软细腻,吸收快也不油腻,能够为肌肤提供充足水分,使用后肌肤柔软富有弹性,尤其适合干燥肌肤,还…

送妈妈母亲紧致抗皱面霜哪个牌子好?适合夏天用抗皱面霜推荐测评(送妈妈的贴心礼物)

领益智造:公司的人形机器人业务已逐步产生收入(领益智造公司怎样)

证券日报网讯领益智造7月31日在互动平台回答投资者提问时表示,公司的人形机器人业务已逐步产生收入,涉及各类机加工结构件、核心零部件、关节等模组、整机组装等全链路环节。(编辑 袁冠琳)…

领益智造:公司的人形机器人业务已逐步产生收入(领益智造公司怎样)