Infinigence Unveils Next-Gen AI Infrastructure Suite, Aims to Lead China’s AI Deployment Revolution

Infinigence Unveils Next-Gen AI Infrastructure Suite, Aims to Lead China’s AI Deployment Revolution

Xia Lixue, Co-founder and CEO of Infinigence

AsianFin -- Infinigence, an AI infrastructure startup backed by Tsinghua University, introduced a sweeping portfolio of performance-optimized computing platforms targeting the full spectrum of AI deployment at this year’s World Artificial Intelligence Conference (WAIC 2025) .

The company officially launched three flagship products under its integrated solution suite: Infinicloud, a global-scale AI cloud platform for clusters of up to 100,000 GPUs; InfiniCore, a high-performance intelligent computing platform designed for multi-thousand-GPU clusters; and InfiniEdge, a lean, edge computing solution optimized for terminal deployments with as few as one GPU.

Together, the platforms represent what CEO Xia Lixue calls a “software-hardware co-designed infrastructure system for the AI 2.0 era.” Built for compatibility across heterogeneous computing environments, the Infinigence stack offers full lifecycle support—from model scheduling and performance optimization to large-scale application deployment.

“We’re addressing a core bottleneck in China’s AI industry: fragmentation in compute infrastructure,” Xia said. “With InfiniCloud, InfiniCore, and InfiniEdge, we’re enabling AI developers to move seamlessly between different chips, architectures, and workloads—unlocking intelligent performance at scale.”

In a fast-evolving AI landscape dominated by open-source large language models such as 『DeepSeek』, GLM-4.5, and MiniMax M1, Chinese infra startups are racing to build the backbone that powers model deployment and inference.

Early on July 29, Infinigence announced that InfiniCloud now supports Zhipu AI’s latest GLM-4.5 and GLM-4.5-air models, which currently rank third globally in performance. The move signals Infinigence’s ambition to anchor the growing synergy between Chinese model developers and domestic chipmakers.

Xia likened the trio of newly launched platforms to “three bundled boxes” that can be matched to AI workloads of any scale. “From a single smartphone to clusters of 100,000 GPUs—our system is designed to ensure resource efficiency and intelligent elasticity,” he said.

Infinigence’s platforms are already powering Shanghai ModelSpeed Space, the world’s largest AI incubator. The facility sees daily token call volumes exceed 10 billion, supports over 100 AI use cases, and reaches tens of millions of monthly active users across its applications.

A key challenge for China’s AI infrastructure sector is hardware heterogeneity. With dozens of domestic chip vendors and proprietary architectures, developers often struggle to port models across systems.

Xia emphasized that Infinigence has developed a “universal compute language” that bridges chips with disparate instruction sets. “We treat computing resources like supermarket goods—plug-and-play, interoperable, and composable,” he said.

The company’s infrastructure has already achieved full-stack adaptation for more than a dozen domestic chips, delivering 50%–200% performance gains through algorithm and compiler optimization. It also supports unified scheduling and mixed-precision computing, enabling cost-performance ratios that beat many international offerings.

“What’s missing in China’s ecosystem is a feedback loop,” Xia said. “In the U.S., NVIDIA and OpenAI form a tight cycle: model developers know what chips are coming, and chipmakers know what models are being built. We’re building that loop domestically.”

Infinigence is also targeting AI democratization with a first-of-its-kind cross-regional federated reinforcement learning system. The system links idle GPU resources from different regional AIDC centers into a unified compute cluster—allowing SMEs to build and fine-tune domain-specific inference models using consumer-grade cards.

To support this, Infinigence launched the “AIDC Joint Operations Innovation Ecosystem Initiative” in partnership with China’s three major telecom providers and 20+ AIDC institutions.

Xia noted that while training still depends heavily on NVIDIA hardware, inference workloads are rapidly migrating to domestic accelerators. “Users often start with international chips on our platform, but we help them transition to Chinese cards—many of which now deliver strong commercial value,” he said.

Infinigence has also rolled out a series of on-device and edge inference engines under its Infini-Ask line. These include:

  • Infini-Megrez2.0, co-developed with the Shanghai Institute of Creative Intelligence, the world’s first on-device intrinsic model.

  • Infini-Mizar2.0, built with Lenovo, which enables heterogeneous computing across AI PCs, boosting local model capacity from 7B to 30B parameters.

  • A low-cost FPGA-based large model inference engine, jointly developed with Suzhou Yige Technology.

Founded in May 2023, Infinigence has raised more than RMB 1 billion in just two years, including a record-setting RMB 500 million Series A round in 2024—the largest to date in China’s AI infrastructure sector.

Its product portfolio now spans everything from model hosting and cloud management to edge optimization and model migration—serving clients across intelligent computing centers, model providers, and industrial sectors.

The company’s broader mission, Xia said, is to balance scale, performance, and resource availability. “Our vision is to deliver ‘boundless intelligence and flawless computing’—wherever there's compute, we want Infinigence to be the intelligence that flows through it.”

特别声明:[Infinigence Unveils Next-Gen AI Infrastructure Suite, Aims to Lead China’s AI Deployment Revolution] 该文观点仅代表作者本人,今日霍州系信息发布平台,霍州网仅提供信息存储空间服务。

猜你喜欢

不返工!A、B、C群脑膜炎球菌试剂新加坡注册检测报告+说明书规范(不返工的好处和坏处)

在进行A、B、C群脑膜炎球菌检测试剂的新加坡注册过程中,检测报告和说明书的规范准备是确保注册顺利的关键环节。 为了避免返工,企业可采取系统性措施:首先,提前研究新加坡的具体法规要求,并参照类似产品的成功案例…

不返工!A、B、C群脑膜炎球菌试剂新加坡注册检测报告+说明书规范(不返工的好处和坏处)

出演庆余年2人世间大爆剧,演员吴幸键回应“资源咖”质(庆余年演员扮演者)

他深知在这个行业里,真正的成功不是靠资源和人脉堆砌起来的,而是靠自己的努力和实力赢得的。 在网友的评论中,有人对吴幸键的演技表示赞赏,认为他是一位有潜力的演员;也有人对他是否是“资源咖”表示质疑,认为他的成功…

出演庆余年2人世间大爆剧,演员吴幸键回应“资源咖”质(庆余年演员扮演者)

一场全运盛会,如何不止于体育?宝安的回答是……(一场全运盛会作文)

宝安区人民政府副区长练聪、区文化广电旅游体育局局长刘晓曦、副局长杨春曦出席发布会,介绍宝安分赛区赛事组织、保障措施与城市联动等相关情况。 记者从发布会上了解到,航海和车辆模型项目决赛作为宝安首场赛事将在9月…

一场全运盛会,如何不止于体育?宝安的回答是……(一场全运盛会作文)

菱角湖:武汉的秘密绿洲,你真的了解吗?(武汉菱角湖在哪)

菱角湖,位于武汉洪山区核心地带,是市民日常散步、跑步、观鸟的“城市后花园”。它不仅名字有趣,还藏着一座集生态、文化与休闲于一体的湿地公园。本文带你深度揭秘它的起源、功能、游玩攻略及避坑指南,让你不再只是路过——而是真正走进这片武汉人的私藏绿

菱角湖:武汉的秘密绿洲,你真的了解吗?(武汉菱角湖在哪)

304不锈钢免打孔圆形底座纸巾架,卫生间🚻卷纸架怎么选?(免打孔不锈钢胶 还能拆下来么)

想要卫生间🚻既整洁又高级?304不锈钢免打孔圆形底座立式纸巾架,是现代家居的颜值担当!无需钻孔、安装简单,防锈耐腐蚀,适合瓷砖玻璃墙面多种场景。本文带你搞懂它的材质优势、使用原理、选购要点与真实体验,还附上高性价比推荐方案,轻松避坑,让家

304不锈钢免打孔圆形底座纸巾架,卫生间🚻卷纸架怎么选?(免打孔不锈钢胶 还能拆下来么)