CONNECT WITH US
NEWS TAGGED AI INFERENCE
Wednesday 20 August 2025
Huawei turns to software to ease pain from China's scarce AI memory
The global race to build ever-larger AI models is intensifying, and the battle is no longer confined to Nvidia's powerful GPUs. Another crucial, though less visible, component has...
Thursday 14 August 2025
Inspur's Metabrain SD200 takes on trillion-parameter AI with four top Chinese models
Following Huawei's recent AI inference milestone, China's server leader Inspur has introduced the Metabrain SD200, a next-generation AI supernode server designed for trillion-parameter...
Thursday 14 August 2025
Tencent widens AI chip sourcing as Beijing puts H20 under the microscope
Tencent reported second-quarter 2025 revenue of CNY184.5 billion (approx. US$26 billion), a 15% increase from a year earlier, with net profit rising 18% to CNY69.2 billion. On the...
Wednesday 13 August 2025
Alif Semiconductor vying for market dominance in battery-powered generative AI applications and Edge AI
The trend of integrating generative artificial intelligence (Gen AI) applications into edge AI (Edge AI) is gaining momentum. More Edge AI systems are providing intelligent applications...
Wednesday 30 July 2025
Rebellions announces collaboration with Marvell to deliver custom AI infrastructure for sovereign-scale deployments
Rebellions Inc., a leading AI semiconductor company based in South Korea, today announced a collaboration with Marvell Technology, Inc. (NASDAQ: MRVL) to offer high-performance, energy-efficient...
Tuesday 29 July 2025
Intellifusion's stealth mission: AI chips in every device by 2030
Ahead of the 2025 World Artificial Intelligence Conference (WAIC), Shenzhen Intellifusion Technologies Co. launched a comprehensive portfolio of AI inference products, including the...
Tuesday 8 July 2025
Groq sets up shop in Finland as sovereign AI demands surge

Groq, the US-based AI chipmaker, has launched its first European data center in Helsinki, accelerating its global expansion and tapping...

Thursday 5 June 2025
AMD acquires AI software firm Brium to strengthen AI stack, expand industry reach
According to a press release, AMD announced the acquisition of Brium, a software company specializing in compilers and AI inference optimization, to enhance its end-to-end AI capabilities...
Tuesday 27 May 2025
AI inference drives storage's critical role; Silicon Motion enters Nvidia ecosystem
AI computing is accelerating the deployment of data center infrastructure, with Nvidia placing greater emphasis on inference computing models and expanding its ecosystem. This trend...
Tuesday 20 May 2025
Retronix launches 2 AI edge platforms: Sparrow Hawk SBC and Raptor SoM, powered by Renesas R-Car V4H SoC
Retronix Technologies Inc. announced the launch of two cutting-edge AI edge computing platforms, developed in collaboration with Renesas Electronics Corporation.
Monday 5 May 2025
Silicon Motion optimistic about consumer recovery in second half of 2025 amid strong AI inference demand
Silicon Motion, a NAND controller provider, reported that its first-quarter revenue and profit almost met the high end of prior forecasts, largely due to increased demand for AI inference...
Thursday 10 April 2025
Google unveils seventh-generation TPU for AI inference era
At the Google Cloud Next '25 conference, the company introduced the seventh-generation Tensor Processing Unit (TPU), Ironwood, designed for AI inference. This chip highlights Google's...
Friday 21 March 2025
Nvidia CEO dismisses ASIC threat as "noncompetitive" — but AI inference competition is heating up
At a GTC media roundtable, when pressed on whether application-specific integrated circuits (ASICs) threaten Nvidia's AI dominance, CEO Jensen Huang didn't mince words.
Thursday 20 March 2025
Nvidia launches Blackwell Ultra AI inference platform, Taiwanese manufacturers eye 2H25 deployment
Nvidia CEO Jensen Huang officially unveiled the Blackwell artificial intelligence (AI) factory platform "Blackwell Ultra" during his keynote speech at GTC. This new platform enhances...
Thursday 20 March 2025
China scales up DeepSeek AI inference clusters; Huawei Ascend takes lead
Since emerging in mid-January 2025, DeepSeek has rapidly reshaped the AI landscape. As its presence grows into a second month, OpenAI has escalated its opposition—first threatening...