Mixture of Experts Model

Ant Group’s use of China-made GPUs, not Nvidia, cuts AI model training costs by 20%

The fintech affiliate of Alibaba said its Ling-Plus-Base model can be ‘effectively trained on lower-performance devices’.

The American Bazaar7h

Alibaba’s Ant Group uses a mix of US and Chinese semiconductors for AI

Alibaba-affiliate Ant Group is reportedly using a mix of U.S.- and Chinese-made semiconductors to enhance the efficiency of ...

Gulf News1d

Jack Ma-backed Ant touts AI breakthrough built on Chinese chips

Ant used domestic chips to train models using Mixture of Experts machine learning approach Jack Ma-backed Ant Group Co. used Chinese-made semiconductors to develop techniques for training AI ...

14d

Chain-of-experts (CoE): A lower-cost LLM framework that increases efficiency and accuracy

Chain-of-experts chains LLM experts in a sequence, outperforming mixture-of-experts (MoE) with lower memory and compute costs.

Outlook Business10h

DeepSeek Rolls Out V3 Model Updates, Strengthen Programming Capabilities to Outpace OpenAI

Chinese AI startup DeepSeek upgrades its V3 model with the V3‑0324 update, enhancing programming capabilities and shifting to ...

14d

ByteDance says new AI technology boosts model training efficiency by 1.7 times

TikTok owner ByteDance said it has achieved a 1.71 times efficiency improvement in large language model (LLM) training, the latest Chinese tech company to achieve a breakthrough that could potentially ...

Earth.com1d

AI prediction model is a major breakthrough in weather forecasting

Aardvark Weather, by the University of Cambridge, uses AI to deliver fast, accurate forecasts in minutes with minimal ...

WinBuzzer12d

Alibaba Updates Quark AI Assistant with Advanced Qwen Reasoning Models

Alibaba has upgraded its Quark AI assistant with the Qwen reasoning model, enhancing its ability to process complex queries ...

scmp.com13d

ByteDance says new AI technology boosts model training efficiency by 1.7 times

TikTok owner ByteDance said it has achieved a 1.71 times efficiency improvement in large language model (LLM ... an optimised Mixture-of-Experts (MoE) system, according to a recent paper published ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results