Mixture of Experts Model

18d

Chain-of-experts (CoE): A lower-cost LLM framework that increases efficiency and accuracy

Chain-of-experts chains LLM experts in a sequence, outperforming mixture-of-experts (MoE) with lower memory and compute costs.

Ant Group’s use of China-made GPUs, not Nvidia, cuts AI model training costs by 20%

The fintech affiliate of Alibaba said its Ling-Plus-Base model can be ‘effectively trained on lower-performance devices’.

Hosted on MSN1mon

Mixture of experts: The method behind DeepSeek's frugal success

A method called "mixture of experts." Traditional AI models try to learn everything in one giant neural network. That’s like stuffing all knowledge into a single brain—inefficient and power ...

17d

ByteDance says new AI technology boosts model training efficiency by 1.7 times

TikTok owner ByteDance said it has achieved a 1.71 times efficiency improvement in large language model (LLM) training, the latest Chinese tech company to achieve a breakthrough that could potentially ...

Gulf News4d

Jack Ma-backed Ant touts AI breakthrough built on Chinese chips

Ant used domestic chips to train models using Mixture of Experts machine learning approach Jack Ma-backed Ant Group Co. used Chinese-made semiconductors to develop techniques for training AI ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results