Deepseek AI Model Efficiency

DeepSeek V4 points to growing use of Huawei chips in AI models

DeepSeek’s V4 may run on Huawei chips instead of NVIDIA hardware, reflecting a shift toward domestic AI infrastructure in ...

10d

DeepSeek R1 Benchmarks: $80 Raspberry Pi vs $250 Jetson vs $1000 Mac

Performance varied significantly, with the MacBook Air M3 achieving the fastest speed (72 tokens/second), followed by the ...

12don MSN

What Google's TurboQuant can and can't do for AI's spiraling cost

What Google's TurboQuant can and can't do for AI's spiraling cost ...

The Chosun Ilbo on MSN

NVIDIA, DeepSeek, Huawei compete in AI memory efficiency race

The current biggest bottleneck in AI infrastructure expansion is a shortage of memory chips,” said Brad Lightcap, OpenAI Chief Operating Officer, COO, at a recent forum. “The AI industry has overcome ...

Forbes

The Jevons Paradox: Flawed Consensus View On Efficiency

Forbes contributors publish independent expert analyses and insights. Analyzing tech stocks through the prism of cultural change. When DeepSeek released its R1 model in late January 2025, claiming ...

10d

Google’s TurboQuant Marks A Turning Point In AI’s Evolution

Google’s TurboQuant could cut LLM memory use sixfold, signaling a shift from brute-force scaling to efficiency and broader AI ...

Hosted on MSN

DeepSeek touts new training method as China pushes AI efficiency

(Bloomberg) -- DeepSeek published a paper outlining a more efficient approach to developing AI, illustrating the Chinese artificial intelligence industry’s effort to compete with the likes of OpenAI ...

15d

IndexCache, a new sparse attention optimizer, delivers 1.82x faster inference on long-context AI models

Researchers at Tsinghua University and Z.ai built IndexCache to eliminate redundant computation in sparse attention models ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results