Deepseek LLM Advanced Language Model

DeepSeek’s new model sees text differently, opening new possibilities for enterprise AI

Hello and welcome to Eye on AI. In this edition: DeepSeek defies AI convention (again)…Meta’s AI layoffs…More legal trouble for OpenAI…and what AI gets wrong about the news. Hi, Beatrice Nolan here, ...

Nature

‘Another DeepSeek moment’: Chinese AI model Kimi K2 stirs excitement

Excitement is growing among researchers about another powerful artificial intelligence (AI) model to emerge from China, after DeepSeek shocked the world with its launch of R1 in January. The ...

Digi Times

Model wars escalate: Baidu, Alibaba, DeepSeek race to dominate China's LLM frontier

In the lead-up to China's Labor Day Golden Week, the country's AI sector is experiencing a flurry of large language model (LLM) upgrades. Baidu and Alibaba have rolled out new flagship models, while ...

CNBC

DeepSeek hints latest model will be compatible with China’s ‘next generation’ homegrown AI chips

DeepSeek hinted that China will have homegrown "next generation" chips to support its AI models. Its mention of China's coming next-generation chips may signal plans to work more closely with China's ...

VentureBeat

DeepSeek-V3.1-Terminus launches with improved agentic tool use and reduced language mixing errors

DeepSeek, the Chinese AI startup spun off of Hong Kong high-frequency trading firm High Flyer Capital Management (and which uses a whale icon for its logo), is back today with a new large language ...

NextBigFuture

Qwen 2.5 Coder and Qwen 3 Lead in Open Source LLM Over DeepSeek and Meta

Qwen 2.5 Coder/Max is currently the top open-source model for coding, with the highest HumanEval (~70–72%), LiveCodeBench (70.7), and Elo (2056) scores among open models. DeepSeek V3/Coder V2 remains ...

Seeking Alpha

DeepSeek shows off new agentic AI model that surpasses R1 in key areas

On Thursday, Chinese AI startup DeepSeek (DEEPSEEK) officially launched its updated DeepSeek-V3.1 AI model, which surpasses its R1 model on key benchmarks. The company unveiled V3.1 earlier this week.

Gizmodo

We Finally Know How Much It Cost to Train China’s Astonishing DeepSeek Model

Remember when DeepSeek briefly shook up the entire artificial intelligence industry by launching its large language model, R1, that was trained for a fraction of the money that OpenAI and other big ...

VentureBeat

Nvidia's new Llama-3.1 Nemotron Ultra outperforms DeepSeek R1 at half the size

Even as Meta fends off questions and criticisms of its new Llama 4 model family, graphics processing unit (GPU) master Nvidia has released a new, fully open source large language model (LLM) based on ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results