Hello and welcome to Eye on AI. In this edition: DeepSeek defies AI convention (again)…Meta’s AI layoffs…More legal trouble for OpenAI…and what AI gets wrong about the news. Hi, Beatrice Nolan here, ...
Excitement is growing among researchers about another powerful artificial intelligence (AI) model to emerge from China, after DeepSeek shocked the world with its launch of R1 in January. The ...
In the lead-up to China's Labor Day Golden Week, the country's AI sector is experiencing a flurry of large language model (LLM) upgrades. Baidu and Alibaba have rolled out new flagship models, while ...
DeepSeek hinted that China will have homegrown "next generation" chips to support its AI models. Its mention of China's coming next-generation chips may signal plans to work more closely with China's ...
DeepSeek, the Chinese AI startup spun off of Hong Kong high-frequency trading firm High Flyer Capital Management (and which uses a whale icon for its logo), is back today with a new large language ...
Qwen 2.5 Coder/Max is currently the top open-source model for coding, with the highest HumanEval (~70–72%), LiveCodeBench (70.7), and Elo (2056) scores among open models. DeepSeek V3/Coder V2 remains ...
On Thursday, Chinese AI startup DeepSeek (DEEPSEEK) officially launched its updated DeepSeek-V3.1 AI model, which surpasses its R1 model on key benchmarks. The company unveiled V3.1 earlier this week.
Remember when DeepSeek briefly shook up the entire artificial intelligence industry by launching its large language model, R1, that was trained for a fraction of the money that OpenAI and other big ...
Even as Meta fends off questions and criticisms of its new Llama 4 model family, graphics processing unit (GPU) master Nvidia has released a new, fully open source large language model (LLM) based on ...