Brianna Wu in Elk: "Doing media today. Do we have any eviden..."

Doing media today. Do we have any evidence that DeepSeek is computationally less expensive beyond china’s claims?

January 29, 2025 at 11:55:13 AM

Well, you can download and run R1 locally, where you could presumably benchmark it.

But what computational power went into the model they are working with?

I think the only evidence it must have been less computing power is (1) their supposedly lower budget, (2) the fact that they aren’t allowed to buy nice NVIDIA chips.

So yeah there’s some “trust us bro” in there for sure.

I literally just watched this. The presenter is a long-time Pi-et-al reviewer. (I have not tried any of it myself, but I used to know him years ago and I trust him to be honest about his findings.)

https://www.youtube.com/watch?v=o1sN1lB76EA

YouTube

OpenAI's nightmare: Deepseek R1 on a Raspberry Pi

This may be helpful:

https://medium.com/@isaakmwangi2018/a-simple-guide-to-deepseek-r1-architecture-training-local-deployment-and-hardware-requirements-300c87991126

Medium

A Simple Guide to DeepSeek R1: Architecture, Training, Local Deployment, and Hardware Requirements

DeepSeek has introduced an innovative approach to improving the reasoning capabilities of large language models (LLMs) through reinforcement learning (RL), detailed in their recent paper on…

You have to differentiate between training costs and running costs. Running costs can be independently verified since the model is open source, training costs cannot.