Skip to content

Main menu

Search
Search

Search for:

Open-source DeepSeek-R1 uses pure reinforcement learning to match OpenAI o1 — at 95% less cost

January 20, 2025 By admin

The company developed DeepSeek-R1 by using pure reinforcement learning on top of DeepSeek-V3-Base, and matched or beat o1 on some benchmarks.

Post navigation

← DeepSeek claims its reasoning model beats OpenAI's o1 on certain benchmarks

Quantum Blockchain’s AI breakthrough in real-time mining →

Search

Recent Posts

Judge Blasts Lawyer Caught Using ChatGPT in Divorce Court, Orders Him to Take Remedial Law Classes
Extra: America's Anxiety Over Artificial Intelligence
Is Wall Street losing faith in AI?
‘Breaking Bad’ creator’s new show ‘Pluribus’ was emphatically ‘made by humans,’ not AI
Some people love AI, others hate it. Here's why.

Recent Comments

No comments to show.

Made with ❤ in Jordan