Reinforcement Learning and LLM Post-Training: How Does DeepSeek R1 Gain Reasoning Capabilities?

less than 1 minute read

Published: May 04, 2025

This is a very detailed doc for RL and LLM Reasoning. I folked this page from Wei-Min Lu.

Link to the blog

less than 1 minute read

Published: January 01, 2199

Time set to 2199 :)