Aionda

Tag: grpo

1 articles available

View all tags View all posts

CommunityJan 31, 20262026-01-31

DeepSeek-R1: Enhancing Reasoning Efficiency Through Reinforcement Learning and GRPO

Explore how DeepSeek-R1 achieves self-correction through RL and optimizes reasoning efficiency using the GRPO algorithm.