Efficient Reinforcement Learning with Semantic and Token Entropy for LLM Reasoning
Hongye Cao, Zhixin Bai et al.
TLDR: This paper presents a novel reinforcement learning framework that uses semantic and token entropy to improve reasoning in large language models, outperforming existing methods across multiple benchmarks.