What if You Could Control How Long a Reasoning Model “Thinks”? CMU Researchers Introduce L1-1.5B: Reinforcement Learning Optimizes AI Thought Process

Reasoning language models have demonstrated the ability to enhance performance by generating longer chain-of-thought sequences during inference, effectively leveraging increased […]