Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
A novel language model architecture enables scaling test-time computation through latent space reasoning using a recurrent block approach, achieving performance improvements equivalent to 50B parameters without specialized training data or large context windows.