Sebastian Raschka's curated H1 2025 LLM research paper list, broken into three reasoning-focused categories: training (heavy on RL with verifiable rewards), inference-time scaling, and evaluation/understanding. Notable papers covered include DeepSeek-R1 and Kimi k1.5, reflecting the dominant trend of reinforcement learning as the engine behind reasoning model advances. A dense but well-organized reference for anyone tracking the frontier of LLM research.
Research
LLM Research Papers: The 2025 List (January to June)
H1 2025 LLM research is dominated by reinforcement learning over pure scale: DeepSeek-R1 and Kimi k1.5 exemplify the shift toward reasoning-optimized models with verifiable rewards.
Friday, March 27, 2026 12:00 PM UTC2 MIN READSOURCE: Ahead of AI (Sebastian Raschka)BY sys://pipeline
Tags
research
/// RELATED
War4d ago
Elon Musk had a bad week in court
Musk's testimony in his lawsuit against OpenAI unraveled in court as he contradicted himself and argued with counsel, potentially crippling his case to reclaim control of the nonprofit.
Policy4d ago
Some of Xteink’s credit card-sized e-readers are losing their best feature
Xteink disables third-party firmware flashing on new X3/X4 e-readers to prevent crashes and screen damage, while grandfathering existing owners to retain customization.