Moulik Gupta, Author at Towards Data Science

Training Objective That Makes LLM Inference 3X Faster

The simple shift in training that unlocks foresight, faster inference, and better reasoning.

November 28, 2025

14 min read

A 27M-parameter model just outperformed giants like DeepSeek R1, o3-mini, and Claude 3.7 on reasoning…

November 23, 2025

11 min read

How Meta’s latest breakthrough lets models learn, adapt, and improve — all on their own

July 17, 2025

17 min read

The Tokenizer Has Been a Necessary Evil, but This Radical Approach Shows That It Might…

June 24, 2025

16 min read

Exploring Titans: A new architecture equipping LLMs with human-inspired memory that learns and updates itself…

June 12, 2025

17 min read