Deep Learning | Towards Data Science

Decentralized Computation: The Hidden Principle Behind Deep Learning

Xiaocong Yang — Fri, 12 Dec 2025 15:47:00 +0000

Most breakthroughs in deep learning — from simple neural networks to large language models — are built upon a principle that is much older than AI itself: decentralization. Instead of relying on a powerful “central planner” coordinating and commanding the behaviors of other components, modern deep-learning-based AI models succeed because many simple units interact locally […]

The post Decentralized Computation: The Hidden Principle Behind Deep Learning appeared first on Towards Data Science.

Optimizing PyTorch Model Inference on AWS Graviton

Chaim Rand — Wed, 10 Dec 2025 12:00:00 +0000

Tips for accelerating AI/ML on CPU — Part 2

The post Optimizing PyTorch Model Inference on AWS Graviton appeared first on Towards Data Science.

Optimizing PyTorch Model Inference on CPU

Chaim Rand — Mon, 08 Dec 2025 12:00:00 +0000

Flyin’ Like a Lion on Intel Xeon

The post Optimizing PyTorch Model Inference on CPU appeared first on Towards Data Science.

On the Challenge of Converting TensorFlow Models to PyTorch

Chaim Rand — Fri, 05 Dec 2025 12:30:00 +0000

How to upgrade and optimize legacy AI/ML models

The post On the Challenge of Converting TensorFlow Models to PyTorch appeared first on Towards Data Science.

Do Labels Make AI Blind? Self-Supervision Solves the Age-Old Binding Problem

Jonathan Williford — Thu, 04 Dec 2025 17:30:00 +0000

A new NeurIPS 2025 paper shows how self-supervised learning imbues ViT with better image understanding than supervised learning

The post Do Labels Make AI Blind? Self-Supervision Solves the Age-Old Binding Problem appeared first on Towards Data Science.

Overcoming the Hidden Performance Traps of Variable-Shaped Tensors: Efficient Data Sampling in PyTorch

Chaim Rand — Wed, 03 Dec 2025 17:00:00 +0000

PyTorch Model Performance Analysis and Optimization — Part 11

The post Overcoming the Hidden Performance Traps of Variable-Shaped Tensors: Efficient Data Sampling in PyTorch appeared first on Towards Data Science.

Neural Networks Are Blurry, Symbolic Systems Are Fragmented. Sparse Autoencoders Help Us Combine Them.

Xiaocong Yang — Thu, 27 Nov 2025 17:24:06 +0000

Neural and symbolic models compress the world in fundamentally different ways, and Sparse Autoencoders (SAEs) offer a bridge to connect them.

The post Neural Networks Are Blurry, Symbolic Systems Are Fragmented. Sparse Autoencoders Help Us Combine Them. appeared first on Towards Data Science.

PyTorch Tutorial for Beginners: Build a Multiple Regression Model from Scratch

Gustavo Santos — Wed, 19 Nov 2025 14:00:00 +0000

Hands-on PyTorch: Building a 3-layer neural network for multiple regression

The post PyTorch Tutorial for Beginners: Build a Multiple Regression Model from Scratch appeared first on Towards Data Science.

Understanding Convolutional Neural Networks (CNNs) Through Excel

angela shi — Mon, 17 Nov 2025 19:54:54 +0000

Deep learning is often seen as a black box. We know that it learns from data, but we rarely stop to ask how it truly learns.
What if we could open that box and watch each step happen right before our eyes?
With Excel, we can do exactly that, see how numbers turn into patterns, and how simple calculations become the foundation of what we call “deep learning.”
In this article, we will build a tiny Convolutional Neural Network (CNN) directly in Excel to understand, step by step, how machines detect shapes, patterns, and meaning in images.

The post Understanding Convolutional Neural Networks (CNNs) Through Excel appeared first on Towards Data Science.

AI Papers to Read in 2025

Ygor Serpa — Wed, 05 Nov 2025 21:47:58 +0000

And Why They Matter for Anyone Working With AI

The post AI Papers to Read in 2025 appeared first on Towards Data Science.

RF-DETR Under the Hood: The Insights of a Real-Time Transformer Detection

David Redó Nieto — Fri, 31 Oct 2025 12:30:00 +0000

From rigid grids to adaptive attention, this is the evolutionary path that made detection transformers fast, flexible, and formidable.

The post RF-DETR Under the Hood: The Insights of a Real-Time Transformer Detection appeared first on Towards Data Science.

How to Classify Lung Cancer Subtype from DNA Copy Numbers Using PyTorch

Adam Streck — Fri, 17 Oct 2025 17:02:09 +0000

A step-by-step introduction to understanding cancer from the perspective of a data scientist.

The post How to Classify Lung Cancer Subtype from DNA Copy Numbers Using PyTorch appeared first on Towards Data Science.

MobileNetV2 Paper Walkthrough: The Smarter Tiny Giant

Muhammad Ardi — Fri, 03 Oct 2025 12:30:00 +0000

Understanding and implementing MobileNetV2 with PyTorch — the next generation of MobileNetV1

The post MobileNetV2 Paper Walkthrough: The Smarter Tiny Giant appeared first on Towards Data Science.

How to Improve the Efficiency of Your PyTorch Training Loop

Andrea D'Agostino — Wed, 01 Oct 2025 19:16:04 +0000

Learn how to diagnose and resolve bottlenecks in PyTorch using the num_workers, pin_memory, and profiler parameters to maximize training performance.

The post How to Improve the Efficiency of Your PyTorch Training Loop appeared first on Towards Data Science.

Learning Triton One Kernel At a Time: Vector Addition

Ryan Pégoud — Sat, 27 Sep 2025 16:00:00 +0000

The basics of GPU programming, optimisation, and your first Triton kernel

The post Learning Triton One Kernel At a Time: Vector Addition appeared first on Towards Data Science.

PyTorch Explained: From Automatic Differentiation to Training Custom Neural Networks

Avishek Biswas — Wed, 24 Sep 2025 18:19:59 +0000

Deep learning is shaping our world as we speak. In fact, it has been slowly revolutionizing software since the early 2010s. In 2025, PyTorch is at the forefront of this revolution, emerging as one of the most important libraries to train neural networks. Whether you are working with computer vision, building large language models (LLMs), […]

The post PyTorch Explained: From Automatic Differentiation to Training Custom Neural Networks appeared first on Towards Data Science.

The SyncNet Research Paper, Clearly Explained

Aman Agrawal — Sat, 20 Sep 2025 14:00:00 +0000

A Deep Dive into "Out of Time: Automated Lip Sync in the Wild"

The post The SyncNet Research Paper, Clearly Explained appeared first on Towards Data Science.

MobileNetV1 Paper Walkthrough: The Tiny Giant

Muhammad Ardi — Thu, 04 Sep 2025 17:35:47 +0000

Understanding and implementing MobileNetV1 from scratch with PyTorch

The post MobileNetV1 Paper Walkthrough: The Tiny Giant appeared first on Towards Data Science.

Positional Embeddings in Transformers: A Math Guide to RoPE & ALiBi

Sathya Krishnan Suresh — Tue, 26 Aug 2025 14:00:00 +0000

Learn APE, RoPE, and ALiBi positional embeddings for GPT — intuitions, math, PyTorch code, and experiments on TinyStories

The post Positional Embeddings in Transformers: A Math Guide to RoPE & ALiBi appeared first on Towards Data Science.

From Genes to Neural Networks: Understanding and Building NEAT (Neuro-Evolution of Augmenting Topologies) from Scratch

Carlos Redondo — Mon, 11 Aug 2025 18:03:44 +0000

Practical Neuroevolution: Reproducing NEAT’s Innovations and Code Walkthrough

The post From Genes to Neural Networks: Understanding and Building NEAT (Neuro-Evolution of Augmenting Topologies) from Scratch appeared first on Towards Data Science.