Machine Learning | Towards Data Science https://towardsdatascience.com/tag/machine-learning/ Publish AI, ML & data-science insights to a global community of data professionals. Tue, 16 Dec 2025 09:35:11 +0000 en-US hourly 1 https://wordpress.org/?v=6.8.3 https://towardsdatascience.com/wp-content/uploads/2025/02/cropped-Favicon-32x32.png Machine Learning | Towards Data Science https://towardsdatascience.com/tag/machine-learning/ 32 32 Lessons Learned from Upgrading to LangChain 1.0 in Production https://towardsdatascience.com/lessons-learnt-from-upgrading-to-langchain-1-0-in-production/ Mon, 15 Dec 2025 10:30:00 +0000 https://towardsdatascience.com/?p=607893 What worked, what broke, and why I did it

The post Lessons Learned from Upgrading to LangChain 1.0 in Production appeared first on Towards Data Science.

]]>
The Machine Learning “Advent Calendar” Day 14: Softmax Regression in Excel https://towardsdatascience.com/the-machine-learning-advent-calendar-day-14-softmax-regression-in-excel/ Sun, 14 Dec 2025 18:12:00 +0000 https://towardsdatascience.com/?p=607910 Softmax Regression is simply Logistic Regression extended to multiple classes.

By computing one linear score per class and normalizing them with Softmax, we obtain multiclass probabilities without changing the core logic.

The loss, the gradients, and the optimization remain the same.
Only the number of parallel scores increases.

Implemented in Excel, the model becomes transparent: you can see the scores, the probabilities, and how the coefficients evolve over time.

The post The Machine Learning “Advent Calendar” Day 14: Softmax Regression in Excel appeared first on Towards Data Science.

]]>
The Skills That Bridge Technical Work and Business Impact https://towardsdatascience.com/the-skills-that-bridge-technical-work-and-business-impact/ Sun, 14 Dec 2025 14:30:29 +0000 https://towardsdatascience.com/?p=607866 In the Author Spotlight series, TDS Editors chat with members of our community about their career path in data science and AI, their writing, and their sources of inspiration. Today, we’re thrilled to share our conversation with Maria Mouschoutzi.  Maria is a Data Analyst and Project Manager with a strong background in Operations Research, Mechanical […]

The post The Skills That Bridge Technical Work and Business Impact appeared first on Towards Data Science.

]]>
The Machine Learning “Advent Calendar” Day 13: LASSO and Ridge Regression in Excel https://towardsdatascience.com/the-machine-learning-advent-calendar-day-13-lasso-and-ridge-regression-in-excel/ Sat, 13 Dec 2025 16:56:00 +0000 https://towardsdatascience.com/?p=607908 Ridge and Lasso regression are often perceived as more complex versions of linear regression. In reality, the prediction model remains exactly the same. What changes is the training objective. By adding a penalty on the coefficients, regularization forces the model to choose more stable solutions, especially when features are correlated. Implementing Ridge and Lasso step by step in Excel makes this idea explicit: regularization does not add complexity, it adds preference.

The post The Machine Learning “Advent Calendar” Day 13: LASSO and Ridge Regression in Excel appeared first on Towards Data Science.

]]>
NeurIPS 2025 Best Paper Review: Qwen’s Systematic Exploration of Attention Gating https://towardsdatascience.com/neurips-2025-best-paper-review-qwens-systematic-exploration-of-attention-gating/ Sat, 13 Dec 2025 10:16:00 +0000 https://towardsdatascience.com/?p=607899 This one little trick can bring about enhanced training stability, the use of larger learning rates and improved scaling properties

The post NeurIPS 2025 Best Paper Review: Qwen’s Systematic Exploration of Attention Gating appeared first on Towards Data Science.

]]>
The Machine Learning “Advent Calendar” Day 12: Logistic Regression in Excel https://towardsdatascience.com/the-machine-learning-advent-calendar-day-12-logistic-regression-in-excel/ Fri, 12 Dec 2025 17:15:00 +0000 https://towardsdatascience.com/?p=607901 In this article, we rebuild Logistic Regression step by step directly in Excel.
Starting from a binary dataset, we explore why linear regression struggles as a classifier, how the logistic function fixes these issues, and how log-loss naturally appears from the likelihood.
With a transparent gradient-descent table, you can watch the model learn at each iteration—making the whole process intuitive, visual, and surprisingly satisfying.

The post The Machine Learning “Advent Calendar” Day 12: Logistic Regression in Excel appeared first on Towards Data Science.

]]>
Decentralized Computation: The Hidden Principle Behind Deep Learning https://towardsdatascience.com/the-power-of-decentralization/ Fri, 12 Dec 2025 15:47:00 +0000 https://towardsdatascience.com/?p=607888 Most breakthroughs in deep learning — from simple neural networks to large language models — are built upon a principle that is much older than AI itself: decentralization. Instead of relying on a powerful “central planner” coordinating and commanding the behaviors of other components, modern deep-learning-based AI models succeed because many simple units interact locally […]

The post Decentralized Computation: The Hidden Principle Behind Deep Learning appeared first on Towards Data Science.

]]>
The Machine Learning “Advent Calendar” Day 11: Linear Regression in Excel https://towardsdatascience.com/the-machine-learning-advent-calendar-day-11-linear-regression-in-excel/ Thu, 11 Dec 2025 16:31:00 +0000 https://towardsdatascience.com/?p=607891 Linear Regression looks simple, but it introduces the core ideas of modern machine learning: loss functions, optimization, gradients, scaling, and interpretation.
In this article, we rebuild Linear Regression in Excel, compare the closed-form solution with Gradient Descent, and see how the coefficients evolve step by step.
This foundation naturally leads to regularization, kernels, classification, and the dual view.
Linear Regression is not just a straight line, but the starting point for many models we will explore next in the Advent Calendar.

The post The Machine Learning “Advent Calendar” Day 11: Linear Regression in Excel appeared first on Towards Data Science.

]]>
Optimizing PyTorch Model Inference on AWS Graviton https://towardsdatascience.com/optimizing-pytorch-model-inference-on-aws-graviton/ Wed, 10 Dec 2025 12:00:00 +0000 https://towardsdatascience.com/?p=607814 Tips for accelerating AI/ML on CPU — Part 2

The post Optimizing PyTorch Model Inference on AWS Graviton appeared first on Towards Data Science.

]]>
The Machine Learning “Advent Calendar” Day 9: LOF in Excel https://towardsdatascience.com/the-machine-learning-advent-calendar-day-9-lof-in-excel/ Tue, 09 Dec 2025 17:45:00 +0000 https://towardsdatascience.com/?p=607869 In this article, we explore LOF through three simple steps: distances and neighbors, reachability distances, and the final LOF score. Using tiny datasets, we see how two anomalies can look obvious to us but completely different to different algorithms. This reveals the key idea of unsupervised learning: there is no single “true” outlier, only definitions. Understanding these definitions is the real skill.

The post The Machine Learning “Advent Calendar” Day 9: LOF in Excel appeared first on Towards Data Science.

]]>
Personal, Agentic Assistants: A Practical Blueprint for a Secure, Multi-User, Self-Hosted Chatbot https://towardsdatascience.com/personal-agentic-assistants-a-practical-blueprint-for-a-secure-multi-user-self-hosted-chatbot/ Tue, 09 Dec 2025 16:30:00 +0000 https://towardsdatascience.com/?p=607863 Build a self-hosted, end-to-end platform that gives each user a personal, agentic chatbot that can autonomously vector-search through files that the user explicitly allows it to access.

The post Personal, Agentic Assistants: A Practical Blueprint for a Secure, Multi-User, Self-Hosted Chatbot appeared first on Towards Data Science.

]]>
A Realistic Roadmap to Start an AI Career in 2026 https://towardsdatascience.com/a-realistic-roadmap-to-start-an-ai-career-in-2026/ Tue, 09 Dec 2025 12:00:00 +0000 https://towardsdatascience.com/?p=607855 How to learn AI in 2026 through real, usable projects

The post A Realistic Roadmap to Start an AI Career in 2026 appeared first on Towards Data Science.

]]>
Bridging the Silence: How LEO Satellites and Edge AI Will Democratize Connectivity https://towardsdatascience.com/bridging-the-silence-how-leo-satellites-and-edge-ai-will-democratize-connectivity/ Mon, 08 Dec 2025 19:00:00 +0000 https://towardsdatascience.com/?p=607853 Why on-device intelligence and low-orbit constellations are the only viable path to universal accessibility

The post Bridging the Silence: How LEO Satellites and Edge AI Will Democratize Connectivity appeared first on Towards Data Science.

]]>
How to Create an ML-Focused Newsletter https://towardsdatascience.com/how-to-create-an-ml-focused-newsletter/ Mon, 08 Dec 2025 13:30:00 +0000 https://towardsdatascience.com/?p=607842 Learn how to make a newsletter with AI tools

The post How to Create an ML-Focused Newsletter appeared first on Towards Data Science.

]]>
The Machine Learning “Advent Calendar” Day 7: Decision Tree Classifier https://towardsdatascience.com/the-machine-learning-advent-calendar-day-7-decision-tree-classifier/ Sun, 07 Dec 2025 14:30:00 +0000 https://towardsdatascience.com/?p=607847 In Day 6, we saw how a Decision Tree Regressor finds its optimal split by minimizing the Mean Squared Error.
Today, for Day 7 of the Machine Learning "Advent Calendar", we switch to classification. With just one numerical feature and two classes, we explore how a Decision Tree Classifier decides where to cut the data, using impurity measures like Gini and Entropy.
Even without doing the math, we can visually guess possible split points. But which one is best? And do impurity measures really make a difference? Let us build the first split step by step in Excel and see what happens.

The post The Machine Learning “Advent Calendar” Day 7: Decision Tree Classifier appeared first on Towards Data Science.

]]>
Artificial Intelligence, Machine Learning, Deep Learning, and Generative AI — Clearly Explained https://towardsdatascience.com/artificial-intelligence-machine-learning-deep-learning-and-generative-ai-clearly-explained/ Sun, 07 Dec 2025 13:00:00 +0000 https://towardsdatascience.com/?p=607834 Understanding AI in 2026 — from machine learning to generative models

The post Artificial Intelligence, Machine Learning, Deep Learning, and Generative AI — Clearly Explained appeared first on Towards Data Science.

]]>
Reading Research Papers in the Age of LLMs https://towardsdatascience.com/reading-research-papers-in-the-age-of-llms/ Sat, 06 Dec 2025 16:00:00 +0000 https://towardsdatascience.com/?p=607833 How I keep up with papers with a mix of manual and AI-assisted reading

The post Reading Research Papers in the Age of LLMs appeared first on Towards Data Science.

]]>
The Machine Learning “Advent Calendar” Day 6: Decision Tree Regressor https://towardsdatascience.com/the-machine-learning-advent-calendar-day-6-decision-tree-regressor/ Sat, 06 Dec 2025 14:30:00 +0000 https://towardsdatascience.com/?p=607840 During the first days of this Machine Learning Advent Calendar, we explored models based on distances. Today, we switch to a completely different way of learning: Decision Trees.
With a simple one-feature dataset, we can see how a tree chooses its first split. The idea is always the same: if humans can guess the split visually, then we can rebuild the logic step by step in Excel.
By listing all possible split values and computing the MSE for each one, we identify the split that reduces the error the most. This gives us a clear intuition of how a Decision Tree grows, how it makes predictions, and why the first split is such a crucial step.

The post The Machine Learning “Advent Calendar” Day 6: Decision Tree Regressor appeared first on Towards Data Science.

]]>
The Machine Learning “Advent Calendar” Day 5: Gaussian Mixture Model in Excel https://towardsdatascience.com/the-machine-learning-advent-calendar-day-5-gmm-in-excel/ Fri, 05 Dec 2025 17:00:00 +0000 https://towardsdatascience.com/?p=607838 This article introduces the Gaussian Mixture Model as a natural extension of k-Means, by improving how distance is measured through variances and the Mahalanobis distance. Instead of assigning points to clusters with hard boundaries, GMM uses probabilities learned through the Expectation–Maximization algorithm – the general form of Lloyd’s method.

Using simple Excel formulas, we implement EM step by step in 1D and 2D, and we visualise how the Gaussian curves or ellipses move during training. The means shift, the variances adjust, and the shapes gradually settle around the true structure of the data.

GMM provides a richer, more flexible way to model clusters, and becomes intuitive once the process is made visible in a spreadsheet.

The post The Machine Learning “Advent Calendar” Day 5: Gaussian Mixture Model in Excel appeared first on Towards Data Science.

]]>
On the Challenge of Converting TensorFlow Models to PyTorch https://towardsdatascience.com/on-the-challenge-of-converting-tensorflow-models-to-pytorch/ Fri, 05 Dec 2025 12:30:00 +0000 https://towardsdatascience.com/?p=607800 How to upgrade and optimize legacy AI/ML models

The post On the Challenge of Converting TensorFlow Models to PyTorch appeared first on Towards Data Science.

]]>