Deep Dives | Towards Data Science https://towardsdatascience.com/tag/deep-dives/ Publish AI, ML & data-science insights to a global community of data professionals. Mon, 15 Dec 2025 20:49:55 +0000 en-US hourly 1 https://wordpress.org/?v=6.8.3 https://towardsdatascience.com/wp-content/uploads/2025/02/cropped-Favicon-32x32.png Deep Dives | Towards Data Science https://towardsdatascience.com/tag/deep-dives/ 32 32 The Machine Learning “Advent Calendar” Day 15: SVM in Excel https://towardsdatascience.com/the-machine-learning-advent-calendar-day-15-svm-in-excel/ Mon, 15 Dec 2025 19:41:01 +0000 https://towardsdatascience.com/?p=607912 Instead of starting with margins and geometry, this article builds the Support Vector Machine step by step from familiar models. By changing the loss function and reusing regularization, SVM appears naturally as a linear classifier trained by optimization. This perspective unifies logistic regression, SVM, and other linear models into a single, coherent framework.

The post The Machine Learning “Advent Calendar” Day 15: SVM in Excel appeared first on Towards Data Science.

]]>
Spectral Community Detection in Clinical Knowledge Graphs https://towardsdatascience.com/spectral-community-detection-in-clinical-knowledge-graphs/ Fri, 12 Dec 2025 10:30:00 +0000 https://towardsdatascience.com/?p=607884 Introduction How do we identify latent groups of patients in a large cohort? How can we find similarities among patients that go beyond the well-known comorbidity clusters associated with specific diseases? And more importantly, how can we extract quantitative signals that can be analyzed, compared, and reused across different clinical scenarios? The information associated to […]

The post Spectral Community Detection in Clinical Knowledge Graphs appeared first on Towards Data Science.

]]>
A Realistic Roadmap to Start an AI Career in 2026 https://towardsdatascience.com/a-realistic-roadmap-to-start-an-ai-career-in-2026/ Tue, 09 Dec 2025 12:00:00 +0000 https://towardsdatascience.com/?p=607855 How to learn AI in 2026 through real, usable projects

The post A Realistic Roadmap to Start an AI Career in 2026 appeared first on Towards Data Science.

]]>
The Machine Learning “Advent Calendar” Day 8: Isolation Forest in Excel https://towardsdatascience.com/the-machine-learning-advent-calendar-day-8-isolation-forest-in-excel/ Mon, 08 Dec 2025 18:26:42 +0000 https://towardsdatascience.com/?p=607851 Isolation Forest may look technical, but its idea is simple: isolate points using random splits. If a point is isolated quickly, it is an anomaly; if it takes many splits, it is normal.

Using the tiny dataset 1, 2, 3, 9, we can see the logic clearly. We build several random trees, measure how many splits each point needs, average the depths, and convert them into anomaly scores. Short depths become scores close to 1, long depths close to 0.

The Excel implementation is painful, but the algorithm itself is elegant. It scales to many features, makes no assumptions about distributions, and even works with categorical data. Above all, Isolation Forest asks a different question: not “What is normal?”, but “How fast can I isolate this point?”

The post The Machine Learning “Advent Calendar” Day 8: Isolation Forest in Excel appeared first on Towards Data Science.

]]>
Optimizing PyTorch Model Inference on CPU https://towardsdatascience.com/optimizing-pytorch-model-inference-on-cpu/ Mon, 08 Dec 2025 12:00:00 +0000 https://towardsdatascience.com/?p=607812 Flyin’ Like a Lion on Intel Xeon

The post Optimizing PyTorch Model Inference on CPU appeared first on Towards Data Science.

]]>
How to Climb the Hidden Career Ladder of Data Science https://towardsdatascience.com/the-hidden-career-ladder-of-data-science/ Sun, 07 Dec 2025 16:00:00 +0000 https://towardsdatascience.com/?p=607836 The behaviors that get you promoted

The post How to Climb the Hidden Career Ladder of Data Science appeared first on Towards Data Science.

]]>
YOLOv1 Paper Walkthrough: The Day YOLO First Saw the World https://towardsdatascience.com/yolov1-paper-walkthrough-the-day-yolo-first-saw-the-world/ Fri, 05 Dec 2025 14:00:00 +0000 https://towardsdatascience.com/?p=607816 A detailed walkthrough of the YOLOv1 architecture and its PyTorch implementation from scratch

The post YOLOv1 Paper Walkthrough: The Day YOLO First Saw the World appeared first on Towards Data Science.

]]>
On the Challenge of Converting TensorFlow Models to PyTorch https://towardsdatascience.com/on-the-challenge-of-converting-tensorflow-models-to-pytorch/ Fri, 05 Dec 2025 12:30:00 +0000 https://towardsdatascience.com/?p=607800 How to upgrade and optimize legacy AI/ML models

The post On the Challenge of Converting TensorFlow Models to PyTorch appeared first on Towards Data Science.

]]>
Build and Deploy Your First Supply Chain App in 20 Minutes https://towardsdatascience.com/build-and-deploy-your-first-supply-chain-app-in-20-minutes/ Thu, 04 Dec 2025 15:00:00 +0000 https://towardsdatascience.com/?p=607808 A factory operator that discovered happiness by switching from notebook to streamlit - (Image Generated with GPT-5.1 by Samir Saci)

The post Build and Deploy Your First Supply Chain App in 20 Minutes appeared first on Towards Data Science.

]]>
The Architecture Behind Web Search in AI Chatbots https://towardsdatascience.com/the-architecture-behind-web-search-in-ai-chatbots-2/ Thu, 04 Dec 2025 06:19:55 +0000 https://towardsdatascience.com/?p=607822 And what this means for generative engine optimization (GEO)

The post The Architecture Behind Web Search in AI Chatbots appeared first on Towards Data Science.

]]>
JSON Parsing for Large Payloads: Balancing Speed, Memory, and Scalability https://towardsdatascience.com/json-parsing-for-large-payloads-balancing-speed-memory-and-scalability/ Tue, 02 Dec 2025 15:30:00 +0000 https://towardsdatascience.com/?p=607786 Benchmarking JSON libraries for large payloads

The post JSON Parsing for Large Payloads: Balancing Speed, Memory, and Scalability appeared first on Towards Data Science.

]]>
The Machine Learning “Advent Calendar” Day 1: k-NN Regressor in Excel https://towardsdatascience.com/day-1-k-nn-regressor-in-excel-how-distance-drives-prediction/ Mon, 01 Dec 2025 19:52:19 +0000 https://towardsdatascience.com/?p=607778 This first day of the Advent Calendar introduces the k-NN regressor, the simplest distance-based model. Using Excel, we explore how predictions rely entirely on the closest observations, why feature scaling matters, and how heterogeneous variables can make distances meaningless. Through examples with continuous and categorical features, including the California Housing and Diamonds datasets, we see the strengths and limitations of k-NN, and why defining the right distance is essential to reflect real-world structure.

The post The Machine Learning “Advent Calendar” Day 1: k-NN Regressor in Excel appeared first on Towards Data Science.

]]>
Why AI Alignment Starts With Better Evaluation https://towardsdatascience.com/why-ai-alignment-starts-with-better-evaluation/ Mon, 01 Dec 2025 13:00:00 +0000 https://towardsdatascience.com/?p=607771 You can’t align what you don’t evaluate

The post Why AI Alignment Starts With Better Evaluation appeared first on Towards Data Science.

]]>
Neural Networks Are Blurry, Symbolic Systems Are Fragmented. Sparse Autoencoders Help Us Combine Them. https://towardsdatascience.com/neuro-symbolic-systems-the-art-of-compromise-2/ Thu, 27 Nov 2025 17:24:06 +0000 https://towardsdatascience.com/?p=607756 Neural and symbolic models compress the world in fundamentally different ways, and Sparse Autoencoders (SAEs) offer a bridge to connect them.

The post Neural Networks Are Blurry, Symbolic Systems Are Fragmented. Sparse Autoencoders Help Us Combine Them. appeared first on Towards Data Science.

]]>
I Cleaned a Messy CSV File Using Pandas .  Here’s the Exact Process I Follow Every Time. https://towardsdatascience.com/i-cleaned-a-messy-csv-file-using-pandas-heres-the-exact-process-i-follow-every-time/ Wed, 26 Nov 2025 19:13:17 +0000 https://towardsdatascience.com/?p=607742 Stop guessing at data cleaning. Use this repeatable 5-step Python workflow to diagnose and fix the most common data flaws.

The post I Cleaned a Messy CSV File Using Pandas .  Here’s the Exact Process I Follow Every Time. appeared first on Towards Data Science.

]]>
Why CrewAI’s Manager-Worker Architecture Fails — and How to Fix It https://towardsdatascience.com/why-crewais-manager-worker-architecture-fails-and-how-to-fix-it/ Tue, 25 Nov 2025 18:45:38 +0000 https://towardsdatascience.com/?p=607733 A real-world analysis of why CrewAI’s hierarchical orchestration misfires—and a practical fix you can implement today.

The post Why CrewAI’s Manager-Worker Architecture Fails — and How to Fix It appeared first on Towards Data Science.

]]>
Ten Lessons of Building LLM Applications for Engineers https://towardsdatascience.com/ten-lessons-of-building-llm-applications-for-engineers/ Tue, 25 Nov 2025 13:00:00 +0000 https://towardsdatascience.com/?p=607729 Practical field notes on workflows, structure, and evaluation from two years of building with engineering domain experts.

The post Ten Lessons of Building LLM Applications for Engineers appeared first on Towards Data Science.

]]>
Your Next ‘Large’ Language Model Might Not Be Large After All https://towardsdatascience.com/your-next-large-language-model-might-not-be-large-afterall-2/ Sun, 23 Nov 2025 14:00:00 +0000 https://towardsdatascience.com/?p=607711 A 27M-parameter model just outperformed giants like DeepSeek R1, o3-mini, and Claude 3.7 on reasoning tasks

The post Your Next ‘Large’ Language Model Might Not Be Large After All appeared first on Towards Data Science.

]]>
Generative AI Will Redesign Cars, But Not the Way Automakers Think https://towardsdatascience.com/generative-ai-will-redesign-cars-but-not-the-way-automakers-think/ Fri, 21 Nov 2025 12:30:00 +0000 https://towardsdatascience.com/?p=607694 Traditional manufacturers are using revolutionary technology for incremental optimization instead of fundamental re-imagination

The post Generative AI Will Redesign Cars, But Not the Way Automakers Think appeared first on Towards Data Science.

]]>
How Relevance Models Foreshadowed Transformers for NLP https://towardsdatascience.com/how-relevance-models-foreshadowed-transformers-for-nlp/ Thu, 20 Nov 2025 14:00:00 +0000 https://towardsdatascience.com/?p=607686 Tracing the history of LLM attention: standing on the shoulders of giants

The post How Relevance Models Foreshadowed Transformers for NLP appeared first on Towards Data Science.

]]>