Data Science | Towards Data Science https://towardsdatascience.com/category/data-science/ Publish AI, ML & data-science insights to a global community of data professionals. Mon, 15 Dec 2025 18:28:04 +0000 en-US hourly 1 https://wordpress.org/?v=6.8.3 https://towardsdatascience.com/wp-content/uploads/2025/02/cropped-Favicon-32x32.png Data Science | Towards Data Science https://towardsdatascience.com/category/data-science/ 32 32 6 Technical Skills That Make You a Senior Data Scientist https://towardsdatascience.com/6-technical-skills-that-make-you-a-senior-data-scientist/ Mon, 15 Dec 2025 15:43:00 +0000 https://towardsdatascience.com/?p=607905 Beyond writing code, these are the design-level decisions, trade-offs, and habits that quietly separate senior data scientists from everyone else.

The post 6 Technical Skills That Make You a Senior Data Scientist appeared first on Towards Data Science.

]]>
Geospatial exploratory data analysis with GeoPandas and DuckDB https://towardsdatascience.com/geospatial-exploratory-data-analysis-with-geopandas-and-duckdb/ Mon, 15 Dec 2025 13:17:00 +0000 https://towardsdatascience.com/?p=607897 In this article, I’ll show you how to use two popular Python libraries to carry out some geospatial analysis of traffic accident data within the UK. I was a relatively early adopter of DuckDB, the fast OLAP database, after it became available, but only recently realised that, through an extension, it offered a large number […]

The post Geospatial exploratory data analysis with GeoPandas and DuckDB appeared first on Towards Data Science.

]]>
7 Pandas Performance Tricks Every Data Scientist Should Know https://towardsdatascience.com/7-pandas-performance-tricks-every-data-scientist-should-know/ Thu, 11 Dec 2025 13:30:00 +0000 https://towardsdatascience.com/?p=607878 What I've learned about making Pandas faster after too many slow notebooks and frozen sessions

The post 7 Pandas Performance Tricks Every Data Scientist Should Know appeared first on Towards Data Science.

]]>
How to Climb the Hidden Career Ladder of Data Science https://towardsdatascience.com/the-hidden-career-ladder-of-data-science/ Sun, 07 Dec 2025 16:00:00 +0000 https://towardsdatascience.com/?p=607836 The behaviors that get you promoted

The post How to Climb the Hidden Career Ladder of Data Science appeared first on Towards Data Science.

]]>
A Product Data Scientist’s Take on LinkedIn Games After 500 Days of Play https://towardsdatascience.com/a-product-data-scientists-take-on-linkedin-games-after-500-days-of-play/ Fri, 05 Dec 2025 15:30:00 +0000 https://towardsdatascience.com/?p=607818 What a simple puzzle game reveals about experimentation, product thinking, and data science

The post A Product Data Scientist’s Take on LinkedIn Games After 500 Days of Play appeared first on Towards Data Science.

]]>
Bootstrap a Data Lakehouse in an Afternoon https://towardsdatascience.com/bootstrap-a-data-lakehouse-in-an-afternoon/ Thu, 04 Dec 2025 13:30:00 +0000 https://towardsdatascience.com/?p=607806 Using Apache Iceberg on AWS with Athena, Glue/Spark and DuckDB

The post Bootstrap a Data Lakehouse in an Afternoon appeared first on Towards Data Science.

]]>
The Best Data Scientists are Always Learning https://towardsdatascience.com/part-1-the-best-data-scientists-are-always-learning/ Thu, 04 Dec 2025 12:00:00 +0000 https://towardsdatascience.com/?p=607798 Why continuous learning matters & how to come up with topics to study

The post The Best Data Scientists are Always Learning appeared first on Towards Data Science.

]]>
JSON Parsing for Large Payloads: Balancing Speed, Memory, and Scalability https://towardsdatascience.com/json-parsing-for-large-payloads-balancing-speed-memory-and-scalability/ Tue, 02 Dec 2025 15:30:00 +0000 https://towardsdatascience.com/?p=607786 Benchmarking JSON libraries for large payloads

The post JSON Parsing for Large Payloads: Balancing Speed, Memory, and Scalability appeared first on Towards Data Science.

]]>
How to Use Simple Data Contracts in Python for Data Scientists https://towardsdatascience.com/how-to-use-simple-data-contracts-in-python-for-data-scientists/ Tue, 02 Dec 2025 14:00:00 +0000 https://towardsdatascience.com/?p=607784 Stop your pipelines from breaking on Friday afternoons using simple, open-source validation with Pandera.

The post How to Use Simple Data Contracts in Python for Data Scientists appeared first on Towards Data Science.

]]>
Metric Deception: When Your Best KPIs Hide Your Worst Failures https://towardsdatascience.com/metric-deception-when-your-best-kpis-hide-your-worst-failures/ Sat, 29 Nov 2025 15:00:00 +0000 https://towardsdatascience.com/?p=607765 The most dangerous KPIs aren’t broken; they’re the ones trusted long after they’ve lost their meaning.

The post Metric Deception: When Your Best KPIs Hide Your Worst Failures appeared first on Towards Data Science.

]]>
Data Science in 2026: Is It Still Worth It? https://towardsdatascience.com/data-science-in-2026-is-it-still-worth-it/ Fri, 28 Nov 2025 15:30:00 +0000 https://towardsdatascience.com/?p=607767 An honest view from a 10-year AI Engineer

The post Data Science in 2026: Is It Still Worth It? appeared first on Towards Data Science.

]]>
Everyday Decisions are Noisier Than You Think — Here’s How AI Can Help Fix That https://towardsdatascience.com/everyday-decisions-are-noisier-than-you-think-heres-how-ai-can-help-fix-that/ Thu, 27 Nov 2025 14:00:00 +0000 https://towardsdatascience.com/?p=607744 From insurance premiums to courtrooms: the impact of noise

The post Everyday Decisions are Noisier Than You Think — Here’s How AI Can Help Fix That appeared first on Towards Data Science.

]]>
I Cleaned a Messy CSV File Using Pandas .  Here’s the Exact Process I Follow Every Time. https://towardsdatascience.com/i-cleaned-a-messy-csv-file-using-pandas-heres-the-exact-process-i-follow-every-time/ Wed, 26 Nov 2025 19:13:17 +0000 https://towardsdatascience.com/?p=607742 Stop guessing at data cleaning. Use this repeatable 5-step Python workflow to diagnose and fix the most common data flaws.

The post I Cleaned a Messy CSV File Using Pandas .  Here’s the Exact Process I Follow Every Time. appeared first on Towards Data Science.

]]>
RISAT’s Silent Promise: Decoding Disasters with Synthetic Aperture Radar https://towardsdatascience.com/risats-silent-promise-decoding-disasters-with-synthetic-aperture-radar/ Wed, 26 Nov 2025 13:30:00 +0000 https://towardsdatascience.com/?p=607737 The high-resolution physics turning microwave echoes into real-time flood intelligence

The post RISAT’s Silent Promise: Decoding Disasters with Synthetic Aperture Radar appeared first on Towards Data Science.

]]>
How to Implement Three Use Cases for the New Calendar-Based Time Intelligence https://towardsdatascience.com/use-cases-for-the-new-calendar-based-time-intelligence/ Tue, 25 Nov 2025 14:30:00 +0000 https://towardsdatascience.com/?p=607731 Starting with the September 2025 Release of Power BI, Microsoft introduced the new Calendar-based Time Intelligence feature. Let’s see what can be done by implementing three use cases. The future looks very interesting with this new feature.

The post How to Implement Three Use Cases for the New Calendar-Based Time Intelligence appeared first on Towards Data Science.

]]>
Data Science Mistakes That Could Ruin Your Learning Path, and How to Avoid Them https://towardsdatascience.com/struggling-with-data-science-5-common-beginner-mistakes/ Mon, 24 Nov 2025 20:30:00 +0000 https://towardsdatascience.com/?p=607722 Avoid these mistakes to fast track your data science career.

The post Data Science Mistakes That Could Ruin Your Learning Path, and How to Avoid Them appeared first on Towards Data Science.

]]>
Empirical Mode Decomposition: The Most Intuitive Way to Decompose Complex Signals and Time Series https://towardsdatascience.com/preprocessing-signal-data-with-empirical-mode-decomposition/ Sat, 22 Nov 2025 15:00:00 +0000 https://towardsdatascience.com/?p=607709 A step-by-step breakdown of empirical mode decomposition to help you extract patterns from time series

The post Empirical Mode Decomposition: The Most Intuitive Way to Decompose Complex Signals and Time Series appeared first on Towards Data Science.

]]>
Overfitting vs. Underfitting: Making Sense of the Bias-Variance Trade-Off https://towardsdatascience.com/overfitting-versus-underfitting/ Sat, 22 Nov 2025 13:00:00 +0000 https://towardsdatascience.com/?p=607704 The best models live in the sweet spot: generalizing well, learning enough, but not too much

The post Overfitting vs. Underfitting: Making Sense of the Bias-Variance Trade-Off appeared first on Towards Data Science.

]]>
Modern DataFrames in Python: A Hands-On Tutorial with Polars and DuckDB https://towardsdatascience.com/modern-dataframes-in-python-a-hands-on-tutorial-with-polars-and-duckdb/ Fri, 21 Nov 2025 17:00:00 +0000 https://towardsdatascience.com/?p=607702 How I learned to handle growing datasets without slowing down my entire workflow

The post Modern DataFrames in Python: A Hands-On Tutorial with Polars and DuckDB appeared first on Towards Data Science.

]]>
How To Build a Graph-Based Recommendation Engine Using EDG and Neo4j https://towardsdatascience.com/how-to-build-a-recommendation-engine-using-edg-and-neo4j/ Fri, 21 Nov 2025 15:30:00 +0000 https://towardsdatascience.com/?p=607699 Use a shared taxonomy to connect RDF and property graphs—and power smarter recommendations with inferencing

The post How To Build a Graph-Based Recommendation Engine Using EDG and Neo4j appeared first on Towards Data Science.

]]>