Towards Data Science

Python 3.14 and its New JIT Compiler
Programming

A technical overview and some benchmarks

Thomas Reid

Jun 19

10 min read
Building a Custom GStreamer Plugin for NVIDIA DeepStream
Deep Learning

Why Custom Inference in DeepStream?

David Redó Nieto

Jun 19

10 min read

Latest

I Tried to Schedule My ETL Pipeline. Here’s What I Didn’t Expect.
Data Engineering

What I thought was a scheduling problem turned out to be a portability problem first

Ibrahim Salami

Jun 19

8 min read
Parse Scanned PDFs for RAG with EasyOCR: Free OCR Gives You Words, Not a Document
Large Language Models

Enterprise Document Intelligence [Vol.1 #5quinquies] – Same 1974 scanned PDF, two engines. EasyOCR recovers text.…

Kezhan Shi

Jun 19

15 min read
GPU-Resident Top-K for Agentic RAG: I Built a CUDA Kernel So My Retrieval Step Would Stop Bouncing Off the GPU
Agentic AI

The PCIe transfer latency is silently bottlenecking your agentic inference. Here is how building a…

Anubhab Banerjee

Jun 19

31 min read
Structured Outputs with LLMs: JSON Mode, Function Calling, and When to Use Each
Large Language Models

Getting reliable, readable responses out of your LLM, and knowing which tool to reach for

Maria Mouschoutzi

Jun 18

13 min read
How Powerful is Claude Fable (Mythos) 5 for Coding?
Large Language Models

Learn about the upsides and downsides of Claude Fable 5

Eivind Kjosbakken

Jun 18

9 min read
Proteins: A Mosaic Pattern to Rule Them All?
Machine Learning

For decades, the existence of the hydrophobic core, a region in the 3D structure of…

Francisco Javier Lobo-Cabrera

Jun 18

12 min read
Dispatching the Parsed RAG Question: Chunk Strategy, Model Tier, Activations, Audit
Large Language Models

Enterprise Document Intelligence [Vol.1 #6c] – The decisions the parser makes on top of the…

angela shi

Jun 18

28 min read
The Power and Pitfalls of Vector-Based Image Search
Artificial Intelligence

A hands-on guide to setting up image similarity search in Milvus, and why visual replication…

Soner Yıldırım

Jun 18

8 min read
Your Churn Threshold Is a Pricing Decision
Data Science

How unit economics should set your classification cutoff, and why they rarely do.

Fabio Oliveira

Jun 17

15 min read

See all of the latest

Editor’s Picks

You Probably Don’t Need an Agent Framework
Large Language Models

Most LLM applications need a clear workflow, not an autonomous agent. Here’s how to build…

Shuai Guo

Jun 17

19 min read
Drilling Into AI’s Financial Sustainability
Artificial Intelligence

Budgets for AI tokens can’t be infinite, no matter how much hyperscalers wish they were

Stephanie Kirmer

Jun 16

8 min read
I Built 11 Models to Predict the 2026 World Cup. They Crown Four Different Champions.
Data Science

A single model hands you a single answer and no sense of how much it…

Ari Joury, PhD

Jun 15

11 min read
Solving the 3Blue1Brown String Probability Problem (Without AI)
Data Science

Let’s practice data science thinking through a probability problem

Jarom Hulet

Jun 13

9 min read
A Harness for Every Task: Putting a Team of Claudes on One Job
Agentic AI

Claude can now write its own harness on the fly, custom-built for the task at…

Chien Vu Minh

Jun 12

28 min read
BI Is Dead, Long Live BI
Agentic AI

The true bottleneck was never the analysis.

Mahdi Karabiben

Jun 11

9 min read
When GPU Utilization Lies: The Hidden Systems Problem Slowing Modern AI
Artificial Intelligence

Why “average utilization” lies about how full your GPUs really are

Arjun Kaarat

Jun 11

13 min read
How to Train a Scoring Model in the Age of Artificial Intelligence
Data Science

A structured methodology for comparing candidate models, testing stability, and selecting a robust final score

JUNIOR JUMBONG

Jun 10

18 min read
Physical AI: What It Is and What It Is Not
Artificial Intelligence

A quick guide to separating Physical AI from world models, embodied AI, physics AI, and…

Shuai Guo

Jun 10

9 min read

The Variable Newsletter

Exciting Changes Are Coming to the TDS Author Payment Program
Writing

Authors can now benefit from updated earning tiers and a higher article cap

TDS Editors

Mar 2

2 min read
TDS Newsletter: Vibe Coding Is Great. Until It’s Not.
The Variable

Sorting through the good, bad, and ambiguous aspects of vibe coding

TDS Editors

Feb 5

4 min read

Deep Dives

The Secret to Reproducible and Portable Optimization: ORPilot’s Intermediate Representation (IR)
Agentic AI

Why production-level AI optimization modeling agent needs reproducibility and portability, and how IR helps achieve…

Guangrui Xie

Jun 17

15 min read
The System Always Knows: Why Local Efficiency and System Performance Are Not the Same Problem
Data Science

How local optimization in last‑mile delivery can quietly break the system

Arjun Kaarat

Jun 15

15 min read
GPU Time-Slicing for Concurrent LLM Agents on Kubernetes
Agentic AI

A systems-level deep dive into the hidden microarchitectural costs of Kubernetes GPU time-slicing, and what…

Anubhab Banerjee

Jun 14

22 min read
Larger Context Windows Don’t Fix RAG — So I Built a System That Does
Large Language Models

Increasing context size in RAG systems doesn’t improve accuracy for aggregation tasks—it makes errors harder…

Emmimal P Alexander

Jun 13

15 min read
Why Decade-Old Residual Connections Still Power All of AI (And Why That’s a Problem)
Large Language Models

For nearly a decade, this part of neural networks barely changed. DeepSeek is trying to…

Moulik Gupta

Jun 12

16 min read
Stop Returning Flat Text from a PDF: The Relational Tables RAG Needs
Large Language Models

Enterprise Document Intelligence [Vol.1 #5B] – One PDF in, a relational set of DataFrames out:…

Kezhan Shi

Jun 11

29 min read