-

Parse Scanned PDFs for RAG with EasyOCR: Free OCR Gives You Words, Not a Document
Large Language ModelsEnterprise Document Intelligence [Vol.1 #5quinquies] – Same 1974 scanned PDF, two engines. EasyOCR recovers text.…
15 min read -

GPU-Resident Top-K for Agentic RAG: I Built a CUDA Kernel So My Retrieval Step Would Stop Bouncing Off the GPU
Agentic AIThe PCIe transfer latency is silently bottlenecking your agentic inference. Here is how building a…
31 min read -

Structured Outputs with LLMs: JSON Mode, Function Calling, and When to Use Each
Large Language ModelsGetting reliable, readable responses out of your LLM, and knowing which tool to reach for
11 min read -

Learn about the upsides and downsides of Claude Fable 5
9 min read -

For decades, the existence of the hydrophobic core, a region in the 3D structure of…
12 min read -

Dispatching the Parsed RAG Question: Chunk Strategy, Model Tier, Activations, Audit
Large Language ModelsEnterprise Document Intelligence [Vol.1 #6c] – The decisions the parser makes on top of the…
28 min read -

How unit economics should set your classification cutoff, and why they rarely do.
15 min read -

The Secret to Reproducible and Portable Optimization: ORPilot’s Intermediate Representation (IR)
Agentic AIWhy production-level AI optimization modeling agent needs reproducibility and portability, and how IR helps achieve…
15 min read -

Most LLM applications need a clear workflow, not an autonomous agent. Here’s how to build…
19 min read
