Eugene Yan (@eugeneyan) / X

Eugene Yan

4,774 posts

Eugene Yan

@eugeneyan

MTS @AnthropicAI. Prev: Principal Applied Scientist @Amazon, led ML @ Alibaba, Lazada, Healthtech startup.

Seattle ⇄ SF

Joined April 2009

Eugene Yan
@eugeneyan
Sep 10, 2021
The first rule of machine learning: Start without machine learning
Eugene Yan
@eugeneyan
Nov 25, 2021
I'm excited to share something I've been working on for a while—ApplyingML.com! It collects the tacit, tribal, ghost knowledge on how to apply machine learning from papers, guides, and interviews with ML practitioners.
ApplyingML - Papers, Guides, and Interviews with ML practitioners
From applyingml.com
Eugene Yan
@eugeneyan
May 8, 2023
Ran a simple benchmark (Mandelbrot sets) between Mojo & Python. The speedup is impressive, and it benefits from Python's libraries. • Python: 1,184ms • Mojo: 27ms 🤯 • Python (vectorized): 240ms • Mojo (vectorized): 2ms
324K
Eugene Yan
@eugeneyan
May 13, 2023
x.com/karpathy/statu…
Andrej Karpathy
@karpathy
Apr 15, 2023
Replying to @sinclanich
np.array people keep reaching for much fancier things way too fast these days
194K
Eugene Yan
@eugeneyan
May 5, 2023
Started a list of open-source LLMs with commercial licenses so you can fine-tune your own applications. Contributions welcome! 🙏
GitHub - eugeneyan/open-llms: 📋 A list of open LLMs available for commercial use.
From github.com
156K
Eugene Yan
@eugeneyan
Aug 28, 2023
“HuggingFace’s leaderboards show how truly blind they are because they actively hurting the open source movement by tricking it into creating a bunch of models that are useless for real usage.” Ouch.
newsletter.semianalysis.com
Google Gemini Eats The World – Gemini Smashes GPT-4 By 5X, The GPU-Poors
Compute Resources That Make Everyone Look GPU-Poor
439K
Eugene Yan
@eugeneyan
Jul 9, 2025
How do you build an LLM-evaluator / LLM-as-Judge? The book for "AI Evals for PMs and Engineers" has a chapter devoted to it (35% discount: maven.com/parlance-labs/…) First, we need to define the right metrics. For example, we can start by listing the failure modes from our error
8.7K
Eugene Yan
@eugeneyan
Apr 2, 2023
This is why CS fundamentals continue to be crucial: LLaMA 30B only needs 4gb of memory if we use mmap(). Not sure why this works but one reason could be that 30B weights are sparse. Thus, lazy loading the fraction of needed weights reduces memory usage. github.com/ggerganov/llam…
311K
Eugene Yan
@eugeneyan
Feb 26, 2025
agent ≈ model + tools, within a for-loop + environment
rahul
@rahulgs
Feb 24, 2025
Anthropic's AI Engineer source code is fully public / there is no server there is no separate backend. they just use the same api.anthropic.com/v1/messages api in a loop with tool use all packaged into a single file: gist.githubusercontent.com/1rgs/e4e13ac9a… tools available: 1. dispatch_agent -
132K
Eugene Yan
@eugeneyan
Oct 16, 2023
You're a great engineer if you know: • Simple is beautiful • To focus on the problem, not the tech • How to reuse what already exists • Launching is the start, not the end • How to step back & let others lead • How to get different views & change your mind • Your customer
Zach Wilson
@EcZachly
Oct 15, 2023
You’re a great engineer if you know the definition of: - idempotent - monoid - decoupled - dependency injection - unit - functional programming - asynchronous vs parallel programming - thread locking - eventual consistency - exactly-once semantics - lambda vs kappa
173K
Eugene Yan
@eugeneyan
May 27, 2025
A love letter to @claude_code: Over the past two days, I built a stock analysis web app for loved ones that includes auth, charting tools, stock data/llm apis, db persistance, and more. (see 1 min demo) The velocity was only possible with claude code—please try it if you haven't
00:00
174K
Eugene Yan
@eugeneyan
Aug 2, 2023
Wrote abt patterns for LLM systems/products • Evals: Track performance • RAG: Add external knowledge • Finetuning: Improve specific tasks • Caching: Reduce latency & cost • Guardrails: Ensure output quality • Defensive UX: Anticipate & manage errors
Patterns for Building LLM-based Systems & Products
From eugeneyan.com
141K
Eugene Yan
@eugeneyan
Jan 16, 2021
Starting a machine learning project? • How to frame the problem? • What methodology to adopt? • How to design the system? • Estimating potential ROI? Learn how other companies did it via their papers & tech blogs.
GitHub - eugeneyan/applied-ml: 📚 Papers & tech blogs by companies sharing their work on data...
From github.com
Eugene Yan
@eugeneyan
Jun 25, 2025
Wrote an intro to evals for long-context Q&A systems: • How it differs from basic Q&A • What dimensions & metrics to eval on • How to build llm-evaluators • How to build eval datasets • Benchmarks: narratives, technical docs, multi-docs
Evaluating Long-Context Question & Answer Systems
From eugeneyan.com
30K