Log inSign up
Eugene Yan
4,774 posts
user avatar
Eugene Yan
@eugeneyan
MTS @AnthropicAI. Prev: Principal Applied Scientist @Amazon, led ML @ Alibaba, Lazada, Healthtech startup.
Seattle ⇄ SF
eugeneyan.com
Joined April 2009
664
Following
27.5K
Followers
  • user avatar
    Eugene Yan
    @eugeneyan
    Sep 10, 2021
    The first rule of machine learning: Start without machine learning
  • user avatar
    Eugene Yan
    @eugeneyan
    Nov 25, 2021
    I'm excited to share something I've been working on for a while—ApplyingML.com! It collects the tacit, tribal, ghost knowledge on how to apply machine learning from papers, guides, and interviews with ML practitioners.
    ApplyingML - Papers, Guides, and Interviews with ML practitioners
    From applyingml.com
  • user avatar
    Eugene Yan
    @eugeneyan
    May 8, 2023
    Ran a simple benchmark (Mandelbrot sets) between Mojo & Python. The speedup is impressive, and it benefits from Python's libraries. • Python: 1,184ms • Mojo: 27ms 🤯 • Python (vectorized): 240ms • Mojo (vectorized): 2ms
    324K
  • user avatar
    Eugene Yan
    @eugeneyan
    May 13, 2023
    x.com/karpathy/statu…
    user avatar
    Andrej Karpathy
    @karpathy
    Apr 15, 2023
    Replying to @sinclanich
    np.array people keep reaching for much fancier things way too fast these days
    194K
  • user avatar
    Eugene Yan
    @eugeneyan
    May 5, 2023
    Started a list of open-source LLMs with commercial licenses so you can fine-tune your own applications. Contributions welcome! 🙏
    GitHub - eugeneyan/open-llms: 📋 A list of open LLMs available for commercial use.
    From github.com
    156K
  • user avatar
    Eugene Yan
    @eugeneyan
    Aug 28, 2023
    “HuggingFace’s leaderboards show how truly blind they are because they actively hurting the open source movement by tricking it into creating a bunch of models that are useless for real usage.” Ouch.
    newsletter.semianalysis.com
    Google Gemini Eats The World – Gemini Smashes GPT-4 By 5X, The GPU-Poors
    Compute Resources That Make Everyone Look GPU-Poor
    439K
  • user avatar
    Eugene Yan
    @eugeneyan
    Jul 9, 2025
    How do you build an LLM-evaluator / LLM-as-Judge? The book for "AI Evals for PMs and Engineers" has a chapter devoted to it (35% discount: maven.com/parlance-labs/…) First, we need to define the right metrics. For example, we can start by listing the failure modes from our error
    8.7K
  • user avatar
    Eugene Yan
    @eugeneyan
    Apr 2, 2023
    This is why CS fundamentals continue to be crucial: LLaMA 30B only needs 4gb of memory if we use mmap(). Not sure why this works but one reason could be that 30B weights are sparse. Thus, lazy loading the fraction of needed weights reduces memory usage. github.com/ggerganov/llam…
    311K
  • user avatar
    Eugene Yan
    @eugeneyan
    Feb 26, 2025
    agent ≈ model + tools, within a for-loop + environment
    user avatar
    rahul
    Ramp
    @rahulgs
    Feb 24, 2025
    Anthropic's AI Engineer source code is fully public / there is no server there is no separate backend. they just use the same api.anthropic.com/v1/messages api in a loop with tool use all packaged into a single file: gist.githubusercontent.com/1rgs/e4e13ac9a… tools available: 1. dispatch_agent -
    132K
  • user avatar
    Eugene Yan
    @eugeneyan
    Oct 16, 2023
    You're a great engineer if you know: • Simple is beautiful • To focus on the problem, not the tech • How to reuse what already exists • Launching is the start, not the end • How to step back & let others lead • How to get different views & change your mind • Your customer
    user avatar
    Zach Wilson
    @EcZachly
    Oct 15, 2023
    You’re a great engineer if you know the definition of: - idempotent - monoid - decoupled - dependency injection - unit - functional programming - asynchronous vs parallel programming - thread locking - eventual consistency - exactly-once semantics - lambda vs kappa
    173K
  • user avatar
    Eugene Yan
    @eugeneyan
    May 27, 2025
    A love letter to @claude_code: Over the past two days, I built a stock analysis web app for loved ones that includes auth, charting tools, stock data/llm apis, db persistance, and more. (see 1 min demo) The velocity was only possible with claude code—please try it if you haven't
    00:00
    174K
  • user avatar
    Eugene Yan
    @eugeneyan
    Aug 2, 2023
    Wrote abt patterns for LLM systems/products • Evals: Track performance • RAG: Add external knowledge • Finetuning: Improve specific tasks • Caching: Reduce latency & cost • Guardrails: Ensure output quality • Defensive UX: Anticipate & manage errors
    Patterns for Building LLM-based Systems & Products
    From eugeneyan.com
    141K
  • user avatar
    Eugene Yan
    @eugeneyan
    Jan 16, 2021
    Starting a machine learning project? • How to frame the problem? • What methodology to adopt? • How to design the system? • Estimating potential ROI? Learn how other companies did it via their papers & tech blogs.
    GitHub - eugeneyan/applied-ml: 📚 Papers & tech blogs by companies sharing their work on data...
    From github.com
  • user avatar
    Eugene Yan
    @eugeneyan
    Jun 25, 2025
    Wrote an intro to evals for long-context Q&A systems: • How it differs from basic Q&A • What dimensions & metrics to eval on • How to build llm-evaluators • How to build eval datasets • Benchmarks: narratives, technical docs, multi-docs
    Evaluating Long-Context Question & Answer Systems
    From eugeneyan.com
    30K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms of Service|Privacy Policy|Cookie Policy|Accessibility|Ads info|© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up