Log inSign up
samir gadre
145 posts
user avatar
samir gadre
@sy_gadre
@anthropicai | prev: @columbia
bk
sagadre.github.io
Joined June 2020
490
Following
715
Followers
  • user avatar
    samir gadre
    @sy_gadre
    Aug 17, 2024
    recently joined the training team @AnthropicAI ! the rumors are true, everyone is smart and nice ! some ant content i did to commemorate the occasion
    19K
  • user avatar
    samir gadre
    @sy_gadre
    Mar 18, 2024
    sharing some highlights from our recent paper: language models scale reliably with over-training and on downstream tasks! arxiv: arxiv.org/abs/2403.08540 104 models, 11M to 7B parameters, varying numbers of tokens, 3 datasets, eval’d on 46 tasks: github.com/mlfoundations/… 1/11
    29K
  • user avatar
    samir gadre
    @sy_gadre
    Jun 26, 2022
    I won a NSF GRF--thanks to .@SongShuran, mentors, and collaborators for all the support! Sharing my apps in case they're helpful to others personal: sagadre.github.io/data/nsf_perso… research: sagadre.github.io/data/nsf_resea… 1/2
  • user avatar
    samir gadre
    @sy_gadre
    May 6, 2021
    excited to share Act the Part (AtP), a framework to learn how to interact with articulated objects to discover and segment their parts! arxiv: arxiv.org/abs/2105.01047 website w/ demo: atp.cs.columbia.edu joint work w/ .@ehsanik and .@SongShuran 1/5
    GIF
  • user avatar
    samir gadre
    @sy_gadre
    Apr 28, 2023
    can we create better models by curating better web-scale datasets? our experiments suggest yes! check out our newly released DataComp, a collaborative benchmark to bootstrap data-centric research excited to see what we build together🙂
    user avatar
    Gabriel Ilharco
    @gabriel_ilharco
    Apr 28, 2023
    Introducing DataComp, a new benchmark for multimodal datasets! We release 12.8B image-text pairs, 300+ experiments and a 1.4B subset that outcompetes compute-matched CLIP runs from OpenAI & LAION 📜 arxiv.org/abs/2304.14108 🖥️ github.com/mlfoundations/… 🌐 datacomp.ai
    5.2K
  • user avatar
    samir gadre
    @sy_gadre
    Sep 26, 2023
    open lm is here! no more secrets in language modeling❤️‍🔥confident that together we'll find all the "little tricks"™️
    This Post is from an account that no longer exists. Learn more
    1.9K
  • user avatar
    samir gadre
    @sy_gadre
    Jun 19, 2024
    excited this is out! long live datacomp.ai !
    user avatar
    Vaishaal Shankar
    @Vaishaal
    Jun 18, 2024
    I am really excited to introduce DataComp for Language Models (DCLM), our new testbed for controlled dataset experiments aimed at improving language models. 1/x
    2.1K
  • user avatar
    samir gadre
    @sy_gadre
    Aug 12, 2022
    is a model (work of art) ever really complete? we introduce PAINT🎨 to touch up CLIP models on target tasks while keeping the model open-vocabulary and maintaining performance elsewhere. we also find PAINTing on a task can improve performance on related tasks🧵
    user avatar
    Gabriel Ilharco
    @gabriel_ilharco
    Aug 12, 2022
    The year is 2032. A model was trained on all images, videos and text on the web, using over 100 yottaFLOPs. It still thinks this is an image of a dog. To fix models post-hoc, check out PAINT!🎨 📜 arxiv.org/abs/2208.05592 💻 github.com/mlfoundations/… 🌐 model-patching.github.io
  • user avatar
    samir gadre
    @sy_gadre
    Jun 18, 2023
    im at CVPR presenting CoW (cow.cs.columbia.edu) on thursday afternoon also excited to talk to folks about DataComp (datacomp.ai) feel free to reach out!
    1.8K
  • user avatar
    samir gadre
    @sy_gadre
    Jun 19, 2022
    en route to my first vision conference! message me if you want to chat in new orleans #CVPR2022
  • user avatar
    samir gadre
    @sy_gadre
    Apr 22, 2024
    A 7B Mamba model trained with open_lm infra (github.com/mlfoundations/…)! Congrats to the TRI team that worked on this!
    user avatar
    Sedrick Keh
    @sedrickkeh2
    Apr 22, 2024
    📢 Releasing TRI's open-source Mamba-7B trained on 1.2T tokens of RefinedWeb! Mamba-7B is the largest fully recurrent Mamba model trained and is a state-of-the-art recurrent LLM. 🚀🚀🚀 huggingface.co/TRI-ML/mamba-7…
    GitHub - mlfoundations/open_lm: A repository for research on medium sized language models.
    From github.com
    1.8K
  • user avatar
    samir gadre
    @sy_gadre
    Mar 18, 2024
    Replying to @sy_gadre
    Key takeaway? Fit scaling laws to small-scale runs trained near compute-optimal, predict the ✨downstream error✨ (average top-1) of large ✨over-trained✨ runs 7/11
    412
  • user avatar
    samir gadre
    @sy_gadre
    Nov 29, 2022
    Want to patch bugs in your model while maintaining performance elsewhere? Check out PAINT🎨, which we'll be presenting on Thur Dec 2 @ 4p in Hall J arxiv.org/abs/2208.05592 I'll be at NeurIPS for the week, so feel free the reach out! (1/2)
  • user avatar
    samir gadre
    @sy_gadre
    May 30, 2023
    a workshop at ICCV on dataset curation! excited for ~*the next generation~ of datasets
    user avatar
    Vaishaal Shankar
    @Vaishaal
    May 30, 2023
    1/9 I am excited to announce that our workshop "Towards the Next Generation of Computer Vision Datasets" will be happening at ICCV 2023 in Paris. We will feature DataComp submissions, other data-centric papers, and invited talks by experts. datacomp.ai/workshop
    823

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms of Service|Privacy Policy|Cookie Policy|Accessibility|Ads info|© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up