samir gadre (@sy_gadre) / X

samir gadre

145 posts

samir gadre

@sy_gadre

@anthropicai | prev: @columbia

Joined June 2020

samir gadre
@sy_gadre
Aug 17, 2024
recently joined the training team @AnthropicAI ! the rumors are true, everyone is smart and nice ! some ant content i did to commemorate the occasion
19K
samir gadre
@sy_gadre
Mar 18, 2024
sharing some highlights from our recent paper: language models scale reliably with over-training and on downstream tasks! arxiv: arxiv.org/abs/2403.08540 104 models, 11M to 7B parameters, varying numbers of tokens, 3 datasets, eval’d on 46 tasks: github.com/mlfoundations/… 1/11
29K
samir gadre
@sy_gadre
Jun 26, 2022
I won a NSF GRF--thanks to .@SongShuran, mentors, and collaborators for all the support! Sharing my apps in case they're helpful to others personal: sagadre.github.io/data/nsf_perso… research: sagadre.github.io/data/nsf_resea… 1/2
samir gadre
@sy_gadre
May 6, 2021
excited to share Act the Part (AtP), a framework to learn how to interact with articulated objects to discover and segment their parts! arxiv: arxiv.org/abs/2105.01047 website w/ demo: atp.cs.columbia.edu joint work w/ .@ehsanik and .@SongShuran 1/5
GIF
samir gadre
@sy_gadre
Apr 28, 2023
can we create better models by curating better web-scale datasets? our experiments suggest yes! check out our newly released DataComp, a collaborative benchmark to bootstrap data-centric research excited to see what we build together🙂
Gabriel Ilharco
@gabriel_ilharco
Apr 28, 2023
Introducing DataComp, a new benchmark for multimodal datasets! We release 12.8B image-text pairs, 300+ experiments and a 1.4B subset that outcompetes compute-matched CLIP runs from OpenAI & LAION 📜 arxiv.org/abs/2304.14108 🖥️ github.com/mlfoundations/… 🌐 datacomp.ai
5.2K
samir gadre
@sy_gadre
Sep 26, 2023
open lm is here! no more secrets in language modeling❤️‍🔥confident that together we'll find all the "little tricks"™️
This Post is from an account that no longer exists. Learn more
1.9K
samir gadre
@sy_gadre
Jun 19, 2024
excited this is out! long live datacomp.ai !
Vaishaal Shankar
@Vaishaal
Jun 18, 2024
I am really excited to introduce DataComp for Language Models (DCLM), our new testbed for controlled dataset experiments aimed at improving language models. 1/x
2.1K
samir gadre
@sy_gadre
Aug 12, 2022
is a model (work of art) ever really complete? we introduce PAINT🎨 to touch up CLIP models on target tasks while keeping the model open-vocabulary and maintaining performance elsewhere. we also find PAINTing on a task can improve performance on related tasks🧵
Gabriel Ilharco
@gabriel_ilharco
Aug 12, 2022
The year is 2032. A model was trained on all images, videos and text on the web, using over 100 yottaFLOPs. It still thinks this is an image of a dog. To fix models post-hoc, check out PAINT!🎨 📜 arxiv.org/abs/2208.05592 💻 github.com/mlfoundations/… 🌐 model-patching.github.io
samir gadre
@sy_gadre
Jun 18, 2023
im at CVPR presenting CoW (cow.cs.columbia.edu) on thursday afternoon also excited to talk to folks about DataComp (datacomp.ai) feel free to reach out!
1.8K
samir gadre
@sy_gadre
Jun 19, 2022
en route to my first vision conference! message me if you want to chat in new orleans #CVPR2022
samir gadre
@sy_gadre
Apr 22, 2024
A 7B Mamba model trained with open_lm infra (github.com/mlfoundations/…)! Congrats to the TRI team that worked on this!
Sedrick Keh
@sedrickkeh2
Apr 22, 2024
📢 Releasing TRI's open-source Mamba-7B trained on 1.2T tokens of RefinedWeb! Mamba-7B is the largest fully recurrent Mamba model trained and is a state-of-the-art recurrent LLM. 🚀🚀🚀 huggingface.co/TRI-ML/mamba-7…
GitHub - mlfoundations/open_lm: A repository for research on medium sized language models.
From github.com
1.8K
samir gadre
@sy_gadre
Mar 18, 2024
Replying to @sy_gadre
Key takeaway? Fit scaling laws to small-scale runs trained near compute-optimal, predict the ✨downstream error✨ (average top-1) of large ✨over-trained✨ runs 7/11
412
samir gadre
@sy_gadre
Nov 29, 2022
Want to patch bugs in your model while maintaining performance elsewhere? Check out PAINT🎨, which we'll be presenting on Thur Dec 2 @ 4p in Hall J arxiv.org/abs/2208.05592 I'll be at NeurIPS for the week, so feel free the reach out! (1/2)
samir gadre
@sy_gadre
May 30, 2023
a workshop at ICCV on dataset curation! excited for ~*the next generation~ of datasets
Vaishaal Shankar
@Vaishaal
May 30, 2023
1/9 I am excited to announce that our workshop "Towards the Next Generation of Computer Vision Datasets" will be happening at ICCV 2023 in Paris. We will feature DataComp submissions, other data-centric papers, and invited talks by experts. datacomp.ai/workshop
823