๐Data Engine
Notes from Andrej Karpathy talks
Last updated
Notes from Andrej Karpathy talks
Last updated
Hypothesis:
Unknown unknowns: Dataset is always imperfect, all scenarios are not represented well yet and can always be more diverse
Capable base model/architecture: Improving dataset improves AI/product guarantees
Inspirations:
"The only sure certain way I have seen of making progress on any task is, you curate the dataset that is clean and varied and you grow it and you pay the labeling cost and I know that works.โ
"Potentially nitpicky but competitive advantage in AI goes not so much to those with data but those with a data engine. And whoever can spin it fastest. Slide from Tesla to ~illustrate but concept is generalโ
QualEval: Qualitative Evaluation for Model Improvement