Training computation vs. dataset size in notable AI systems, by
researcher affiliation

Computation is measured in total petaFLOP, which is 10¹⁵ floating-point operations estimated from AI literature, albeit withsome uncertainty. Training dataset size refers to the volume of text that is employed to train a model effectively.

Training dataset size10010,0001 million100 million10 billion1 trillion4Training computation (petaFLOP)<0.1AcademiaAcademia and industrycollaborationIndustryOtherJul 2,1950Dec24,2024

Select systems

Sort by
Name
    All systems
    Training computation vs. dataset size in notable AI systems, by researcher affiliation

    Interactive visualization requires JavaScript