Measuring the performance of our models on real-world tasks

OpenAI introduces GDPval-v0, a new evaluation that measures model performance on real-world economically valuable tasks across 44 occupations.


Comments

Leave a Reply

Your email address will not be published. Required fields are marked *