As for what we'll see in the course of this livestream, we are going to be having a lot of demos but are setting up to start with with evaluations. OpenAI has just declared that GPT-5 has set a fresh level on quite a few benchmarks, together with SWE-Bench – https://smedleyz344gdw9.loginblogin.com/profile