View All Jobs 1571

Research Engineering Lead, Post-training Evaluation

Create the next generation of model evaluation systems with a focus on evaluation infrastructure.
San Francisco Bay Area
Senior
$360,000 - 440,000 USD / year
3 weeks ago

✨ About The Role

- Lead the creation of the next generation of model evaluation systems with a focus on evaluation infrastructure - Own evaluation infrastructure and visualizations, working with researchers to streamline the evaluation process - Optimize performance and improve observability of the system, incorporating new evaluation methods from academic papers and internal research ideas - Take ownership over the organization and presentation of results, ensuring high-quality and accurate evaluations - Collaborate with the team to produce models used by millions of users, contributing to the advancement of artificial general intelligence

⚡ Requirements

- Strong technical background in scientific computing, distributed computing, statistics, and ideally some machine learning experience - Experience working in complex technical environments and debugging ML systems - Proficiency with Python, Kubernetes, distributed infrastructure, GPUs, and large-scale data systems - Ability to work collaboratively in a team environment and take ownership over tasks that move the team forward - Willingness to dive into large ML codebases to help debug and optimize performance
+ Show Original Job Post
























Research Engineering Lead, Post-training Evaluation
San Francisco Bay Area
$360,000 - 440,000 USD / year
Engineering
About OpenAI
Building artificial general intelligence