View All Jobs 2357

Staff Software Engineer, ML Training Platform - Remote Eligible

Design and implement a self-service ML platform for continuous model iteration and improvement.
Remote
Expert
1 week ago

✨ About The Role

- The Staff Software Engineer will work on the Machine Learning Platform team at Reddit, focusing on foundational ML infrastructure. - Responsibilities include architecting, implementing, and maintaining systems that power Feeds Ranking, Content Understanding, and Recommendations. - The role involves building tools that enable machine learning engineers and data scientists to improve the ML software development lifecycle. - The engineer will analyze bottlenecks in distributed systems and optimize for performance and cost-efficiency. - Mentoring team members in adopting a rigorous DevOps approach is also a key responsibility.

âš¡ Requirements

- The ideal candidate will have over 8 years of experience in a production software development environment or building data systems. - A degree in Machine Learning, Engineering, Computer Science, or a related discipline is essential. - Experience with the design and architecture of large-scale Machine Learning systems is crucial for success in this role. - Familiarity with ML frameworks such as TensorFlow, PyTorch, or JAX is required. - The candidate should have hands-on experience with Kubernetes, Docker, or other container orchestration systems.
+ Show Original Job Post
























Staff Software Engineer, ML Training Platform - Remote Eligible
Remote
Engineering
About Reddit
The frontpage of the internet.