View All Jobs 2268

Site Reliability Engineer - Remote Eligible

Design and build scalable infrastructure for public sector customers to enhance their operations.
Washington DCSan Francisco Bay Area
Mid-Level
3 weeks ago

✨ About The Role

- The Site Reliability Engineer will design and build reliable and scalable infrastructure for public sector customers. - Responsibilities include administering systems from hardware to Kubernetes, ensuring standardized infrastructure for deployment. - The role involves owning the reliability of systems by being on-site with customers and troubleshooting issues directly. - Collaboration with engineering and security teams is essential to meet the unique needs of infrastructure and use cases. - The engineer will automate routine tasks and standardize infrastructure offerings to support team scalability. - The position requires travel to customer sites and working closely with on-site clients and remote colleagues.

âš¡ Requirements

- The ideal candidate will have over 5 years of experience operating infrastructure and systems at scale. - A strong background in managing systems and infrastructure in secure environments is essential. - Hands-on experience with containerization technologies such as Docker and orchestration platforms like Kubernetes is required. - Proficiency in scripting languages, particularly Python, for automating routine tasks is necessary. - The candidate should possess strong troubleshooting skills across the entire stack, including infrastructure, systems, and applications. - A proactive approach to problem-solving and a willingness to learn new skills to ensure team and customer success is crucial. - The ability to thrive in dynamic environments and navigate ambiguity with ease will be beneficial.
+ Show Original Job Post
























Site Reliability Engineer - Remote Eligible
Washington DCSan Francisco Bay Area
Engineering
About OpenAI
Building artificial general intelligence