View All Jobs 1591

Lead Site Reliability Engineer

Develop and execute strategies for cluster synchronization to ensure system consistency and performance.
Austin
Senior
2 months ago
Observe.AI

Observe.AI

Contact center AI platform.

Job is no longer active
8 Similar Jobs at Observe.AI

✨ About The Role

- Implement and maintain strategies to enhance the reliability and observability of complex systems - Work closely with a team of skilled engineers, providing guidance and standards for reliability, resilience, and scalability - Collaborate with engineering teams in India to ensure smooth coordination and communication - Develop and execute strategies to ensure multiple production clusters are synchronized in terms of features, uptime, functionality, and SLA compliance - Design and document comprehensive test plans to proactively identify and mitigate risks, ensuring continuous system integrity and performance

⚑ Requirements

- Ideal candidate would have around 7 years of experience in site reliability engineering or related fields, with a strong background in managing high-availability systems - Proficient in one of python or shell scripting, and extensive experience with AWS cloud environments and infrastructure management - Demonstrated ability to lead projects and mentor junior engineers, fostering a collaborative and productive environment - Excellent communication skills to effectively manage team interactions and articulate technical challenges and solutions to stakeholders - Comfort with working in predetermined flexible hours to interact with teams across different time zones, ensuring project alignment and timely delivery
+ Show Original Job Post
























Lead Site Reliability Engineer
Austin
Engineering
Job is no longer active
About Observe.AI
Contact center AI platform.