https://bayt.page.link/v1TUmrkCw1dqRip19
Create a job alert for similar positions

Job Description

  1. Design and implement highly available and scalable systems, ensuring the reliability and performance of the company's website or application.
  2. Collaborate with cross-functional teams to define and establish desired service levels
  3. Monitor systems and applications, proactively identifying and resolving any performance bottlenecks or availability issues.
  4. Develop and maintain monitoring tools, alerts, and dashboards to provide visibility into system health and performance.
  5. Conduct post-incident analyses to identify root causes and implement preventive measures to avoid future incidents.
  6. Automate repetitive tasks and processes to improve efficiency and reduce manual intervention.
  7. Create and maintain documentation for system architecture, configuration, and troubleshooting procedures.
  8. Perform capacity planning and resource allocation to ensure optimal system performance and scalability.
  9. Collaborate with development teams to implement and deploy new features and enhancements, ensuring they meet reliability and performance standards.
  10. Stay up to date with industry best practices, new technologies, and emerging trends in site reliability engineering.

 

Preferred Candidate

Degree
Bachelor's degree / higher diploma
You have reached your limit of 15 Job Alerts. To create a new Job Alert, delete one of your existing Job Alerts first.
Similar jobs alert created successfully. You can manage alerts in settings.
Similar jobs alert disabled successfully. You can manage alerts in settings.