Job Description
There’s nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.
As an AWS Cloud Site Reliability Engineer-III at JPMorgan Chase within the Corporate Technology, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.
Job responsibilities
- Guiding and supporting others in building appropriate level designs and gaining consensus from peers where appropriate.
- Collaborating with other Site Reliability Engineers, software engineers, and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines.
- Collaborating to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications.
- Implementing infrastructure, configuration, and network as code for the applications and platforms in your remit.
- Collaborating with technical experts, key stakeholders, and team members to resolve complex problems.
- Understanding service level indicators and utilizing service level objectives to proactively resolve issues before they impact customers.
- Supporting the adoption of site reliability engineering best practices and Site Reliability Engineer-specific responsibilities like monitoring and automation.
Required qualifications, capabilities, and skills
- Formal training or certification on software engineering concepts and 3+ years applied experience
- Gaining hands-on experience in system support and cloud deployment with Kubernetes ECS/EKS.
- Experiencing the full Software Development Life Cycle.
- Being exposed to agile methodologies like CI/CD and application resiliency.
- Deploying and supporting CI/CD pipelines using PySpark/Databricks in the cloud.
- Migrating data solutions in the AWS cloud.
- Troubleshooting with hands-on Python programming experience.
- Working with Kubernetes, Terraform, and AWS Cloud Services.
- Knowing best practices for automating and supporting data lakes.
- Familiarizing with containerization and cloud deployment.
- Holding Cloud/Terraform/Kubernetes certification and serving as an Site Reliability Engineer Bar Raiser.
Preferred qualifications, capabilities, and skills
- Completing Site Reliability Engineer training or certification.