Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!
We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.


https://bayt.page.link/bfNjtNCVYKGpK9zM9
Back to the job results

Site Reliability Developer 4

2 days ago 2026/06/03
100-499 Employees · Other Business Support Services
Create a job alert for similar positions
Job alert turned off. You won’t receive updates for this search anymore.

Job description

Job Summary:


As a Principal Cloud Engineer (SRE), you will play a key role in ensuring the reliability, performance, and scalability of modern cloud-based data platforms. This position involves close collaboration with development, operations, and security teams to automate processes, monitor system health, and maintain optimal uptime for critical production workloads. You will leverage your technical expertise to design, automate, and maintain large-scale data pipelines and lakehouse infrastructure, supporting mission-critical data engineering and analytics initiatives.


Key Responsibilities:


  • Design, implement, and maintain scalable, secure cloud infrastructure for large data platforms (data lakes, data warehouses, and lakehouse solutions) on OCI, AWS, Azure, or GCP.
  • Collaborate with Data Engineering teams to build robust, automated ETL/ELT pipelines using tools such as Apache Spark, Databricks, Kafka, or Oracle Cloud Data Integration.
  • Implement site reliability engineering best practices tailored for data systems: SLO/SLI definition, error budgeting, automated monitoring, data integrity validation, and incident response for data workloads.
  • Design and optimize data storage solutions leveraging both structured and unstructured storage (object storage, data lake/lakehouse platforms like Delta Lake, Iceberg etc.,).
  • Automate infrastructure provisioning and CI/CD deployments for data pipelines and analytic workloads with tools like Terraform, Ansible, or CloudFormation.
  • Instrument and monitor data platform components for performance, availability, resource consumption, and data quality using observability tools (e.g., Grafana, Splunk).
  • Troubleshoot and resolve complex data pipeline or infrastructure issues, conducting root cause analyses and post-incident reviews.
  • Advocate for and implement security, governance, and compliance best practices—including data privacy, encryption, and access controls.
  • Mentor junior team members and promote knowledge sharing around data platform reliability.

Qualifications:


  • Bachelor’s or Master’s in Computer Science, Engineering, Data Science, or related field, or equivalent experience.
  •  6 or more years experience in cloud engineering, SRE, or DevOps roles with at least 4 years supporting data engineering initiatives.
  • Practical experience designing and operating large-scale cloud-based data platforms (data lakes, warehouses, or lakehouses).
  • Strong hands-on skills with infrastructure-as-code (e.g., Terraform), automation (Python/Scala), and containerization (Kubernetes, Docker).
  • Familiarity with data processing frameworks (Apache Spark, Databricks, Hadoop), as well as orchestration tools (Airflow, Oozie, or similar).
  • Working knowledge of distributed storage, data formats (Parquet, Avro), and modern analytics platforms.
  • Solid understanding of networking, cloud security, and regulatory compliance for data platforms.
  • Strong analytical, troubleshooting, and communication skills.
  • Preferred certifications: Cloud Architect/Engineer (OCI, AWS, Azure, GCP), Databricks, or relevant data engineering credentials.

As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s challenges. We’ve partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.


We know that true innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing an inclusive workforce that promotes opportunities for all.


Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.


We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling +1 888 404 2494 in the United States.


Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law. 



Responsibilities:

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.



Qualifications:

Career Level - IC4


This job post has been translated by AI and may contain minor differences or errors.

You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.
Job alert created for this search. You’ll receive updates when new jobs match.
Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.