Job Description
Role Overview:As a Cloud and Platform engineer within the Office of the CTO working on the AI Automation team at McAfee, you will play a vital role in providing our customers with cutting edge protection and peace of mind.
You will work on building scalable ML Infrastructure on AWS and EKS that will leverage McAfee’s rich data ecosystems to create pipelines that will continuously retrain, validate and approve ML models to deliver incremental value in pursuit of providing the best protection in cybersecurity. You will design and maintain CI/CD pipelines for seamless deployment and implement infrastructure as code (IaC). You’ll work closely with Data Scientists to set up model automation workflows to enable high frequency retraining and deployment.
This is a remote position in India. We will only consider candidates currently in India and are not offering relocation assistance at this time
About the Role:
- Design and manage scalable AWS cloud infrastructure for MLOps and DevOps workflows, including Kubernetes clusters with Amazon EKS.
- Implement infrastructure as code (IaC) using tools like Terraform, Ansible, or CloudFormation.
- Build and maintain CI/CD pipelines using Jenkins or GitLab CI/CD to automate deployment processes.
- Set up monitoring and logging solutions (e.g., Prometheus, Grafana, ELK Stack, or CloudWatch) to ensure system health and performance.
- Develop scalable pipelines for ML model training, validation, deployment, and monitoring.
- Optimize resource allocation, cost management, and cloud security for AWS services.
- Collaborate with Data Scientists to design, deploy, and maintain production-scale ML models.
About You:
- 6-8 years of hands-on experience with AWS and Kubernetes/EKS, with a bachelor’s degree in IT, Software Engineering, or Computer Science will be preferred.
- Proficient in Python, Linux shell scripting, and tools like Terraform, Ansible, and Jenkins.
- Skilled in monitoring tools such as Prometheus, Grafana, and CloudWatch; knowledge of MLOps tools (Kubeflow, MLflow, Ray) is a plus.
- Experienced in managing cloud networking, security, scaling, and big data platforms like Databricks and Delta tables.
- Familiar with Machine Learning, Generative AI concepts, and tools like PySpark and NumPy.
- AWS (e.g., Solutions Architect) or Kubernetes (e.g., CKA) certifications are a plus.
- Self-motivated with a proven track record of managing production-grade AWS infrastructure and ML pipelines in collaboration with international teams.
#LI-Remote
Company OverviewMcAfee is a leader in personal security for consumers. Focused on protecting people, not just devices, McAfee consumer solutions adapt to users’ needs in an always online world, empowering them to live securely through integrated, intuitive solutions that protects their families and communities with the right security at the right moment.
Company Benefits and Perks:We work hard to embrace diversity and inclusion and encourage everyone at McAfee to bring their authentic selves to work every day. We offer a variety of social programs, flexible work hours and family-friendly benefits to all of our employees.
- Bonus Program
- Pension and Retirement Plans
- Medical, Dental and Vision Coverage
- Paid Time Off
- Paid Parental Leave
- Support for Community Involvement
We're serious about our commitment to diversity which is why McAfee prohibits discrimination based on race, color, religion, gender, national origin, age, disability, veteran status, marital status, pregnancy, gender expression or identity, sexual orientation or any other legally protected status.