Job description
Company Description
馃憢馃徏We're Nagarro
聽we are a Digital Product Engineering company that is scaling in a big way! We build products, services, and experiences that inspire, excite, and delight. We work at scale across all devices and digital mediums, and our people exist everywhere in the world (17700+ experts across 39 countries, to be exact). Our work culture is dynamic and non-hierarchical. We are looking for great new colleagues. That is where you come in!聽
Job Description
REQUIREMENTS:
- Total experience of 6聽years+
- Strong expertise in Python and backend engineering with experience building scalable, distributed microservices.
- Hands-on experience designing and delivering end-to-end RAG (Retrieval-Augmented Generation) workflows in production systems.
- Solid understanding of ML solution design, including embeddings, retrieval, ranking, feature engineering, and evaluation strategies.
- Experience with vector databases (FAISS, Pinecone, Milvus, Weaviate) and implementing chunking, indexing, vector search, re-ranking, caching, and memory patterns.
- Knowledge of LLM/NLP engineering, including prompt engineering, model integration, orchestration tools (LangChain/LlamaIndex), and evaluation instrumentation.
- Experience productionizing ML systems with observability, online/offline parity, and performance optimization across latency, throughput, and cost.
- Strong backend integration skills using REST/gRPC APIs, Docker, Kubernetes, CI/CD, and cloud platforms (AWS/GCP/Azure).
- Ability to independently design, ship, and operate reliable, scalable, and cost-efficient ML-backed backend systems with strong ownership mindset.
RESPONSIBILITIES:
- Design and build core backend services powering AI/ML runtime including orchestration, session/state management, and tools/services integration.
- Implement end-to-end retrieval and memory systems covering ingestion, embeddings, indexing, vector search, ranking, caching, and lifecycle management.
- Productionize ML workflows with feature/metadata services, model integration contracts, and evaluation hooks.
- Drive performance, reliability, and cost optimization with strong SLO ownership and observability practices (logs, metrics, tracing, guardrails).
- Collaborate with applied ML teams on model routing, prompts/tools, evaluation datasets, and safe releases.
- Translate business requirements into scalable technical designs, define NFR benchmarks, and review architecture for extensibility and best practices.
- Lead troubleshooting, root-cause analysis, and POCs to validate technology and design decisions.
Qualifications
Bachelor鈥檚 or master鈥檚 degree in computer science, Information Technology, or a related field.
This job post has been translated by AI and may contain minor differences or errors.