Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!
We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.


https://bayt.page.link/NfWJe8u6qb3CUSFr9
Back to the job results

AI Systems Architect - LLM & Vector Infrastructure

2 days ago 2026/06/10
Other Business Support Services
Create a job alert for similar positions
Job alert turned off. You won’t receive updates for this search anymore.

Job description

We are seeking a senior AI Systems Architect to design and implement AI-native application cores where Large Language Models (LLMs), vector databases, retrieval systems, and agent frameworks form the primary computational layer of our web and mobile applications.
This role is responsible for architecting scalable AI pipelines, retrieval-augmented generation (RAG) systems, memory architectures, AI agents, and orchestration workflows integrated with our development stack (Web, Mobile, n8n automation, and AI services).
The ideal candidate understands that AI is not a feature, it is the operating system of the product.
Key Responsibilities 1.
AI Core Architecture Design Design AI-first system architecture for web and mobile applications Architect RAG pipelines using vector databases Define long-term memory, short-term memory, and contextual state systems Implement multi-agent AI systems Design AI orchestration layers 2.
Vector Database & Embedding Systems Select and implement vector databases such as: Pinecone Weaviate Qdrant Milvus Supabase (pgvector) Optimize embedding strategies Implement hybrid search (semantic + keyword) Design scalable indexing pipelines 3.
LLM Integration & Optimization Work with models such as: OpenAI APIs Anthropic Meta (LLaMA) DeepSeek Alibaba (Qwen) Implement structured output pipelines Design evaluation and prompt testing frameworks Optimize cost-performance ratio 4.
AI Agent Systems & Orchestration Build autonomous AI agents Design tool-calling systems Integrate with: n8n LangGraph / LangChain style agent flows Implement memory-aware agents 5.
Production AI Engineering Build monitoring systems for hallucination detection Design guardrails and validation layers Implement evaluation datasets and benchmarking Ensure security of AI pipelines Build scalable infrastructure (Docker, Kubernetes, GPU optimization) Technical Expertise 5+ years software engineering experience 2+ years building production AI systems Deep knowledge of: Vector embeddings & similarity search RAG architectures Tokenization and context window optimization Fine-tuning & LoRA concepts Prompt evaluation frameworks Experience with Python (mandatory) Experience with FastAPI / backend services Experience designing scalable APIs Architecture Experience Designing distributed systems Microservices & event-driven architecture Experience with PostgreSQL + pgvector Experience deploying LLM systems in production

This job post has been translated by AI and may contain minor differences or errors.

You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.
Job alert created for this search. You’ll receive updates when new jobs match.
Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.