Staff AI Engineer
Wati View all jobs
- Shenzhen, Guangdong
- Permanent
- Full-time
- Core LLM Infrastructure: Architect and lead our AI production stack, including multi-provider LLM gateway optimization, token budget management, and low-latency inference routing across OpenAI, Gemini, and other providers.
- Agentic AI & RAG: Design and implement scalable RAG (Retrieval-Augmented Generation) systems, multi-step AI agent workflows, and tool-calling infrastructure (MCP), ensuring high accuracy and reliability in customer interactions.
- Voice & Multimodal AI: Lead the evolution of our voice AI layer (WebRTC/realtime) and cross-channel agent coordination across text, voice, and connected messaging platforms.
- AI Production Lifecycles: Own the "Engineering-to-AI" loop: building automated pipelines for data collection, cleaning, fine-tuning orchestration, and model versioning.
- Performance & Cost Optimization: Continuously optimize API costs, token budgets, latency, and caching strategies to ensure our 4-billion-message scale remains sustainable and performant.
- Evaluation & Benchmarking: Build the infrastructure for systematic AI quality assessment, identifying failure modes and ensuring model improvements are grounded in real-world production metrics.
- Technical Roadmap: Drive technology decisions in close collaboration with engineering leadership, selecting frameworks and architectural patterns that will define our AI future.
- Systems Expert: 5+ years of professional experience in backend or infrastructure engineering. Mastery of at least one high-performance language (Go, Rust, or C++) and deep proficiency in Python.
- AI Deployment Mastery: Proven track record of taking LLMs/NLP models from experiments to high-traffic production. You understand multi-provider orchestration, prompt engineering at scale, and model drift management.
- Data Pipeline Experience: Strong experience building data pipelines for AI workloads, including document processing, embedding generation, and vector search.
- Product-Minded Engineer: You don't just build for the sake of tech; you understand how AI performance impacts customer outcomes and business value.
- Autonomous Builder: You thrive in environments with high ambiguity and can design, code, and deploy complex systems independently.
- Experience with vector databases (e.g., Qdrant, Milvus, Pinecone) and RAG architecture patterns.
- Familiarity with agentic frameworks, tool-calling protocols (MCP, function calling), or multi-agent orchestration.
- Experience with real-time voice/audio AI pipelines (WebRTC, LiveKit, or similar).
- Infrastructure-as-Code experience with GCP/AWS, Docker, and Kubernetes.