TeamPlus
Previous Work Experience 6–10 yrs
Key Skills &
Requirements
Build and operate ML-backed backend systems (RAG, retrieval, ranking,
embeddings) powering large-scale consumer experiences. This is a hands-on
role with real ownership across quality, latency, and cost.
Role Responsibilities • Build and operate core backend services for AI product runtime: orchestration,
state/session, policy enforcement, tools/services integration
• Implement retrieval + memory primitives end-to-end: chunking, embeddings
generation, indexing, vector search, re-ranking, caching, freshness and deletion
semantics
• Productionize ML workflows and interfaces: feature/metadata services,
online/offline parity, model integration contracts, and evaluation
instrumentation
• Drive performance and cost optimization (P50/P95 latency, throughput, cache
hit rates, token/call cost, infra efficiency) with strong SLO ownership
• Add observability-by-default: tracing, structured logs, metrics, guardrail
signals,
failure taxonomy, and reliable fallback paths
• Collaborate with applied ML on model routing, prompt/tool schemas,
evaluation
datasets, and release safety gates
What we’re looking for (must-have):
• 6–10 years building backend systems in production, including at least 2–3