Previous Work Experience 5 yrs
Own automation for AI APIs, agentic workflows, and RAG systems. Design
LLM evaluation tests and strengthen reliability, regression, and performance
coverage.
Role Responsibilities
• Build and maintain automation suites for AI APIs, agentic workflows, and
RAG
systems.
• Implement LLM evaluation tests (accuracy, consistency, performance).
• Collaborate with ML & full-stack teams to validate releases.
• Contribute to reliability, regression, and performance test coverage.
Requirements
• 5+ years QA automation experience.
• Strong Python/JS testing frameworks.
• Familiarity with LLM or ML testing preferred.