
Sales - AI QA Engineer
- Singapore
- Permanent
- Full-time
- Design automated and manual test plans for GenAI agents, summarization modules, and LLM outputs.
- Develop and maintain unit and integration test suites for GenAI pipelines and agent workflows
- Develop evaluation scripts and tools to validate factual accuracy, token usage, and latency.
- Collaborate with prompt engineers and data scientists to red-team and edge-test agent behavior.
- Create regression testing pipelines for model updates and prompt changes.
- Support telemetry reviews and UX teams with actionable insights from QA cycles.
- Implement regression testing and failure detection for backend services and APIs.
- Support CI/CD quality gates and automated test triggers for model or prompt updates.
- Help define and monitor QA metrics such as error rates, fallbacks triggered, and invalid outputs.
- 4+ years of experience in QA, testing engineering, ML, data engineering, governance, or backend development, with recent focus on GenAI and LLMs.
- We're looking for someone with an eagerness and ability to learn new skills and solve dynamic problems in an encouraging and expansive environment.
- Experience testing ML, NLP, or GenAI products (especially RAG and prompt-driven systems).
- Comfort with ambiguity. Ability to audit a full orchestrator and business context layer for sales.
- Proficiency in Python (FastAPI, LangChain, or similar frameworks), prompt engineering, and RESTful API design.
- Hands-on experience with LLM APIs, embeddings, vector databases, and RAG workflows.
- Experience working with monitoring and observability tools (e.g., Prometheus, OpenTelemetry, Weights & Biases).
- Familiarity with telemetry and evaluation frameworks for AI agents.
- Experience working with data science teams on insights generation leveraging LLMs.
- Knowledge of project management, productivity, and design tools such as Wrike and Sketch.
- Strong time management skills with the ability to collaborate across multiple teams globally.
- Able to balance competing priorities, long-term projects, and ad hoc requirements.
- Ability to work in a fast-paced, dynamic, constantly evolving business environment.
- B.S. Degree in Computer Science/Engineering, or equivalent work experience.
- Strong experience articulating and translating business questions into AI solutions.
- Communicate results and insights effectively to partners and senior leaders, as well as both technical and non-technical audiences.
- Familiarity with LangSmith, Trulens, Weights & Biases, or other LLMOps tools.
- Experience with hallucination detection, chain-of-thought reasoning QA, or trust scoring.
- Understanding of both backend API testing and UI/UX acceptance criteria.
- Other complementary technologies for distributed systems architecture and asynchronous messaging, agent communication, and catching like RabbitMQ, Redis, and Valkey are preferred.
- Advanced Degree (MS or Ph.D.) in Economics, Electrical Engineering, Statistics, Data Science, or a similar quantitative field is preferred.