Job Description
Resaro was founded on the belief that AI will change the world in ways we cannot even imagine, but every new technology needs safeguards to advance. Resaro builds custom AI testing software that helps organisations validate the performance, robustness, and safety of mission‑critical AI systems -- spanning computer vision, generative AI, and autonomous systems. Our clients include government, military, and commercial organisations deploying AI in high‑stakes environments. We work through embedded teams deployed on‑site or closely integrated with our clients. What You’ll Do
Build, deploy, and maintain AI models and model‑based features in production -- from fine‑tuning through serving and monitoring. Design and implement evaluation methodologies, test plans, and quality frameworks for AI systems (LLM, CV, RL). Build agentic AI systems using frameworks like CrewAI, LangGraph, LlamaIndex, and MCP. Create synthetic data generation pipelines, adversarial test cases, and benchmark datasets...
Build, deploy, and maintain AI models and model‑based features in production -- from fine‑tuning through serving and monitoring. Design and implement evaluation methodologies, test plans, and quality frameworks for AI systems (LLM, CV, RL). Build agentic AI systems using frameworks like CrewAI, LangGraph, LlamaIndex, and MCP. Create synthetic data generation pipelines, adversarial test cases, and benchmark datasets...