Talent.com
Evaluation Scenario Writer - AI Agent Testing Specialist
Evaluation Scenario Writer - AI Agent Testing SpecialistMindrift • SG
Evaluation Scenario Writer - AI Agent Testing Specialist

Evaluation Scenario Writer - AI Agent Testing Specialist

Mindrift • SG
30+ days ago
Job type
  • Remote
  • Quick Apply
Job description

Please submit your CV in English and indicate your level of English proficiency.

Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation isproject-based, not permanent employment.

What this opportunity involves

You’ll create challenging coding test cases that push AI coding systems to their limits :

  • Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements and information sources
  • Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases, not just superficial checks
  • Craft “fair but hard” challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required)
  • Analyze AI failures to understand what the model struggles with vs. what it masters
  • Iterate based on feedback from expert QA reviewers who score your work on 7 quality criteria

What we look for

This opportunity is a good fit for experienced developers, software engineers, and / or test automation specialists open to part-time, non-permanent projects. Ideally, contributors will have :

  • Degree in Computer Science, Software Engineering or related fields
  • 5+ years in software development, primarily Python (pytest, async / await, subprocess, file operations)
  • Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems
  • Experience writing tests (functional, integration – not just running them)
  • Docker containers (running evaluations locally in containers)
  • CI / CD understanding (GitHub Actions as a user : triggers, labels, reading results)
  • English proficiency - B2
  • How it works

    Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid

    Effort estimate

    Tasks for this project are estimated to take 20 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted.

    Payment

  • Paid contributions, with rates up to $40 / hour
  • Fixed project rate or individual rates, depending on the project
  • Some projects include incentive payments
  • Note : Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non-core project phases. Payment details are shared per project.
  • Create a job alert for this search

    Evaluation Scenario Writer AI Agent Testing Specialist • SG

    Similar jobs
    Hybrid Presales Solution Architect – AI Testing

    Hybrid Presales Solution Architect – AI Testing

    Tricentis GmbH • WorkFromHome, Singapore, Singapore
    A leading software testing company seeks a Senior Solution Architect in Singapore to drive business growth through expert technical guidance in presales. The role involves consulting with clients, c...Show more
    Last updated: 30+ days ago • Promoted
    Manager, AI Acceleration and COE

    Manager, AI Acceleration and COE

    Singtel • WorkFromHome, Singapore, Singapore
    An empowering career at Singtel begins with a Hello.Our purpose, to Empower Every Generation, connects people to the possibilities they need to excel. Every "hello" at Singtel opens doors to new ini...Show more
    Last updated: 30+ days ago • Promoted
    Commodities Solutions Architect — Remote AI Data

    Commodities Solutions Architect — Remote AI Data

    Vortexa • WorkFromHome, Singapore, Singapore
    Remote
    An international technology company in Singapore seeks a Solution Architect to provide technical support and solutions for clients. The role involves collaborating with clients and internal teams, e...Show more
    Last updated: 30+ days ago • Promoted
    AI Enablement Lead : Data Insights & Process Optimization

    AI Enablement Lead : Data Insights & Process Optimization

    Unison Consulting Pte Ltd • WorkFromHome, Singapore, Singapore
    A consulting firm in Singapore is seeking a dynamic Business & Data Analyst to lead the enablement of an Artificial Intelligence Program. This hybrid role involves leading workshops, analyzing busin...Show more
    Last updated: 30+ days ago • Promoted
    Strategic Enterprise AE — Contract Intelligence & Lifecycle

    Strategic Enterprise AE — Contract Intelligence & Lifecycle

    Workday • WorkFromHome, Singapore, Singapore
    A leading enterprise software company is seeking a Strategic Account Executive in Singapore.The role involves targeting enterprise customers primarily through outbound efforts, managing the entire ...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning / AI Engineer (Contract)

    Machine Learning / AI Engineer (Contract)

    GMP Technologies • WorkFromHome, Singapore, Singapore
    About the job Machine Learning / AI Engineer (Contract).Collaborate with data scientists and business stakeholders to define ML solutions and develop Proof of Concepts (PoCs).Engineer and deploy ML...Show more
    Last updated: 30+ days ago • Promoted
    AI Acceleration & COE Lead

    AI Acceleration & COE Lead

    Singtel • WorkFromHome, Singapore, Singapore
    A leading telecommunications firm in Singapore is seeking a highly organised Manager to support AI deployment.The role demands over 5 years of experience in digital strategy and analytics, with a f...Show more
    Last updated: 30+ days ago • Promoted
    Commodities Solutions Architect — Remote AI Data

    Commodities Solutions Architect — Remote AI Data

    Vortexa Ltd • WorkFromHome, Singapore, Singapore
    Remote
    A fast-growing technology company in Singapore seeks a Solution Architect to serve as a trusted advisor to clients.You will engage with clients to deliver tailored solutions using Vortexa’s data vi...Show more
    Last updated: 30+ days ago • Promoted
    Associate / AVP, Data Engineer, AI Alpha Group

    Associate / AVP, Data Engineer, AI Alpha Group

    GIC • WorkFromHome, Singapore, Singapore
    Associate / AVP, Data Engineer, AI Alpha Group at GIC.GIC is one of the world’s largest sovereign wealth funds with over 2,000 employees across 11 offices worldwide. We invest in more than 40 countrie...Show more
    Last updated: 30+ days ago • Promoted
    Remote Presales Architect & Product Specialist (CDP AI)

    Remote Presales Architect & Product Specialist (CDP AI)

    Twilio • WorkFromHome, Singapore, Singapore
    Remote
    A communications technology company is seeking a Presales Architect to partner with sales and engineering in designing complex multi-product solutions. Based in Singapore, this remote role requires ...Show more
    Last updated: 30+ days ago • Promoted
    Associate / AVP, Data Engineer, AI Alpha Group

    Associate / AVP, Data Engineer, AI Alpha Group

    GIC Private Limited • WorkFromHome, Singapore, Singapore
    GIC is one of the world’s largest sovereign wealth funds.With over 2,000 employees across 11 offices around the world, we invest in more than 40 countries globally across asset classes and business...Show more
    Last updated: 30+ days ago • Promoted
    Evaluation Scenario Writer - AI Agent Testing Specialist

    Evaluation Scenario Writer - AI Agent Testing Specialist

    Mindrift • Singapore, Singapore
    Please submit your CV in English and indicate your level of English proficiency.Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, eva...Show more
    Last updated: 3 hours ago • Promoted • New!
    AI Strategy & CoE Transformation Lead

    AI Strategy & CoE Transformation Lead

    Manulife Financial • WorkFromHome, Singapore, Singapore
    A leading financial services provider in Singapore seeks an AI Business Consultant to lead AI integration into core operations and develop strategic roadmaps. Responsibilities include partnership wi...Show more
    Last updated: 30+ days ago • Promoted
    Associate Data Engineer - AI & Threat Detection

    Associate Data Engineer - AI & Threat Detection

    Proofpoint • WorkFromHome, Singapore, Singapore
    A leading cybersecurity company in Singapore is seeking an Associate Data Engineer to enhance their Machine Learning and NLP solutions. This role involves contributing to data science projects that ...Show more
    Last updated: 30+ days ago • Promoted
    AI Chatbot Product Manager - CS Platform

    AI Chatbot Product Manager - CS Platform

    Binance • WorkFromHome, Singapore, Singapore
    A leading cryptocurrency exchange in Singapore is seeking a Mid-Senior level Product Manager to oversee the CS bot management platform. The ideal candidate will have a Bachelor's degree, experience ...Show more
    Last updated: 30+ days ago • Promoted
    SEO & AI Growth Architect (Remote)

    SEO & AI Growth Architect (Remote)

    Chainstack • WorkFromHome, Singapore, Singapore
    Remote
    A leading Web3 infrastructure company is seeking an SEO & AI Optimization Manager to lead organic search strategies.The ideal candidate should have over 5 years of experience in technical SEO, a de...Show more
    Last updated: 30+ days ago • Promoted
    Binance Accelerator Program - AI Security Automation

    Binance Accelerator Program - AI Security Automation

    Binance • WorkFromHome, Singapore, Singapore
    Binance Accelerator Program - AI Security Automation.Binance Accelerator Program - AI Security Automation.Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency ...Show more
    Last updated: 30+ days ago • Promoted
    AI / ML Research Fellow — KYC / KYB (Accelerator)

    AI / ML Research Fellow — KYC / KYB (Accelerator)

    Binance • WorkFromHome, Singapore, Singapore
    A leading cryptocurrency exchange in Singapore seeks early-career talent for an AI / ML role focusing on developing algorithms for KYC and KYB products. Ideal candidates are PhD students in Computer S...Show more
    Last updated: 30+ days ago • Promoted