Mindrift1 follower72 jobs

Freelance Agent Evaluation Engineer

Mid levelPart-timeRemoteAustraliaSoftware engineeringQuality engineeringPosted 8 hours agoVerified 16 hours ago
Pending fitX of Y criteria met

About the job

This job is an exciting opportunity to work with Mindrift, connecting specialists with project-based AI opportunities for leading tech companies. You'll play a crucial role in evaluating AI coding agents by creating realistic tasks and criteria, all while collaborating with cutting-edge technology. The team values flexibility and encourages contributors to choose their own working hours while meeting project deadlines.

Meet the team

About the company

  • Explore
    • Jobs
    • Companies
    • People
    • Communities
    • Hatch Hotlist 2025
  • Hiring
    • Permanent hires
  • Resources
    • Blog
    • Community stories
    • Career advice
    • Customer stories
    • Help centre
  • Hatch
    • About
    • Careers
    • Contact
    • Hatch updates
    • Media enquiries
  • ยฉ 2026 Hatch
  • Privacy
  • Terms
Think you're a good fit?See what the hiring team are looking for

You'll be responsible for

๐Ÿ› ๏ธ

Creating tasks

Building challenging tasks and evaluation criteria within realistic simulated environments to evaluate AI coding agents.
๐Ÿ”

Reviewing code

Analyzing code written by AI agents, identifying successes and failures, and designing edge cases.
๐Ÿค–

Iterating with AI agents

Collaborating with AI agents to verify tests and ensure they effectively evaluate solutions.

Key criteria

๐Ÿ’ป

Software development experience

5+ years in software development, primarily with Python.

View

๐Ÿ”

Testing expertise

Proven experience writing functional and integration tests.

View

๐ŸŒ

Full-stack development background

Experience building React-based interfaces and robust back-end systems.

View

View more

A meaningful career starts with a match

View your fit

5 criteria for this job
Software development experience
5+ years in software development, primarily with Python.

View

Testing expertise
Proven experience writing functional and integration tests.

View

Full-stack development background
Experience building React-based interfaces and robust back-end systems.

View

Docker familiarity
Experience with Docker containers and infrastructure tools.

View

English proficiency
B2 level proficiency for effective communication.

View

Similar jobs

View all
Mindrift
Mindrift
Mechanical Engineer & Python Expert - Freelance AI TrainerNSW ยท Part-time
This job offers an exciting opportunity to connect with leading tech companies through project-based AI tasks. Contributors will play a vital role in designing and solving computational engineering problems, making a meaningful impact on AI systems. The team values collaboration and innovation, providing a flexible environment where engineers can thrive.
Mindrift
Mindrift
Freelance Agent Evaluation EngineerVIC ยท Part-time
This job offers an exciting opportunity to work on cutting-edge AI projects, focusing on evaluating coding agents through realistic developer tasks. Contributors will play a vital role in shaping the future of AI by creating and testing challenges that push the boundaries of current models. The team values collaboration, innovation, and a flexible work environment, allowing contributors to choose their hours and work at their own pace.
Mindrift
Mindrift
Freelance Agent Evaluation EngineerNSW ยท Part-time
This job offers an exciting opportunity to work on project-based AI tasks that evaluate coding agents for leading tech companies. Contributors will engage in creating realistic developer tasks, reviewing AI-generated code, and iterating based on expert feedback. The team values collaboration and flexibility, allowing contributors to choose their working hours while ensuring high-quality outcomes.
Mindrift
Mindrift
Freelance Agent Evaluation EngineerQLD ยท Part-time
This job offers an exciting opportunity to work with Mindrift, connecting specialists with project-based AI opportunities. Contributors will engage in creating and evaluating tasks for AI coding agents, making a significant impact on the development of AI systems. The team values collaboration and innovation, allowing contributors to choose their working hours while meeting project deadlines.