Mindrift1 follower72 jobs

Freelance Agent Evaluation Engineer

Mid levelPart-timeRemoteQueensland, AustraliaSoftware engineeringQuality engineeringPosted 8 hours agoVerified 16 hours ago
Pending fitX of Y criteria met

About the job

This job offers an exciting opportunity to work with Mindrift, connecting specialists with project-based AI opportunities. Contributors will engage in creating and evaluating tasks for AI coding agents, making a significant impact on the development of AI systems. The team values collaboration and innovation, allowing contributors to choose their working hours while meeting project deadlines.

Meet the team

About the company

  • Explore
    • Jobs
    • Companies
    • People
    • Communities
    • Hatch Hotlist 2025
  • Hiring
    • Permanent hires
  • Resources
    • Blog
    • Community stories
    • Career advice
    • Customer stories
    • Help centre
  • Hatch
    • About
    • Careers
    • Contact
    • Hatch updates
    • Media enquiries
  • ยฉ 2026 Hatch
  • Privacy
  • Terms
Think you're a good fit?See what the hiring team are looking for

You'll be responsible for

๐Ÿ› ๏ธ

Creating tasks

Building challenging tasks and evaluation criteria within realistic simulated environments to evaluate AI coding agents.
โœ…

Writing tests

Developing tests that accurately accept correct solutions and reject incorrect ones, ensuring fairness in evaluation.
๐Ÿค–

Iterating with AI agents

Collaborating with AI agents to verify that tests catch real problems and analyzing code written by agents to improve task design.

Key criteria

๐Ÿ’ป

Software development experience

5+ years in software development, primarily using Python.

View

๐Ÿ”

Test writing skills

Proven experience writing functional and integration tests.

View

๐ŸŒ

Full-stack development background

Experience building React-based interfaces and robust back-end systems.

View

View more

A meaningful career starts with a match

View your fit

5 criteria for this job
Software development experience
5+ years in software development, primarily using Python.

View

Test writing skills
Proven experience writing functional and integration tests.

View

Full-stack development background
Experience building React-based interfaces and robust back-end systems.

View

Docker familiarity
Experience with Docker containers and infrastructure tools.

View

English proficiency
B2 level English proficiency for effective communication.

View

Similar jobs

View all
Mindrift
Mindrift
Mechanical Engineer & Python Expert - Freelance AI TrainerNSW ยท Part-time
This job offers an exciting opportunity to connect with leading tech companies through project-based AI tasks. Contributors will play a vital role in designing and solving computational engineering problems, making a meaningful impact on AI systems. The team values collaboration and innovation, providing a flexible environment where engineers can thrive.
Mindrift
Mindrift
Freelance Agent Evaluation EngineerVIC ยท Part-time
This job offers an exciting opportunity to work on cutting-edge AI projects, focusing on evaluating coding agents through realistic developer tasks. Contributors will play a vital role in shaping the future of AI by creating and testing challenges that push the boundaries of current models. The team values collaboration, innovation, and a flexible work environment, allowing contributors to choose their hours and work at their own pace.
Mindrift
Mindrift
Freelance Agent Evaluation EngineerNSW ยท Part-time
This job offers an exciting opportunity to work on project-based AI tasks that evaluate coding agents for leading tech companies. Contributors will engage in creating realistic developer tasks, reviewing AI-generated code, and iterating based on expert feedback. The team values collaboration and flexibility, allowing contributors to choose their working hours while ensuring high-quality outcomes.
Mindrift
Mindrift
Freelance Agent Evaluation EngineerAustralia ยท Part-time
This job is an exciting opportunity to work with Mindrift, connecting specialists with project-based AI opportunities for leading tech companies. You'll play a crucial role in evaluating AI coding agents by creating realistic tasks and criteria, all while collaborating with cutting-edge technology. The team values flexibility and encourages contributors to choose their own working hours while meeting project deadlines.