Founding Applied AI Engineer (Eval-Driven)

Mid levelFull-timeHybridSydney NSW, AustraliaDataInformation technologySoftware engineeringPosted 3 hours ago

View your fit0 of 9 criteria met

About the job

This job is for an Applied AI Engineer (Eval-driven) at an early-stage company focused on eradicating cost overruns in construction. The mission is to tackle a $3 trillion problem that impacts cities and builder-client relationships. The team values collaboration and innovation, working together to build AI-powered solutions that help builders maintain control over project costs.

You'll be responsible for

📊

Defining evaluation problems

Establishing success criteria, failure modes, datasets, labeling guidelines, and score functions to ensure effective evaluation processes.

🔧

Building and maintaining an evaluation harness

Creating regression tests, edge-case suites, and quality dashboards to prevent backsliding and ensure consistent performance.

🔄

Implementing workflow systems end-to-end

Managing the entire process from data to model/LLM components, post-processing, and acceptance testing until they meet evaluation thresholds.

Skills you'll need

🐍

Strong Python skills

Practical experience in shipping ML/AI systems beyond experimentation, ensuring robust and reliable solutions.

📈

Experience designing evals for ML/LLM systems

Demonstrated ability in offline metrics, gold sets, error analysis, regression testing, and monitoring to ensure high-quality outputs.

⚙️

Comfort with data science and engineering tasks

Ability to handle data wrangling, feature/label design, model/LLM iteration, and productionization effectively.

Meet the team

About the company

View your fit

0 of 5 criteria met

Applied AI experience

Proven experience shipping ML/AI systems with Python.

View

Evaluation design expertise

Demonstrated experience designing evaluations for ML/LLM systems.

View

End-to-end workflow implementation

Experience implementing complete workflow systems from data to testing.

View

Collaboration with stakeholders

Ability to translate real-world requirements into testable specifications.

View

Data science and engineering skills

Comfortable with data wrangling, feature design, and model iteration.

View

Founding Applied AI Engineer (Eval-Driven)

About the job

You'll be responsible for

Defining evaluation problems

Building and maintaining an evaluation harness

Implementing workflow systems end-to-end

Skills you'll need

Strong Python skills

Experience designing evals for ML/LLM systems

Comfort with data science and engineering tasks

Meet the team

About the company

View your fit

A meaningful career starts with a match

Similar jobs

A meaningful career starts with a match

Similar jobs