This job involves authoring and reviewing technical tasks that evaluate AI model performance on mathematics problems. It plays a crucial role in ensuring the quality and effectiveness of mathematical evaluations. The team values flexibility and collaboration, allowing for a fully remote work environment with a commitment of around 20 hours per week.
View
View
View