This job is about being the Design Generation Evaluation owner, where you'll build the infrastructure for quality monitoring across Design Generation. Your expertise in evaluation methodologies will empower teams to effectively assess their systems, ensuring high-quality outputs. Collaboration is key, as you will work closely with various teams to create a seamless evaluation ecosystem.
You'll be responsible for
🔍
Optimizing evaluation systems
Understand and optimize existing evaluation systems including LLM-as-Judge frameworks and visual quality models, analyzing their strengths and limitations.⚙️
Designing automated evaluation pipelines
Design and implement automated evaluation pipelines that score generated designs at scale, balancing accuracy with computational cost.🔗
Integrating evaluation systems
Integrate evaluation systems into continuous deployment pipelines, creating automated quality gates that catch regressions before production.