Program manager - Evaluations Job at Luma AI, Stanford, CA

Y3g3SVhTUnMzSTJ3YW95d3VJZ1g0N0VmZ2c9PQ==
  • Luma AI
  • Stanford, CA

Job Description

About the Role

Luma is pushing the boundaries of generative AI, building tools that redefine how visual content is created. We're seeking a candidate to help shape and scale the way we understand, measure, and improve model performance. In this role, you'll partner with researchers, engineers, and technical artists to evaluate our models against real-world creative use cases, design frameworks that capture qualitative nuance, and identify actionable insights that guide development.

This is not a checkbox metrics role - it's about building evaluative systems that match the complexity of human perception, creativity, and intention.

Responsibilities
  • Evaluate generative model performance across diverse tasks, prompts, and modalities.
  • Identify key failure modes, regression patterns, and edge cases that impact product quality.
  • Develop and maintain qualitative evaluation frameworks that are scalable and reusable.
  • Collaborate closely with technical artists and engineers to align evaluations with model capabilities and target use cases.
  • Translate high-level product goals into concrete evaluative criteria.
  • Lead qualitative studies, side-by-side comparisons, and human-in-the-loop evaluation efforts.
  • Provide detailed feedback that informs model fine-tuning, dataset curation, and product UX.
  • Stay informed about emerging evaluation standards in generative AI and creative tools.
Qualifications
  • Master's degree or higher in Cognitive Science, Human-Computer Interaction (HCI), Design Research, Psychology, Media Studies, or a related field.
  • 5+ years of experience in product evaluation, UX research, model testing, or similar roles that involve structured qualitative assessment.
  • Deep familiarity with creative workflows and real-world use cases for generative models (e.g., animation, filmmaking, digital art, VFX).
  • Strong systems thinking and the ability to define abstract qualities (like believability, identity retention, or scene coherence) in clear evaluative terms.
  • Experience working cross-functionally with engineers, researchers, and creatives.
  • Excellent written communication skills and the ability to synthesize nuanced judgments into clear, actionable insights.
Nice to Have
  • Background in motion, visual effects, or storytelling pipelines
  • Experience evaluating AI-generated media (video, images, 3D)
  • Prior work on building internal tools for qualitative data collection or scoring
  • Familiarity with prompt engineering and reference-based input methods

Job Tags

Similar Jobs

Dana-Farber Cancer Institute

Legal Project Manager Job at Dana-Farber Cancer Institute

OverviewAs a member of the Office of General Counsel (OGC), the Legal Project Manager, Legal Operations, plays a crucial role in enhancing operational efficiency and business processes. This position is responsible for leading the planning, execution, and closure of legal... 

Trestonsecurity

CA Corporate Security Officer, Beverly Hills (Full-Time / Part-Time, Day Shift) (Beverly Hills) Job at Trestonsecurity

 ...CA Corporate Security Officer, Beverly Hills (Full-Time / Part-Time, Day & Evening Shift) Location: Beverly Hills, CA Pay Rate: $27/hr - $28/hr DOE Schedule: MondaySunday: 08:3016:30 (Morning) /MondaySunday: 16:3000:30 (Evening) Start Date: June 16, 2025... 

Hartford Hospital

Nurse Practitioner or Physician Assistant - Surgical Critical Care - Hartford, CT (Nights) Job at Hartford Hospital

 ...licenses and dues Enhanced Tuition Assistance and Higher Education Partnerships...  ...: Hartford Hospital is seeking a Physician Assistant or Nurse Practitioner to join...  ..., general surgery, neurosurgery, interventional radiology andother critical care patients as the... 

Jackson Furniture Ind.

Local Class A CDL Truck Driver - 1st Shift Job at Jackson Furniture Ind.

Jackson Furniture is seeking a LOCAL Class A Truck Driver!!! MUST have valid and current Class A CDL Must be able to work Monday-Friday (Overtime Saturday) Candidate will be responsible for pickup and delivery betweenthe Cleveland Facilities. On occasion travel...

SupportFinity

Executive Creative Director (Portland) Job at SupportFinity

 ...Join to apply for the Executive Creative Director role at Tillamook County Creamery Association 16 hours ago Be among the first 25 applicants Join to apply for the Executive Creative Director role at Tillamook County Creamery Association Get AI-powered advice on...