Back to jobs
Flowd

Aerospace Engineering Expert - Visual Reasoning Benchmark

United States$100 - $125Posted 3 days ago
Contractremote

Job Description

Mercor is building the Industrial Technical Visual Reasoning Benchmark, an evaluation dataset that tests how well AI models reason over authentic engineering visuals. We are hiring Aerospace Engineering experts to author benchmark problems in your field. For each problem you will source a genuine professional artifact (system schematics, assembly drawings, maintenance diagrams, aerodynamics diagrams, propulsion schematics, flight system diagrams), write a problem that cannot be solved from the text alone, determine a concise and unambiguous final answer, and write a step-by-step worked solution at a professional level. You will tag each problem with its domain, artifact type, skill tested, and difficulty. Every problem is independently reviewed by another domain expert before it ships. This is a part-time, remote, hourly engagement for practicing professionals with hands-on experience reading real aerospace engineering artifacts. Simplified textbook figures and questions answerable by simple lookup are out of scope; the work rewards genuine structural, spatial, and multi-step reasoning.

See Your Match Score

Sign up and Renata will show you how this job matches your skills and experience.

Get Started Free
Aerospace Engineering Expert - Visual Reasoning Benchmark at Flowd | Renata