About the Company
We are a seed-stage AI company building the industry standard for evaluating and benchmarking large language models on real enterprise tasks.
About the Role
As a Research Scientist, you will develop new benchmarks, methodologies, and evaluation pipelines that shape how cutting-edge models are assessed, compared, and deployed in production environments. Your work will directly influence model selection and safety decisions across foundation model labs, high-growth AI product companies, and Fortune-scale enterprises.
Responsibilities
Benchmarking & Model Analysis
Design New Benchmarks from Scratch
Advance Automated Evaluation Methodologies
Cross-functional Collaboration
Qualifications
Required Skills
Preferred Skills
Pay range and compensation package
Equal Opportunity Statement
Visa sponsorship available. Relocation support. Health & dental coverage. Lunch + dinner provided, snacks & coffee. Unlimited PTO. Weekly happy hours with community guests. Team events (bowling, hiking, rock climbing, etc.). Swag program (hats, etc.).
Work Environment & Culture
In-person, San Francisco HQ (required). Core hours: 95, some teammates extend voluntarily. Most team members work 1 weekend day per week (flexible). High-ownership, low-ego, collaborative. Live demos Mondays, team lunch Thursdays, community Fridays. Early-stage pace, applied focusnot academic publishing.
Tech Environment
(while research-focused, exposure beneficial) Backend: Python / Django. Frontend: React + TypeScript. Infra: AWS. Evaluation frameworks + internal tooling.
Why This Role Is Unique
The company already collaborates with foundation model labs, high-growth AI vertical product companies, and Fortune 500 enterprises (not publicly facing). ChatGPT Vals AI $5M seed raised, runway of 2+ years at current burn. Only one research scientist is being hiredtrue founding impact. Opportunity to define industry standards for model trust, reliability, and certification. Positioned to become the rating agency for generative AI.
...Job Description Alliance Medical Staffing is seeking a Social Work Licensed Clinical Social Worker for a travel job in Bowling Green, Missouri. Job Description & Requirements ~ Specialty: Licensed Clinical Social Worker ~ Discipline: Social Work ~ Start...
...Job Description Job Description Job Description Delivery Driver We recognize our delivery experts as the ambassadors of Domino's Pizza... ...and mileage reimbursement. That is money in your pocket every night! In addition, they earn an hourly wage. Great Hours!...
...Exciting Opportunity for CDL-A Truck Drivers! Join our dedicated Ecolab division as a professional truck driver. This is a fantastic opportunity to drive solo and enjoy weekly home time while earning competitive pay! What We Offer: ~$1,250 - $1,340 average...
Job Description SAT and academic tutors wanted! We tutor at an attractive Manhasset office by the LIRR train station. Great training and a collaborative team environment! $40-50/hr. Join the most professional tutoring team in Long Island. If you're bright, energetic...
...School Office Clerk (Spanish) - Reyes Elementary School at Merced City School District IN-HOUSE VACANCY - INTERNAL CANDIDATES ONLY This position is only available to current employees of this school district. Applications submitted by job seekers not Office Clerk, Elementary...