Research Engineer Model Evaluations Jobs

88 jobs (page 1)

Categories

All Categories

Engineering (31)

Software/IT (6)

Research Engineer , Model…

Menlo Ventures (San Francisco, CA)

…working together to build beneficial AI systems. About the role As a Research Engineer on the Model Evaluations team, you'll lead the design and ... intersection of research and engineering to develop and implement model evaluations that give us insight into emerging capabilities and build robust… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Research Engineer , Frontier AI…

OpenAI (San Francisco, CA)

A leading AI research firm in San Francisco seeks a Research Engineer to evaluate model capabilities in finance. The candidate should possess strong ... Responsibilities include identifying significant financial workflows and refining AI evaluations . A passion for AGI/ASI measurement and proficiency in Excel… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
AI Research Engineer , Enterprise…

Scale AI, Inc. (San Francisco, CA)

Scale AI is seeking a technically rigorous and driven AI Research Engineer to join our Enterprise Evaluations team. This high-impact role is critical to our ... a passion for tackling complex evaluation challenges, and thrives in a dynamic, fast-paced research environment. We are looking for an engineer who can think… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
AI Engineer - Evaluations

Hippocratic AI (Palo Alto, CA)

…unless explicitly noted otherwise in the job description. About the Role As an AI Engineer - Evaluations at Hippocratic AI , you'll define and build the systems ... About Us Hippocratic AI has developed a safety-focused Large Language Model (LLM) for healthcare. The company believes that a safe LLM can dramatically improve… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Founding Audio AI Research Engineer

David AI (New York, NY)

…the latest research insights. About this role As a Founding Audio AI Research Engineer , you'll define the research agenda that shapes how ... our Research team As an audio data research company, we believe model capabilities come...and PyTorchExperience designing and conducting rigorous ML experiments and evaluations . Bonus points if you have PhD or Masters… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Research Engineer

Amadeus Search (San Francisco, CA)

Role: Senior Research Engineer Employment Type: Full-time Location: On-site, San Francisco (5 days/week, no remote option) Compensation: $200,000 - $275,000 base ... and RL infrastructure to power next-generation AI development. The Role The Senior Research Engineer will be responsible for designing scalable RL recipes,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
ML Safety Research Engineer

Apple Inc. (San Francisco, CA)

…Description Our team, part of Apple Services Engineering, is looking for an ML Research Engineer to lead the design and continuous development of automated ... and research teams to translate experimental findings into actionable model improvements and safety mitigations Publish internal reports and external papers… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Research Engineer - Post Training

Cerebras (Palo Alto, CA)

…models and reinforcement learning algorithms. Design and implement robust and scalable model evaluations . Design and implement large-scale model training ... and capable of understanding and addressing real-world challenges. As a post-training engineer , you will enhance the model \'s reasoning and coding ability… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Research Engineer , Notifcations

OpenAI (San Francisco, CA)

About the Team The ChatGPT team works across research , engineering, product, and design to bring OpenAI's technology to the world. We seek to learn from deployment ... the Role We are looking for a Machine Learning Engineer to join our Notifications team , focused on...sharp product intuition. You should be comfortable operating across research and product boundaries: thinking from first principles, running… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
AI research engineer

Monograph (San Francisco, CA)

…and innovate on the AI brains behind Pathos' AI Therapist. Combine research , data science, and engineering to create models, orchestration, and evaluation systems ... through fine-tuning strategies, training data curation, building RL environments, new model architectures and other AI innovations. Improve evaluation of AI quality:… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
[Expression of Interest] Research…

Menlo Ventures (San Francisco, CA)

[Expression of Interest] Research Scientist/ Engineer , Honesty About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. ... working together to build beneficial AI systems. About the role As a Research Scientist/ Engineer focused on honesty within the Finetuning Alignment team, you\'ll… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Research Software Engineer…

Google Inc. (San Jose, CA)

Research Software Engineer , Multimodal AI corporate_fare Google place San Jose, CA, USA Apply Bachelor's degree or equivalent practical experience. 5 years of ... Experience with prompt engineering, few-shot learning, post-training techniques, and evaluations . Familiarity with large-scale model training and deployment.… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Founding Research Engineer

The LLM Data Company (San Francisco, CA)

…LLM Data Company (YC X25) provides post-training data and RL environments to foundation model labs and frontier applied AI companies. We have raised $3.6m from Tier ... reward functions, and evaluator scaffolds for internal and customer-facing tasks Drive research at the intersection of scalable infra and modern RL frameworks to… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Member of Technical Staff - ML Research…

Liquid AI (San Francisco, CA)

…across ML, systems, and product The opportunity to shape multimodal foundation model research with both scientific rigor and real‑world impact About ... data (eg, image‑text, video, visual documents, audio) You've contributed to research papers, open‑source projects, or production‑grade multimodal model systems… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Research Engineer , Multilingual…

Cartesia (San Francisco, CA)

…at scale. What you'll do Design and build large-scale multilingual datasets for model training. Build evaluations of multilingual speech models, both via manual ... audio tokens and 1T video tokens-let alone do this on-device. We're pioneering the model architectures that will make this possible. Our founding team met as PhDs at… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
AIML - Machine Learning Engineer…

Apple Inc. (Cupertino, CA)

… model -based auto-grading are thoughtfully leveraged to power these evaluations . Additionally, this role researches and develops auto-grading methodology & ... AIML - Machine Learning Engineer , Responsible AI Cupertino, California, United States Machine...text generation, diffusion models for image generation, and mixed model systems for multimodal applications. As a member of… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior AI Software Engineer

Essential Software Inc. (Rockville, MD)

…model drift, accuracy, latency, cost, and user experience. Run experiments and evaluations to improve model performance and outcomes for researchers and ... Sr. AI Software Engineer Essential Software Inc. (ESI) Overview Essential Software...to the National Cancer Institute's (NCI) large-scale data and research initiatives, providing secure, cloud-based platforms for scientific discovery… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Staff Software Engineer , GenAI, Data…

Google (Mountain View, CA)

…on GenAI products and services. Our team has been instrumental in performing LLM/GenAI model evaluations and model fine-tuning (eg, using SFT and RLHF ... Staff Software Engineer , GenAI, Data Quality corporate_fare Google place Mountain...to push technology forward. Power the next Large Language Model (LLM) advancements with data. We are building Google-wide… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Engineer , Benchmarking

Epoch (Berkeley, CA)

Epoch AI is looking for a Software Engineer who will help us evaluate frontier AI models, enabling researchers, developers, and policymakers to better understand AI ... identity traits, etc.). We are looking for a Software Engineer to help us expand and develop our AI...benchmarks so we can quickly and painlessly evaluate new model releases. Develop new benchmarks : Contribute to the… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Test Engineer -AI/LLM

OPPO US Research Center (Palo Alto, CA)

OPPO US Research Center is seeking a full-time meticulous and innovative AI/LLM Test Engineer to join our cutting-edge AI team. In this critical role, you will ... also seeking a Contractor based LLM Evaluation & QA Engineer to support the testing and validation of large...to support the testing and validation of large language model (LLM)-powered applications. You will help implement test strategies,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source

"Juju

Account Login

Sign Up

Forgot your password?

Advanced Search