- Menlo Ventures (San Francisco, CA)
- …working together to build beneficial AI systems. About the role As a Research Engineer on the Model Evaluations team, you'll lead the design and ... intersection of research and engineering to develop and implement model evaluations that give us insight into emerging capabilities and build robust… more
- Scale AI, Inc. (San Francisco, CA)
- Scale AI is seeking a technically rigorous and driven AI Research Engineer to join our Enterprise Evaluations team. This high-impact role is critical to our ... a passion for tackling complex evaluation challenges, and thrives in a dynamic, fast-paced research environment. We are looking for an engineer who can think… more
- OpenAI (San Francisco, CA)
- A leading AI research firm in San Francisco seeks a Research Engineer to evaluate model capabilities in finance. The candidate should possess strong ... Responsibilities include identifying significant financial workflows and refining AI evaluations . A passion for AGI/ASI measurement and proficiency in Excel… more
- Hippocratic AI (Palo Alto, CA)
- …unless explicitly noted otherwise in the job description. About the Role As an AI Engineer - Evaluations at Hippocratic AI , you'll define and build the systems ... About Us Hippocratic AI has developed a safety-focused Large Language Model (LLM) for healthcare. The company believes that a safe LLM can dramatically improve… more
- Menlo Ventures (San Francisco, CA)
- [Expression of Interest] Research Scientist/ Engineer , Honesty About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. ... working together to build beneficial AI systems. About the role As a Research Scientist/ Engineer focused on honesty within the Finetuning Alignment team, you\'ll… more
- David AI (New York, NY)
- …the latest research insights. About this role As a Founding Audio AI Research Engineer , you'll define the research agenda that shapes how ... our Research team As an audio data research company, we believe model capabilities come...and PyTorchExperience designing and conducting rigorous ML experiments and evaluations . Bonus points if you have PhD or Masters… more
- Amadeus Search (San Francisco, CA)
- Role: Senior Research Engineer Employment Type: Full-time Location: On-site, San Francisco (5 days/week, no remote option) Compensation: $200,000 - $275,000 base ... and RL infrastructure to power next-generation AI development. The Role The Senior Research Engineer will be responsible for designing scalable RL recipes,… more
- Apple Inc. (San Francisco, CA)
- …Description Our team, part of Apple Services Engineering, is looking for an ML Research Engineer to lead the design and continuous development of automated ... and research teams to translate experimental findings into actionable model improvements and safety mitigations Publish internal reports and external papers… more
- Cerebras (Palo Alto, CA)
- …models and reinforcement learning algorithms. Design and implement robust and scalable model evaluations . Design and implement large-scale model training ... and capable of understanding and addressing real-world challenges. As a post-training engineer , you will enhance the model \'s reasoning and coding ability… more
- OpenAI (San Francisco, CA)
- About the Team The ChatGPT team works across research , engineering, product, and design to bring OpenAI's technology to the world. We seek to learn from deployment ... the Role We are looking for a Machine Learning Engineer to join our Notifications team , focused on...sharp product intuition. You should be comfortable operating across research and product boundaries: thinking from first principles, running… more
- Liquid AI (San Francisco, CA)
- …across ML, systems, and product The opportunity to shape multimodal foundation model research with both scientific rigor and real‑world impact About ... data (eg, image‑text, video, visual documents, audio) You've contributed to research papers, open‑source projects, or production‑grade multimodal model systems… more
- Monograph (San Francisco, CA)
- …and innovate on the AI brains behind Pathos' AI Therapist. Combine research , data science, and engineering to create models, orchestration, and evaluation systems ... through fine-tuning strategies, training data curation, building RL environments, new model architectures and other AI innovations. Improve evaluation of AI quality:… more
- Google Inc. (San Jose, CA)
- Research Software Engineer , Multimodal AI corporate_fare Google place San Jose, CA, USA Apply Bachelor's degree or equivalent practical experience. 5 years of ... Experience with prompt engineering, few-shot learning, post-training techniques, and evaluations . Familiarity with large-scale model training and deployment.… more
- The LLM Data Company (San Francisco, CA)
- …LLM Data Company (YC X25) provides post-training data and RL environments to foundation model labs and frontier applied AI companies. We have raised $3.6m from Tier ... reward functions, and evaluator scaffolds for internal and customer-facing tasks Drive research at the intersection of scalable infra and modern RL frameworks to… more
- Cartesia (San Francisco, CA)
- …at scale. What you'll do Design and build large-scale multilingual datasets for model training. Build evaluations of multilingual speech models, both via manual ... audio tokens and 1T video tokens-let alone do this on-device. We're pioneering the model architectures that will make this possible. Our founding team met as PhDs at… more
- Apple Inc. (Cupertino, CA)
- … model -based auto-grading are thoughtfully leveraged to power these evaluations . Additionally, this role researches and develops auto-grading methodology & ... AIML - Machine Learning Engineer , Responsible AI Cupertino, California, United States Machine...text generation, diffusion models for image generation, and mixed model systems for multimodal applications. As a member of… more
- BlackLine (Pleasanton, CA)
- …Work, Play and Grow at BlackLine! Make Your Mark: As a Machine Learning Operations Engineer , you will play a pivotal role in bridging the gap between data science ... SLAs). Mentor senior engineers and drive design reviews for ML pipelines, model registries, and agentic runtime environments. Lead incident response and reliability… more
- BlackLine (Pleasanton, CA)
- …and Grow at BlackLine! Make Your Mark: The Principal AI/ML Operations Engineer leads the architecture, automation, and operationalization of both machine learning ... SLAs). Mentor senior engineers and drive design reviews for ML pipelines, model registries, and agentic runtime environments. Lead incident response and reliability… more
- Essential Software Inc. (Rockville, MD)
- …model drift, accuracy, latency, cost, and user experience. Run experiments and evaluations to improve model performance and outcomes for researchers and ... Sr. AI Software Engineer Essential Software Inc. (ESI) Overview Essential Software...to the National Cancer Institute's (NCI) large-scale data and research initiatives, providing secure, cloud-based platforms for scientific discovery… more
- Google (Mountain View, CA)
- …on GenAI products and services. Our team has been instrumental in performing LLM/GenAI model evaluations and model fine-tuning (eg, using SFT and RLHF ... Staff Software Engineer , GenAI, Data Quality corporate_fare Google place Mountain...to push technology forward. Power the next Large Language Model (LLM) advancements with data. We are building Google-wide… more