• Menlo Ventures (San Francisco, CA)
    …working together to build beneficial AI systems. About the role As a Research Engineer on the Model Evaluations team, you'll lead the design and ... intersection of research and engineering to develop and implement model evaluations that give us insight into emerging capabilities and build robust… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    A leading AI research firm in San Francisco seeks a Research Engineer to evaluate model capabilities in finance. The candidate should possess strong ... Responsibilities include identifying significant financial workflows and refining AI evaluations . A passion for AGI/ASI measurement and proficiency in Excel… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Scale AI, Inc. (San Francisco, CA)
    Scale AI is seeking a technically rigorous and driven AI Research Engineer to join our Enterprise Evaluations team. This high-impact role is critical to our ... a passion for tackling complex evaluation challenges, and thrives in a dynamic, fast-paced research environment. We are looking for an engineer who can think… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Hippocratic AI (Palo Alto, CA)
    …unless explicitly noted otherwise in the job description. About the Role As an AI Engineer - Evaluations at Hippocratic AI , you'll define and build the systems ... About Us Hippocratic AI has developed a safety-focused Large Language Model (LLM) for healthcare. The company believes that a safe LLM can dramatically improve… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • David AI (New York, NY)
    …the latest research insights. About this role As a Founding Audio AI Research Engineer , you'll define the research agenda that shapes how ... our Research team As an audio data research company, we believe model capabilities come...and PyTorchExperience designing and conducting rigorous ML experiments and evaluations . Bonus points if you have PhD or Masters… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amadeus Search (San Francisco, CA)
    Role: Senior Research Engineer Employment Type: Full-time Location: On-site, San Francisco (5 days/week, no remote option) Compensation: $200,000 - $275,000 base ... and RL infrastructure to power next-generation AI development. The Role The Senior Research Engineer will be responsible for designing scalable RL recipes,… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Apple Inc. (San Francisco, CA)
    …Description Our team, part of Apple Services Engineering, is looking for an ML Research Engineer to lead the design and continuous development of automated ... and research teams to translate experimental findings into actionable model improvements and safety mitigations Publish internal reports and external papers… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Cerebras (Palo Alto, CA)
    …models and reinforcement learning algorithms. Design and implement robust and scalable model evaluations . Design and implement large-scale model training ... and capable of understanding and addressing real-world challenges. As a post-training engineer , you will enhance the model \'s reasoning and coding ability… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    About the Team The ChatGPT team works across research , engineering, product, and design to bring OpenAI's technology to the world. We seek to learn from deployment ... the Role We are looking for a Machine Learning Engineer to join our Notifications team , focused on...sharp product intuition. You should be comfortable operating across research and product boundaries: thinking from first principles, running… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Monograph (San Francisco, CA)
    …and innovate on the AI brains behind Pathos' AI Therapist. Combine research , data science, and engineering to create models, orchestration, and evaluation systems ... through fine-tuning strategies, training data curation, building RL environments, new model architectures and other AI innovations. Improve evaluation of AI quality:… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    [Expression of Interest] Research Scientist/ Engineer , Honesty About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. ... working together to build beneficial AI systems. About the role As a Research Scientist/ Engineer focused on honesty within the Finetuning Alignment team, you\'ll… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Google Inc. (San Jose, CA)
    Research Software Engineer , Multimodal AI corporate_fare Google place San Jose, CA, USA Apply Bachelor's degree or equivalent practical experience. 5 years of ... Experience with prompt engineering, few-shot learning, post-training techniques, and evaluations . Familiarity with large-scale model training and deployment.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • The LLM Data Company (San Francisco, CA)
    …LLM Data Company (YC X25) provides post-training data and RL environments to foundation model labs and frontier applied AI companies. We have raised $3.6m from Tier ... reward functions, and evaluator scaffolds for internal and customer-facing tasks Drive research at the intersection of scalable infra and modern RL frameworks to… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Liquid AI (San Francisco, CA)
    …across ML, systems, and product The opportunity to shape multimodal foundation model research with both scientific rigor and real‑world impact About ... data (eg, image‑text, video, visual documents, audio) You've contributed to research papers, open‑source projects, or production‑grade multimodal model systems… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Cartesia (San Francisco, CA)
    …at scale. What you'll do Design and build large-scale multilingual datasets for model training. Build evaluations of multilingual speech models, both via manual ... audio tokens and 1T video tokens-let alone do this on-device. We're pioneering the model architectures that will make this possible. Our founding team met as PhDs at… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Apple Inc. (Cupertino, CA)
    model -based auto-grading are thoughtfully leveraged to power these evaluations . Additionally, this role researches and develops auto-grading methodology & ... AIML - Machine Learning Engineer , Responsible AI Cupertino, California, United States Machine...text generation, diffusion models for image generation, and mixed model systems for multimodal applications. As a member of… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Essential Software Inc. (Rockville, MD)
    model drift, accuracy, latency, cost, and user experience. Run experiments and evaluations to improve model performance and outcomes for researchers and ... Sr. AI Software Engineer Essential Software Inc. (ESI) Overview Essential Software...to the National Cancer Institute's (NCI) large-scale data and research initiatives, providing secure, cloud-based platforms for scientific discovery… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Google (Mountain View, CA)
    …on GenAI products and services. Our team has been instrumental in performing LLM/GenAI model evaluations and model fine-tuning (eg, using SFT and RLHF ... Staff Software Engineer , GenAI, Data Quality corporate_fare Google place Mountain...to push technology forward. Power the next Large Language Model (LLM) advancements with data. We are building Google-wide… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Epoch (Berkeley, CA)
    Epoch AI is looking for a Software Engineer who will help us evaluate frontier AI models, enabling researchers, developers, and policymakers to better understand AI ... identity traits, etc.). We are looking for a Software Engineer to help us expand and develop our AI...benchmarks so we can quickly and painlessly evaluate new model releases. Develop new benchmarks : Contribute to the… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OPPO US Research Center (Palo Alto, CA)
    OPPO US Research Center is seeking a full-time meticulous and innovative AI/LLM Test Engineer to join our cutting-edge AI team. In this critical role, you will ... also seeking a Contractor based LLM Evaluation & QA Engineer to support the testing and validation of large...to support the testing and validation of large language model (LLM)-powered applications. You will help implement test strategies,… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source