- Anthropic (San Francisco, CA)
- …innovations in GPU performance and systems engineering. As a GPU Performance Engineer , you'll architect and implement the foundational systems ... Strong candidates will have a track record of delivering transformative GPU performance improvements in production ML systems and will be excited to… more
- Anthropic (San Francisco, CA)
- A leading AI research and development firm in San Francisco is seeking a GPU Performance Engineer to innovate GPU performance and systems ... engineering. This role involves architecting systems that maximize performance and developing optimizations for large language models. Ideal candidates have strong… more
- Relace (San Francisco, CA)
- A tech company specializing in ML infrastructure is seeking a Machine Learning Engineer who excels at making models faster and more efficient through performance ... tuning and optimization. The ideal candidate will have a strong background in systems-level ML engineering, experience with CUDA, and familiarity with distributed training frameworks. This is an in-person role in San Francisco's FiDi district, ideal for those… more
- Oracle (Redwood City, CA)
- Senior Principal Software Engineer - AI GPU Innovation Oracle Cloud Infrastructure's (OCI) architecture development engineering team is seeking a highly driven ... GPU platform software & system development engineer ...as well as in‑house development, design reviews, system integration, performance testing and characterization. You will interact closely with… more
- San Francisco Compute Co. (San Francisco, CA)
- …located in San Francisco is looking for a dedicated professional to manage GPU training clusters. You will ensure the smooth operation of high- performance ... for hardware management. Applicants must have experience with Linux, networking fundamentals, and GPU clusters. The role offers a salary ranging from $170k to $300k… more
- NVIDIA Corporation (Santa Clara, CA)
- …NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High- Performance Computing and Visualization. The GPU , our invention, serves ... driver experience; background and strength with complex system-level debugging, performance analysis and test design Familiarity with kernel level...time posted on Posted 8 Days Ago Senior Software Engineer - GPU and SOC locations US,… more
- Hamilton Barnes ? (San Francisco, CA)
- …/Senior Site Reliability Engineer to ensure the reliability and performance of GPU -powered infrastructure. You will design and maintain large-scale ... GPU clusters, build automation pipelines, and develop monitoring systems. Ideal candidates have hands-on Kubernetes and Slurm experience, along with deep Linux systems knowledge. This role is crucial for shaping the operational backbone of cutting-edge AI… more
- Oracle (Redwood City, CA)
- …leading cloud technology company in Redwood City is seeking a Senior Principal Software Engineer for AI GPU Innovation. The role involves developing AI hardware ... cloud services. Responsibilities include benchmarking AI accelerators and working on performance optimization. The position offers a competitive salary range and… more
- Boeing (Berkeley, MO)
- …Defense Space & Security (BDS) is seeking a Senior Design and Analysis Engineer to serve the Air Dominance program in Berkeley, MO . Position Responsibilities: ... operational and functional requirements Oversees the team that monitors supplier performance to ensure system integration and compliance with requirements Solves… more
- Oracle (Redwood City, CA)
- …located in Redwood City is seeking an experienced Senior Principal Software Engineer to innovate in AI infrastructure. You will evaluate cutting-edge hardware, ... collaborate with engineering teams, and lead in performance analysis. The ideal candidate has over 10 years of software development experience and knowledge of AI… more
- NVIDIA Corporation (Santa Clara, CA)
- …and reasoning models across multi-node distributed environments. Built in Rust for performance and Python for extensibility, Dynamo orchestrates GPU shards, ... Principal Software Engineer - Large-Scale LLM Memory and Storage Systems...outgrow the memory and compute budget of any single GPU , this platform enables efficient, resilient deployment of cutting-edge… more
- OpenAI (San Francisco, CA)
- …ideas become experiments and products). About the Role As a Training Performance Engineer , you'll drive efficiency improvements across our distributed training ... and uptime. This role blends deep systems understanding with practical performance engineering-analyzing GPU kernel performance , collective communication… more
- NVIDIA (Santa Clara, CA)
- …and reasoning models across multi-node distributed environments. Built in Rust for performance and Python for extensibility, Dynamo orchestrates GPU shards, ... outgrow the memory and compute budget of any single GPU , this platform enables efficient, resilient deployment of cutting-edge...cutting-edge LLM workloads. We are seeking a Principal Systems Engineer to define the vision and roadmap for memory… more
- OpenAI (San Francisco, CA)
- …to us than unfettered growth. About the Role OpenAI is looking for an experienced Performance Engineer to help us scale the performance , reliability, and ... apply deep technical expertise to optimize infrastructure and application-level performance across mission-critical products like ChatGPT and our developer API.… more
- Hamilton Barnes ? (San Francisco, CA)
- …Engineer /Senior Site Reliability Engineer , you'll own the reliability, performance , and automation of this GPU -powered infrastructure, ensuring seamless ... to the market and help shape the operational backbone of one of the largest GPU clusters in private deployment. If you want to build and operate infrastructure for… more
- Adobe (San Jose, CA)
- Principal Machine Learning Engineer , Firefly Join to apply for the Principal Machine Learning Engineer , Firefly role at Adobe Principal Machine Learning ... among the first 25 applicants Join to apply for the Principal Machine Learning Engineer , Firefly role at Adobe Get AI-powered advice on this job and more exclusive… more
- Meetingcpp GmbH (San Francisco, CA)
- [Remote] Performance Oriented, Sr. C++ Engineer for core animation runtime at Rive published at 10.04.2025 13:14 Location: Remote Company: Rive Remote support ... space! Rive is looking for a very experienced C++ Engineer , with an obsession for making code FAST, to...runtime. You will be part of a self-directed, low-level, performance oriented team that specializes in GPU … more
- P-1 AI (San Francisco, CA)
- …and bend it to our will. Our first product is Archie, an AI engineer capable of quantitative and spatial reasoning over physical product domains that performs at ... the level of an entry‑level design engineer . We aim to put an Archie on every...research team. Your focus will be on making large‑scale GPU training run reliably, efficiently, and fast on a… more
- Oracle (Seattle, WA)
- …required across Kernel, NIC, switch, transport, protocol, storage, and GPU communications. Develop production‑grade, high‑ performance software features with ... Sr Principal Software Engineer , Networking - AI Infrastructure Innovation OCI (Oracle...is pioneering the creation of next‑generation AI/HPC networking for GPU superclusters at massive scale. Our mission is to… more
- Character.AI (San Francisco, CA)
- …Storage) Experience working with GPUs Nice to have Experience with large GPU clusters and high- performance computing/networking Experience with supporting large ... Machine Learning Infrastructure Engineer Join to apply for the Machine Learning...deployments, manage experiments, and generally support our research Maximize GPU allocation and utilization for both serving and training… more