• OpenAI (San Francisco, CA)
    …and low-latency connection management. Have 5+ years of experience as a software engineer and systems architect working on high-scale, high-reliability ... About the Team Our Inference team brings OpenAI's most capable research and.... About the Role We're looking for a senior engineer to design and build the load balancer that… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Databricks Inc. (San Francisco, CA)
    Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the ... architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API. You'll bridge research advances and production… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... are fast, scalable, and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration and memory management. What… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • DatologyAI (Redwood City, CA)
    …are in office 4 days a week. About the Role We're looking for an engineer with deep experience building and operating large-scale training and inference systems. ... infrastructure that powers both our internal ML research workflows and the high-performance inference pipelines that deliver curated data to our customers. As one of… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    …and contribute to our innovative projects. Position Overview We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge AI ... of our embodied systems. You will be responsible for optimizing AI inference processes from lightweight to billion-parameter models, ensuring our robots operate with… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    …tighter coordination with product and research. About the Role We're looking for a software engineer to help us serve OpenAI's multimodal models at scale. You'll ... About the Team OpenAI's Inference team powers the deployment of our most...with research. Are comfortable dealing with systems that span networking , distributed compute, and high-throughput data handling. Have familiarity… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Google Inc. (Sunnyvale, CA)
    Software Engineer III, Infrastructure, Inference Control Plane corporate_fare Google place Sunnyvale, CA, USA Apply Bachelor's degree or equivalent practical ... UI design and mobile; the list goes on and is growing every day. As a software engineer , you will work on a specific project critical to Google's needs with… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • jobr.pro (Sunnyvale, CA)
    …UI design and mobile; the list goes on and is growing every day. As a software engineer , you will work on a specific project critical to Google's needs with ... Large Language Models (LLM) and other Machine Learning (ML) models for inference . Experience building GPU-related software . Experience with compilers or ML… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    Senior Technical Marketing Engineer - AI Inference at Scale page is loaded## Senior Technical Marketing Engineer - AI Inference at Scalelocations: US, ... intelligence. Our data center platforms integrate CPUs, GPUs, DPUs, networking , and a full-stack software ecosystem to...scale. We are looking for a Senior Technical Marketing Engineer to join our growing accelerated computing product team.… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Etched.ai, Inc. (San Jose, CA)
    networking solutions for large-scale inference workloads. As a Pod Software Engineer , you will focus on developing and qualifying software ... Overview We are seeking highly motivated and skilled Pod Networking Software Engineers to join our System...communication amongst Sohu inference nodes in multi-rack inference clusters. You will collaborate closely with kernel, platform,… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Neara (Palo Alto, CA)
    Job type: Full Time Department: Backend Engineer Work type: On-Site About A rchetype AI Archetype AI is developing the world's first AI platform to bring AI into the ... jobsarchetypeaiio. About the Role Were looking for a highly motivated backend engineer with a passion for building performant, scalable, and resilient distributed… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • San Francisco Compute Co. (San Francisco, CA)
    …we make: a liquid market for GPU offtake. About the Role As a Principal Software Engineer - Networking , you'll be responsible for designing and operating ... distributed systems that can withstand node or cluster-wide failures Architect software -defined networking solutions that integrate with underlay switches and… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Pantera Capital (Palo Alto, CA)
    …share knowledge with their teammates. About the Role As part of the Network Software and Services for AI (nssAI) team at xAI, you'll build cutting‑edge software ... inter team collaboration and the data center for hands‑on experience using the software you write and identifying other opportunities of improvement. Focus Building … more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA (Austin, TX)
    Senior Software Architect, AI Networking page is loaded## Senior Software Architect, AI Networkinglocations: US, CA, Santa Clara: US, TX, Austin: US, CA, ... impact on the world.## Join NVIDIA as a Senior Software Architect for AI Networking and help...Networking and help define the future of AI inference infrastructure. In this groundbreaking role, you will drive… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Austin, TX)
    Senior Software Architect, AI Networking page is loaded## Senior Software Architect, AI Networkinglocations: US, CA, Santa Clara: US, TX, Austin: US, CA, ... lasting impact on the world.Join NVIDIA as a Senior Software Architect for AI Networking and help...Networking and help define the future of AI inference infrastructure. In this groundbreaking role, you will drive… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Google Inc. (Sunnyvale, CA)
    Senior Software Engineer , Machine Learning, Kernal Apply Note: By applying to this position you will have an opportunity to share your preferred working location ... like PyTorch or JAX. 3 years of experience in software development for machine learning model inference ...goes on and is growing every day. As a software engineer , you will work on a… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Rockstar (San Francisco, CA)
    …promise is simple: they make your AI system better. They are hiring a Backend Software Engineer (ML Infrastructure) to help design, build, and scale the core ... or raw compute, but a full-stack backend for fine-tuning, reinforcement learning, inference , and long-term model maintenance. Their customers are Series A-C AI… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    Principal Software Engineer - Large-Scale LLM Memory and Storage Systems page is loaded## Principal Software Engineer - Large-Scale LLM Memory and ... storage (GPU, CPU, local disk, and remote memory) for high-throughput, low-latency inference .* Partner closely with GPU architecture, networking , and platform… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Eridu AI (Saratoga, CA)
    Sr. Staff/Principal Software Engineer - SONiC - SAI Eridu AI isa Silicon Valley hardware startup focused on accelerating training and inference performance ... . We are seeking a highly experienced Senior Staff Engineer to lead the architecture, development, and integration of...integration of OCP SAI (Switch Abstraction Interface) with SONiC ( Software for Open Networking in the Cloud).… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Vinci4d (Palo Alto, CA)
    …optimized. Our teams rely on high-performance computing environments, secure networking infrastructure, and cloud integrations to accelerate development. We're ... looking for a C++-heavy engineer who writes production application code-owning features end-to-end and making them consumable by customers. Role Summary This role… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source