- OpenAI (San Francisco, CA)
- …Inference team. This role involves designing and implementing infrastructure for large- scale multimodal models, focusing on high-performance delivery of audio ... and image inputs. You'll collaborate closely with researchers and product teams to push the boundaries of AI technology, ensuring reliable production services. If you thrive in fast-paced environments and enjoy tackling complex challenges, this opportunity… more
- Tether Operations Limited (San Francisco, CA)
- …integrating text, visual, and audio modalities. Engineer scalable training and inference pipelines optimized for large‑ scale multimodal datasets and ... the full development pipeline from data processing & data loading to training, inference , and optimization. Experience working with large‑ scale text data, or… more
- Pantera Capital (Palo Alto, CA)
- …various modalities, including image, video, and audio. As an AI infrastructure researcher/ engineer on multimodal , you will develop and optimize scalable ... frameworks and software for large- scale machine-learning tasks, including model training, inference ,...training systems. Profiling, debugging, and optimizing GPU utilization for multimodal model training and inference . Building scalable… more
- Eloquent AI (San Francisco, CA)
- …AI At Eloquent AI, we're building the next generation of AI Operators- multimodal , autonomous systems that execute complex workflows across fragmented tools with ... future of financial services. Your Role As an AI Engineer at Eloquent AI, you will be at the...problem-solving skills, with the ability to customize, optimize, and scale AI models for real-world enterprise applications. If you're… more
- OpenAI (San Francisco, CA)
- …in ways far beyond text. In this role, you will: Design and implement inference infrastructure for large- scale multimodal models. Optimize systems for ... boundaries of what AI can do. We're expanding into multimodal inference , building the infrastructure needed to...software engineer to help us serve OpenAI's multimodal models at scale . You'll be part… more
- Virtue AI (San Francisco, CA)
- …we're looking for passionate builders to join our core team. What You'll Do As an Inference Engineer , you will own how models are served in production. Your job ... in AI security, its AI-native architecture unifies automated red‑teaming, real‑time multimodal guardrails, and systematic governance for enterprise apps and agents.… more
- NVIDIA Corporation (Santa Clara, CA)
- Senior Deep Learning Software Engineer , Inference page is loaded## Senior Deep Learning Software Engineer , Inferencelocations: US, CA, Santa Clara: US, ... Posted Yesterdayjob requisition id: JR2002670NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for...tuning of DL models in various domains like LLM, Multimodal and Generative AI.* Scale performance of… more
- Eloquent AI, Inc. (San Francisco, CA)
- …At Eloquent AI, we're building the next generation of AI Operators- multimodal , autonomous systems that execute complex workflows across fragmented tools with ... future of financial services. Your Role As an AI Engineer at Eloquent AI, you will be at the...problem‑solving skills, with the ability to customize, optimize, and scale AI models for real-world enterprise applications. If you're… more
- Pulse (San Francisco, CA)
- …founding experience is a plus About the Role Specialize in low-latency, high-throughput inference for OCR and multimodal models. Own profiling, batching, and ... data infrastructure: extracting accurate, structured information from complex documents at scale . We have a breakthrough approach to document understanding that… more
- Menlo Ventures (San Francisco, CA)
- …leaders working together to build beneficial AI systems. About the role Our Inference team is responsible for building and maintaining the critical systems that ... to life by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack from intelligent request… more
- Neara (Palo Alto, CA)
- Job type: Full Time Department: Backend Engineer Work type: On-Site About A rchetype AI Archetype AI is developing the world's first AI platform to bring AI into the ... a foundation model for the physical world, a real-time multimodal LLM for real life, transforming real-world data into...the Role Were looking for a highly motivated backend engineer with a passion for building performant, scalable, and… more
- Institute of Foundation Models (Sunnyvale, CA)
- …consistent labels across large- scale datasets. Key Responsibilities: Training & Inference Systems Support the training of multimodal foundation models (eg, ... Experience Required Proficient in data collection, cleaning, and transformation at scale , including designing robust pipelines for multimodal datasets (eg,… more
- Liquid AI (San Francisco, CA)
- …Spun out of MIT, our mission is to build efficient AI systems at every scale . Our Liquid Foundation Models (LFMs) operate where others can't: on-device, at the edge, ... You If: You have experience with machine learning at scale You have worked with audio models and understand...frameworks like DeepSpeed, FSDP, or Megatron-LM You've worked with multimodal data (eg audio, text, image, video) You've contributed… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …The Crusoe Cloud Managed AI team seeks an ambitious and experienced Senior Software Engineer to join their team. You'll have a pivotal role in shaping the ... architecture and scalability of our next-generation AI inference platform. You will lead the design and implementation...This role gives you the opportunity to build and scale infrastructure capable of handling millions of API requests… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …etc. AI/ML Expertise: Experience in Generative AI (Large Language Models, Multimodal ). Familiarity with AI infrastructure, including training, inference , and ... people can create ambitiously with AI - without sacrificing scale , speed, or sustainability. Be a part of the...cloud infrastructure. About This Role: As a Principal Software Engineer on the Managed AI team at Crusoe, you'll… more
- Menlo Ventures (San Francisco, CA)
- …data pipelines to handle advanced language model training requirements Optimize large scale training and inference pipelines for stable and efficient ... together to build beneficial AI systems. About the role As a Staff Infrastructure Engineer on our team you will work end to end, identifying and addressing key… more
- OpenAI (San Francisco, CA)
- …training and inference infrastructure that powers frontier models at massive scale . Our systems unify how researchers train and serve models, abstracting away ... focus on advancing model capabilities while we handle the scale , efficiency, and reliability required to bring those models...life. About the Role We are looking for an engineer to design and implement the dataset infrastructure that… more
- Neara (Palo Alto, CA)
- …incidents. Support AI/ML infrastructure for training and serving models at scale , including GPU clusters, pipelines, and inference services. Contribute ... Job type: Full Time . Department: Backend Engineer . Work type: Remote About A rchetype...a foundation model for the physical world, a real-time multimodal LLM for real life, transforming real-world data into… more
- Virtue AI (San Francisco, CA)
- …Helm‑based enterprise installs Preferred Qualifications Experience deploying ML / LLM inference systems at scale Familiarity with vLLM, sglang, Triton, ... in AI security, its AI-native architecture unifies automated red‑teaming, real‑time multimodal guardrails, and systematic governance for enterprise apps and agents.… more
- Amazon (San Francisco, CA)
- …Robotics team, where you'll contribute to breakthrough foundation models run at production scale . As a Software Development Engineer embedded in our science ... Software Development Engineer , Frontier AI & Robotics Job ID: 2914306...your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale . In this… more