- Databricks Inc. (San Francisco, CA)
- A leading AI-focused technology company in San Francisco is seeking a Software Engineer for GenAI inference . In this role, you'll design, develop, and ... optimize the inference engine powering the Foundation Model API. You will collaborate closely with researchers and engage in performance-critical system challenges,… more
- Databricks Inc. (San Francisco, CA)
- Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the ... low latency, and robust scaling. Your work will encompass the full GenAI inference stack: kernels, runtimes, orchestration, memory, and integration with… more
- Menlo Ventures (San Francisco, CA)
- About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... our large language model (LLM) serving systems are fast, scalable , and efficient. Your work will touch the full..., and efficient. Your work will touch the full GenAI inference stack - from kernels and… more
- Apple Inc. (Seattle, WA)
- …with data, ultimately helping teams derive insights that drive product success. As a Staff GenAI Engineer on the Apple Data Platform group's GenAI Platform ... Apple. Description Join Apple's Data Platform as a Staff GenAI Engineer , where you'll be at the... optimization enabling teams across Apple to rapidly create scalable , context‑aware GenAI solutions. You will collaborate… more
- NVIDIA Corporation (Santa Clara, CA)
- …Frameworks ( and ) team. Megatron Core and NeMo Framework are open-source, scalable and cloud-native frameworks built for researchers and developers working on Large ... and Multimodal (MM) foundation model pretraining and post-training. Our GenAI Frameworks provide end-to-end model training, including pretraining, alignment,… more
- Rivian (Palo Alto, CA)
- … Engineer , you will be instrumental in building and maintaining a scalable training and inference platform using both Databricks and open‑source technologies. ... Scalable ML Infrastructure: Design and implement a scalable training and inference platform using Databricks and open‑source technologies to support ML/AI… more
- Rivian (Palo Alto, CA)
- …in California seeks an ML Ops Engineer to build and maintain a scalable training and inference platform. The role involves managing ML/AI models, utilizing ... cloud technologies, and collaborating with cross-functional teams. Candidates should have 5+ years in ML Ops, experience with distributed training frameworks, proficiency in programming languages like Python, and a solid background in cloud technologies. This… more
- Freddie Mac (Mclean, VA)
- …governance and ethical deployment frameworks. Familiarity with CI/CD practices for ML Ops and scalable inference APIs. Keys to Success in this Role: Deep Gen AI ... greater purpose. Position Overview: We are seeking an Software Engineer , Professional - Gen AI (Data) Scientist with a...of performance and reliability. Key Responsibilities: Design and implement scalable AI Agents, Agentic Workflows and GenAI … more
- Northeastern University (Boston, MA)
- …models to production JOB DUTIES Model Development & Deployment Design and implement scalable model serving architectures for both GenAI (LLMs, diffusion models) ... Sr. Machine Learning Engineer Apply locations Boston, MA (Main Campus) time...traditional ML models Build and maintain real-time and batch inference pipelines with high availability and fault tolerance Optimize… more
- Genentech (San Francisco, CA)
- …and optimise workflows. We also work on scaling up model training and inference , evaluating the quality of AI/ML models and output, and building impactful ... the scientific needs. The Opportunity: As a machine learning engineer in AI Enablement, you will be working closely...everyone in between. You'll build, own, and constantly improve scalable AI/ML based systems that unlock the potential of… more
- NVIDIA Corporation (Santa Clara, CA)
- …diverse real world workloads. Megatron Core and NeMo Framework are open-source, scalable and cloud-native frameworks built for researchers and developers working on ... and Multimodal (MM) foundation model pretraining and post-training. Our GenAI Frameworks provide end-to-end model training, including pretraining, reasoning,… more
- Teradata Corporation (SE) (Boston, MA)
- …be a critical force in defining how Teradata evolves to deliver intelligent, scalable , and secure AI capabilities tightly integrated into our hybrid cloud platform. ... Teradata's foundational AI strategy - driving capabilities that enable agentic AI, GenAI , and advanced analytics on the Teradata VantageCloud platform. Design … more
- Teradata Corporation (SE) (Honolulu, HI)
- …be a critical force in defining how Teradata evolves to deliver intelligent, scalable , and secure AI capabilities tightly integrated into our hybrid cloud platform. ... Teradata's foundational AI strategy - driving capabilities that enable agentic AI, GenAI , and advanced analytics on the Teradata VantageCloud platform. Design … more
- Press Ganey Associates LLC (Chicago, IL)
- …and online inference of AI systems.Deploy ML models, LLMs and GenAI systems into production, ensuring reliability, efficiency, and scalability across cloud or ... Staff Data Engineer ( Boston or Chicago ) page is...and generative AI solutions. This position focuses on creating scalable , reliable systems and processes that streamline the developer… more
- Socotra, Inc. (San Francisco, CA)
- Build the Future of Scalable AI at TrueFoundry At TrueFoundry , we're redefining how ML teams train, deploy, and scale their models. Our LLMOps and MLOps platform ... on Kubernetes-with the same muscle as Big Tech. We're looking for an Engineer who is passionate about scaling deep learning workloads, optimizing multi-GPU training,… more
- Dana-Farber Cancer Institute (Boston, MA)
- The Senior Artificial Intelligence & Machine Learning Engineer I/Scientist I works within the Artificial Intelligence Operations and Data Science Services group ... of Harvard Medical School. The Senior Artificial Intelligence & Machine Learning Engineer I/Scientist I works in a team environment on both short-term priorities… more
- harvey.ai (San Francisco, CA)
- … GenAI ‑native applications - such as supporting high‑throughput model inference , managing streaming and long‑running API interactions, and designing abstractions ... today - and we're just getting started. Role Overview As a Backend Platform Engineer at Harvey, you will help build and operate the cohesive backend platform that… more
- EarlyStage Partners (Sunnyvale, CA)
- …What are we looking for? We're in search of a Senior Generative AI Engineer who brings deep expertise in building and deploying large language models and AI ... ML/AI experience with at least 1 year of hands-on experience in GenAI /LLM projects Track record of successfully deploying ML/AI systems in production environments… more
- Rubrik, Inc. (Palo Alto, CA)
- …Innovate at the intersection of AI, security, and distributed systems: Design scalable mechanisms for agent discovery and classification . Contribute to the design ... infrastructure . Experience You'll Need: You are passionate about building secure, scalable AI infrastructure that enables enterprises to safely harness the power of… more
- Forsta (Boston, MA)
- Staff Data Engineer ( Boston or Chicago ) page is loaded **Staff Data Engineer ( Boston or Chicago )**remote typeHybrid locationsChicago, IL time typeFull time ... Ganey is looking to hire a self-motivated Staff Data Engineer with data platform experience. The Staff Data ...and generative AI solutions. This position focuses on creating scalable , reliable systems and processes that streamline the developer… more