- Amazon (San Francisco, CA)
- …of software development experience, strong programming skills, and a passion for AI / ML . The position offers flexibility and an inclusive culture. #J-18808-Ljbffr ... San Francisco. In this role, you will help shape the future of AI and cloud computing by designing and operating innovative infrastructure solutions. Candidates… more
- PlanetiQ (Boston, MA)
- …to design, build, and maintain high-performance computing clusters and cloud solutions for AI / ML applications. The role offers a competitive base salary from ... $180k to $230k, with benefits like stock options, and a hybrid working model. The ideal candidate will have strong skills in automation, site reliability engineering, and a passion for Earth sciences. #J-18808-Ljbffr more
- Hewlett Packard Enterprise Development LP (San Jose, CA)
- Senior Software Engineer - HPC & AI Advanced Development This role has been designated as 'Remote/Teleworker', which means you will primarily work from home. ... Senior Software Engineer to join the Advanced Programming Team within the HPC & AI Advanced Development Organization. This position is remote within the… more
- NVIDIA Corporation (Santa Clara, CA)
- …computing workloads.Observability is at the heart of this transformation. We are looking for a Senior AI & HPC Observability Engineer to design and build the ... Senior Engineer - AI and HPC Observability page is...PromQL, time-series databases, and large-scale monitoring systems.* Familiarity with AI / ML pipelines, GPU-based workloads, and HPC… more
- Accenture (California, MO)
- …advising and engaging with C‑Suite executives and senior leadership, translating complex AI and HPC technologies into business and strategic value Minimum 4+ ... (NVIDIA CUDA, AMD ROCm) and integration considerations for HPC / AI workloads Knowledge of AI / ML frameworks (TensorFlow, PyTorch) and their operational and… more
- Dyna Robotics (Redwood City, CA)
- …years in software with hands-on experience in ML model tuning and managing cloud environments. Join us to shape the future of AI -driven robotics. #J-18808-Ljbffr ... Machine Learning Infrastructure Engineer. This role involves designing scalable ML training platforms, optimizing high-performance computing systems, and ensuring… more
- Amazon (Herndon, VA)
- …of the following programming languages: C++, Python, CUDA, Bash. Deep GPU knowledge in HPC and/or AI / ML frameworks. Current, active US Government Security ... large analytical problems as massive scale? Amazon Web Services (AWS) is seeking a Senior Worldwide Specialist Solutions Architect focused on HPC to work with… more
- Lavendo (San Francisco, CA)
- …Cloud Computing, Infrastructure-as-Code Candidate Location: Remote US The Opportunity We are seeking a Senior AI / ML Specialist Solutions Architect to join ... a publicly traded company at the forefront of the AI revolution, offering an AI -centric cloud platform...maximize performance and business value Lead the transition of ML pipelines from POC to scalable production systems Build… more
- Ring Inc (San Francisco, CA)
- …The ideal candidate has significant experience in software engineering and management in AI / ML environments, particularly with Python and Golang, and is familiar ... with cloud platforms and distributed architectures. The organization offers a flexible remote-first work policy with comprehensive benefits. #J-18808-Ljbffr more
- Siemens AG (Santa Clara, CA)
- …C/C++, CUDA, and Python. Responsibilities include enhancing semiconductor manufacturing tools with AI and ML applications. This full-time position offers a ... hybrid work model and is part of a collaborative team in a competitive industry environment. #J-18808-Ljbffr more
- Labelbox (San Francisco, CA)
- …systems and high-impact research workflows across data, tooling, and infrastructure. Position Senior C++ Full-Stack Engineer - AI Data & Infrastructure Type: ... experience with data annotation, data quality, or evaluation systems Familiarity with AI / ML workflows, model training, or benchmarking pipelines Experience with… more
- Boson AI (Palo Alto, CA)
- About The Role We're looking for a Senior Site Reliability Engineer to help us run one of the most exciting GPU clusters around-our Toronto datacenter packed with ... of servers. You'll be hands‑on with the full lifecycle of HPC infrastructure: planning, building, testing, deploying, and keeping everything running smoothly.… more
- Roblox Corporation (San Mateo, CA)
- Senior Hardware Engineer - GPU & AI Infrastructure Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends ... with a specific focus on GPU architecture (NVIDIA HGX/MGX platforms preferred), AI accelerators, or high-performance compute ( HPC ) systems. Deep Technical… more
- Pacific Northwest National Laboratory (Seattle, WA)
- …design, execution, and analysis of large‑scale machine learning experiments, including HPC ‑based experiments lasting weeks. Evaluate AI systems across ... with current ML research landscape, especially xAI, adversarial ML , AI safety, and deep learning science. Hands‑on experience analyzing internal structures… more
- Ll Oefentherapie (Nashville, TN)
- …triage automation, and diagnostic services. These are essential for running distributed AI / ML / HPC workloads across thousands of GPUs, leveraging technologies ... to scale and optimize Monitoring and Repair solutions for AI infrastructure components like GPU control plane and GPU...GPU data plane that provide computing resources to customer AI workloads. You will provide technical leadership to the… more
- NVIDIA Corporation (Santa Clara, CA)
- …looking for an experienced engineer to triage customers' hardware platform issues and AI / ML workloads in huge datacenters of rack-scale platforms, solve customer ... and the ability to analyze, optimize, and customize Linux environments for AI / ML workloads.* Containerized solutions experience with Docker, Kubernetes, Slurm*… more
- Support Revolution (San Jose, CA)
- …passionate, and committed engineers, technologists, and business leaders to join us. Job Summary: Senior AI Product Manager with strong experience in AI / ... Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5...create intuitive AI experiences and developer tools. AI Integration: Work with ML engineers and… more
- Datacrunch (San Francisco, CA)
- … HPC GPU clusters, diagnosing performance issues, and designing efficient HPC jobs. Familiarity with ML model training environments. Understanding of ... access to intelligence. We're building a fully featured European AI cloud - with everything one needs to train,...vision and expectations. About the role We're seeking a Senior or Principal Site Reliability Engineer (SRE) to become… more
- NVIDIA Corporation (Santa Clara, CA)
- Senior Storage and Networking Product Engineer page is...are pioneers in making the impossible achievable, particularly within AI , ML , and HPC . Joining ... loaded## Senior Storage and Networking Product Engineerlocations: US, CA, Santa...architectures for storage environments, ensuring low-latency data paths for AI / ML and HPC workloads.* Configure… more
- NVIDIA Corporation (Santa Clara, CA)
- …NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We are searching for a highly motivated ... for our data center platforms and products* Characterize real-world AI training, inference, and HPC workloads at...teams**Ways to stand** **out from the crowd: Experience with AI / ML frameworks (PyTorch, TensorFlow, JAX). Knowledge of… more