- Meta (Austin, TX)
- …fabric and host networking, communications lib and scheduling infrastructure. Required Skills: AI / HPC System Performance Engineer Responsibilities: Lead ... a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look...with teamwork and close collaboration Responsible for the overall performance of the communication system , including … more
- Oracle (Austin, TX)
- …possible. Responsibilities Lead architecture, system design, and implementation for high- performance RDMA solutions across OCI's AI / HPC platforms, ... If you thrive at the intersection of large-scale distributed systems , high-speed networking, and AI workloads, this...and performance tuning at scale. Familiarity with AI / HPC stacks and workloads: NCCL/RCCL/MPI, Slurm or… more
- Micron Technology, Inc. (Richardson, TX)
- …designing and optimizing High Bandwidth Memory (HBM) products for AI /ML, high- performance computing ( HPC ), and data-centric systems , collaborating across ... ever. Micron's Heterogeneous Integration Group (HIG) is shaping the future of AI and accelerated computing by developing sophisticated memory solutions! The team… more
- Oracle (Austin, TX)
- …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
- Oracle (Austin, TX)
- …metrics, logs, eBPF/perf, chaos/failure testing, and SLO-driven operations. Knowledge of AI / HPC workload patterns and their implications for storage, query ... Job Description OCI (Oracle Cloud) AI Infrastructure Innovation team is inventing the next...If you thrive at the intersection of large-scale distributed systems , database internals, and cloud platforms, this role offers… more
- Meta (Austin, TX)
- …fabric and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: ... a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look...teamwork and close collaboration 3. Responsible for the overall performance of the communication system , including … more
- Deloitte (Dallas, TX)
- …building secure networks and modern data centers , to enabling the adoption of AI or high- performance computing ( HPC ), you'll gain firsthand experience with ... organizations through Data Center and infrastructure transformation journeys, such as adopting AI , deploying high- performance computing ( HPC ) or edge… more
- Texas A&M University System (College Station, TX)
- …patching, and performance tuning.* Oversee networking, security, and infrastructure for HPC systems .* Lead the development of specialized HPC computing ... expertise and consultation for the design and deployment of HPC systems . Get in on the ground...the following:* Experience with High Performance Computing ( HPC ) environments* Advanced Linux system administration skills*… more
- Amazon (Austin, TX)
- …computing and its potential to overcome some of the biggest challenges in High Performance Computing ( HPC )? Do you have a unique combination of deep technical ... C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI /ML frameworks. Preferred Qualifications -...life sciences or related discipline. - Working knowledge of HPC schedulers and distributed/parallel file systems , underlying… more
- Google (Austin, TX)
- …cycles, building tools, architecting and developing software for scalable distributed systems , including data platform, AI /ML, and infrastructure. + Experience ... products, and different customer segments/use cases of the emerging AI compute tech stack. **About the job** The Google...of our customers and helping shape the future of HPC . As the Senior Manager in High Performance… more
- Micron Technology, Inc. (Richardson, TX)
- …infrastructure. + Coordinate the management of enterprise SAN, NAS, and cloud storage systems to ensure reliability and performance . + Implement new storage ... learn, communicate and advance faster than ever. As an HPC Staff Engineer at Micron, you will join a...storage environments, including enterprise SAN NAS and cloud storage systems across the company's global infrastructure! Your role will… more
- Micron Technology, Inc. (Richardson, TX)
- …designing and optimizing High Bandwidth Memory (HBM) products for AI /ML, high- performance computing ( HPC ), and data-centric systems , collaborating across ... role offers the opportunity to contribute to ground breaking memory solutions for AI , HPC , and data-centric systems . **Responsibilities** + Contribute to… more
- Amazon (Austin, TX)
- …and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. You are intrigued by the continuous release of ... Want to do industry leading work delivering continuous price performance improvements in the cloud for AI ...have tremendous interest in cloud scale and curious how systems and software decisions impact the user. You insist… more
- Micron Technology, Inc. (Richardson, TX)
- …designing and optimizing High Bandwidth Memory (HBM) products for AI /ML, high- performance computing ( HPC ), and data-centric systems , collaborating across ... Introduction** Micron's Heterogeneous Integration Group (HIG) is shaping the future of AI and accelerated computing by developing advanced memory solutions. The team… more
- Oracle (Austin, TX)
- …Responsibilities + Lead architecture, system design, and implementation for high- performance RDMA solutions across OCI's AI / HPC platforms, including ... If you thrive at the intersection of large-scale distributed systems , high-speed networking, and AI workloads, this... performance tuning at scale. + Familiarity with AI / HPC stacks and workloads: NCCL/RCCL/MPI, Slurm or… more
- Micron Technology, Inc. (Richardson, TX)
- …designing and optimizing High Bandwidth Memory (HBM) products for AI /ML, high- performance computing ( HPC ), and data-centric systems , collaborating across ... ever. Micron's Heterogeneous Integration Group (HIG) is shaping the future of AI and accelerated computing by developing sophisticated memory solutions! The team… more
- Oracle (Austin, TX)
- …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
- Micron Technology, Inc. (Richardson, TX)
- …designing and optimizing High Bandwidth Memory (HBM) products for AI /ML, high- performance computing ( HPC ), and data-centric systems , collaborating across ... ever. Micron's Heterogeneous Integration Group (HIG) is shaping the future of AI and accelerated computing by developing advanced memory solutions! The team focuses… more
- Oracle (Austin, TX)
- …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
- Oracle (Austin, TX)
- …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more