• Meta (Austin, TX)
    …and host networking, communications lib and scheduling infrastructure. Required Skills: AI / HPC System Performance Engineer Responsibilities: Lead ... Summary: Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Johns Hopkins University (Baltimore, MD)
    …Deployment and Design Develop and refine deployment strategies for scientific software on HPC and AI systems . Design computational workflows, selecting ... Agents). Performance Optimization Analyze and optimize the performance of AI models and HPC...Ensure compliance with security and regulatory standards for all HPC and AI systems . In… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Micron Technology, Inc. (Richardson, TX)
    …infrastructure. + Coordinate the management of enterprise SAN, NAS, and cloud storage systems to ensure reliability and performance . + Implement new storage ... learn, communicate and advance faster than ever. As an HPC Staff Engineer at Micron, you will join a...storage environments, including enterprise SAN NAS and cloud storage systems across the company's global infrastructure! Your role will… more
    DirectEmployers Association (12/10/25)
    - Save Job - Related Jobs - Block Source
  • Oracle (Indianapolis, IN)
    …of what's possible. Responsibilities Lead architecture, system design, and implementation for high- performance RDMA solutions across OCI's AI / HPC platforms, ... If you thrive at the intersection of large-scale distributed systems , high-speed networking, and AI workloads, this...and performance tuning at scale. Familiarity with AI / HPC stacks and workloads: NCCL/RCCL/MPI, Slurm or… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Oracle (Austin, TX)
    …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Oracle (Nashville, TN)
    …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Oracle (Salt Lake City, UT)
    …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Microsoft Corporation (Mountain View, CA)
    …the boundaries of scale, performance , and deployment, creating frontier AI systems that power transformative experiences across Microsoft. The Multimodal ... data libraries (Pandas, NumPy, etc.) OR equivalent experience. Experience with large-scale AI systems - design and deployment of distributed architectures,… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Oracle (Washington, DC)
    …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, working with systems that allow...scale from tens to thousands of GPUs without compromising performance . Our team is responsible for designing and developing… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Oracle (Denver, CO)
    …metrics, logs, eBPF/perf, chaos/failure testing, and SLO-driven operations. Knowledge of AI / HPC workload patterns and their implications for storage, query ... Job Description OCI (Oracle Cloud) AI Infrastructure Innovation team is inventing the next...If you thrive at the intersection of large-scale distributed systems , database internals, and cloud platforms, this role offers… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • General Dynamics Information Technology (Falls Church, VA)
    …Family: IT Infrastructure and Operations Skills: High Performance Computing ( HPC ),High- Performance Computing ( HPC ) Systems ,Scientific Research ... Experience working directly with researchers for support and troubleshooting Experience with HPC systems and tooling like SLURM, GPFS, Globus Experience with… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Argonne National Laboratory (Lemont, IL)
    …and evaluate vector databases and retrieval-augmented generation (RAG) and integrate agentic AI systems to meet the demands of large-scale fine-tuning and ... group, a vibrant multidisciplinary team of scientists and High Performance Computing ( HPC ) engineers. In the AL/ML...learning and statistics techniques. With the advancement of Exascale systems and the variety of novel AI more
    DirectEmployers Association (10/15/25)
    - Save Job - Related Jobs - Block Source
  • Curtiss-Wright Corporation (Davidson, NC)
    …evaluated when extending an offer. Your Challenge Develop architectures for high performance software, advanced analytics, and AI /ML solution to solve industry ... relationships with key customers to understand problems and required solutions for high performance software, advanced analytics, and AI / ML -20% Coordinate… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Sandia National Laboratories (Albuquerque, NM)
    …independent project, thesis, or dissertation). Experience in developing software and AI systems for enterprise and national security applications. Demonstrated ... with distributed training frameworks (MPI, Horovod, Ray), hyperparameter tuning, and HPC systems . Hands-on experience with model optimization techniques… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Micron Technology, Inc. (Richardson, TX)
    …engineering skillset to explore future memory architectures for high- performance compute ( HPC ) and artificial intelligence ( AI ) systems . You will have ... desired + Knowledge of memory hierarchies and GPU offloading in heterogeneous AI systems is highly desired + Knowledge of filesystems and familiarity with… more
    DirectEmployers Association (12/10/25)
    - Save Job - Related Jobs - Block Source
  • Oracle (Little Rock, AR)
    …in the RDMA cluster networking domain and enable seamless, accelerated High- Performance Compute ( HPC ), Artificial Intelligence and Machine Learning advancements. ... force, driving the development and design of state-of-the-art RDMA clusters tailored specifically for AI , ML, HPC workloads. We strive to be the go-to experts in… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Robert Half (Raymond, OH)
    …with Computer-Aided Engineering (CAE) tools (Pre/Post and Solving). Familiarity with High- Performance Computing ( HPC ) environments. Knowledge of AWS and Azure ... and collaboration with CAE leadership to ensure a secure, reliable, and high- performance environment. Key Responsibilities Daily Tasks Provide CAE Linux user support… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Micron Technology, Inc. (Richardson, TX)
    …designing and optimizing High Bandwidth Memory (HBM) products for AI /ML, high- performance computing ( HPC ), and data-centric systems , collaborating across ... ever. Micron's Heterogeneous Integration Group (HIG) is shaping the future of AI and accelerated computing by developing sophisticated memory solutions! The team… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Micron Technology, Inc. (San Jose, CA)
    …position in the Artificial Intelligence ( AI ), Machine Learning (ML) and High Performance Computing ( HPC ) business segments. You will be working on innovative ... you will be charged with defining and accomplishing the strategy for a High Performance Memory product portfolio that will further fortify Micron's leadership… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Meta (Bellevue, WA)
    …Qualifications Experience in leading teams working on high performance computing ( HPC ) and AI /ML systems , including: GPU/ASIC-based kernel development ... distributed systems for large scale training and serving, and systems architecture and performance Accelerator (GPU/ASIC) kernel development and optimization… more
    job goal (12/12/25)
    - Save Job - Related Jobs - Block Source