- Microsoft Corporation (Mountain View, CA)
- …the boundaries of scale, performance , and deployment, creating frontier AI systems that power transformative experiences across Microsoft. The Multimodal ... data libraries (Pandas, NumPy, etc.) OR equivalent experience. Experience with large-scale AI systems - design and deployment of distributed architectures,… more
- Micron Technology, Inc. (San Jose, CA)
- …position in the Artificial Intelligence ( AI ), Machine Learning (ML) and High Performance Computing ( HPC ) business segments. You will be working on innovative ... you will be charged with defining and accomplishing the strategy for a High Performance Memory product portfolio that will further fortify Micron's leadership… more
- Broadcom (San Jose, CA)
- …you apply. Job Description: Ethernet NIC product portfolio is designed for high performance computing and networking applications including AI and ML. This is ... of the next generation of Ethernet NIC solutions for AI /ML and High performance computing applications. We...networking is an added advantage. Experience analyzing and tuning performance for a variety of HPC workloads.… more
- Micron Technology, Inc. (San Jose, CA)
- …in growing the Artificial Intelligence ( AI ), Machine Learning (ML) and High- Performance Computing ( HPC ) business segments. You will be working on innovative ... of Work (SOWs), business term sheets, and other customer-facing documents for high- performance memory products. + Represent the Product Management team in Product… more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look… more
- NVIDIA (Santa Clara, CA)
- …with AI / HPC workflows that use MPI + Experience analyzing and tuning performance for a variety of AI / HPC workloads. + Passion for continual learning ... GPU compute clusters that run demanding deep learning, high performance computing, and computationally intensive workloads. We seek a...storage systems like Lustre and GPFS for AI / HPC workloads + Familiarity with deep learning… more
- NVIDIA (Santa Clara, CA)
- …analyzing and tuning performance for a variety of AI / HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks, and ... and implement GPU compute clusters for deep learning and high- performance computing. What you'll be doing: + Provide leadership...storage systems like Lustre and GPFS for AI / HPC workload. Experience working with deep learning… more
- NVIDIA (Santa Clara, CA)
- …, time-series databases, and large-scale monitoring systems . + Familiarity with AI /ML pipelines, GPU-based workloads , and HPC environments. + Experience ... teams to optimize observability for model training, inference workloads, and HPC performance . + Leverage machine learning and statistical techniques… more
- Meta (Menlo Park, CA)
- …and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: 1. Lead ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look… more
- NVIDIA (Santa Clara, CA)
- …fit for you, we'd love to hear from you! NVIDIA is seeking a Senior High Performance Computing ( HPC ) and AI Networking Performance Research and Analysis ... In this exciting role, you will profile and analyze AI workloads on large GPUs and CPUs scale clusters...and platforms, such as HCAs, Switches, CPUs, GPUs, and Systems . You will develop performance analysis tools… more
- Meta (Menlo Park, CA)
- …These workloads expect a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look for opportunities across ... host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineering Manager Responsibilities: 1. Manage engineers… more
- NVIDIA (Santa Clara, CA)
- …group at NVIDIA has openings for software architects in the field of AI and high- performance networking and system software. We research, develop, and ... and usable. + Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new… more
- NVIDIA (Santa Clara, CA)
- …Be Doing: + Primary responsibilities will include building and enabling robust AI / HPC infrastructure for customers + Support operational and reliability aspects ... of large-scale AI clusters, focusing on performance at scale,...in working with customers + Expertise with parallel file systems (eg Lustre, GPFS, BeeGFS, WekaIO) and high-speed interconnects… more
- NVIDIA (Santa Clara, CA)
- …looking for a Performance Engineer Intern focused on Deep Learning (DL) & High- Performance Computing ( HPC ) applications to join our diverse team. Our team is ... What you'll be doing: + Plan and execute GPU performance benchmarking across a wide range of HPC...quality, and want to be at the forefront of AI & HPC , we would love for… more
- Amazon (Santa Clara, CA)
- …computing and its potential to overcome some of the biggest challenges in High Performance Computing ( HPC )? Do you have a unique combination of deep technical ... C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI /ML frameworks. Preferred Qualifications -...life sciences or related discipline. - Working knowledge of HPC schedulers and distributed/parallel file systems , underlying… more
- Google (Sunnyvale, CA)
- …cycles, building tools, architecting and developing software for scalable distributed systems , including data platform, AI /ML, and infrastructure. + Experience ... products, and different customer segments/use cases of the emerging AI compute tech stack. **About the job** The Google...of our customers and helping shape the future of HPC . As the Senior Manager in High Performance… more
- NVIDIA (Santa Clara, CA)
- …analyzing and tuning performance for a variety of AI / HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks, and ... deploy, and operate GPU Compute Clusters for EDA and high- performance computing workloads used across multiple teams and projects.... systems such as Lustre and GPFS for AI / HPC workload. + Familiarity with metrics collection… more
- NVIDIA (Santa Clara, CA)
- …management, and fabric scalability. + Experience working with benchmarking tools and performance analysis for large-scale HPC / AI networking deployments. + ... engine of modern Artificial Intelligence, Advanced Networking, and High Performance Computing ( HPC ) - the biggest technology...Published work, patents, or advanced certifications in networking or HPC systems . NVIDIA is widely considered to… more
- NVIDIA (Santa Clara, CA)
- …vision? What you will be doing: + Investigate opportunities to improve communication performance by identifying bottlenecks in today's systems . + Design and ... implement new communication technologies to accelerate AI and HPC workloads. + Explore innovative solutions in HW and SW for our next generation platforms as… more
- NVIDIA (Santa Clara, CA)
- …improved workflows and develop new, leading differentiated solutions. You will interact with HPC , OS, GPU compute, and systems specialist to architect, develop ... parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is...looking for an outstanding hands-on architect/engineer for a Senior HPC architect role to support deployment and bringup of… more