- NVIDIA (Santa Clara, CA)
- …be doing: + Build internal perf/power profiling and analysis tools and platform for AI workloads at cluster scale + Build debugging tools for common ... frameworks like Pytorch, TensorFlow and etc + Knowledge of AI cluster job scheduling, storage management and...GPU cluster scale continuous profiling & analysis tools /platforms + Solid experience in large AI … more
- NVIDIA (Santa Clara, CA)
- …in administering Centos/RHEL and/or Ubuntu Linux distributions + Solid understanding of cluster configuration managements tools such as Ansible, Puppet, Salt + ... parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is...join us today! As a member of the GPU AI /HPC Infrastructure team, you will provide leadership in the… more
- NVIDIA (Santa Clara, CA)
- …looking for an experienced HPC Engineer to join the E2E software verification HPC/ AI Infrastructure team. We are building supercomputers and HPC clusters based on ... We are looking for an outstanding architect for a senior HPC, be a key player to the most...be doing: + Designing, implementing and maintaining large scale HPC/ AI clusters with monitoring, logging and alerting + Managing… more
- NVIDIA (Santa Clara, CA)
- …5K GPUs cluster . + Deep understanding of GPU computing and AI infrastructure. + Passion for solving complex technical challenges and optimizing system ... NVIDIA is the leader in AI , machine learning and datacenter acceleration. NVIDIA is...Solid experience with GPU clusters, and working knowledge of cluster configuration management tools such as BCM… more
- NVIDIA (Santa Clara, CA)
- …HW, and SW engineering and research teams to define a vision and roadmap for AI /HPC cluster observability. + Architect and lead teams to develop, test, and ... NVIDIA's Hardware Infrastructure organization is seeking a Senior or Princip al Data and Observability Architect....We serve and collaborate directly with NVIDIA's rapidly growing AI , HW, and SW engineering and research teams across… more
- Google (Sunnyvale, CA)
- …storage, compute hardware, databases, file systems, data analytics, cluster management, and other software infrastructure areas. Preferred qualifications: ... We deliver enterprise-grade solutions that leverage Google's cutting-edge technology, and tools that help developers build more sustainably. Customers in more than… more
- NVIDIA (Santa Clara, CA)
- …JAX, or TensorFlow. + Deep understanding of GPU acceleration, CUDA programming, and cluster management tools like Kubernetes. + Strong programming skills in ... NVIDIA is searching for a senior or principal engineer who specializes in building...works on multimodal foundation models, large-scale robot learning, embodied AI , and physics simulation. Our past projects include Eureka… more
- NVIDIA (Santa Clara, CA)
- …cloud architectures, for a dynamic role. What You'll Be Doing: + Leverage AI -powered testing tools to improve test automation, increase coverage, and accelerate ... We are in search of a highly skilled Senior Test Developer Architect to join our dynamic...and optimization techniques. + Improve observability and monitoring through AI -powered anomaly detection, predictive analytics, and intelligent alerting. +… more
- NVIDIA (Santa Clara, CA)
- …software engineers with bare metal hardware experience to help scale up its AI Infrastructure. We expect you to have significant software engineering experience with ... to build and deploy leading infrastructure solutions for a broad range of AI -based applications. If you're creative, passionate about GPU hardware, and love having… more
- NVIDIA (Santa Clara, CA)
- …graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" that ... parallel computing! More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is...high-frequency and low-power CPUs, GPUs, SoCs at block level, cluster level, and/or full chip level, with a focus… more
- NVIDIA (Santa Clara, CA)
- …architecture. Expertise with NVIDIA systems is a huge bonus. + Any experience with cluster software automation tools such as Ansible, Jenkins NVIDIA is leading ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...GPU accelerated applications, such as Deep Learning. As a Senior NVLink System Software Bringup Engineer within our Fabric… more
- NVIDIA (Santa Clara, CA)
- …KVM/VMWare/Hyper-V environment. + Good knowledge and hands-on experience in model testing, AI tools /frameworks (TensorFlow, Pytorch, Cursor and etc ), NLP and ... OEM business. NVIDIA is also well positioned as the ' AI Computing Company', and NVIDIA GPUs are the brains...OS experience, reliability testing with various telemetries, scale out cluster , test plan development, CI/CD and DevOps experience to… more
- LinkedIn (Mountain View, CA)
- …This spans multiple areas including but not limited to: Providing tools and infrastructure that creates delightful development experience; Provide and support ... member facing products; Provide actionable insights that leverages data and metrics; Provide tools and data that help LinkedIn teams listen to their customers; and… more
- Google (Sunnyvale, CA)
- …related field. + 10 years of experience in product management, with a focus on developer tools or AI /ML. + Experience taking a product from 0 to 1 and from ... networking, storage, compute hardware, databases, file systems, data analytics, cluster management, and other software infrastructure areas. Preferred qualifications:… more
- NVIDIA (Santa Clara, CA)
- …development team to build and develop test content in C++. + Experience of AI -Powered tools that rapidly accelerating automation. Those tools leverage ML, ... of high-quality software by focusing on code coverage and maintaining automation tools and infrastructure. + Contribute to the automation of manual test cases… more