- Meta (Seattle, WA)
- …on one of our network engineering teams is for you! **Required Skills:** Network Engineer , HPC Systems Network Strategy Responsibilities: 1. Design, deploy, ... be responsible for conceiving, developing, and deploying software, hardware and network systems and tools that improve reliability and efficiency in our global… more
- University of Washington (Seattle, WA)
- …central IT organization designs, implements, and operates Linux-based High-Performance Computing ( HPC ) clusters and storage systems to offer UW researchers ... unlock new possibilities for their research. This position focuses on supporting HPC efforts for the research computing group, while providing expertise and support… more
- Meta (Bellevue, WA)
- …evolving AI workload needs.We are hiring in multiple locations. **Required Skills:** Software Engineer , Systems ML - HPC Specialist Responsibilities: 1. ... **Summary:** Meta is seeking an AI Software Engineer to join our Research & Development teams....on the web.Some aspects of this role as an HPC specialist may include authoring components such as cuBLAS,… more
- Amazon (Seattle, WA)
- …physical or life sciences or related discipline. - Working knowledge of HPC schedulers and distributed/parallel file systems , underlying IT systems ... they refactor an application or designing entirely new cloud-based systems . Do you enjoy solving novel and unique technical...some of the biggest challenges in High Performance Computing ( HPC )? Do you have a unique combination of deep… more
- Meta (Bellevue, WA)
- **Summary:** Meta is seeking a Hardware Systems Engineer to join our Release to Production (RTP) team working on new NPI hardware. Our servers and data centers ... high-performance software and hardware technologies for AI at datacenter scale. Hardware Systems Engineer in RTP work closely with HW/SW co-design teams,… more
- Meta (Bellevue, WA)
- **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are the ... and lifecycle of servers in production. **Required Skills:** Production Systems Engineer , Fleet AI Systems ...wide-reaching, at-scale systems issues. 21. Experience supporting AI/ HPC systems and/or related components at scale.… more
- Amazon (Seattle, WA)
- Description As a Cloud Support Engineer you will learn at an accelerated pace how to use and leverage many different cloud technologies to help our customers ... (WorkSpaces, WorkMail, WorkDocs, WorkLink, WorkSpaces Application Manager), Directory Service, Systems Manager, EC2 Image Builder, Migration Suite, Application Discovery… more
- Meta (Bellevue, WA)
- …planning, distributed and centralized control systems , modeling/provisioning/automation, monitoring/troubleshooting/analytics, and simulation/design/failure ... Software Engineers with a passion for networking and aptitude for building scalable distributed systems . Do you want to work on one of the most dynamic, fast-paced… more
- Meta (Bellevue, WA)
- …planning, distributed and centralized control systems , modeling/provisioning/automation, monitoring/troubleshooting/analytics, and simulation/design/failure ... Software Engineers with a passion for networking and aptitude for building scalable distributed systems . Do you want to work on one of the most dynamic, fast-paced… more
- Amazon (Seattle, WA)
- …to efficiently process genomic data at scale. Our team is seeking a Software Development Engineer to join the Usability & Interfaces team. In this role, you will own ... workflow and performance optimizations, and interfacing with high-performance computing ( HPC ). You will also get an opportunity to support...will help each team member develop into a better-rounded engineer and enable them to take on more complex… more
- Google (Kirkland, WA)
- …limitations in the areas of virtualization, global infrastructure, distributed ML and HPC systems , load balancing, networking, massive data storage, and ... + 15 years of experience building software and distributed systems . + 10 years of experience with machine learning...innovative, and cutting edge ML workloads. As a Principal Engineer , you will be responsible for driving the Google… more
- Amazon (Seattle, WA)
- …to use. The AWS Health AI team is looking for a Software Development Engineer to build a new customer experience that will simplify how life sciences researchers ... foundation models (BioFMs), generative AI (GenAI), and high-performance computing ( HPC ). We do this by orchestrating these models and...drug therapies. You will design and build new software systems at the cutting edge of Healthcare & Life… more
- NVIDIA (Redmond, WA)
- We are looking for a LLVM/MLIR Compiler Engineer for an exciting role in our Compute Compiler Team. We deliver features and improvements to CUDA and other compute ... the top valued diverse minds in GPU computing and systems software, doing what you enjoy. See your efforts...what you enjoy. See your efforts in action as HPC and DL developers use features and optimizations to… more
- Amazon (Seattle, WA)
- …order to solve our customers toughest problems. As a Runtime Software Development Engineer you will have experience with high-performance Linux drivers, HPC ... or architecture (design patterns, reliability and scaling) of new and existing systems experience - 5+ years of full software development life cycle, including… more
- Amazon (Seattle, WA)
- …cloud offerings that enable high performance and scalability in AI/ML and HPC workloads. AWS Infrastructure Services owns the design, planning, delivery, and ... goal of improving the current customer experience as well as developing improved systems for future designs. You will work directly with vendors and ODM/JDM design… more
- Amazon (Seattle, WA)
- …the lowest cost per inference instances in the cloud. More SAP, high performance computing ( HPC ), ML, and Windows workloads run on AWS than any other cloud. In this ... or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience programming with at least one software programming… more
- Amazon (Seattle, WA)
- …SRE (Site Reliability Engineering), or Resilience Engineering - 5+ years of SysDE ( Systems Development Engineer ) or equivalent experience - 5+ years of server ... that enable high performance and scalability in AI/ML and HPC workloads. You are intrigued by the continuous release...have tremendous interest in cloud scale and curious how systems and software decisions impact the user. You insist… more