- NVIDIA (Santa Clara, CA)
- …infrastructure at scale. We are looking for a talented and experienced engineer having experience with RAS(Reliability, Availability, and Serviceability ) and ... NVIDIA's Data Center products. + Define server-level reliability, availability, and serviceability requirements in collaboration with various customers like CSPs and… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for Senior Software Engineering to join NVIDIA in the Cumulus Linux team! We present you with an opportunity to be part of the team that develops ... implementing core infrastructure services, as well as Reliability, Availability and Serviceability features for Cumulus Linux, the Debian-based operating system for… more
- Google (Sunnyvale, CA)
- …+ Identify critical user journeys that capture operational requirements (eg, serviceability , automation, safety, repair cost, and deployment time). + Integrate ... design for operations into the architecture of platforms hardware and software and investigate alternative architectures and roadmaps to enable an optimized system design. + Engage in the development of the system architecture, including design intent,… more
- NVIDIA (Santa Clara, CA)
- …from idea to reality. + Define server-level reliability, availability, and serviceability requirements in collaboration with various customers like CSPs and deliver ... fault resilient solution at scale as per customer expectations. + Collaborate with hardware, software and firmware teams to drive failure analysis and large scale solution deployment. + Work with engineering teams across NVIDIA to ensure your software… more
- Amazon (Sunnyvale, CA)
- …Product Operations team is seeking a highly skilled and motivated Business Intelligence Engineer to design and run a world class Global Supply Chain business ... the level of their importance. As an Business Intelligence Engineer , you will be working in one of the...metrics and dashboards; build solutions at maximum scale and self- serviceability by users - Recognize and adopt best practices… more
- NVIDIA (Santa Clara, CA)
- …fundamentals of safety & security concept, RAS (Reliability, Availability, and Serviceability ) requirements for safety platforms. (preferred) + Good exposure in ... our best-in-class engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.… more