- NVIDIA (Santa Clara, CA)
- NVIDIA is seeking hardworking and creative Memory System Resiliency Architect to join our Memory Subsystem team. At Nvidia, we have crafted a team of ... with design, product, and quality teams to ensure the memory system is resilient to errors. This...team, you will collaborate with DRAM vendors, Product engineering, System resiliency , DFT and Quality teams to… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Resiliency and Safety Architect! NVIDIA is a learning machine that constantly evolves by seeking exciting opportunities that ... definition - Clocks, Resets, Boot Sequence, Power Management, Interrupts, Memory Controller, Virtualization, Security, System Performance, IO technologies,… more
- NVIDIA (Santa Clara, CA)
- We are now seeking a Senior Resiliency and Safety Architect! NVIDIA is a learning machine that constantly evolves by seeking exciting opportunities that matter ... + Run simulations to analyze Architectural Vulnerability Factor and Liveness of on-die memory + Develop diagnostics software components for Resiliency and Safety… more
- NVIDIA (Santa Clara, CA)
- …designing system architectures tailored for AI, covering CPU, GPU, memory , storage, and networking. + Hands-on involvement in the entire lifecycle-from design ... We are seeking a Distinguished Engineer to lead AI Resiliency at NVIDIA! Join NVIDIA and help push the...and leadership across organizations, with direct exposure to NVIDIA's senior leadership. What You'll Be Doing: + Define a… more
- NVIDIA (Santa Clara, CA)
- … system architecture, microprocessor, and microcontroller fundamentals (caches, buses, memory controllers, DMA, etc.) + Strong Operating systems fundamentals ... Background in solving problems that apply to large complex systems deployed at scale. + Testing and Validating drivers...system -level debugging is invaluable + Experience working on system level reliability and resiliency features. +… more
- NVIDIA (Santa Clara, CA)
- …for degraded mode operation of the system per datacenter requirements and improve resiliency of a GPU based systems + Identify gaps in platform debuggability ... GPU based AI server with focus on PCIe architecture, system engineering, software/firmware changes as per processor & I/O...I/O bus (PCIe, etc.) and CPU. + Architecting complex systems , I/O error handling from PCIe & other I/O… more
- NVIDIA (Santa Clara, CA)
- …with computer system architectures, SoC fundamentals (eg, caches, buses, memory controllers, debug), OS architectures, and networking systems (eg, ethernet, ... are not only operating at the highest performance level, but also achieving high resiliency to cyber attacks. The Drive OS Automotive Software team is looking for a… more