- Pantera Capital (Palo Alto, CA)
- …able to concisely and accurately share knowledge with their teammates. About the role As a Site Reliability Storage Engineer , you will play a pivotal ... access for xAI researchers working on advanced AI models. Enhance the reliability , performance, and cost-effectiveness of xAI's storage infrastructure to support… more
- Pantera Capital (Palo Alto, CA)
- …knowledge with their teammates. About the Role We are seeking a highly skilled Senior Site Reliability Storage Engineer to join our mission-driven team, ... CA, with up to 25% travel required. Required Qualifications 5+ years of experience as a Site Reliability Engineer or similar role, with a focus on building… more
- Pantera Capital (Palo Alto, CA)
- A cutting-edge technology company in California seeks a Site Reliability Storage Engineer to design and operate scalable storage systems for AI ... research. The ideal candidate will have strong programming skills in Rust or Go, and experience with IaC tools and Kubernetes. This role offers a competitive salary ranging from $180,000 to $440,000 and comprehensive benefits including equity and health… more
- Pantera Capital (Palo Alto, CA)
- A leading tech firm in Palo Alto is seeking a Senior Site Reliability Storage Engineer to design and optimize Kubernetes clusters. The ideal candidate ... Kubernetes orchestration and distributed systems. Responsibilities include managing system reliability , developing software for cluster provisioning, and enhancing infrastructure… more
- Apple Inc. (Seattle, WA)
- …company in Seattle is looking for an experienced Site Reliability Engineer to join their Block Storage SRE team. This role involves significant ... responsibility in shaping the platform for critical Apple Cloud services. You will handle unique challenges involving large datasets and utilize your skills in programming, Linux systems, and cloud environments. The position offers a competitive salary ranging… more
- The Aerospace Corporation (El Segundo, CA)
- …Summary The Aerospace Corporation is seeking a High-Performance Computing (HPC) Data Storage Engineer ( Site Reliability Engineer Staff III/IV ) to ... join Computational Services team that oversees HPC storage for the Aerospace Corporation.The team manages several HPC...You Need to be Successful Minimum Requirements for the Site Reliability Engineer Staff III… more
- Apple Inc. (Seattle, WA)
- …art and technology. Join Apple Services Engineering Cloud Service Infrastructure team, as a Site Reliability Engineer , to help support and scale cloud ... and frameworks which provide and support services like structured and unstructured storage , caching, queueing, searching, and much more at hyperscale. These form the… more
- harvey.ai (San Francisco, CA)
- …is being written today - and we're just getting started. Role Overview As a Software Engineer on the Site Reliability team at Harvey, you will ensure the ... , and functionality. What You Have 5+ years of experience in Site Reliability Engineering or similar roles supporting production environments Expertise… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …platform - and operational excellence is at the heart of that mission. As a Site Reliability Engineer focused on Operational Excellence, you will help ensure ... want to grow their technical career while supporting incident response, reliability , and continuous improvement across a large-scale distributed platform. You'll… more
- Internetwork Expert (Carlsbad, CA)
- The Senior Site Reliability Engineer (SRE) will be responsible for ensuring the availability, performance, scalability, and operational efficiency of the ... with disabilities to perform the essential functions. 5+ years of experience in Site Reliability Engineering, systems engineering, or DevOps roles. Expertise in… more
- Blueface Ltd (Chicago, IL)
- …the Global Operation team, you will be responsible for ensuring the reliability , scalability, and performance of Freewheel systems. Working closely with engineers ... and other operation sub-teams, you will manage infrastructure, optimize system reliability , automate daily operations, and resolve technical issues that impact… more
- Comcast (Chicago, IL)
- …the Global Operation team, you will be responsible for ensuring the reliability , scalability, and performance of Freewheel systems. Working closely with engineers ... and other operation sub-teams, you will manage infrastructure, optimize system reliability , automate daily operations, and resolve technical issues that impact… more
- CATHEXIS (Honolulu, HI)
- …empower our employees to create innovative and trusted results. We are looking for a dynamic Site Reliability Engineer (SRE) to join our team at Joint Base ... Pearl Harbor-Hickam. The Site Reliability Engineer (SRE) will...Python, Java, C/C++, Ruby, and JavaScript Experience with distributed storage technologies such as NFS, HDFS, Ceph, and Amazon… more
- SPACE EXPLORATION TECHNOLOGIES CORP (Redmond, WA)
- …this possible, with the ultimate goal of enabling human life on Mars. SR. LINUX SITE RELIABILITY ENGINEER SpaceX is looking for an experienced engineer ... work extended hours and weekends as needed. COMPENSATION AND BENEFITS Pay Range: Site Reliability Engineer /Senior: $160,000.00 - $220,000.00/year Your actual… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- A technology firm based in California is seeking a Site Reliability Engineer to optimize their AI-optimized cloud infrastructure. The role involves building ... automation tools, driving reliability initiatives, and collaborating... initiatives, and collaborating with engineers to ensure high-performance storage systems. Candidates should have strong experience in SRE,… more
- Federated IT (Washington, DC)
- …understand both the technology and the mission. About the Role As the Lead Site Reliability Engineer for our ComputeBridge Engagement, you'll be responsible ... program as the contract grows What You'll Bring 3+ years of experience in site reliability , systems engineering, or hardware operations roles Deep expertise with… more
- Steampunk (Mclean, VA)
- Overview As a Site Reliability Engineer ,...more clouds (ie AWS, Azure, or GCP) and distributed storage technologies 5 Years of Experience with Git SCM ... Act as an individual contributor and mentor more junior team members Engineer and implement solutions and provide recommendations for continuous improvement for the… more
- Boson AI (Palo Alto, CA)
- About The Role We're looking for a Senior Site Reliability Engineer to help us run one of the most exciting GPU clusters around-our Toronto datacenter packed ... with NVIDIA H100 and A100 GPUs, over 20PB of Ceph storage , terabit networking, and hundreds of servers. You'll be hands‑on with the full lifecycle of HPC… more
- Hive (Seattle, WA)
- …to commercialize our machine learning models, we also need to grow our DevOps and Site Reliability team to maintain the reliability of our enterprise SaaS ... - Jenkins Network hardware - Arista/Cisco/Fortinet Hardware - HP/SuperMicro Storage - Ceph, S3 Database - Scylla, Postgres, Pivotal.../ IP, ICMP, SSH, DNS, HTTP, SSL / TLS, Storage systems, RAID, distributed file systems, NFS / iSCSI… more
- OpenAI (San Francisco, CA)
- …will work at the intersection of hardware and software, where speed and reliability are critical. Expect to manage fast-moving operations, quickly diagnose and fix ... Qualifications Experience as an infrastructure, systems, or distributed systems engineer in large-scale or high-availability environments Strong knowledge of… more