Site Reliability Engineer Job at OpenAI, San Francisco, CA

cy9Ia3Y5ZnduN21KZVJLakphbDlKUT09
  • OpenAI
  • San Francisco, CA

Job Description

About the Team

Join the engineering teams that bring OpenAI’s ideas safely to the world!!

The Applied Engineering team works across research, engineering, product, and design to bring OpenAI’s technology to consumers and businesses. We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely. Safety is more important to us than unfettered growth.

About the Role

We’re seeking a Site Reliability Engineer with experience in managing systems and infrastructure at scale. You’ll join a nimble team where you’ll help drive deployment of OpenAI’s technology into new environments and infrastructure to enable the critical missions in the public sector. This role engages cross-functionally with internal product, security, and compliance teams to build required functionality and ensure we’re delivering a scalable, reliable platform. The proximity to customers provides a unique opportunity to see the impact of your work first-hand.

This role is based in Washington D.C. and San Francisco, CA. Travel to and working from customer sites is required for this role. 

In this role, you will:

  • Design and build performant, reliable, and scalable infrastructure, both on-premises and in the cloud, for our public sector customers.

  • Administer the systems from the hardware up to kubernetes, ensuring our teams have a standardized infrastructure to deploy OpenAI’s technology onto.

  • Own the reliability of these systems by being on-site with the customer, utilizing observability tooling, and directly troubleshooting issues that arise as the first line of support.

  • Partner with teams across engineering and security to ensure the product supports the unique needs of the infrastructure and use-cases.

  • Automate routine tasks and standardize our infrastructure offerings to allow our team to scale as we continue to grow.

  • Partner with teams across the business, including engineering, security, and compliance, to enable our products to work within the unique constraints of new environments.

You might thrive in this role if you:

  • Hold an active US security clearance

  • 5+ years experience operating infrastructure and systems at scale

  • Worked out of secure environments, closely collaborating with both on-site clients and remote colleagues.

  • Hands-on experience with containers (Docker) and orchestration platforms (kubernetes)

  • Scripting experience with Python or equivalents for automating routine tasks

  • Own problems end-to-end, and are willing to pick up whatever knowledge you're missing to get the job done to ensure both your team and our customers succeed.

  • Strong troubleshooting skills across the entire stack (infrastructure, systems, and applications)

  • Thrive in dynamic environments and can navigate ambiguity with ease.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. 

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status. 

For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this  link .

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Compensation

$279K – $385K

The base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience.

In addition to the salary range listed above, total compensation also includes generous equity and benefits.

  • Medical, dental, and vision insurance for you and your family

  • Mental health and wellness support

  • 401(k) plan with 50% matching

  • Unlimited time off and 13 company holidays per year

  • Paid parental leave (24 weeks paid birth-parent leave & 20-week paid parental leave) and family-planning support

  • Annual learning & development stipend ($1,500 per year)

Job Tags

Remote job, Holiday work,

Similar Jobs

CRH

Plant Operator Job at CRH

 ...construction services in North America. Our 24,100 employees at 1,300 operating locations provide our customers with top-quality, innovative products and services. Position Overview The Plant Operator will operate and assist in maintenance of plants at our... 

Glice Eco Skating Rinks

Glice Inc. Warehouse & Logistics Manager Job at Glice Eco Skating Rinks

 ...will take charge of preparing our shipments and ensuring their timely delivery to our customers and returns in the case of our rink rentals...  ...7. Be diligent about inbound rentals for damage, missing parts, dirt. Work Hours Typical starts at 8am, ending at 4pm... 

The Dragons Concord

Storyteller Job at The Dragons Concord

Are you a Gamemaster (GM) who's been wanting to expand your scope or add a little extra money to your pocket running tabletop role-play games (TTRPGs)? Now's your chance! The Dragons Concord in Fairfax, VA is looking for individuals with experience running TTRPGs to join...

Cherry Tree Dental

Dental Hygienist Job at Cherry Tree Dental

 ...procedures are followed. What we seek: Completion of an accredited dental hygiene program Licensed Dental Hygienist (RDH) in the State of Wisconsin Patient Care Hours: Monday through Thursday 8:00 a.m. to 5:00 p.m. Fridays as needed 8:00 a.m. to 2:00 p.m.... 

TEKsystems

Associate Creative Art Director (Social) Job at TEKsystems

Summary TEKsystems is seeking an Associate Creative Director Art Director for a ~2-3 month contract position to focus on social-first content. The ideal candidate is someone who can work proactively and autonomously (in partnership with a writer) to concept, produce,...