Senior Site Reliability Engineer

OneSignal Logo OneSignal United States

Date Posted

31 Oct, 2022

Work Location

San Mateo, CA, United States

Salary Offered

$110000 — $135000 yearly

Job Type

Full Time

Experience Required

1+ years

Remote Work

Allowed

Stock Options

Vacancies

1 available

OneSignal is a leading omnichannel customer engagement solution, powering personalized customer journeys across mobile and web push notifications, in-app messaging, SMS, and email. On a mission to democratize engagement, we enable over a million businesses to keep their users - readers, fans, players and shoppers - engaged and up to date by delivering 10 billion messages daily.

1 in 7 new apps launches using OneSignal! We support companies in 140 countries, including Zynga, USA Today, Bitcoin.com, Upwork, Tribune, and many more - from startups and small businesses just getting off the ground to established companies communicating with millions of customers.

We’re venture-backed by SignalFire, Rakuten Ventures, Y Combinator, HubSpot, and BAM Elevate (read more about our recent Series C!). We’re a remote-first company, offering remote work as the default option in the United States in California, New York, Pennsylvania, and Texas, as well as in the UK and Singapore - with plans to expand the locations we support in the future. We also have offices in San Mateo, California, New York City, and London, UK.

OneSignal has a lot of the great tech startup qualities you'd expect, but we don't stop there. Our massive scale and small team, emphasis on healthy life balance and kindness in all our interactions, and focus on ownership and personal growth make OneSignal a uniquely great place to work.

About The Team:

We have grown rapidly to where we are today serving billions of HTTP requests daily and sending upwards of over 10 billion messages daily. We achieved this scale by writing scale-sensitive components in languages like Rust and Go. This potent combination of high performance with efficient resource utilization has given us an incredible competitive edge.

With our UK partner Elements Global Services, we are hiring SREs to help us continue to scale by operating and engineering the future of our infrastructure. We are maintaining 99.95% uptime today, and we are investing to ensure we maintain that as the business continues to grow and as the product evolves.

Your primary task will be software engineering with a focus on infrastructure, operations, and automation. You'll be building systems to run our product, improving internal services, and advising product teams on architecture as it relates to the operability of the service.

The systems you'll be responsible for include all of the services which power our product. This ranges from off-the-shelf services like haproxy, nginx, Redis, PostgreSQL, Kafka, Kubernetes, etc. to our in-house services such as the Rails web app, various Rust backend services, and our high-performance API layer written in Go.

You'll be working with Kubernetes to automate our data center operations and writing operational services to automate database operations. One of the key challenges in this role is to not only understand systems to the point of being able to manually operate by hand but also to understand in sufficient detail to write software systems to automate such operations.

Our blog contains more information about the OneSignal Engineering career ladder, compensation model, remote-first culture, and our diverse team. Our salary bands are available on AngelList.

In a typical month, a Senior Site Reliability Engineer at OneSignal might:

Improve our CI/CD pipeline to improve deploy performance
Develop new tools to enable other developers to better spend their time
Add new code to the system to enable messaging users on a new platform
Help evaluate a new storage technology to further scale our stack
Provision and configure new hardware
Investigate network issues
Improve application and infrastructure monitoring

What you'll bring:

At least 4 years SRE experience
Experience operating reliable production systems at scale
Knowledge of Linux systems internals
Desire and ability to automate tasks
Experience with PostgreSQL
Operational experience deploying and managing Kubernetes
Experience working with Cloud Providers (AWS/GCP/Azure)

We value a variety of experiences, and these are not required. It would be an added bonus if you have experience in any of the following:

Recent experience writing Go and/or Rust
Experience working with Layers 1-3 of the OSI networking model
Experience with any of Redis, Kafka, etcd, ZooKeeper, nginx, haproxy

In keeping with our beliefs and goals, no employee or applicant will face discrimination/harassment based on: race, color, ancestry, national origin, religion, age, gender, marital domestic partner status, sexual orientation, gender identity, disability status, or veteran status. Above and beyond discrimination/harassment based on 'protected categories,' we also strive to prevent other, subtler forms of inappropriate behavior (e.g., stereotyping) from ever gaining a foothold in our office. Whether blatant or hidden, barriers to success have no place in our workplace.

About OneSignal

Engage customers through personalized omni-channel messaging

Company Size: 51 - 250 People

Year Founded: 2011

Country: United States

Senior Site Reliability Engineer

About OneSignal

Engage customers through personalized omni-channel messaging

Share This Job

More Full Time Jobs

Growth Marketing Manager

Data Engineer / ML/NLP Engineer

Founding Product Manager

Software Engineer - Frontend

Finance Manager

More Companies Hiring

Remo

Persana AI

Garage

Rio

Pythagora (GPT Pilot)

Our Company