Infrastructure Engineer

Cerebrium Logo Cerebrium United States

Date Posted

04 Apr, 2023

Work Location

New York, United States

Salary Offered

$100 — $140 yearly

Job Type

Full Time

Experience Required

3+ years

Remote Work

Allowed

Stock Options

Yes

Vacancies

1 available

Cerebrium is a machine learning platform that makes it easy for companies to fine-tune, deploy and monitor ML models in production. We abstract away the complexity that comes with managing ML infrastructure and the need to stay up to date with the latest ML research and technologies.

As an Infrastructure Engineer, you'll work closely with our team to build and operate our infrastructure at scale. You'll help us optimize our model deployment pipeline, implement GPU sharing and time-slicing on top of K8’s, implement parallism and implement scale this across 1000’s of machines to ensure our models and fine-tuning jobs run smoothly.

The ideal candidate has experience building and scaling infrastructure at a large scale, expertise in Kubernetes and serverless architectures, and excellent communication skills. You should be able to communicate complex topics clearly and work closely with customers - typically they are all machine learing engineers and software engineers.

How we work:

We focus on output. We don’t care what hours you work or from where you work. Just do what it takes to meet your weekly sprint. Finished early or just not having a good day - take the day.
We have a flat structure and want to constantly to be challenged by you. In terms of product and company decisions.
We ship multiple times a week - every time we can add value to the customer we ship it. Also you do weekly demos to team members on what you have been building.

Responsibilities:

Build and operate our infrastructure at scale
Optimize and deploy machine learning models and training jobs at scale
Improve GPU-sharing and time-slicing capability

Qualifications:

Experience building and scaling infrastructure at a large scale
Expertise in Kubernetes and serverless architectures
Experience with Python or Go
Experience with Infrastructure as code such as Terraform

We offer a remote, international work environment and only require 4 hours overlap with the team daily (9-18pm EST). If you're passionate about machine learning and are interested in joining a team dedicated to making it faster and easier to deploy machine learning models, please apply with your resume and cover letter.

Benefits

Competitive salary and meaningful equity
Flexible work environment – work remotely from home or from a WeWork which we sponsor
Health, dental, and vision benefits with 80% coverage for you
Unlimited PTO
Opportunities to speak and participate in events across the Cloud Native community
2-3 company off-sites a year. We have previously done Tulum and Greece.
Learning budget, and much more

About Cerebrium

Serverless Infrastructure Platform for AI

Company Size: 1 - 5 People

Year Founded: 2021

Country: United States

Share This Job

More Full Time Jobs

Social Media Manager

Bengaluru, India

Full Time

$600000 - $800000 yearly

SDR / Sr. SDR - APAC + India

Singapore, Singapore

Full Time

$400 - $1000 yearly

Senior Product Designer

Jakarta, Indonesia

Full Time

$3000 - $6000 yearly

Software Engineer

New York

Full Time

$110000 - $125000 yearly

Customer Data Enablement Manager

Toronto, Canada

Full Time

$75000 - $130000 yearly

More Companies Hiring

BEAMSTART is a global entrepreneurship community, serving as a catalyst for innovation and collaboration. With a mission to empower entrepreneurs, we offer exclusive deals with savings totaling over $1,000,000, curated news, events, and a vast investor database. Through our portal, we aim to foster a supportive ecosystem where like-minded individuals can connect and create opportunities for growth and success.

Our Company

Home

Jobs

Investors

Members