Infrastructure Engineer

Replicate Logo Replicate United States

Date Posted

16 Nov, 2024

Work Location

California, United States

Salary Offered

$200000 — $279975 yearly

Job Type

Full Time

Experience Required

3+ years

Remote Work

Allowed

Stock Options

Vacancies

1 available

At Replicate, we believe AI shouldn’t be exclusive to tech giants — it should be accessible to every software developer. Our goal is straightforward: build the best platform for creating, deploying, and running machine learning models. As an Infrastructure Engineer on the Platform team, you’ll play a key role in making generative AI available to everyone.

The Platform team at Replicate oversees the entire lifecycle of models, from packaging and deployment to serving, scaling, and monitoring. You’ll be developing the infrastructure that supports thousands of models and powers millions of predictions daily. This is a chance to build something truly innovative, where each decision you make has a tangible impact and allows your creativity to shine.

What you’ll be doing:

Designing and building our deployment and model-serving platform.
Building technology to operate the latest advancements in the ML and AI space.
Designing systems to maximize the utilization and reliability of our Kubernetes clusters and GPUs, including multi-regional traffic shifting and failover capabilities.
Owning and optimizing fair and reliable task allocation and queuing across a diverse set of customers with heterogeneous workloads.
Working with our Models team to speed up model inference through techniques like caching, weights management, machine configurations, and runtime optimizations in Python and PyTorch.
Working with technologies such as
- Python, Go, and Node.js
- Kubernetes and Terraform
- Redis, Google BigQuery, and PostgreSQL

We're looking for the right person, not just someone who checks boxes, but, it’s likely you have…

Experience building platforms at scale.
Worked in complex systems with many moving parts; you have opinions on monoliths vs. services.
Designed and implemented developer-friendly APIs to enable scalable and reliable integration.
Hands-on experience setting up and operating Kubernetes.
A passion for building tools that empower developers.
Strong communication and collaboration skills, with the ability to understand customer needs and distill complex topics into clear, actionable insights. We believe that most of programming isn’t just about writing code; building a platform requires a collaborative approach.
At least 3 years of full time software engineering experience.

These aren’t hard requirements, but we definitely want to talk with you if…

You have worked on machine learning platform teams in the past
You have experience working with or on teams that have put ML/AI into production, even though this role does not entail building ML models directly.
You have some exposure to serving Generative AI features where GPUs are costly commodities and workloads can take significant time to finish.

This role can be remote (anywhere in the United States) or in-person. We have a preference for timezones closer to PST. If possible, we like people to come into our San Francisco office at least 3 days a week.

About Replicate

Run machine learning models in the cloud

Company Size: 11 - 50 People

Year Founded: 2019

Country: United States

Share This Job

More Full Time Jobs

Research Scientist / Senior Research Scientist, Assay Discovery

Massachusetts

Full Time

Software Engineer - Fintech

New York

Full Time

$125000 - $175000 yearly

Identity and Access Management Architect

Full Time

$215000 - $285000 yearly

Full-Stack Software Engineer

California

Full Time

$120000 - $180000 yearly

Founding Engineer

California

Full Time

$100000 - $180000 yearly

More Companies Hiring

BEAMSTART is a global entrepreneurship community, serving as a catalyst for innovation and collaboration. With a mission to empower entrepreneurs, we offer exclusive deals with savings totaling over $1,000,000, curated news, events, and a vast investor database. Through our portal, we aim to foster a supportive ecosystem where like-minded individuals can connect and create opportunities for growth and success.

Connect with Us

Discover More

Home

Jobs

Investors

Members