Sapphire Ventures
Partnering with expansion-stage, enterprise software companies that we believe can become category leaders.
Sapphire Partners
Limited partner investing in exceptional early-stage venture fund managers.
Sapphire Sport
Partnering with early-stage companies at the nexus of technology and culture.
Menu close
Sapphire
hero

Portfolio Jobs

Looking for your next role? Take a look at these exciting jobs at Sapphire Ventures’ portfolio companies. Our Talent team is passionate about connecting you to your dream job!

Senior Software Engineer - Site Reliability Engineering

Rewst

Rewst

Software Engineering
Posted on Thursday, August 15, 2024

About Rewst

Rewst is a venture-backed B2B SaaS startup founded in January 2021. We’ve raised over $50M to date and have hundreds of paying customers.

We’re a low-code workflow automation platform targeted at the Managed Service Provider (MSP) market. To get an idea of how Rewst works and what it does, take a look at our YouTube channel. There you can find training videos from our education team, demos of the product by our ROC (Robotic Operations Center) team, and even customers showcasing how they use Rewst in their business.

We don’t have dedicated managers and we don’t put people in charge who don’t know how to write code themselves. We’re a relatively flat organization that prioritizes talented engineers who want to take charge. We see management, rules, and performance metrics as roadblocks to creativity and innovative thinking. We expect engineers to adhere to a single rule: create value for the organization. We tell you the problem, provide some constraints or example scenarios, and expect you to work with product management and stakeholders to craft the right solution and maximize the value to the business.

About You

As a Senior Site Reliability Engineer (SRE), you will ensure that Rewst’s systems are reliable, scalable, and performing optimally. You’re comfortable identifying system bottlenecks, automating tasks, and implementing best practices to prevent downtime and ensure high availability. You can talk to product stakeholders, record requirements, and ask the right questions where things are ambiguous. You have a passion for maintaining the stability of production systems and improving overall user experience. You are adept at using monitoring tools to foresee issues before they affect the user and have experience in incident management to resolve issues swiftly. You understand the balance between reliability and rapid feature development and are skilled at implementing strategies that align with business needs.

You have excellent skills in managing PostgreSQL databases, ensuring their performance, scalability, and reliability. You possess strong software engineering skills and can contribute to the development of internal tools and scripts to enhance system reliability. You’re a proactive engineer who thrives in environments where you can automate and improve existing processes. You are comfortable leading initiatives that improve system performance and reliability, and you can collaborate effectively with other engineers to achieve these goals. You have experience maintaining, scaling, and making improvements to Kafka and Redis systems. You are driven by results and are not afraid to make decisions that ensure the robustness and reliability of production systems.

Our Tech Stack

AWS

Docker

Kubernetes

Prometheus

Grafana

Elasticsearch

AWS CDK

Python

TypeScript

PostgreSQL

Kafka

Redis

Requirements

5+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role.

Strong experience with cloud platforms such as AWS or Azure.

Proficiency in containerization and orchestration technologies such as Docker and Kubernetes.

Experience with infrastructure as code tools like AWS CDK.

Familiarity with monitoring and logging tools such as Prometheus, Grafana, and OpenTelemetry.

Strong scripting skills in languages such as Python or TypeScript.

Proven ability to design and implement automated CI/CD pipelines.

Excellent skills in managing PostgreSQL databases.

Strong software engineering skills, with the ability to write clean, maintainable code.

Experience maintaining, scaling, and improving Kafka and Redis systems.

Ability to debug complex issues in a distributed system and identify root causes.

Excellent communication skills, able to convey complex technical information to non-technical stakeholders.

Self-motivated and able to take ownership of projects with minimal oversight.

Experience with incident management and on-call rotation.

Above and Beyond

Experience working in a startup environment and excited to tackle new challenges.

Knowledge or experience with low-code tools or robotic process automation tools.

Familiarity with advanced backend concepts such as state machines, distributed systems, or network protocols.

Contributions to open-source projects related to SRE or DevOps.

Certifications in cloud technologies such as AWS Certified DevOps Engineer or Google Professional Cloud DevOps Engineer.