Senior Software Engineer - Site Reliability Engineering
Rewst
This job is no longer accepting applications
See open jobs at Rewst.See open jobs similar to "Senior Software Engineer - Site Reliability Engineering" Sapphire Ventures.About Rewst
Rewst is a venture-backed B2B SaaS startup founded in January 2021. We’ve raised over $50M to date and have hundreds of paying customers.
We’re a low-code workflow automation platform targeted at the Managed Service Provider (MSP) market. To get an idea of how Rewst works and what it does, take a look at our YouTube channel. There you can find training videos from our education team, demos of the product by our ROC (Robotic Operations Center) team, and even customers showcasing how they use Rewst in their business.
We don’t have dedicated managers and we don’t put people in charge who don’t know how to write code themselves. We’re a relatively flat organization that prioritizes talented engineers who want to take charge. We see management, rules, and performance metrics as roadblocks to creativity and innovative thinking. We expect engineers to adhere to a single rule: create value for the organization. We tell you the problem, provide some constraints or example scenarios, and expect you to work with product management and stakeholders to craft the right solution and maximize the value to the business.
About You
As a Senior Site Reliability Engineer (SRE), you will ensure that Rewst’s systems are reliable, scalable, and performing optimally. You’re comfortable identifying system bottlenecks, automating tasks, and implementing best practices to prevent downtime and ensure high availability. You can talk to product stakeholders, record requirements, and ask the right questions where things are ambiguous. You have a passion for maintaining the stability of production systems and improving overall user experience. You are adept at using monitoring tools to foresee issues before they affect the user and have experience in incident management to resolve issues swiftly. You understand the balance between reliability and rapid feature development and are skilled at implementing strategies that align with business needs.
You have excellent skills in managing PostgreSQL databases, ensuring their performance, scalability, and reliability. You possess strong software engineering skills and can contribute to the development of internal tools and scripts to enhance system reliability. You’re a proactive engineer who thrives in environments where you can automate and improve existing processes. You are comfortable leading initiatives that improve system performance and reliability, and you can collaborate effectively with other engineers to achieve these goals. You have experience maintaining, scaling, and making improvements to Kafka and Redis systems. You are driven by results and are not afraid to make decisions that ensure the robustness and reliability of production systems.
Our Tech Stack
• AWS
• Docker
• Kubernetes
• Prometheus
• Grafana
• Elasticsearch
• AWS CDK
• Python
• TypeScript
• PostgreSQL
• Kafka
• Redis
Requirements
• 5+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role.
• Strong experience with cloud platforms such as AWS or Azure.
• Proficiency in containerization and orchestration technologies such as Docker and Kubernetes.
• Experience with infrastructure as code tools like AWS CDK.
• Familiarity with monitoring and logging tools such as Prometheus, Grafana, and OpenTelemetry.
• Strong scripting skills in languages such as Python or TypeScript.
• Proven ability to design and implement automated CI/CD pipelines.
• Excellent skills in managing PostgreSQL databases.
• Strong software engineering skills, with the ability to write clean, maintainable code.
• Experience maintaining, scaling, and improving Kafka and Redis systems.
• Ability to debug complex issues in a distributed system and identify root causes.
• Excellent communication skills, able to convey complex technical information to non-technical stakeholders.
• Self-motivated and able to take ownership of projects with minimal oversight.
• Experience with incident management and on-call rotation.
Above and Beyond
• Experience working in a startup environment and excited to tackle new challenges.
• Knowledge or experience with low-code tools or robotic process automation tools.
• Familiarity with advanced backend concepts such as state machines, distributed systems, or network protocols.
• Contributions to open-source projects related to SRE or DevOps.
• Certifications in cloud technologies such as AWS Certified DevOps Engineer or Google Professional Cloud DevOps Engineer.
This job is no longer accepting applications
See open jobs at Rewst.See open jobs similar to "Senior Software Engineer - Site Reliability Engineering" Sapphire Ventures.