The Site Reliability Engineering (SRE) is a critical role that will have a significant impact on our CData Connect Cloud product. We’re looking for someone who is excited about taking ownership of improving the existing infrastructure, designing the future of CData Cloud and working with a diverse team. Attention to detail and eagerness to learn new technologies and systems is critical to the success of this role.
Responsibilities include but are not limited to:
Define and help implement infrastructure improvements for CData Connect Cloud
Support & contribute improvements to the availability, scalability, latency, and efficiency of CData Connect Cloud
Define and measure production availability, navigating known downtime, and service level outages
Increase product delivery velocity
Debug problems at scale for our mission critical services and help our development teams implement lasting fixes to recurring issues
Execute, debug, and configure CI/CD pipelines
Analyze service requests and take appropriate action meeting defined SLA
Define and implement monitoring metrics and alerts to ensure tools and environments are meeting SLA's for uptime and performance
Advancing Infrastructure-as-Code and GitOps for the Cloud product team
Qualifications:
B.S. degree in Computer Science or related technical field (e.g., EE, physics or mathematics), or equivalent practical experience and including: 5 + years professional coding software development experience
Deep understanding of Linux and containerization
Experience with Kubernetes both as a developer and from an operations perspective
2+ years of experience working with public cloud infrastructure (Azure preferred)
Experience deploying and operating applications, Java or C# preferred
Experience with GitOps based workflows
Terraform infrastructure as code experience
Database experience (SQL Server preferred)
Experience with development practices and tools (JIRA, Git, Azure DevOps)
Experience with messaging systems and APIs
Hands-on experience in a variety of SRE tools and techniques
Working knowledge of networking (e.g., firewall, routing, network topologies, etc)
Benefits
11 Paid Holidays
20 Days of PTO
Employer-paid Medical, Dental, and Vision plans (for employee only)
HSA with Company Contribution
Employee Assistance Program
401k with 6% Immediately Vested Company Match
Professional development opportunities