Site Reliability Engineering Professional role at IBM in Durham

IBM in Durham is hiring a Site Reliability Engineering Professional


This job might already be filled.

At IBM, work is more than a job – it’s a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you’ve never thought possible. Are you ready to lead in this new era of technology and solve some of the world’s most challenging problems? If so, lets talk.

Your Role and Responsibilities
IBM’s CIO Network Engineering Team has an exciting opportunity for a Networking Site Reliability Engineer for our complex, global corporate network. This role will be joining a collaborative, cross-organization environment with a cooperative “whatever-it takes” approach by each team member. Networking Site Reliability Engineers will focus on Availability, Performance, Automation, Efficiency & Change Management. This is a hands-on role with ability to work with SMEs across all networking product disciplines to drive stability and reliability concepts into every segment of our network with the goal to maintain the very best IBMer user experience. This role requires development skills to technically contribute to the quality of our network deployments and inspire both team members and product teams to embrace SRE concepts and disciplines to create a strong and successful culture. Strong candidates will have established experience in software development, networking, problem solving skills. Ideal candidates will have worked in both a software development and network engineering capacity.Responsibilities

  • Involvement in every facet of product segment support — from the earliest stages of influencing product architecture, design and development to deployment, troubleshooting, and performance analysis – to ensure a reliable quality product in production.
  • Ability to collaborate and communicate clearly on status and progress.
  • Design and build tools and automation to manage a rapidly growing number of networking devices and services.
  • Take initiative to do what must be done in order to keep critical network product segments operating.
  • When required, perform general OS updates/patches, networking/server/database configuration changes, installs and automation.
  • Participate in periodic on-call rotation in a 7X24, follow the sun environment.

Required Technical and Professional Expertise

  • 3-5 years of experience in application development
  • 3-5 years of demonstrated experience with Python, BASH or other scripting languages
  • Demonstrated and proven understanding of enterprise network architecture principles and practices
  • Experience using Linux and/or having Linux administration skills
  • Experience with deployment and integration of monitoring tools (e.g. Dynatrace, NewRelic, Instana, SevOne, etc…)
  • Experience using and coding to API frameworks
  • Experience with Database technologies (SQL and NoSQL)
  • Experience with code repository and revision control systems (e.g GitHub)
  • Exposure to Web Technologies (JSON/XML, HTML/CSS, Web Services)
  • Exposure to identity integration technologies such as LDAP, SAML/SSO

Preferred Technical and Professional Expertise

  • Network Certifications (e.g. CCNA, CCNP, CCIE, etc…)
  • Familiarity with full software development life cycle (SDLC): Analysis, Design, Coding, Testing, Deploying, Training, Maintaining and Operational Support.
  • Previous experience working in Agile concepts and methodologies