Pyramid Systems, Inc.

Senior Site Reliability Engineer

Job Locations US-DC
Posted Date 2 weeks ago(9/14/2022 9:31 PM)
Job ID
2022-1832
# of Openings
1
Category
Software Engineering

Overview

Looking for that next challenge? Do you thrive in fast paced challenging environment?

 

Pyramid Systems is seeking an Senior Site Reliability Engineer who is responsible for ensuring the availability, performance, and security of company websites and services. They work closely with developers and other engineers to identify and resolve issues that may impact website or service availability.

 

Individual will also monitor website and service performance and implement changes to improve reliability. In addition, senior site reliability engineers are responsible for developing and maintaining company website and service security policies and procedures.

Responsibilities

  • Spearhead the SRE patterns and practices in our team
  • Ensure that we meet the SLAs, SLOs, and SLIs with our clients
  • Document every action so your findings turn into repeatable actions and then into automation.
  • Improve operational processes (such as deployments and upgrades) to make them as boring as possible.
  • Design, build and maintain core infrastructure that enables HUD’s services scaling to support hundreds of thousands of concurrent users.
  • Debug production issues across services and levels of the stack.
  • Plan the growth of HUD’s AWS infrastructure.
  • Be on an on-call rotation to respond to incidents that impact client’s availability and provide support for service engineers with customer incidents.
  • Use your on-call shift to prevent incidents from ever happening.
  • Run our infrastructure with AWS Cloud, Terraform and GitLab CI/CD.
  • Build monitoring that alerts on symptoms rather than on outages.

Qualifications

  • 3+ years of experience managing cloud or virtualized infrastructure
  • Hands-on experience in architecting and operationalizing multi-tier production stacks
  • Passionately practices Infrastructure automation using Terraform, Ansible, or similar
  • Knowledge and familiarity with Docker and Kubernetes
  • Fluent in at least one programming language - Ruby, Python, Go and similar
  • Strong system administration experience in Linux and fluent in Bash
  • Practices sound security engineering principles and familiar with NIST 800-53 controls
  • Lives and breathes to automate pipelines using AWS DevOps, Jenkins, or similar
  • Strong team player, communicates well, and caries an infectious can-do attitude

Education:

  • Bachelor’s Degree in Computer Science, Engineering, or related field

Options

Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed