£65K/yr to £85K/yr
England, United Kingdom
Permanent, Variable

Site Reliability Engineer

Posted by Twinstream Limited.

SITE RELIABILITY ENGINEER / CHELTENHAM (REMOTE) / UP TO £85K & GREAT BENEFTIS

Fantastic new opportunity for an experienced Site Reliability Engineer to join a dynamic organisation with ambitious growth plans. Excellent pay & extensive benefits package.

Remote working supported until DV Clearance is obtained, then onsite 4/5 days in Cheltenham.

In 2019, our founders were working as engineers solving complex cross domain problems in defence and security organisations.

TwinStream was formed to consolidate their collective expertise and experience into one business, providing technical excellence and exceptional service to their clients. The business is headquartered in Cheltenham with teams working both on-site with clients and remotely from home.

As we continue to expand our operations, we are looking for an experienced Site Reliability Engineer to join a new team that will deploy and maintain our established cross-domain system for a customer.

This is the perfect role for an experienced Engineer who is comfortable working in a managed serviced environment and wants to gain more experience with best-of-breed DevOps tools and techniques.

What's on Offer?

  • Highly competitive salary of £65,000 to £85,000 (depending on experience).
  • Pension - 8% pension plan.
  • Learning and Development - A £1,000 training budget to help with your development.
  • 25 days' holiday plus bank holidays.
  • Every quarter, we hold a meeting involving all team members from TwinStream - this allows us to meet up, chat about all things TwinStream, and enjoy team building and company updates.
  • Christmas and summer parties to celebrate our successes.
  • Opportunity to lease an electric vehicle via salary sacrifice
  • Health and Well-being - Access to workplace Mental Health First Aider
  • Life assurance

Key Responsibilities of the Site Reliability Engineer:

  • Collaborate with Feature Development teams to promote new component versions into production as efficiently as possible.
  • Maintain the system to agreed service level and availability objectives using real-time monitoring tools and system generated metrics.
  • Instrumentation of new system metrics and alerts to pre-empt issues and improve performance.
  • Respond to monitoring alerts and customer incidents, taking preventative/remedial action to minimise customer impact.
  • Liaising with key customer stakeholders to schedule capability changes and capture new service requirements as they arise.
  • Apply automation techniques to reduce manual operations burden.

Skills & Experience:

  • Must be eligible and willing to undergo DV clearance.
  • Experience in infrastructure automation tools (CloudFormation, Terraform or Ansible)
  • Experience working with docker containers & container orchestration tools (such as Kubernetes, OpenShift or Docker Swarm)
  • Experience using and maintaining CI / CD tools (such as Jenkins or GitHub actions)
  • Good understanding of relational databases and SQL
  • Linux command line, administration and shell scripting
  • Solid understanding of monitoring, auto-scaling, performance tuning, troubleshooting and disaster recovery best practices
  • Working knowledge of network security protocols
  • Working knowledge of AWS
  • Experience with monitoring tools such as InfluxDB, Prometheus or Grafana

What's Next?

If you have the passion and skills to be successful in this Site Reliability Engineer position, we would love to hear from you. APPLY NOW for immediate consideration.

Further Information:

Due to the industries in which some of our clients work, and to comply with their requirements, any offer would be conditional on achieving satisfactory Baseline Personnel Security Standard (BPSS) screening results, and on subsequently achieving and retaining Developed Vetting (DV) clearance.