Skip to main content

You will need to login before you can apply for a job.

Site Reliability Engineering Specialist

BT Security
Birmingham, United Kingdom
Closing date
25 Jun 2024

View more

Why this job matters

Your role as a Site Reliability Engineering Specialist in the Secure Development SRE Team is to manage the implementation, operation and support, of one or more Internal Security Tools used by BT.
We are a self-contained SRE team responsible for implementing, running and supporting a diverse range of tools which support the management of our core IT infrastructure. This includes tools which manage network and IT security through Cyber threat detection, Physical Security systems securing our physical estate and compliance, automated software delivery and software discovery used by the whole of BT to secure all tools used in the business; as well as several other IT estate management functions, the team currently support over 14 applications in an ever expanding area of Cyber and Security tools.
This team is built on an ethos of continuous improvement; empowering the whole team to be actively engaged in improving our processes and our performance. Your role will be to drive this change and support the team in an ever changing and complex arena.

This role can be based in either Birmingham or Manchester and follows hybrid working

What you'll be doing
  • Executes the implementation of new software development life cycle automation tools, frameworks, and code pipelines (continuous integration/continuous delivery pipelines whilst executing best practices with a focus on the re-use of application code, demonstrates consistent software delivery practices and produces continuous integration/continuous delivery platform solutions
  • Executes the implementation of automation technologies to ensure repeatability, eliminating toil, reducing mean time to detection and resolution and repair services
  • Proactively identifies and manages risk through regular assessment and diligent execution of controls and mitigations, proactively raising any concerns
  • Leads scale testing to measure, tune and optimise system performance
  • Executes metric/monitoring analysis that creates stability, security, and performance improvements
  • Designs, analyses, develops and troubleshoots highly-distributed large-scale production systems spanning on-prem and cloud-based hosting
  • Executes approaches that scale systems sustainably through mechanisms like automation and evolves systems by pushing for changes that improve reliability and velocity
  • Writes and delivers infrastructure as code software to improve the availability, scalability, latency, and efficiency of services
  • Implements robust monitoring and alerting systems and performs root cause analysis and post-mortems with an eye towards future prevention
  • Inspects queue and support processing to ensure early warning of support issues
  • Executes retrospective and preventive actions after each high severity production incident
  • Analyses complex systems from a reliability and resilience perspective and identifies sources of instability in distributed systems
  • Champions, continuously develops and shares with team knowledge on emerging trends and changes in site reliability engineering best practices and industry standards
  • Mentors other site reliability engineers, helping to improve the team's abilities by acting as a technical resource
The skills you'll need to succeed
  • Incident Management Ensures that any incidents affecting processes and performances of relevant technology services or systems are managed appropriately to mitigate risk and minimise disruption.
  • Infrastructure Configuration Design, deploy and maintain highly available and safe networks and applications.
  • Continuous Integration / Deployment Build, Deploy and unit testing stages of the software release process into Production.
  • Service Assurance Service-level management involving the monitoring and management of the quality of the key performance indicators (KPIs) of a product or service to provide stable and performant applications to end users.
  • Troubleshooting: Applies problem solving methods to repair failed products or processes.
  • Programming / Scripting Provides automation to ensure repeatability, eliminating toil.
  • System Administration Knowledge of Windows and Linux System Administration
  • Project Management Ability to plan projects, assess risks and opportunities, communicating with stakeholders, troubleshooting problems, and more.
  • Application Performance Monitoring & Alerting Ensures suitable, modern and proactive monitoring and alerting in place to raise and mitigate concerns in system performance before user awareness.
Experience you'd be expected to have

  • Broad technical experience of Programming and Scripting, e.g BASH, Python and PowerShell
  • Broad technical experience across a range of IT infrastructure disciplines (eg, networks, datacentre infrastructure, operating systems etc)
  • Experience of Continuous Improvement
  • Strong experience of communicating complex detail to technical and non-technical audiences
  • Working with wider programme/LoB delivery organisations
  • Team leading experience or an interest in leading a team, further advantageous if this experience is within change transformation
  • Experience in one or more of the following:
  • Application implementation / solution design
  • IT security and compliance
  • Physical Security
  • Datacentre infrastructure
  • Software development
  • Vendor management

At BT, we entertain, educate, and empower millions of people every single day. We're a brand built on connecting people - whether that's friends, family, businesses, or communities. Working here, you'll receive an attractive salary and a range of competitive benefits, but - more than that - you'll be joining an ambitious organisation with a culture of togetherness, collaboration, and inclusivity, that takes a genuine and proactive interest in your progress and development.
  • Competitive salary
  • 10% on target bonus
  • BT Pension scheme, minimum 5% Employee contribution, BT contribution 10%
  • 25 days annual leave (not including bank holidays), increasing with service
  • Huge range of flexible benefits including cycle to work, healthcare, season ticket loan
  • World-class training and development opportunities
  • Option to join BT Shares Saving schemes.
  • Discounted broadband, mobile and TV packages
  • Access to 100's of retail discounts including the BT shop
About us

BT is part of BT Group, along with EE, Openreach, and Plusnet.

Millions of people rely on us every day to help them live their lives, power their businesses, and keep their public services running. We connect friends to family, clients to colleagues, people to possibilities. We keep the wheels of business spinning, and the emergency services responding.

We value diversity and celebrate difference. 'We embed diversity and inclusion into everything that we do. It's fundamental to our purpose: we connect for good.'

We all stick to the same values: Personal, Simple, and Brilliant. From day one, you'll get stuck in to tough challenges, pitch in with ideas, make things happen. But you won't be alone: we'll be there with help and support, learning and development.

This is your chance to make a real difference to the world: to be part of the digital transformation of countless lives and businesses. Grab it.


Although these roles are listed as full-time, if you're a job share partnership, work reduced hours, or any other way of working flexibly, please still get in touch.


Studies have shown that women and people who are disabled, LGBTQ+, neurodiverse or from ethnic minority backgrounds are less likely to apply for jobs unless they meet every single qualification and criteria. We're committed to building a diverse, inclusive, and authentic workplace where everyone can be their best, so if you're excited about this role but your past experience doesn't align perfectly with every requirement on the Job Description, please apply anyway - you may just be the right candidate for this or other roles in our wider team.

Get job alerts

Create a job alert and receive personalised job recommendations straight to your inbox.

Create alert