SubBanner banner image

Senior Site Reliability Engineer - AWS Kubernetes

London, Greater London, South East, England

Apply by 5 Aug 2025

£100000 - £115000 per annum, Benefits: 15% bonus, 7% pension

Job Ref.: BH-51847-1

Job Description

A truly unique opportunity to help launch a brand new team within a global financial services provider. This new team of highly skilled Full Stack Infrastructure Engineers will cover Compute, Storage, Network and Cloud technologies. You will help design, implement, and manage robust infrastructure solutions, ensuring reliability, scalability, and performance.
 
Requirements:
  • Proven experience managing and optimizing a diverse infrastructure stack.
  • Extensive knowledge of cloud platforms (AWS, Azure, GCP) and infrastructure as code (Terraform, CloudFormation).
  • Familiarity of service mesh technologies (Istio, Linkerd).
  • Solid understanding of virtualization (VMware, Hyper-V) and containerization (Docker, Kubernetes) and orchestration.
  • Understanding of storage solutions (SAN, NAS, cloud storage) and backup systems.
  • Strong understanding of network protocols, routing, switching, and firewalls. • Experience with load balancers (F5, HAProxy, Nginx) and network monitoring tools.
  • Experience in DNS management and troubleshooting.
  • Experience in network security best practices.
  • Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk).
  • Proficiency in at least one scripting language (Python, Bash) for automation.
  • Experience with CI/CD pipeline management and DevOps practices.
  • Strong understanding of disaster recovery and business continuity planning.
  • Experience with performance tuning and capacity planning.
  • Understanding of chaos engineering principles and practices.
  • Skills in cost optimization for cloud infrastructure.


Specific Tools and Techniques:
  • Experience in using cloud native monitoring tools like AWS CloudWatch, Azure Monitor, and Google Cloud Operations Suite.
  • Experience with packet capture tools like Wireshark for troubleshooting network issues.
  • Experience in using traceroute utilities and performance analysis tools like perf for identifying and resolving bottlenecks.
  • Familiarity with tools such as ipconfig/ifconfig for viewing network configurations, flushing DNS, and diagnosing network issues.
  • Experience with SNMP-based tools for network device monitoring and performance management.
  • Experience in using NetFlow for network traffic analysis.
  • Experience with tools like iostat, vmstat, and dstat for monitoring storage and system performance.
  • Experience in tools like df, du, lsblk, and fdisk for managing and troubleshooting file systems and disk partitions.
  • Familiarity with tools like Prometheus and Grafana for monitoring and observability
APPLY NOW

Recent Jobs.

DevOps Engineer
Zürich, Switzerland

Job Title: Adobe Developer– Marketing Technology (MarTech) Location: Zurich Contract Duration: 01.07.2025 – 30.06.2026 (80% workload) Availability: ASAP Rate: Negotiable About the Role: We are seeking

Data Management Consultant
Paris La Defense, Ile De France, France

For our customer in Paris we are seeking a Data Governance Manager/ Data Architect (***FRENCH SPEAKING***) - This is a key hire and an active player fully integrated into the teamA blend of strong tec

FX Software Engineer - C# .Net Multithreading
London, Greater London, South East, England

Software EngineerInitial 6 Month Contract London - 2 days a week onsite £700 per day - Umbrella One of our Investment Bank clients is currently searching for a Software Engineer on an interim basis. Y