IAM Governance Job at Openkyber, Georgia

SHhzdGRSMUxlRkliS2lJSDFHSmVZMkh0SFE9PQ==
  • Openkyber
  • Georgia

Job Description

Description We are seeking a highly experienced Site Reliability Engineering (SRE) Senior Architect to lead the design, implementation, and optimization of large-scale, highly available, and secure production systems. The ideal candidate will bring strong expertise in architecting SRE frameworks, modern observability platforms, DevOps automation, cloud infrastructure, reliability engineering, and incident management across distributed systems. This role requires hands-on technical leadership, architectural vision, and the ability to collaborate with cross-functional engineering, platform, and security teams to drive service reliability, performance, and operational excellence.

Key Responsibilities SRE Architecture & Reliability Engineering:
  • Define and drive the overall SRE architecture, standards, best practices, and governance.
  • Design highly scalable, resilient, and fault-tolerant infrastructure and application architectures.
  • Lead the implementation of SLOs, SLIs, error budgets, and reliability KPIs across critical services.
  • Build strategies for system reliability automation and proactive production hardening.
Observability, Monitoring & Incident Management:
  • Architect and implement enterprise-level observability platforms (metrics, logs, traces, events).
  • Reduce alert fatigue by ensuring actionable alerting and fully automated runbooks.
  • Drive major incident response, root cause analysis (RCA), and post-incident reviews.
  • Establish incident lifecycle processes and on-call operational frameworks.
DevOps, Automation & Cloud Engineering:
  • Architect and automate CI/CD pipelines using modern DevOps toolsets.
  • Design IaC (Infrastructure-as-Code) solutions using Terraform, Ansible, Helm, or similar tools.
  • Enable automated deployment strategies-blue/green, rolling, canary-across cloud and hybrid environments.
  • Build self-healing, auto-remediation, and predictive scaling solutions.
Cloud Platform Expertise:
  • Architect cloud-native and hybrid solutions using AWS, Azure, or Google Cloud Platform.
  • Lead cloud migration strategies, cloud security integration, and compliance automation.
  • Implement cost-optimized, high-availability cloud architectures with deep understanding of distributed systems.
Chaos Engineering & Resilience:
  • Lead resilience testing programs including chaos engineering, fault injection, and DR simulation.
  • Develop resilience scorecards and reliability maturity frameworks.
Collaboration & Leadership:
  • Work with application, security, data, and infrastructure teams to ensure reliability is built into design.
  • Mentor and guide SRE and DevOps teams on best practices, automation, and scalable architecture.
  • Influence enterprise-level technical direction and participate in architecture review boards.
Required Skills & Experience:
  • 12+ years of IT experience with 6+ years in SRE/DevOps architecture.
  • Strong expertise in cloud platforms (AWS, Azure, Google Cloud Platform) and distributed systems.
  • Deep knowledge of: Kubernetes, Docker, service mesh Observability stacks (Prometheus, Grafana, ELK, Splunk, Datadog, New Relic)
  • CI/CD systems (Jenkins, GitHub Actions, GitLab, Harness, ArgoCD, etc.)
  • IaC tools (Terraform, CloudFormation, Helm)
  • Strong understanding of networking, cloud security, IAM, and compliance frameworks.
  • Experience with incident response, operational playbooks, and reliability governance.
  • Excellent communication and ability to influence leadership and engineering teams.
Preferred Qualifications:
  • Certifications in AWS/Azure/Google Cloud Platform architecture.
  • Experience with chaos engineering tools such as Gremlin or Litmus.
  • Background in financial services or other highly regulated industries.
  • Hands-on programming skills in Python, Go, or Java.

For applications and inquiries, contact: hirings@openkyber.com

Job Tags

Similar Jobs

Hutchinson Consulting

Facilities Manager Job at Hutchinson Consulting

 ...Facilities Manager Private Estate | Seattle area, WA Seeking a deeply experienced Facilities/Property Manager to oversee the operations and maintenance of a large private estate in the Seattle area, WA. In this role, you will be responsible for ensuring that the... 

Prime Staffing

Travel Paramedic Job at Prime Staffing

 ...Job Description Prime Staffing is seeking a travel Paramedic for a travel job in Wausau, Wisconsin. Job Description & Requirements ~ Specialty: Paramedic ~ Discipline: Allied Health Professional ~ Start Date: 12/29/2025~ Duration: 13 weeks ~36 hours... 

Jackson Hewitt Tax Service, Inc.

Experienced Tax Preparer Professional Job at Jackson Hewitt Tax Service, Inc.

 ...Experienced professional needed for Independently owned tax office. Competitive hourly rate plus bonus after tax season ends...  ...setting with great people. Required: 1 year as a professional preparer with experience with 1040's, sch C and various other schedules... 

Bert Ogden McAllen Nissan

BERT OGDEN MCALLEN NISSAN SALES MANAGER Job at Bert Ogden McAllen Nissan

 ...gross and increase F&I penetration. OTHER DUTIES: Must conduct periodic self-inspection for hazard assessment within the Nissan sales department and recommend and document action needed and action taken. Ensure that sales department employees follow safety... 

TD Securities (USA) LLC

Vice President, Investment Banking - Financial Sponsors (San Francisco) Job at TD Securities (USA) LLC

 ...Description TD Securities is a leading investment banking franchise with offices in New York, Houston, San Francisco, Toronto, Vancouver, Calgary, Montreal and London. Our team of professionals provides corporates, financial sponsors and government clients with capital...