LOG IN
SIGN UP
Canary Wharfian - Online Investment Banking & Finance Community.
Sign In
OR continue with e-mail and password
E-mail address
Password
Don't have an account?
Reset password
Join Canary Wharfian
OR continue with e-mail and password
E-mail address
Username
Password
Confirm Password
How did you hear about us?
By signing up, you agree to our Terms & Conditions and Privacy Policy.

Job Details

J.P. Morgan logo
Bulge Bracket Investment Banks

Senior Manager of Site Reliability Engineering

at J.P. Morgan

ExperiencedNo visa sponsorship

Posted 16 days ago

No clicks

Senior Manager of Site Reliability Engineering at JPMorgan Chase leading reliability for Email services across SaaS, vendor appliances, and in-house applications. Owns non-functional requirements and champions site reliability practices including observability, resiliency, security, scalability, monitoring, instrumentation, and automation. Leads and coaches a team of reliability engineers, influences strategic planning, and drives continual improvement through blameless post-mortems and data-driven metrics. Requires SRE training and hands-on experience with programming, observability, CI/CD, and container orchestration technologies.

Compensation
Not specified

Currency: Not specified

City
Bengaluru
Country
India

Full Job Description

Location: Bengaluru, Karnataka, India

Guide and shape the future of technology at a globally recognized firm, driven by pride in ownership.

 

As a Senior Manager of Site Reliability Engineering at JPMorgan Chase within the Employee Platforms team, you are the non-functional requirement owner and champion for the applications in your remit. You are a key influencer in your team’s strategic planning, driving continual improvement in customer experience, resiliency, security, scalability, monitoring, instrumentation, and automation of the software in your area. In this role, you will lead the Reliability function for the firm’s Email services, a mixture of both SaaS based solutions on Microsoft 365, vendor appliances deployed within the firm, and a number of in-house applications. You will not only have strong skills in all aspects of reliability (observability, failure mode analysis, and an appreciation of resilient software patterns) but strong influencing and leadership skills to drive the reliability agenda and manage a diverse team of reliability engineers.  You will take control in times of ambiguity, showing thought leadership and innovation in tackling problems.

 

 

Job responsibilities

 

  • Demonstrates expertise in site reliability principles and demonstrates an understanding of the fine balance between features, efficiency, and stability
  • Effectively negotiates with peers and executive partners to ensure optimal outcomes for all 
  • Drives the adoption of site reliability practices throughout the organization
  • Ensures your teams demonstrate site reliability best practices with the ability to demonstrate this empirically through stability and reliability metrics
  • Drives a culture of continual improvement and solicits real-time feedback to improve the customer’s experience
  • Ensures your team collaborates with other teams within your group’s specialization and avoids duplication of work where possible
  • Follows blameless, data-driven, post-mortem strategies and conducts regular team debriefs to enable learning from both successes and mistakes
  • Provides personalized coaching for entry to mid-level team members 
  • Ensures your team documents and shares their knowledge and innovations via internal forums, communities of practice, guilds, and conferences 
  • Supports the adoption of site reliability engineering best practices within your team
  • Should complete SRE Bar Raiser Program

 

Required qualifications, capabilities, and skills

  • Formal training or certification on Site Reliability concepts and 5+ years applied experience. In addition, 2+ years of experience leading technologists to manage and solve complex technical items within your domain of expertise.
  • Advanced proficiency in site reliability culture and principles and can demonstrate how to implement site reliability across application and platform teams while avoiding common pitfalls
  • Experience leading technologists to manage and solve complex technological issues at a firmwide level
  • Ability to influence the team’s culture by championing innovation and change for success
  • Proficiency in at least one programming language (e.g., Python, Java Spring Boot, .Net, etc.)
  • Proficiency in observability tools (Grafana, Splunk, Thousand Eyes, Apica)
  • Proficiency in Real User Monitoring and synthetic monitoring solutions and their difference needs and advantages 
  • Demonstrated proficiency in software applications and technical processes within a technical discipline (e.g., cloud, artificial intelligence, machine learning, mobile, etc.)
  • Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)
  • Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.)
  • Experience with troubleshooting common networking technologies and issues

     

Preferred qualifications, capabilities, and skills

  • Appreciation for reliability in a SaaS environment 
  • Understanding of Email technologies (MSFT Exchange 2019, MSFT Exchange Online, ProofPoint)
  • Ability to initiate and implement ideas to solve business problems
  • Passion for learning new technologies and driving innovative solutions.

     

Influence your team’s strategic planning while driving continual site reliability improvements

Job Details

J.P. Morgan logo
Bulge Bracket Investment Banks

16 days ago

clicks

Senior Manager of Site Reliability Engineering

at J.P. Morgan

ExperiencedNo visa sponsorship

Not specified

Currency not set

City: Bengaluru

Country: India

Senior Manager of Site Reliability Engineering at JPMorgan Chase leading reliability for Email services across SaaS, vendor appliances, and in-house applications. Owns non-functional requirements and champions site reliability practices including observability, resiliency, security, scalability, monitoring, instrumentation, and automation. Leads and coaches a team of reliability engineers, influences strategic planning, and drives continual improvement through blameless post-mortems and data-driven metrics. Requires SRE training and hands-on experience with programming, observability, CI/CD, and container orchestration technologies.

Full Job Description

Location: Bengaluru, Karnataka, India

Guide and shape the future of technology at a globally recognized firm, driven by pride in ownership.

 

As a Senior Manager of Site Reliability Engineering at JPMorgan Chase within the Employee Platforms team, you are the non-functional requirement owner and champion for the applications in your remit. You are a key influencer in your team’s strategic planning, driving continual improvement in customer experience, resiliency, security, scalability, monitoring, instrumentation, and automation of the software in your area. In this role, you will lead the Reliability function for the firm’s Email services, a mixture of both SaaS based solutions on Microsoft 365, vendor appliances deployed within the firm, and a number of in-house applications. You will not only have strong skills in all aspects of reliability (observability, failure mode analysis, and an appreciation of resilient software patterns) but strong influencing and leadership skills to drive the reliability agenda and manage a diverse team of reliability engineers.  You will take control in times of ambiguity, showing thought leadership and innovation in tackling problems.

 

 

Job responsibilities

 

  • Demonstrates expertise in site reliability principles and demonstrates an understanding of the fine balance between features, efficiency, and stability
  • Effectively negotiates with peers and executive partners to ensure optimal outcomes for all 
  • Drives the adoption of site reliability practices throughout the organization
  • Ensures your teams demonstrate site reliability best practices with the ability to demonstrate this empirically through stability and reliability metrics
  • Drives a culture of continual improvement and solicits real-time feedback to improve the customer’s experience
  • Ensures your team collaborates with other teams within your group’s specialization and avoids duplication of work where possible
  • Follows blameless, data-driven, post-mortem strategies and conducts regular team debriefs to enable learning from both successes and mistakes
  • Provides personalized coaching for entry to mid-level team members 
  • Ensures your team documents and shares their knowledge and innovations via internal forums, communities of practice, guilds, and conferences 
  • Supports the adoption of site reliability engineering best practices within your team
  • Should complete SRE Bar Raiser Program

 

Required qualifications, capabilities, and skills

  • Formal training or certification on Site Reliability concepts and 5+ years applied experience. In addition, 2+ years of experience leading technologists to manage and solve complex technical items within your domain of expertise.
  • Advanced proficiency in site reliability culture and principles and can demonstrate how to implement site reliability across application and platform teams while avoiding common pitfalls
  • Experience leading technologists to manage and solve complex technological issues at a firmwide level
  • Ability to influence the team’s culture by championing innovation and change for success
  • Proficiency in at least one programming language (e.g., Python, Java Spring Boot, .Net, etc.)
  • Proficiency in observability tools (Grafana, Splunk, Thousand Eyes, Apica)
  • Proficiency in Real User Monitoring and synthetic monitoring solutions and their difference needs and advantages 
  • Demonstrated proficiency in software applications and technical processes within a technical discipline (e.g., cloud, artificial intelligence, machine learning, mobile, etc.)
  • Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)
  • Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.)
  • Experience with troubleshooting common networking technologies and issues

     

Preferred qualifications, capabilities, and skills

  • Appreciation for reliability in a SaaS environment 
  • Understanding of Email technologies (MSFT Exchange 2019, MSFT Exchange Online, ProofPoint)
  • Ability to initiate and implement ideas to solve business problems
  • Passion for learning new technologies and driving innovative solutions.

     

Influence your team’s strategic planning while driving continual site reliability improvements