The Job logo

What

Where

Lead Site Reliability Engineer

ApplyJoin for More Updates

You must Sign In before continuing to the company website to apply.

Smart SummaryPowered by Roshi
Establish an SRE site and build an effective, inclusive SRE team. Provide technical leadership and guidance to ensure availability and performance of mission critical services. Manage project priorities and deadlines. Lead Incident Management and drive MTTR as per the Incident SLA.

What is the job like

  • Establish a SRE site and help build an effective, inclusive SRE team.
  • Provide technical leadership for the local team and work closely with partner team technical leads and cloud leadership.
  • Provide guidance to other team members on managing availability and performance of mission critical services, on building automation to prevent problem recurrence, and building automated responses for non-exceptional service conditions.
  • Manage execution of project priorities, deadlines, and deliverables.
  • Lead Incident Management during Incidents.
  • Responsible for driving MTTR as per the Incident SLA.
  • Responsible for having 100% coverage for various alerts covering Application, Infrasture, Security, Flows etc 

Qualification:

  • 6-10 years of experience in distributed systems, storage systems, or databases, algorithms and data structures and/or Unix/Linux systems internals (e.g., filesystems, system calls) and administration.
  • Experience designing, analyzing, and troubleshooting large-scale distributed systems.
  • Experience in MySQL or Postgres SQL in database.
  • Hands-on experience on operating with k8s and any cloud.
  • Excellent communication skills and a sense of ownership, with a systematic problem-solving approach


 

Set alert for similar jobsLead Site Reliability Engineer role in Hyderabad, India
Zeta Logo

Company

Zeta

Job Posted

a year ago

Job Type

Full-time

WorkMode

On-site

Experience Level

8-12 Years

Category

Engineering

Locations

Hyderabad, Telangana, India

Qualification

Bachelor

Applicants

Be an early applicant

Related Jobs

Zeta Logo

Senior Site Reliability Engineer

Zeta

Hyderabad, Telangana, India

Posted: a year ago

Work to understand arising issues and improve application performance by enacting monitoring solutions. Analyze current systems, reduce problems, and suggest solutions. Support monitoring, processes, tools, architecture, and root cause analysis. Develop and maintain monitoring systems, automate tasks, troubleshoot incidents, and improve system management efficiency. Identify areas for improvement and design scalable, reliable solutions. Monitor and act on alerts to prevent outages. Meet qualifications and possess necessary experience.

JPMorgan Chase & Co. Logo

Site Reliability Engineer III

JPMorgan Chase & Co.

Hyderabad, Telangana, India

Posted: a year ago

JOB DESCRIPTION There’s nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a Site Reliability Engineer III at JPMorgan Chase within the Consumer and Community Banking of Infrastructure and Production Management, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform. Job responsibilities Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications Implements infrastructure, configuration, and network as code for the applications and platforms in your remit Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers Develop, test and debug automated tasks (Apps, Systems, Infrastructure) Troubleshoot priority incidents, facilitate blameless post-mortems    Required qualifications, capabilities, and skills Minimum 7 years of over all experience in IT industry Formal training or certification on site reliability engineering concepts and 3+ years applied experience Proficient in at least one programming language such as Python, Java/Spring Boot Proficient in site reliability culture and principles and familiarity with how to implement site reliability within an application or platform Proficient knowledge of software applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.) Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform Familiarity with container and container orchestration such as ECS, Kubernetes, and Docker Preferred qualifications, capabilities, and skills Proficiency in one or more technology domains, may be a cross-domain expert able to solve complex and mission critical problems within a business or across the firm Adept in the development of automated tools, systems, and services in multiple technology domains Working knowledge of infrastructure components. (E.g. routers, load balancers , cloud products , container systems , compute, storage and networks) Excellent debugging and trouble shooting skills   ABOUT US JPMorgan Chase & Co., one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world’s most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management. We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. In accordance with applicable law, we make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as any mental health or physical disability needs. ABOUT THE TEAM Our Consumer & Community Banking division serves our Chase customers through a range of financial services, including personal banking, credit cards, mortgages, auto financing, investment advice, small business loans and payment processing. We’re proud to lead the U.S. in credit card sales and deposit growth and have the most-used digital solutions – all while ranking first in customer satisfaction.