The Job logo

What

Where

Site Reliability Engineer

ApplyJoin for More Updates

You must Sign In before continuing to the company website to apply.

Job description 

Juniper is changing what’s possible in networking. We’re going beyond building the networks customers expect — we’re building the networks customers deserve. And the world is taking note. But to continue to excel, we have work to do. Change in our industry is accelerating. To power connections and empower change, we need radical thinkers, eternal optimists, and energized personalities. We need people like you.

Juniper is seeking a full-time SRE to join our talented team and support high quality technology solutions that revolutionize wireless and wired networks, powered by Artificial Intelligence in the cloud. Juniper provides services through SaaS applications to several enterprises, including Fortune 100 and Fortune 500 customers. You will be responsible for maintaining and improving the company's production environment for rapid scaling and outstanding performance. You will keep stellar cloud uptime and reliability. Your primary responsibilities will be incident management and release management in cloud instances in various regions.

 

Responsibilities:

  • Manage system availability, health and service levels (SLAs, SLOs) of the large-scale cloud infrastructure, running in AWS and GCP.
  • Proactively monitor, diagnose, analyze failures, and provide support for software engineers to debug production issues across microservices and distributed platforms. Work with development team in resolving the issues found.
  • Participate in on-call rotation and resolution of issues in a 24x7 multi-cloud (AWS/GCP) environment.
  • Monitor metrics and performance of applications and cloud infrastructure.
  • Manage code releases, i.e., push code and patches on cloud.
  • Own entire lifecycle of incidents (incident management), including reporting, analyzing, handling incidents, all the way up to its closure and writing RCAs.
  • Laser focus and be able to analyze scalability, reliability, high availability, performance, software maintainability, and operational challenges.
  • Write and maintain runbooks for knowledge driven automated processes and bots.
  • Perform capacity planning based on performance, usage, and utilization stats.
  • Perform after-hours infrastructure updates and maintenance.
  • Follow SRE best practices and procedures.

 

Required Skills:

  • Bachelor’s degree in Computer Science or Computer Engineering or equivalent.
  • Minimum 6-7 years of devops/SRE experience.
  • 5+ years hands-on experience with AWS or GCP, EC2 (GCE), IAM, S3 (GS), Docker, Kubernetes pods, Jenkins, Prometheus, CloudWatch (Stack Driver), Linux, Ansible, Salt
  • 5+ years’ experience in deploying code and infrastructure in AWS or GCP using continuous integration/continuous delivery (CI/CD) tools in production environments.
  • 5+ years of automation using python or/and Golang or/and shell scripting.
  • 6+ prior experience in developing metrics to monitor health of infrastructure and applications.
  • 5+ years of experience in managing SaaS applications infrastructure with REST based test automation experience using python.
  • Basic understanding of Terraform or CloudFormation or any IaC code is preferred.
  • Ideally detailed understanding of IP routing, Security and Cloud services such as CGNAT, IPSec, IDP and SDWAN/SDN for different customer use cases.
  • The candidate should have a thorough understanding of networking fundamentals (TCP/IP, UDP, DHCP, DNS, ICMP, AR, routing and switching).
  • General understanding of distributed systems. 
  • Understanding of data management technologies including relational and non-relational databases. 
  • Hands on experience in operating large-scale cloud-based distributed applications.
  • Knowledge of build pipeline/infrastructure like Jenkin, GitHub, CICD would be added advantage.
  • The ability to "fix the plane while in flight".
Set alert for similar jobsSite Reliability Engineer role in Bengaluru, India
Juniper Networks Logo

Company

Juniper Networks

Job Posted

a year ago

Job Type

Full-time

WorkMode

On-site

Experience Level

8-12 Years

Category

Software Engineering

Locations

Bengaluru, Karnataka, India

Qualification

Bachelor or Master

Applicants

Be an early applicant

Related Jobs

Juniper Networks Logo

Senior Data Engineer

Juniper Networks

Bengaluru, Karnataka, India

Posted: a year ago

We are looking for an ideal candidate who is comfortable in a dynamic environment and enjoys working with new technologies. The person should be open to learning and teaching the team. The individual must be creative, fun, and have the ability to work well with colleagues.

Juniper Networks Logo

Senior IT Systems Engineer

Juniper Networks

Bengaluru, Karnataka, India

Posted: a year ago

Job description  Job Description: We are looking for a skilled ServiceNow CMDB Specialist specializing in ITOM. The ideal candidate will have a strong background in ServiceNow platform management and configuration, with a specific focus on IT Operations Management (ITOM) and Configuration Management Database (CMDB) solutions. As a ServiceNow ITOM and CMDB Specialist, you will collaborate with cross-functional teams to design, develop, and implement ServiceNow applications to support ITOM and CMDB processes within our organization. Key Responsibilities: Partner with End User Services team and adopt best practices of custom solution development and CMDB management in ServiceNow Platform Collaborate with architects and engineers to develop technical catalog and workflow designs for Infrastructure automation Design and implement solutions for CI data integrity across the CI lifecycle with consideration to the ITSM process like Incident, Problem and Change Create/Monitor operation of mid servers, CI discovery schedules and tagging data to/from other tools Work closely with various stakeholders to maintain infrastructure CIs, relationship mappings and related configurations Assist IT operation teams in managing knowledge article lifecycle   Skills/Qualifications Required: Advanced experience in ServiceNow custom flow development and related platform modules Experience in creating, updating, and maintaining CMDBs, Asset Management Databases and other asset/configuration management repositories in a complex IT environment Outstanding ability to perform requirements gathering, process mapping via ITIL, business process reengineering. Excellent understanding of Infrastructure configurations, CI classes and their relationships Strong Understanding of Change & Release management Experience with knowledge management and using AI/ML based search and automation techniques Advanced and current knowledge of web-based systems architecture, service-based architecture, enterprise application architecture and Infrastructure Operations. Amazing ability to perform requirements gathering, process mapping via ITIL, business process reengineering Demonstrated project execution and skills in SDLC methodologies - Agile, Waterfall   Nice to have: Experience in a high-tech, networking, manufacturing, or other data intensive industry ServiceNow Certified Implementation Specialist – Discovery or related certifications Experience leading medium to large projects by bringing together the right perspectives, identifying roadblocks, and integrating feedback from clients and team members Bachelor's Degree in Computer Science, Information Systems, or equivalent degree/experience 8+ years of experience as IT Operations Management (ITOM) with minimum 4 years of experience in Integrating and deploying solutions on ServiceNow Platform.