The Job logo

What

Where

Sr. Site Reliability Engineer

ApplyJoin for More Updates

You must Sign In before continuing to the company website to apply.

Smart SummaryPowered by Roshi
Seeking a skilled Linux OS system administrator with a deep understanding of security and experience in networking administration. Must have practical experience with AWS services and be adept in designing and managing cloud-based solutions. The candidate should be proficient in implementing CI/CD pipelines and possess strong bash scripting skills. The ability to actively participate in team meetings, prioritize tasks, and contribute to system architecture improvement is essential. Immediate availability to address requests and incidents is required.

Your great at:

  • Perform Linux OS system administration tasks, including the development of bash scripts, OpenVPN configuration, and a strong focus on security.
  • Administer and maintain networking infrastructure, ensuring smooth operations and optimal performance.
  • Demonstrate a deep understanding of AWS services, leveraging them to design, deploy, and manage cloud-based solutions.
  • Proactively monitor the production and development environments, promptly responding to alerts through Slack, emails, and logz.io.
  • Implement and maintain CI/CD pipelines, automating the build, test, and deployment processes.
  • Actively participate in daily team meetings, providing updates on ongoing tasks, discussing priorities, and collaborating with colleagues.
  • Develop and execute tasks based on manager-defined priorities, ensuring timely completion within required timelines.
  • Contribute to design and brainstorming sessions, offering insights and ideas to enhance system architecture and operational efficiency.
  • Remain readily available during working hours to address immediate requests and incidents as they arise.
  • Constantly monitor the production and development environments, proactively addressing any issues or anomalies.

What it takes:

  • Minimum of 5 years of experience in Linux OS system administration, including bash script writing, OpenVPN configuration, and a strong focus on security.
  • Minimum of 5 years of experience in networking administration.
  • Minimum of 3 years of deep understanding and practical experience with AWS services.
  • Advantageous: Experience with GitHub, utilizing it for version control and collaboration.
  • Advantageous: Proficiency in Ansible for configuration management and automation.
  • Advantageous: Experience with TeamCity administration for continuous integration and delivery.
  • Advantageous: Proficiency in Nexus repository administration for artifact management.
  • Advantageous: Knowledge of Java programming.
Set alert for similar jobsSr. Site Reliability Engineer role in Bengaluru, India
Opentext Logo

Company

Opentext

Job Posted

a year ago

Job Type

Full-time

WorkMode

On-site

Experience Level

3-7 years

Category

Software Engineering

Locations

Bengaluru, Karnataka, India

Qualification

Bachelor

Applicants

Be an early applicant

Related Jobs

Groww Logo

Site Reliability Engineer

Groww

Gurgaon, Haryana, India

+2 more

Posted: a year ago

Monitor and troubleshoot system performance, availability, and security. Analyze metrics and trace data. Collaborate with development teams for scalability and reliability. Manage app releases and resolve production issues. Conduct root cause analysis. Optimize system performance and capacity planning. Utilize CI/CD tools.

Opentext Logo

Lead Site Reliability Engineer

Opentext

Waterloo, Ontario, Canada

+2 more

Posted: a year ago

What You Are Great At   Applying broad range of knowledge skills and experiences with an area of expertise to assignments that are received in the form of objectives. Determining how to use resources to meet schedules and goals. Providing guidance to peers within the latitude of established company policy. Using broad knowledge of the organization to impact strategy, policy, and process development as a technical authority and leader with vision for positive business outcomes Leading multi-functional strategic and tactical efforts. Providing leadership by assisting in triage for escalated production incidents. Being a change agent able to develop, implement and maintain policies and processes Collaborating with peer technology organizations, business, clients and management to review application, systems and infrastructure functionality and develop plans for improvement. Leading development and implementation of strategies focused on greater efficiencies to deliver systems. Identifying and implementing strategies to reduce platform Mean-Time-To-Resolution (MTTR) Reliability (SRE) practices and automation principles. Managing continuous improvement of service engineering, delivery, and operational practices. Reduces expenses by eliminating unnecessary downtime and disruptions. Understanding of current business and technology trends to find opportunities for improving services and reducing risk. Adopting and promoting an an SLO mindset with Disaster recovery best practices in mind Effectively navigating organization structure and culture to make positive outcomes.   What It Takes   10+ years of related experience, or equivalent Intermediate and advanced level certifications that demonstrate knowledge of Cloud and security concepts Extensive knowledge of: CaaS Technologies including Kubernetes, Google Anthos/Google Kubernetes Engine (GKE), Ingress and PaaS technologies Knowledge of (IaaS) technologies including Hypervisor (VMWare ESX), Routing (VMWare NSX-T) and Load Balancing (F5, etc.) Knowledge of monitoring and logging technologies including VMWare Tanzu Observability/Wavefront, Dynatrace and Splunk In depth knowledge of Network and Infrastructure security best practices including governance Experience in CI/CD Pipeline implementation Automation of build, Packaging and Release Management activities (Build automation, CI/ CD, GIT, Jenkins, Git) Experience with tools like JIRA, GIT/Bitbucket, Confluence, etc. Build self-healing and automated systems Design and build systems to collect, visualize, and store service health indicators Demonstrates ability to achieve successful outcomes in handling difficult situations and work with various customers and management levels. Demonstrates previously working in Agile team working in SCRUM and Kanban formats. Communicate effectively with technical and non-technical audiences. A self-starter with the ability to work independently and in a collaborative team environment