The Job logo

What

Where

Staff Site Reliability Engineer

ApplyJoin for More Updates

You must Sign In before continuing to the company website to apply.

About the role

Please note, this team is hiring across all levels and candidates are individually assessed and appropriately leveled based upon their skills and experience.

The SRE Data / Provisioner team supports the Netskope Data Product Suite, and Provisioner, a critical component of our foundational technologies and the single source of truth for all user data across all Netskope Apps. We are a team of software engineers focused on improving availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of the engineering stacks. If you are passionate about solving complex problems and developing cloud services at scale, we would like to speak with you.

Job Responsibilities 

  • Partner closely with our development teams and product managers to architect and build features that are highly available, performant and secure
  • Develop innovative ways to smartly measure, monitor & report application and infrastructure health
  • Gain deep knowledge of our application stack
  • Experience improving the performance of micro-services and solve scaling/performance issues
  • Capacity management and planning
  • Function well in a fast-paced and rapidly-changing environment
  • Participate in 24X7 on-call rotations.

Preferred Qualifications

  • BS or MS in Computer Science or equivalent technical degree or related practical experience

Preferred Technical Skills:

  • 10+ years experience with troubleshooting Unix/Linux
  • Understanding of Networking concepts - TCP/IP, SSL/TLS, IPSec, GRE, VPN
  • Experience with algorithms, data structures, complexity analysis, and software design
  • Experience in one or more of the following: C, C++, Python, Go
  • Experience in managing a large-scale web operations role
  • Bonus points for experience with Ansible, Kubernetes, SQL and NoSQL datastores, CI/CD
  • Hands-on working with private or public cloud services in a highly available and scalable production environment. 

Desired Technical Skills:

  • Knowledge of distributed systems is a big plus.

 Additional Skills

  • Great written and verbal communication
  • Ability to work for a geo-distributed cross-functional group
  • Demonstrated ability to own and deliver projects independently
  • Demonstrated ability of technical mentoring and coaching 
  • Strong interpersonal communication skills (including listening, speaking, and writing) and the ability to work well in a diverse, team-focused environment with other SREs, developers, Product Managers, etc
Set alert for similar jobsStaff Site Reliability Engineer role in Bengaluru, India
Netskope Logo

Company

Netskope

Job Posted

a year ago

Job Type

Full-time

WorkMode

On-site

Experience Level

8-12 years

Category

Engineering

Locations

Bengaluru, Karnataka, India

Qualification

Bachelor or Master

Applicants

Be an early applicant

Related Jobs

Rubrik Logo

Staff Site Reliability Engineer - FedRAMP

Rubrik

Statesboro, Georgia, United States

Posted: a year ago

Deploy and operate security solutions and supporting infrastructure in cloud and datacenter environments. Manage a scalable and highly available solution for security logging. Develop and automate Security tasks, perform Production Readiness Assessments, and lead post-incident reviews. Experience in security engineering, logging, data management, scripting, cloud platforms, and security automation tools. Knowledge of NIST standards and certifications.

Freshworks Logo

Staff Engineer - Site Reliability

Freshworks

Chennai, Tamil Nadu, India

Posted: a year ago

Job Description SRE at Freshworks Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Freshwork's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance. Much of our SRE focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale, while using your expertise in coding, algorithms, complexity analysis and large-scale system design.    Responsibilities: Design, write, and deliver software to improve the availability, latency, and efficiency of Freshwork’s Products & Platforms. Manage availability, latency and performance of mission critical services and build automation to prevent problem recurrence. Independently determine and develop architectural approaches and Infrastructure solutions. Defining strategy, vision, and roadmap to develop CI/CD, Application hosting, Security and Compliance standards and guidelines across Freshworks. Drive blameless postmortems for large scale incidents. Define and drive automation and orchestration strategies. Strategize cost optimization across Freshworks Cloud environment.   Qualifications Requirements: 12+ years of Software Engineering and Coding Experience in C# / Python / JavaScript / Golang (one or more).  12+ years of Experience handling Linux and Windows Systems at a very large scale.  6+ years of Hands-on experience on Containers & Container Orchestration Tools. 10+ years of proven Experience with designing, building, supporting and observing large-scale distributed systems/services/infrastructure. Strong Experience in Microservices Architecture, Service Mesh implementation and instrumenting XaaC (Infrastructure, Software, Network, Policy, Security) across global scale systems Hands-on Experience in defining and driving Disaster Recovery across Freshworks Products & Platforms. Proficiency in implementing FinOps and cloud cost optimization strategies. Experience and knowledge of incorporating testing, compliance and security requirements within code release pipelines.  Proficiency in algorithms, data structures, complexity analysis, and software design. Ability to turn technical deep-dives into code, networking, operating systems, and storage, with ability to participate in an executive strategy discussion. Data Mining & Data Analytics experience utilizing big data and\or relational data bases technologies. Excellent experience in designing & architecting solutions using OpenSource Software (OSS). Intellectual Curiosity, Problem Solving and Storytelling presentation.