Compute Cluster DevOps Engineer, GPU - HPC

NVIDIA

Bengaluru, Karnataka, India

Posted: 2 years ago

What you will be doing: Design, implement and support large scale infrastructure with monitoring, logging, and alerting with promised uptime. Engage in and improve the whole lifecycle of services—from inception and design through deployment, operation, and refinement. Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity management. Maintain infra and services once they are live by measuring and monitoring availability, latency, and overall system health. Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity. Practice sustainable incident response and blameless postmortems. Understand complex and vast infrastructure and support it during on call weeks. Work with different SME and help provide quality resolution to the production issues to the customer.   What we need to see: BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics) or equivalent. 4+ years of hands-on industry experience in the above mentioned areas Experience with automation around the Linux system administration. Experience in one or more of the following: Python, Perl. Good understanding of open-source IT Automation tools like Ansible, slat. Interest in crafting, analyzing, and fixing large-scale distributed systems. Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive. Ability to debug and optimize code and automate routine tasks.   Ways to stand out of the crowd: Good hands on experience on schedulers like LSF and SLURM. Experience in maintaining and writing automation around the HPC cluster. Strong System Administration skills in Linux.  

Staff IT SRE Engineer

NVIDIA

Bengaluru, Karnataka, India

Posted: a year ago

Join our team as a DevOps Engineer to work on building and packaging infrastructure for autonomous vehicles, designing high-end architectures for manufacturing sites, and maintaining system health. We are looking for someone with experience in automation of infrastructure, containerization, large-scale project management, and cloud-based platforms. Strong communication skills and a background in Linux platform are required.

Cluster Optometrist

Lenskart.com

Bengaluru, Karnataka, India

Posted: 2 years ago

We are hiring for the position of Cluster Optometrist. Your responsibilities will include managing Lenskart stores, working as an in-store optometrist, mentoring and coaching junior optometrists, maximizing in-store revenue, and supporting the store team. You must have a good knowledge of optometry, eye wear products, and measurements. You should also be passionate about sales, retail, and customer service. Traveling within India may be required. If you have at least 3 years of retail experience and strong leadership qualities, we would like to hear from you.

SRE Architect

Infosys

Bengaluru, Karnataka, India

Posted: a year ago

As an SRE Architect at Infosys in Bangalore, you will be responsible for solutioning, RFP, technical solutions, tool implementation, support & maintenance processes, and knowledge of various IT areas.

GPU Unit Verification Engineer

NVIDIA

Bengaluru, Karnataka, India

Posted: 2 years ago

As a key member of our ASIC Verification team, you will verify the design and implementation of the industry's leading GPUs. Be responsible for ensuring that the verification meets the hardware functional safety standards required by Automotive applications. Work with architects, designers, and pre and post silicon verification teams to accomplish your tasks.

GPU Unit Verification Engineer

NVIDIA

Bengaluru, Karnataka, India

Posted: 2 years ago

Join our ASIC Verification team and verify the design and implementation of industry-leading GPUs. Responsible for ensuring verification meets functional safety standards for automotive applications. Develop verification infrastructure and collaborate with teams to accomplish tasks.

Compute Cluster SRE Engineer, GPU - HPC

Related Jobs

Compute Cluster DevOps Engineer, GPU - HPC

NVIDIA

Staff IT SRE Engineer

NVIDIA

Cluster Optometrist

Lenskart.com

SRE Architect

Infosys

GPU Unit Verification Engineer

NVIDIA

GPU Unit Verification Engineer

NVIDIA