Job description
In this role, you will work with NVIDIA Cloud Functions and Web Services teams to design and deploy AI based APIs for training, fine-tuning and inferencing of our Picasso-Edify products on public and private cloud, based on groundbreaking NVIDIA technology. You will also work with our customers and AI partners to improve and scale up our Cloud Services by driving adoption for end-to-end Machine Learning and Deep Learning solutions in the cloud.
What you'll be doing:
Set and drive best practices within engineering organization for cloud related development and deployments
Working with teams across NVIDIA to design, implement and deploy microservices
Participate in code reviews, debugging of production systems, coordinating incident management follow-up and contributing to the software system directly through code contributions
Build and deploy AI/ML solutions at scale using NVIDIA's AI software on cloud-based GPU platforms
Address workload, security and privacy and architectural requirements and understand and resolve technical challenges
Occasionally help debug critical service disruption, bottlenecks, and provide fixes
Work with NVIDIA’s product and architecture teams by providing customer requirements, feedback and prioritization
What we need to see:
BE/ MS in Computer Science or equivalent experience
10+ years of foundational expertise in Engineering, Computer Science, Data Science, or a related field
5+ years of working experience in cloud engineering roles
Proficiency in languages like GoLang, Python, Javascript or C/C++
Experience in authentication and identity management and cloud security
Experience in using data analytics tools for debugging and dashboarding purposes
Exposure to cloud storage and databases
Exposure to Deep Learning and Machine Learning concepts
Strong written and oral communication skills with the ability to optimally collaborate with customers, management and engineering
Excellent verbal communication and presentation skills in English
Ways to stand out from the crowd:
Working experience with major or tier-2 cloud service providers (such as AWS, GCP and Azure), or with AI Datacenter Providers or HPC Datacenters
Experience with AI frameworks and tools on GPUs