As a Senior Lab Administrator II, you will be responsible for strategizing, planning, installing, configuring, maintaining, and debugging the complete Hardware lab. You will work with various teams to build and debug deployments, and provide and maintain the software lab. Your role involves driving capacity planning, creating the lab roadmap, monitoring and maintaining lab infrastructure, designing custom setups, identifying inefficiencies, automating processes, providing L2 trainings, developing monitoring tools, and resolving region-wise lab outages. You will also provide technical support and consult with test engineers and customers. Prior experience of 5 to 10 years in network and system administration is required, along with excellent knowledge in network, system administration, cloud platforms (Redhat Openshift, Vanilla K8s, Openstack), database management systems, routing and switching concepts, DevOps workflow, and JIRA management.
The Opportunity
As a Senior Lab Administrator II one would be responsible for Strategizing, Planning and Installing, configuring, maintaining, and debugging the complete Hardware lab.
The role also calls for working with various teams across Sandvine to build/debug the deployments that may be required for coding, testing, issue reproduction etc.
The Job
- Provide and maintain the software lab
- Drive Capacity planning
- Work with Area Product Owners to create Lab roadmap and providing inputs on budgeting
- Monitor and maintain the lab infrastructure used for orchestration (profiles, cloud), test automation, reporting
- Design and create Custom setups for customer demos and issues
- Identify inefficiencies and opportunities to automate the same
- Drive L2 trainings within team
- Test/Develop/Integrate tools that would help monitor, debug and help effectivity utilize the lab
Lab Setup and Health:
- Installation, Configuration, Maintenance of software and hardware Lab infrastructure
- Configuration, maintaining and debugging of
Sandvine hardware
Off the shelf hardware such as Dell/HP etc
Cloud
Redhat Openshift
Vanilla K8s
Openstack
Linux/Windows server
Storage servers such as NetApp
Database management – SQL/postgres etc
Routers/Switches – CISCO, Juniper etc
Third party tools such as IXIA, Landslide,
- Configuring and maintaining IT/network/server/application monitoring software such as Nagios
- Interacting with 3rd party vendors for selecting hardware/software
- Develop monitoring and automated scripts to monitor hardware
- Handle region wise lab outages where the issue has to be debugged and resolved in the least amount of time with minimal downtimes. Outages can range from
key critical hardware such as Storage/NFS servers(NetApp), Gateways, Core switches, Time servers, Authentication servers(LDAP)
key critical software which runs on these servers
Database servers
Technical Support
- Setting up custom deployments which have a high project value
- Consult w/test engineers in real-time when issues are discovered
- Support customer issues as needed
What skills you bring:
- 5 to 10years of relevant experience is required
- Excellent knowledge in Network and System Administration
- Servers
- L2/L3 Switches/Routers
- Excellent Cloud Knowledge - MANDATORY
- Redhat Openshift
- Vanilla K8s
- Openstack
- AWS (OPTIONAL)
- Good understanding of dockers, Kubernetes
- Excellent knowledge of Database Management systems
- SQL/PSQL/Sqlite - MANDATORY
- Strong in Routing and Switching concepts – Configuration/debugging
- Good understanding of DevOps workflow and tools
- JIRA management - MANDATORY
- GIT
- Be able to take the right decision to debug and resolve critical outages
- Evaluate and recommend the right hardware/software for a solution