Job description
What you'll do:
As a Support Engineering Manager, you will...
- Be a technical lead for organization and are regarded as a support leader by peers across the team
- Have a solid understanding of the business as it relates to the team’s supported services
- Lead or participate in Team wide user groups to exchange support information, procedures, and best practices
- Be an expert in system design and configuration and contribute regularly to physical architecture design for the organization, especially for new services
- Be involved in major technical decisions and work with development and QA teams to help increase service supportability, scalability, and performance
- Lead large initiatives that span many services such as efficiency and scaling
- Manage organization-wide outages ensuring accurate documentation and timely resolution
- Define aggressive SLAs and drive the team to meet them through metrics creation and analysis
- Conduct cross-team monitor planning and reviews
- Have a strong understanding of the technologies, tools, procedures, resources, and best practices used to improve operational excellence
- Determine best practices for code management, support, code builds, and code deployments
- Participate and may drive operational excellence across teams
- Display the leadership core values in every activity and may lead people or virtual teams
What you'll bring:
- Expertise in operational excellence, security compliance, and distributed systems’ resiliency
- Knowledge of professional software engineering practices & best practices for the full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations
- Expertise in operations of highly available and scalable software systems with large customer bases
- Excellent communication abilities, both verbal and written, including long-form business writing on complex topics
- Ability to drive effective teamwork, collaboration and commitment
- Excellent analytical problem solving and troubleshooting skills
- Ability to coordinate between technical teams, peers and business stakeholders
- Ability to quickly learn and adopt new technologies
- Experience managing on-call support coverage, escalation and critical issue management
- Experience in contributing to the definition and implementation of automation opportunities within an operations environment
- Experience in root cause analysis and error correction, identifying changes to procedures and systems to implement long term fixes and avoid repeating issue