Title and Summary
Lead Platform Architect-1
Overview:
Does building exciting platforms that will enable our customers across the world with faster, smarter and more effective solutions excite you? We are looking for a Platforms Architect that will work with our teams around the world to improve system reliability, improve operations and build strong partnerships with key stakeholder teams to own, monitor and maintain our services and infrastructure.
Our team thrives on mixing intuition with logic, art with science and magic with mathematics to create products that can change the world. Join us on our journey!
Role
- You’ll be responsible for the installation, administration, and lifecycle management of RHE Linux servers, focusing especially on platforms for Big Data solutions such as Hadoop.
- You take part in on-call as part of our ‘you build it, you run it’ philosophy. This includes improving operations, such as by adding metrics or building dashboards.
- Improve our incident management lifecycle to identify, mitigate, and learn from reliability risks.
- Develop strong partnerships with product teams to understand and proactively address future technology needs and current developer pain points.
- As part of the Platform team, you own, monitor, and maintain our development teams' backend services and infrastructure.
- Incident response, diagnosis, and follow-up on system outages or alerts across the infrastructure
- Increase developer productivity by building innovative tools that reduce maintenance overhead.
About You
- Previous experience as a Platform engineer, SRE, Platform Administrator, Infrastructure engineer, or DevOps engineer
- Strong experience with Linux and at least one of the following automation tools: Ansible, Puppet, Saltstack, Chef
- Infrastructure-as-code experience using shell scripting, Python, or GO
- Monitoring technologies such as Prometheus and Grafana
- Strong analytical and troubleshooting skills, fluency in high traffic / high availability system architecture
Desired Skills & Experience
- Understanding of TCP/IP networking
- Solid understanding of the OSI model and working knowledge of the key protocols from Layer 2 through Layer 7, including ARP, IP, TCP, UDP, and HTTP.
- Familiar with managing and maintaining Hadoop storage technologies for high availability and scale
- Ability to install, configure, and manage software lifecycle tools such as Red Hat Satellite (Spacewalk, etc.)
- Direct knowledge of architecting and managing Linux systems using secure configurations such as SE Linux
- Knowledge of building scalable and high-performance software applications and systems