The successful candidate will play an important role in maintaining a state-of-the-art email delivery platform that serves thousands of customers worldwide.
Technical experience:
- Understanding of how DNS works (SOA, NS, MX, DKIM, etc.)
- Strong understanding of Linux systems management (distributions, firewall, ssh, network troubleshooting)
- Be familiar with SMTP protocol
- Be familiar with HTTP protocol (request types, codes, etc.)
- Strong network understanding. OSI model. (VLAN, link aggregation, macvlan) You can tell the difference between L4 and L7 protocol
- Basic knowledge of different DBMS (mysql, postgresql, mongodb, redis, etc.) Basic setup, replication, clustering. (You can tell what you know about it)
- Monitoring systems (Zabbix, Prometheus, Cloudwatch) at least one
- Containerisation (Docker, lxc). Docker swarm orchestrator.
- High availability (Haproxy, nginx, keepalived)
- IaC (Terraform, Ansible) — would be great if you know it, but not neccesary
Functionality in the position:
- Build, maintain and successfully resolve problems within hybrid ecosystem of 500+ servers
- Implement, maintain and continuously improve Operations team processes to meet the target Service Level Objectives / Service Level Agreements and uptime goals and commitments
- Be an integral part of the Incident management process. Act as last-mile expert for the escalated issues and provide timed and efficient solutions, also providing subject matter expert (SME) assistance to teammates during incident resolution process (both on/off hours on-call escalation participation)
- Support and optimize monitoring systems and related services around the product including early incident detection and resolution with no- to minimal service impact
- Architect and implement increasingly better High Availability, Disaster Recovery and Business Continuity (Backup) solutions.
- Lead the operational efforts in platform cloud expansion, including infrastructure, network, security and monitoring design and ongoing enhancements
- Contribute and optimize hybrid resource management and utilization strategy. Lead the cost optimization / performance improvement processes within the team.
- Support and maintain the up-to-date documentation for the infrastructures, systems, components, services and network layers
- Participation in software release management cycle
We offer:
- High level of salary
- B2B contract
- Transparent evaluation and remuneration policy with regular reviews
- Social package: paid vacation and sick leave (29 days)
- Continuous professional development
- Free English language courses