Why This Role Is Exciting
At Innovatrics, we’re building biometric technologies that empower people across 80+ countries — and we’re now taking the next step: building our SaaS platform.
As a Senior DevOps Engineer, you’ll help design and operate the infrastructure powering our biometric services and new cloud-native SaaS solutions. Your work will ensure reliability, scalability, and automation across all environments, from development to global production.
You’ll be part of a highly skilled team leveraging AWS Cloud, Kubernetes, Terraform, ArgoCD, and modern observability tools to deliver resilient, automated, and observable systems that scale globally.
What You’ll Do
- Design, build, and operate cloud-native infrastructure supporting Innovatrics’ upcoming SaaS platform.
- Develop and manage containerized environments using Kubernetes and AWS services.
- Develop and maintain automation & reliability tools, deployment automatization (e.g., canary deployments)
- Implement and evolve monitoring and observability systems (Prometheus, Grafana, OpenTelemetry).
- Support and optimize service communication and reliability across microservices (gRPC, MQ, Keycloak, Ignite).
- Contribute to on-call support, root cause analysis, and reliability improvements.
- Collaborate closely with developers, QA, and platform engineers to improve scalability, resilience, and delivery performance.
- Responsible for DR tests (tabletop, partial failover, full DR, chaos/resilience)
- Define and ensure achievement of SLOs/SLIs based on required SLAs.
Requirements
About You
- Proven experience in SRE, DevOps, or Platform Engineering in production or SaaS environments.
- Strong hands-on experience with AWS Cloud, Kubernetes, and Terraform.
- Familiar with monitoring and observability tools (Prometheus, Grafana, OpenTelemetry).
- Experience with networking, security, and distributed systems fundamentals.
- Skilled in scripting or automation using Python, Go, or Bash.
- You work well in cross-functional teams and take ownership of system reliability and performance.
Nice-to-have:
- Experience with service mesh (Linkerd).
- Understanding of messaging systems (RabbitMQ, ActiveMQ).
- Familiarity with Keycloak for identity and access management.
- Background in building or supporting SaaS or multi-tenant systems.
Our Tech Stack
Infrastructure & Cloud: AWS, Kubernetes, Terraform, ArgoCD
Networking & Observability: OpenTelemetry, Linkerd, Prometheus, Grafana
CI/CD & Automation: GitLab CI/CD
Messaging & Integration: RabbitMQ, ActiveMQ, gRPC
Security & Access: Keycloak, Auth0
Caching & Data Layer: Redis/Elasticache
Languages & Tools: Bash, Python, Go