Site Reliability Engineer

Paulo

ID Verified
2050/mo
Natal, Brazil
Looking for Full-time
8 hours/day
Availability: US / EU timezone

Profile Description

Site reliability engineer focused on ensuring high availability and performance of production systems. Experienced in monitoring, incident management, and automation. Proficient in cloud platforms, containerization, and infrastructure as code. Skilled in troubleshooting complex distributed systems. Strong understanding of SLAs, SLOs, and error budgets. Passionate about building reliable, scalable systems.

Top Skills

Python (Django/FastAPI) · 4 yearsTerraform · 4 years

Skills & Expertise

Software Development

TerraformAdvanced
Python (Django/FastAPI)Advanced

Work Experience

SRE

Nubank

Apr 2020 - Sep 2024

Key Achievements:

Maintained [phone hidden]% uptime for banking platform serving millions of customers. Automated incident response reducing MTTR by 50%. Implemented chaos engineering practices improving system resilience. Developed monitoring and alerting systems for proactive issue detection. Led post-mortem process driving continuous improvement.

Site ReliabilityKubernetesTerraformMonitoring ToolsIncident ManagementPython (Django/FastAPI)

Quick Stats

Age29 years
English LevelFluent
Other Languages
Portuguesefluent
ID VerificationVerified

Certifications

Certified Kubernetes Administrator

Education

Bachelor of Science in Computer Engineering

Universidade Federal do Rio Grande do Norte

2013 - 2018