#63525 - Site Reliability Engineer at Full Scale

About The Role

Join one of the Philippines’ fastest-growing tech companies. Open to Philippine-based candidates only.

About Us

Full Scale is a fully remote-first company that helps businesses build dedicated teams of skilled software engineers. We make finding and retaining experienced software talent easy and affordable.

About the role

We are seeking a Site Reliability Engineer to join our growing team. The ideal candidate has strong hands-on experience with Cloudflare, DataDome, and managing high-traffic, customer-facing websites. This role will focus on improving platform reliability, performance, scalability, and edge security for a large-scale web environment.

Key Responsibilities

Manage the reliability, availability, and performance of high-traffic web platforms.
Administer and optimize Cloudflare services, including CDN, caching, DNS, WAF, and rate limiting.
Configure and manage DataDome to mitigate bots, abuse, scraping, and malicious traffic.
Monitor production systems and respond to incidents affecting uptime, latency, and user experience.
Investigate outages and performance issues, conduct root cause analysis, and implement long-term fixes.
Collaborate with engineering teams to improve resiliency, observability, and deployment safety.
Support traffic scaling, capacity planning, and operational readiness for large-volume environments.
Implement automation and operational best practices to improve stability and efficiency.

Requirements

Proven experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
Strong hands-on production experience with Cloudflare.
Experience with DataDome or similar bot protection / traffic filtering platforms.
Proven experience supporting high-traffic websites or large-scale web applications.
Strong understanding of CDN, caching, DNS, WAF, DDoS mitigation, and edge performance optimization.
Experience with monitoring, alerting, incident response, and root cause analysis.
Strong troubleshooting skills in live production environments.
Experience improving system reliability, scalability, and performance.
Strong communication and collaboration skills.

Nice to Have

Experience with AWS, GCP, or Azure.
Experience with Kubernetes, Terraform, or other infrastructure-as-code tools.
Background in consumer internet, search platforms, data platforms, or other high-scale digital environments.
Experience with Datadog, Grafana, Prometheus, or similar observability tools.

Benefits

Why Join Us

Fully remote – work from anywhere in the Philippines
Collaborative, high-performing engineering culture
Work on scalable, real-world systems with modern architecture

This listing was posted by a verified recruiter at Full Scale. Report this listing

Site Reliability Engineer (SRE)