Skip to content
← Back to job listings

Site Reliability Engineer (SRE)

Full Scale · Remote

Software DevelopmentRemoteQuick applyfull-timeabout 2 months ago

About The Role

Join one of the Philippines’ fastest-growing tech companies. Open to Philippine-based candidates only.

About Us

Full Scale is a fully remote-first company that helps businesses build dedicated teams of skilled software engineers. We make finding and retaining experienced software talent easy and affordable.

About the role

We are seeking a Site Reliability Engineer to join our growing team. The ideal candidate has strong hands-on experience with Cloudflare, DataDome, and managing high-traffic, customer-facing websites. This role will focus on improving platform reliability, performance, scalability, and edge security for a large-scale web environment.

Key Responsibilities

  • Manage the reliability, availability, and performance of high-traffic web platforms.
  • Administer and optimize Cloudflare services, including CDN, caching, DNS, WAF, and rate limiting.
  • Configure and manage DataDome to mitigate bots, abuse, scraping, and malicious traffic.
  • Monitor production systems and respond to incidents affecting uptime, latency, and user experience.
  • Investigate outages and performance issues, conduct root cause analysis, and implement long-term fixes.
  • Collaborate with engineering teams to improve resiliency, observability, and deployment safety.
  • Support traffic scaling, capacity planning, and operational readiness for large-volume environments.
  • Implement automation and operational best practices to improve stability and efficiency.

Requirements

  • Proven experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
  • Strong hands-on production experience with Cloudflare.
  • Experience with DataDome or similar bot protection / traffic filtering platforms.
  • Proven experience supporting high-traffic websites or large-scale web applications.
  • Strong understanding of CDN, caching, DNS, WAF, DDoS mitigation, and edge performance optimization.
  • Experience with monitoring, alerting, incident response, and root cause analysis.
  • Strong troubleshooting skills in live production environments.
  • Experience improving system reliability, scalability, and performance.
  • Strong communication and collaboration skills.

Nice to Have

  • Experience with AWS, GCP, or Azure.
  • Experience with Kubernetes, Terraform, or other infrastructure-as-code tools.
  • Background in consumer internet, search platforms, data platforms, or other high-scale digital environments.
  • Experience with Datadog, Grafana, Prometheus, or similar observability tools.

Benefits

Why Join Us

  • Fully remote – work from anywhere in the Philippines
  • Collaborative, high-performing engineering culture
  • Work on scalable, real-world systems with modern architecture

This listing was posted by a verified recruiter at Full Scale. Report this listing