- Own reliability of production services running on AWS while steering the roadmap for platform resilience and building out the SRE team.
- Lead and grow a remote team of SREs—coaching, hiring, performance‑managing, and fostering a blameless culture.
- Set and enforce Service Level Objectives (SLOs), error budgets, and incident response processes.
- Drive automation via Infrastructure‑as‑Code (Pulumi / TypeScript), CI/CD, and observability pipelines.
- Represent the SRE discipline to product, engineering, and senior leadership across our global business.
- Hands on monitoring and incident response will be critical as the team grows.
- Leadership & People Management
- Build an SRE team of initially 3-6 engineers: goal setting, career development, regular 1:1s, and annual performance reviews.
- Ensure operational system knowledge is captured and that the team is kept "fresh" on operating and troubleshooting procedures.
- Recruit, onboard, and mentor new engineers; scale the team to meet business growth.
- Maintain an inclusive, psychologically‑safe culture centred on learning and continuous improvement.
- Own, and participate in, the on‑call roster for the team, ensuring equitable rotations and sustainable workloads.
- Service Level Management & Reliability
- Define, monitor, and enforce SLOs and error budgets across all production systems.
- Continuously analyse error‑budget burn to halt risky deployments and guide capacity decisions.
- Champion a data‑driven reliability mindset throughout engineering and product teams.
- Infrastructure Automation & Management
- Architect and implement Infrastructure‑as‑Code in Pulumi/TypeScript for AWS resources (EKS, MSK, SingleStore, MongoDB, S3, etc.).
- Lead large‑scale migration or modernisation projects (e.g., Kubernetes upgrades, multi‑AZ resilience).
- Eliminate toil—any manual task >2 engineer‑days/quarter or frequently repeated becomes an automation candidate.
- Incident Response & Post‑Mortem Leadership
- Participate in on-call monitoring and response roster.
- Serve as escalation point and incident commander.
- Ensure post‑mortems are published within 48 hours with actionable "never again" tasks tracked to closure.
- Improve runbooks and game‑day exercises; train engineers on incident command principles.
- Security & Compliance
- Enforce least‑privilege IAM policies and champion DevSecOps practices.
- Contribute to SOC 2 & ISO 27001 evidence collection and continuous control monitoring.
- Oversee security patch pipelines, vulnerability management, and secrets hygiene.
- Operational Excellence & Continuous Improvement
- Own reliability KPIs (MTTR, change failure rate, meantime between failures).
- Lead quarterly reliability reviews and drive the reliability roadmap.
- Partner with Product on capacity forecasts and cost‑optimisation initiatives
-
+We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) with a strong focus on front-end application performance and reliability. + · ++ Otimizar rendimiento frontend para aplicaciones web y móviles asegurando tiempos de carga rápidos e interacciones suaves ...
Taguig1 linggo ang nakalipas
-
We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) with a strong focus on front-end application performance and reliability. · ...
Taguig, Metro Manila1 buwan ang nakalipas
-
As an Application Reliability Engineer, you are responsible for ensuring the stability, performance, and resilience of the bank's core and digital applications. You will serve as the second-line technical expert for production systems, resolving complex incidents, preventing recu ...
Taguig1 araw ang nakalipas
-
Site Reliability Engineer/DevOps Engineer - MS Azure, DevOps · The EY Foundation teams develop systems and infrastructure for the Reporting & Analysis Platform for Tax and Other Regulations (RAPToR). Our work supports EY's software developers in creating key products for clients. ...
Taguig ₱900,000 - ₱1,800,000 (PHP) bawat taon1 araw ang nakalipas
-
Ensure the stability performance and resilience of the bank's core and digital applications as an Application Reliability Engineer. Serve as a second-line technical expert for production systems resolving complex incidents preventing recurring issues and driving continuous improv ...
Taguig, National Capital Region1 linggo ang nakalipas
-
Site Reliability Engineer/DevOps Engineer - MS Azure, DevOps · The EY Foundation teams develop systems and infrastructure for the Reporting & Analysis Platform for Tax and Other Regulations (RAPToR). Our work supports EY's software developers in creating key products for clients. ...
Taguig, National Capital Region ₱900,000 - ₱1,800,000 (PHP) bawat taon16 oras ang nakalipas
-
This role involves working as a Site Reliability Engineer maintaining production systems with technical support responsibility. · Provide technical support swiftly diagnosing and resolving production issues. · ...
Taguig, National Capital Region1 buwan ang nakalipas
-
We are urgently Hiring for: Site Reliability Engineers Hybrid BGC Up to · 155K Gross Monthly · The Role Implement and maintain Observability platforms such as Datadog Proactive monitoring of production and other environments to ensure stability availability security and integrit ...
Taguig, NCR, Philippines1 linggo ang nakalipas
-
Provide technical guidance to application teams on MQ Encryption in Transit. · Monitor, maintain, troubleshoot IBM MQ & Kafka clusters for optimal performance. · Develop automation scripts using Ansible, Python & shell scripts. · Implement monitoring tools (Prometheus, Grafana) t ...
Taguig3 linggo ang nakalipas
-
About the Role · We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) with a strong focus on front-end application performance and reliability. In this role, you will ensure the scalability, availability, and responsiveness of our web and mobile user-faci ...
Taguig ₱900,000 - ₱1,800,000 (PHP) bawat taon1 araw ang nakalipas
-
This role is for an individual contributor who will play a critical role in maintaining the health and reliability of production systems. · Provide technical support and swiftly diagnose and resolve production issues to minimize downtime and ensure seamless operations. · ...
Taguig1 buwan ang nakalipas
-
As an Application Reliability Engineer you are responsible for ensuring the stability performance and resilience of the banks core and digital applications. · You will serve as the second-line technical expert for production systems resolving complex incidents preventing recurrin ...
Taguig1 linggo ang nakalipas
-
We're looking for experienced Site Reliability Engineer to develop, implement, optimize and maintain our platform. · We will be responsible for deploying and debugging cloud stacks, · educating teams on new cloud initiatives, · and ensuring the security of the cloud infrastructur ...
Taguig, Metro Manila1 buwan ang nakalipas
-
We are looking for Senior Site Reliability Engineer client in BGC to ensure production systems are always performing optimally and efficiently. · ...
Taguig City, NCR, Philippines ₱900,000 - ₱1,800,000 (PHP) bawat taon5 araw ang nakalipas
-
Ensure all tickets updated handled based set KPI's SLA's · Manage monitoring alerting logging tools ensure system health service uptime. · ...
Taguig1 buwan ang nakalipas
-
+We are a pro sports team. We work together seamlessly, passing the ball of innovation and collaboration to score success as one. · + · You will manage Amazon Web Services (AWS) infrastructure with a focus on security, high availability and cost using a Infrastructure-As-Code met ...
Taguig2 linggo ang nakalipas
-
We are looking for a Site Reliability Engineer to join our team. The ideal candidate will have at least 3 years of hands-on experience as a Site Reliability Engineer, good knowledge of Azure foundation components or Google Cloud Platform, and strong troubleshooting and performanc ...
Taguig1 linggo ang nakalipas
-
The company is looking for a Site Reliability Engineer to work on shifting schedule as needed. The ideal candidate should have experience with monitoring tools such as Grafana or Appdynamics and be familiar with ServiceNow, Confluent, Akamai or Adobe services. · ...
Taguig3 linggo ang nakalipas
- Trabaho sa kumpanya
Site Reliability Engineer/DevOps Engineer
Para lamang sa mga rehistradong miyembro
We seek dedicated Site Reliability Engineers (SREs) to maintain our high service standards. Our services are designed for global scalability continuous availability and seamless operation. · The SRE role involves managing and improving our Azure Cloud infrastructure ensuring our ...
Taguig, National Capital Region1 buwan ang nakalipas
-
Role Profile · We are looking for a Site Reliability Engineer (SRE) to join a product suite within the Risk Intelligence business. This role is based in Manila, Phillipines, and will be responsible for executing reliability engineering practices, supporting platform operations, a ...
Taguig Buong oras1 araw ang nakalipas
-
The Head of Site Reliability Engineering is a hybrid technical‑leadership role. · Own reliability of production services running on AWS while steering the roadmap for platform resilience and building out the SRE team. · ...
Taguig, National Capital Region6 araw ang nakalipas
Head of Site Reliability Engineering - Taguig - Acquire Intelligence
Paglalarawan
Job Description:
The Head of Site Reliability Engineering is a hybrid technical‑leadership role. You will:
Key Responsibilities
-
Site Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig
-
Site Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig, Metro Manila
-
Application Reliability Engineer
UNO Digital Bank- Taguig
-
Site Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig
-
Application Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig, National Capital Region
-
Site Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig, National Capital Region
-
Site Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig, National Capital Region
-
Site Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig, NCR, Philippines
-
Site Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig
-
Site Reliability Engineer
Philtech Inc.- Taguig
-
Site Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig
-
Application Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig
-
Site Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig, Metro Manila
-
Site Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig City, NCR, Philippines
-
Site Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig
-
Site Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig
-
Site Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig
-
Site Reliability Engineer
Para lamang sa mga rehistradong miyembro Taguig
-
Site Reliability Engineer/DevOps Engineer
Para lamang sa mga rehistradong miyembro Taguig, National Capital Region
-
Engineer, Site Reliability Engineering
Buong oras Para lamang sa mga rehistradong miyembro Taguig
-
Head of Site Reliability Engineering
Para lamang sa mga rehistradong miyembro Taguig, National Capital Region