Head of Site Reliability Engineering - Taguig - Acquire Intelligence

    Acquire Intelligence
    Acquire Intelligence Taguig

    1 araw ang nakalipas

    Accounting / Finance
    Paglalarawan

    Job Description:

    The Head of Site Reliability Engineering is a hybrid technical‑leadership role. You will:

    • Own reliability of production services running on AWS while steering the roadmap for platform resilience and building out the SRE team.
    • Lead and grow a remote team of SREs—coaching, hiring, performance‑managing, and fostering a blameless culture.
    • Set and enforce Service Level Objectives (SLOs), error budgets, and incident response processes.
    • Drive automation via Infrastructure‑as‑Code (Pulumi / TypeScript), CI/CD, and observability pipelines.
    • Represent the SRE discipline to product, engineering, and senior leadership across our global business.
    • Hands on monitoring and incident response will be critical as the team grows.

    Key Responsibilities

    • Leadership & People Management
    • Build an SRE team of initially 3-6 engineers: goal setting, career development, regular 1:1s, and annual performance reviews.
    • Ensure operational system knowledge is captured and that the team is kept "fresh" on operating and troubleshooting procedures.
    • Recruit, onboard, and mentor new engineers; scale the team to meet business growth.
    • Maintain an inclusive, psychologically‑safe culture centred on learning and continuous improvement.
    • Own, and participate in, the on‑call roster for the team, ensuring equitable rotations and sustainable workloads.
    • Service Level Management & Reliability
    • Define, monitor, and enforce SLOs and error budgets across all production systems.
    • Continuously analyse error‑budget burn to halt risky deployments and guide capacity decisions.
    • Champion a data‑driven reliability mindset throughout engineering and product teams.
    • Infrastructure Automation & Management
    • Architect and implement Infrastructure‑as‑Code in Pulumi/TypeScript for AWS resources (EKS, MSK, SingleStore, MongoDB, S3, etc.).
    • Lead large‑scale migration or modernisation projects (e.g., Kubernetes upgrades, multi‑AZ resilience).
    • Eliminate toil—any manual task >2 engineer‑days/quarter or frequently repeated becomes an automation candidate.
    • Incident Response & Post‑Mortem Leadership
    • Participate in on-call monitoring and response roster.
    • Serve as escalation point and incident commander.
    • Ensure post‑mortems are published within 48 hours with actionable "never again" tasks tracked to closure.
    • Improve runbooks and game‑day exercises; train engineers on incident command principles.
    • Security & Compliance
    • Enforce least‑privilege IAM policies and champion DevSecOps practices.
    • Contribute to SOC 2 & ISO 27001 evidence collection and continuous control monitoring.
    • Oversee security patch pipelines, vulnerability management, and secrets hygiene.
    • Operational Excellence & Continuous Improvement
    • Own reliability KPIs (MTTR, change failure rate, meantime between failures).
    • Lead quarterly reliability reviews and drive the reliability roadmap.
    • Partner with Product on capacity forecasts and cost‑optimisation initiatives

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    +We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) with a strong focus on front-end application performance and reliability. + · ++ Otimizar rendimiento frontend para aplicaciones web y móviles asegurando tiempos de carga rápidos e interacciones suaves ...

    Taguig

    1 linggo ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) with a strong focus on front-end application performance and reliability. · ...

    Taguig, Metro Manila

    1 buwan ang nakalipas

  • Trabaho sa kumpanya

    Application Reliability Engineer

    UNO Digital Bank

    As an Application Reliability Engineer, you are responsible for ensuring the stability, performance, and resilience of the bank's core and digital applications. You will serve as the second-line technical expert for production systems, resolving complex incidents, preventing recu ...

    Taguig

    1 araw ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    Site Reliability Engineer/DevOps Engineer - MS Azure, DevOps · The EY Foundation teams develop systems and infrastructure for the Reporting & Analysis Platform for Tax and Other Regulations (RAPToR). Our work supports EY's software developers in creating key products for clients. ...

    Taguig ₱900,000 - ₱1,800,000 (PHP) bawat taon

    1 araw ang nakalipas

  • Trabaho sa kumpanya

    Application Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    Ensure the stability performance and resilience of the bank's core and digital applications as an Application Reliability Engineer. Serve as a second-line technical expert for production systems resolving complex incidents preventing recurring issues and driving continuous improv ...

    Taguig, National Capital Region

    1 linggo ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    Site Reliability Engineer/DevOps Engineer - MS Azure, DevOps · The EY Foundation teams develop systems and infrastructure for the Reporting & Analysis Platform for Tax and Other Regulations (RAPToR). Our work supports EY's software developers in creating key products for clients. ...

    Taguig, National Capital Region ₱900,000 - ₱1,800,000 (PHP) bawat taon

    16 oras ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    This role involves working as a Site Reliability Engineer maintaining production systems with technical support responsibility. · Provide technical support swiftly diagnosing and resolving production issues. · ...

    Taguig, National Capital Region

    1 buwan ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    We are urgently Hiring for: Site Reliability Engineers Hybrid BGC Up to · 155K Gross Monthly · The Role Implement and maintain Observability platforms such as Datadog Proactive monitoring of production and other environments to ensure stability availability security and integrit ...

    Taguig, NCR, Philippines

    1 linggo ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    Provide technical guidance to application teams on MQ Encryption in Transit. · Monitor, maintain, troubleshoot IBM MQ & Kafka clusters for optimal performance. · Develop automation scripts using Ansible, Python & shell scripts. · Implement monitoring tools (Prometheus, Grafana) t ...

    Taguig

    3 linggo ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Philtech Inc.

    About the Role · We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) with a strong focus on front-end application performance and reliability. In this role, you will ensure the scalability, availability, and responsiveness of our web and mobile user-faci ...

    Taguig ₱900,000 - ₱1,800,000 (PHP) bawat taon

    1 araw ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    This role is for an individual contributor who will play a critical role in maintaining the health and reliability of production systems. · Provide technical support and swiftly diagnose and resolve production issues to minimize downtime and ensure seamless operations. · ...

    Taguig

    1 buwan ang nakalipas

  • Trabaho sa kumpanya

    Application Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    As an Application Reliability Engineer you are responsible for ensuring the stability performance and resilience of the banks core and digital applications. · You will serve as the second-line technical expert for production systems resolving complex incidents preventing recurrin ...

    Taguig

    1 linggo ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    We're looking for experienced Site Reliability Engineer to develop, implement, optimize and maintain our platform. · We will be responsible for deploying and debugging cloud stacks, · educating teams on new cloud initiatives, · and ensuring the security of the cloud infrastructur ...

    Taguig, Metro Manila

    1 buwan ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    We are looking for Senior Site Reliability Engineer client in BGC to ensure production systems are always performing optimally and efficiently. · ...

    Taguig City, NCR, Philippines ₱900,000 - ₱1,800,000 (PHP) bawat taon

    5 araw ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    Ensure all tickets updated handled based set KPI's SLA's · Manage monitoring alerting logging tools ensure system health service uptime. · ...

    Taguig

    1 buwan ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    +We are a pro sports team. We work together seamlessly, passing the ball of innovation and collaboration to score success as one. · + · You will manage Amazon Web Services (AWS) infrastructure with a focus on security, high availability and cost using a Infrastructure-As-Code met ...

    Taguig

    2 linggo ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    We are looking for a Site Reliability Engineer to join our team. The ideal candidate will have at least 3 years of hands-on experience as a Site Reliability Engineer, good knowledge of Azure foundation components or Google Cloud Platform, and strong troubleshooting and performanc ...

    Taguig

    1 linggo ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer

    Para lamang sa mga rehistradong miyembro

    The company is looking for a Site Reliability Engineer to work on shifting schedule as needed. The ideal candidate should have experience with monitoring tools such as Grafana or Appdynamics and be familiar with ServiceNow, Confluent, Akamai or Adobe services. · ...

    Taguig

    3 linggo ang nakalipas

  • Trabaho sa kumpanya

    Site Reliability Engineer/DevOps Engineer

    Para lamang sa mga rehistradong miyembro

    We seek dedicated Site Reliability Engineers (SREs) to maintain our high service standards. Our services are designed for global scalability continuous availability and seamless operation. · The SRE role involves managing and improving our Azure Cloud infrastructure ensuring our ...

    Taguig, National Capital Region

    1 buwan ang nakalipas

  • Trabaho sa kumpanya

    Engineer, Site Reliability Engineering

    Para lamang sa mga rehistradong miyembro

    Role Profile · We are looking for a Site Reliability Engineer (SRE) to join a product suite within the Risk Intelligence business. This role is based in Manila, Phillipines, and will be responsible for executing reliability engineering practices, supporting platform operations, a ...

    Taguig Buong oras

    1 araw ang nakalipas

  • Trabaho sa kumpanya

    Head of Site Reliability Engineering

    Para lamang sa mga rehistradong miyembro

    The Head of Site Reliability Engineering is a hybrid technical‑leadership role. · Own reliability of production services running on AWS while steering the roadmap for platform resilience and building out the SRE team. · ...

    Taguig, National Capital Region

    6 araw ang nakalipas

Trabaho
>
Taguig