Lead, Site Reliability Engineer (Hybrid-Flexible Options)

Details of the offer

Role Overview As an Application Site Reliability Engineer (SRE), you will play a critical role in ensuring the stability, scalability, and reliability of our products and services. You will work closely with cross-functional teams to design, develop, and deploy solutions that enhance the performance and uptime of our applications.

The Application SRE is part of the Enterprise Platform (EP) group and is responsible for supporting and running our standard platforms efficiently and effectively. You will be expected to collaborate closely with other functions within EP (DevOps/Cloud Platforms, Quality Engineering and Developer Experience) to provide robust, integrated and best-in-class solutions for our product engineering teams.
Responsibilities Implementing Site Reliability Engineering best practices, including error budgeting, service level objectives (SLOs), and monitoring and alerting systemsBuilding automation tools and processes to improve the efficiency and reliability of running our products and standard platformsPerforming capacity planning and system design to ensure that our systems can handle increasing traffic and loadTroubleshooting complex technical issues and providing root cause analysis to prevent future incidentsParticipating in incident calls to respond to system outages and emergenciesCollaborating with software developers to define and implement reliability requirements for new products/applications/servicesConducting post-mortem analyses to identify opportunities for improvement and prevent recurring issuesUsing data-based decision making to be proactive in the prevention of potential incidents and problemsSupporting product development teams in the implementation of tools, processes, and practices to improve stability, reliability, and extensibility of their productsCollaborate across the EP function to ensure that standard platforms are best-in-classDrive standard implementation of NFRs in new product development and own the "deep-dive" process to improve problematic applicationOverall management and governance of vulnerabilities and End-of-life within our productsYour profile Bachelor's degree in Computer Science, Information Technology, Software Engineering, or a related field.5+ years of experience supporting production applications (ie SRE and/or DevOps roles)Practical understanding of implementing SLOs and SLIsKnowledge of Windows and/or Linux Systems administration and networking fundamentalsExperience in implementing Observability and Alerting tools (eg Datadog, Splunk)Ability to automate application operations using tools such as Python, Java, Shell Scripting, Terraform, Chef, Puppet, SQL, AnsibleKnowledge of AWSExperience in supporting middleware such as databases, webservers, MQ and KafkaFamiliarity with containerization technologies, such as Docker and KubernetesExcellent problem-solving skills and attention to detailAbility to work well under pressure and prioritize tasks in a fast-paced environment.Fluency in English is essential.Ability to collaborate closely with others.Continual Improvement mindsetLeadership Responsibilities As a senior member of the team, you will be responsible for:
Overseeing initiatives and deliverables across the teamTechnical and design decisions made by the team.Coaching and mentoring members of the teamKeeping your finger on the pulse: identifying and developing new ideas and initiativesActing as an advocate for SRE across Enterprise Platform team and wider Broadridge communityReviewing work and improving SRE processesCollaborating with other teams outside Enterprise PlatformsContributing to the strategic direction of the functionWhat Broadridge Offers An opportunity to be part of a global leader in fintech innovation.A culture of inclusivity, collaboration, and professional development.Competitive salary, comprehensive benefits, and a commitment to work-life balance.Access to state-of-the-art technologies and tools.Continuous learning opportunities through professional development programs and educational assistance.Broadridge is proud to be an equal opportunity employer. We celebrate diversity and are dedicated to creating an inclusive environment for all employees. We encourage applications from individuals of all backgrounds.


#J-18808-Ljbffr


Nominal Salary: To be agreed

Source: Grabsjobs_Co

Requirements

Jr. Technical Service (Ts0) (Caloocan)

Jr. Technical Service (TS0) POWERKING INDUSTRIES CORPORATION Responsibilities: - Ensuring Service Job orders are done on time and with quality done by Techni...


Dempsey - National Capital Region

Published a month ago

Pega Support Engineer - Work-From-Home

We are hiring a Tier 1 support engineer to join our Managed Services team. In this role you will be supporting_ daily with a variety of Pega support issues. ...


Gratitude Philippines - National Capital Region

Published a month ago

Sap Fico Consultant

Position Overview: We are seeking a skilled SAP FICO (Finance and Controlling) Consultant with a minimum of 3 years of experience to join our Application Man...


Geco Philippines - National Capital Region

Published a month ago

Java Spring Boot Developer

We are seeking a skilled Java Spring Boot Developer for our locations in Manila and Cebu. The ideal candidate should have a minimum of 2 years of experience ...


Geco Philippines - National Capital Region

Published a month ago

Built at: 2024-11-23T14:54:08.805Z