Platform Reliability Engineer (For Pooling) @ Manulife
We’re looking for Platform Reliability Engineer who is passionate about building solutions that makes life better for others, enabling visibility and comprehension of application health and performance; motivated by opportunities to automate processes to improve system reliability and customers' experience; to experiment with new technologies and guide others.The team provides consulting and education on reliability practices to other teams, creates new tools and finds innovative ways to surface operational data and simplify the lives of our fellow colleagues. We are an integral part of the transformation to be a data-driven organization!Position Responsibilities: Support our Product Line Engineering (PLE) organizations to develop resilient and highly scalable applications, working closely with our operations staff to support these applications while maintaining a strong focus on application reliabilityChampion, promote, and deploy the effective use of innovative application monitoring and AIOps-based machine learning toolsFacilitate monitoring and custom instrumentation across our business-critical applications, including the consultation and management of agent-based, agentless, synthetic and scripted monitoring technologiesAddress incidents and problems within the platforms, with rotational accountability for on-call supportCollaborate with platform and software engineers, site reliability engineers, product managers and engineering leadership to uncover difficulties and opportunities to accelerate the delivery of new value through softwarePrototype and build new capabilities to increase the leverage of platform operations and securityDeliver an outstanding user experience to our engineers with a focus on reliabilityRequired Qualifications: A bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field is often preferred.3-5 years’ experience in Observability and MonitoringProficiency in programming languages such as Powershell, Python, Java, Javascript, NodeJS, or Ruby for scripting and automation.Experience with monitoring and observability tools like New Relic, Broadcom CA APM, Dynatrace, and PRTG, Solarwinds, Grafana, ADXFamiliarity AIOps technologies such as MoogsoftProficient in cloud platforms (AWS, Azure, Google Cloud) and their native monitoring tools.Familiarity with containerization and…
Apply To This Job