System Reliability Engineer, Consultant job opportunity at AIA Group.



DateMore Than 30 Days Ago bot
AIA Group System Reliability Engineer, Consultant
Experience: Highly Experienced
Pattern: full-time
apply Apply Now
Salary:
Status:

Consultant

Copy Link Report
degreeOND
loacation Kuala Lumpur, MY-AIA Malaysia, Malaysia
loacation Kuala Lumpur, ..........Malaysia

At AIA we’ve started an exciting movement to create a healthier, more sustainable future for everyone. As pioneering innovators for over 100 years, we’re now transforming our organisation to be faster, simpler and more connected. Because we want to be even better equipped to develop digital solutions and experiences that help more people live Healthier, Longer, Better Lives. To get there, we need people with tech/digital/analytics expertise and passion to help develop positive, sustainable change through digitally enhanced experiences that will impact the lives of millions of people and create a healthier future for everyone. If you believe in developing a better tomorrow, read on.  About the Role To ensure the reliability, scalability, and performance of enterprise systems and services by applying software engineering principles to operations. The System / Site Reliability Engineer will collaborate with development and operations teams to build robust automation, monitor system health, respond to incidents, and continuously improve service availability and efficiency. This role is critical in bridging the gap between software development and IT operations, fostering a culture of resilience, observability, and proactive problem-solving. Job Description Ensure System Reliability and Availability Oversee application performance, report any deviation and issue Collaborate with application engineers and developers in root cause identification Incident Management and Root Cause Analysis Participate in incident response efforts for production outages as Subject Matter Advisor Provide insights from monitoring and in-depth code/database review Assist Application Operation post-mortems review Automation and Tooling Automate operational tasks such as monitoring, and recovery. Develop scripts and tools to reduce manual toil and improve efficiency. Monitoring and Observability Implement robust telemetry systems to monitor application health, latency, and error rates. Manage Dynatrace platform and integration with all application services Assist Application team in dashboarding design and setup Security and Compliance Collaborate with Security teams to ensure systems meet regulatory and security standards (e.g., PCI-DSS, GDPR). Implement access controls, encryption, and audit mechanisms as and where required by the scope of SRE team Capacity Planning and Performance Optimization Assist in analyzing usage trends to forecast demand and scale infrastructure accordingly. Participate in resource optimization utilization to balance cost and performance. Work closely with development, QA, and infrastructure teams to embed reliability into the SDLC. Promote SRE principles across teams to foster a culture of resilience and accountability. Maintain clear operational documentation, runbooks, and architecture diagrams. Job Requirements Bachelor’s degree in Computer Science, Software Engineering, Information Technology, or a related field. 3–5 years of experience in Site Reliability Engineering, DevOps, or Software Engineering roles. Prior experience supporting front-end applications in production environments, preferably in financial services or regulated industries. Frontend Performance Monitoring; Ability to instrument front-end code for custom metrics and traces. Experience with Real User Monitoring (RUM), Synthetic Monitoring, and Application Performance Monitoring (APM) tools (e.g., New Relic, Dynatrace, Datadog). Proficiency in setting up dashboards and alerts using tools like Dynatrace, Grafana, Prometheus, Elastic Stack, or Splunk. Familiarity with OpenTelemetry standards for distributed tracing. Scripting skills in Python, Bash, or JavaScript for automation and tooling. Experience with CI/CD pipelines (e.g., GitHub Flow). Hands-on experience with cloud platforms (AWS, Azure). Familiarity with containerization (Docker) and orchestration (Kubernetes). Understanding of secure coding practices for front-end applications. Awareness of financial compliance standards (e.g., PCI-DSS). Build a career with us as we help our customers and the community live Healthier, Longer, Better Lives. You must provide all requested information, including Personal Data, to be considered for this career opportunity. Failure to provide such information may influence the processing and outcome of your application. You are responsible for ensuring that the information you submit is accurate and up-to-date.

Other Ai Matches

Customer Centres, Specialist (Klang) Applicants are expected to have a solid experience in handling Specialist (Klang) related tasks
Assistant Manager, Solution Delivery Management Applicants are expected to have a solid experience in handling Solution Delivery Management related tasks
Recruitment Development Consultant Applicants are expected to have a solid experience in handling Job related tasks
AIA Actuarial Internship Programme Applicants are expected to have a solid experience in handling Job related tasks
Data Management Head Applicants are expected to have a solid experience in handling Job related tasks
Assistant Director Corporate Solutions Applicants are expected to have a solid experience in handling Job related tasks
Medical Technologist Applicants are expected to have a solid experience in handling Job related tasks
Premier Academy Training Partner, Consultant Applicants are expected to have a solid experience in handling Consultant related tasks
Credit Research, Specialist Applicants are expected to have a solid experience in handling Specialist related tasks
Intern-Insurance Operations Applicants are expected to have a solid experience in handling Job related tasks
Intern, Testing Applicants are expected to have a solid experience in handling Testing related tasks
Principal Specialist, Business Continuity Management (BCM) Applicants are expected to have a solid experience in handling Business Continuity Management (BCM) related tasks
(高级)经理,消费者权益保护 Applicants are expected to have a solid experience in handling Job related tasks
Associate - SG Policy Owner Services (POS) Applicants are expected to have a solid experience in handling Job related tasks
Test Management, Analyst Applicants are expected to have a solid experience in handling Analyst related tasks
Product Management (Pricing), Specialist Applicants are expected to have a solid experience in handling Specialist related tasks
[AIAPP] 준법지원 담당자 (Compliance Support) Applicants are expected to have a solid experience in handling Job related tasks
CS Strategy & Portfolio Management, Specialist Applicants are expected to have a solid experience in handling Specialist related tasks
Director, Health Proposition Strategy & Development Applicants are expected to have a solid experience in handling Health Proposition Strategy & Development related tasks
Manager / Senior Manager, Technology Team Lead Applicants are expected to have a solid experience in handling Technology Team Lead related tasks
Manager, AI Engineer Applicants are expected to have a solid experience in handling AI Engineer related tasks
REGIONAL MANAGER - AGENCY DSF Applicants are expected to have a solid experience in handling Job related tasks
Telemarketing Sales Representative Applicants are expected to have a solid experience in handling Job related tasks