Lead Observability Engineer - Elasticsearch, APM, Azure, Service now job opportunity at Blue Yonder, Inc..



DatePosted 30+ Days Ago bot
Blue Yonder, Inc. Lead Observability Engineer - Elasticsearch, APM, Azure, Service now
Experience: 10-years
Pattern: full-time
apply Apply Now
Salary:
Status:

APM, Azure, Service now

Copy Link Report
degreeGeneral
loacation Hyderabad, India
loacation Hyderabad....India

Scope Receives work assignments through the ticketing system or from senior leadership. Provides Tier-4 engineering expertise , platform ownership, and technical leadership for all observability capabilities across hybrid cloud, on-premises, and SaaS environments. Leads the design, architecture, and maturity of the enterprise observability ecosystem with a primary focus on the Elastic Observability Platform , ensuring end-to-end visibility for infrastructure, cloud services, networks, and business-critical applications. Drives the enterprise strategy for logging, metrics, traces, synthetics, and alerting—including governance, standardization, and performance optimization. Partners closely with Cloud, Infrastructure, Security, Enterprise Applications, and SRE leadership to define observability frameworks, drive operational transparency, and strengthen service reliability. Ensures observability platforms meet enterprise requirements for security, performance, availability, compliance, and scalability . Oversees monitoring implementations for key SaaS applications including Workday, Salesforce, ServiceNow, and Microsoft 365, ensuring proactive issue detection and excellent user experience. Provides guidance, mentorship, and direction to observability engineers, SREs, and operational teams to uplift monitoring maturity and promote best-practice adoption. Acts as a strategic advisor during major incidents by providing real-time diagnostics, correlation insights, and driving RCA improvements. Required to provide on-call support during off-hours on weekdays, weekends, and holidays on a rotating basis. Our Current Technical Environment: Tools & Platforms: Elastic Stack (Elasticsearch, Kibana, APM, Logstash, Beats/Elastic Agent), ServiceNow, Azure Monitor, API-driven integrations, SIEM/SOAR systems. Cloud Platforms: Azure, VMware, Kubernetes/Container platforms, Linux and Windows servers, enterprise network infrastructure. SaaS Applications: Workday, Salesforce, ServiceNow, Microsoft 365 (Teams, Exchange, SharePoint, OneDrive), commercial SaaS telemetry sources. Programming & Scripting: PowerShell, Python, Bash, API automation. Architecture & Engineering: Azure ARM templates, Terraform, Ansible, hybrid cloud architecture, observability governance, ILM, ML-based anomaly detection, synthetics. What You’ll Do: Own and lead the architecture and roadmap for the Elastic Observability platform across the enterprise. Define and enforce governance standards for logs, metrics, traces, data retention, and alerting quality. Lead platform scaling initiatives—including cluster sizing, performance tuning, ILM tiering, and cost optimization. Architect, deploy, and maintain advanced Elastic Observability solutions across hybrid environments. Design executive-grade dashboards, correlation views, analytics boards, anomaly detection, and ML-based detections. Optimize ingestion pipelines, index structures, data flow, and search/query performance at scale. Integrate Elastic Observability with Azure, VMware, Kubernetes, network platforms, ServiceNow, and API sources. Define and lead enterprise monitoring standards across logs, metrics, traces, and synthetics. Drive cloud and on-prem monitoring maturity by improving instrumentation, coverage, and telemetry consistency. Establish alert engineering frameworks that reduce noise and improve detection fidelity. Lead design of synthetic transactions, user-experience monitoring, and availability baselines for SaaS apps. Ensure proactive monitoring of Workday, Salesforce, ServiceNow, and Microsoft 365 integrations. Serve as the observability lead during P1/P0 incidents by delivering real-time visibility and correlation insights. Drive MTTR/MTTD improvements through enhanced observability patterns and RCA alignment. Build and maintain operational runbooks, dashboards, and standard operating procedures. Work with engineering, Cloud, Infrastructure, Applications, and Security leadership to improve observability adoption. Act as the senior technical advisor in major IT projects, shaping observability-by-design principles. Mentor and guide observability engineers, analysts, and SRE teams to uplift operational capabilities. Ensure all monitoring pipelines follow enterprise security, compliance, retention, and logging policies. Validate that new systems adhere to observability onboarding requirements and telemetry standards. What We Are Looking For: Bachelor’s degree in Computer Science, Engineering, MIS, or equivalent experience. 7–10+ years of experience in observability engineering, SRE, monitoring platform ownership, or infrastructure operations. Deep, hands-on expertise with Elastic Stack (Elasticsearch, Kibana, Logstash, Beats/Elastic Agent, APM). Strong architectural knowledge of cloud (Azure/AWS) and hybrid observability patterns. Experience leading observability for infrastructure, cloud platforms, network systems, Kubernetes, and Microsoft 365. Proven experience designing monitoring for SaaS platforms (Workday, Salesforce, ServiceNow). Advanced scripting/automation experience (Python, PowerShell, Bash). Strong knowledge of API integrations, data pipelines, and log-flow engineering. Experience leading incident diagnostics and delivering visibility for RCA and operational improvement. Strong analytical, architectural, and troubleshooting skills with a platform-owner mindset. Demonstrated ability to influence cross-functional teams and drive enterprise observability adoption. Familiarity with Grafana, Prometheus, Splunk, AppDynamics, Dynatrace (preferred). Knowledge of Terraform, Ansible, Kubernetes, and infrastructure-as-code tools (preferred). Knowledge of ITIL processes, SRE principles, and operational governance. Excellent communication, leadership, and stakeholder-management skills. empowering partner IT teams, such as Infrastructure and Apps, to self-service by creating their own monitors, all within the unified guidance and framework established by Observability. Our Values If you want to know the heart of a company, take a look at their values. Ours unite us. They are what drive our success – and the success of our customers. Does your heart beat like ours? Find out here: Core Values All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status.

Other Ai Matches

Principal Software Engineer ( Java Architect) Applicants are expected to have a solid experience in handling Job related tasks
Senior Solution Architect - WMS Applicants are expected to have a solid experience in handling Job related tasks
Business Strategist - Automotive Factory and Supply Chain Planning Applicants are expected to have a solid experience in handling Job related tasks
Sr Software Engineer - ReactJS, Typescript & Tailwind CSS Applicants are expected to have a solid experience in handling Typescript & Tailwind CSS related tasks
Lead Observability Engineer - Elasticsearch, APM, Azure, Service now Applicants are expected to have a solid experience in handling APM, Azure, Service now related tasks
Sr Technical Consultant -Java ,PLSQL ,WMS Applicants are expected to have a solid experience in handling PLSQL , WMS related tasks
Technical Architect - Warehouse Management Systems Applicants are expected to have a solid experience in handling Job related tasks
Engineering Manager - Java,Python, Microservices, kafka, Nosql Applicants are expected to have a solid experience in handling Python, Microservices, kafka, Nosql related tasks
Security Contract Manager Applicants are expected to have a solid experience in handling Job related tasks
Senior Security Engineer (DevSecOps) Applicants are expected to have a solid experience in handling Job related tasks
Sr. QA Engineer - Robot, Python & Playwright Applicants are expected to have a solid experience in handling Python & Playwright related tasks
Senior Marketing Project Manager Applicants are expected to have a solid experience in handling Job related tasks
Sr Technical Consultant - Platform Integration Applicants are expected to have a solid experience in handling Job related tasks
Principal Solutions Advisor - Retail Applicants are expected to have a solid experience in handling Job related tasks
Sr Software Engineer ( Gen AI) Applicants are expected to have a solid experience in handling Job related tasks
Sr Technical Consultant - Java, PLSQL, WMS Applicants are expected to have a solid experience in handling PLSQL, WMS related tasks
Functional Architect - TMS Applicants are expected to have a solid experience in handling Job related tasks
Senior Enterprise Architect Applicants are expected to have a solid experience in handling Job related tasks
Principal Solutions Advisor - Grocery Applicants are expected to have a solid experience in handling Job related tasks
Devops Architect - Azure, CICD, Terraform, Github, Java, RestAPI, ReactJs Applicants are expected to have a solid experience in handling CICD, Terraform, Github, Java, RestAPI, ReactJs related tasks
Technical Account Advisor - Cloud Service Applicants are expected to have a solid experience in handling Job related tasks
Staff Software Engineer - NodeJs, ReactJs, NextJs & Microservices Applicants are expected to have a solid experience in handling ReactJs, NextJs & Microservices related tasks
Education Architect, Int Applicants are expected to have a solid experience in handling Int related tasks