Job Details
Location: Iselin, New Jersey, United States
Salary: Not specified
Company: ApnaWorker
We are hiring a Sr. Observability Engineer to lead the design, modernization, and operation of enterprise-scale observability and reliability platforms supporting mission-critical environments. Key skills include Prometheus (Advanced PromQL, Cardinality Management, Optimization), Grafana, Grafana Agent, Grafana Alloy, Grafana Mimir, Observability Platform Architecture, Monitoring & Alerting Engineering, Automation Development, Exporter Management (Node Exporter, Process Exporter, MQ Exporters), Dynatrace Integrations, OpenShift (OCP), Linux & Windows Infrastructure Monitoring, Hybrid Cloud & Enterprise Monitoring, BCP/DR Monitoring Architecture, Event-Driven Automation & Self-Service Platforms. Responsibilities include leading modernization initiatives from legacy monitoring platforms to modern cloud-native observability stacks, architecting scalable monitoring solutions across large-scale Linux, Windows, Mainframe, and Cloud environments, building resilient monitoring pipelines and observability platforms, developing automation for onboarding, exporter lifecycle management, target discovery, and remediation workflows, and troubleshooting metric ingestion, scrape failures, remote_write backpressure, and platform performance issues.