
Equinix
Global leader in data center and interconnection services, enabling digital transformation.
Staff Engineer, Product Software
Lead technical authority for global telemetry, building high‑throughput observability systems.
Job Highlights
About the Role
The Staff Engineer for NPE Observability leads the design and implementation of Equinix’s global telemetry fabric, bridging high‑scale distributed software with global network hardware. This role owns the technical integrity of streaming pipelines, ensuring telemetry from the global fleet is ingested, normalized, and processed with sub‑second latency, and drives technical excellence within the Network Platform Engineering group. Joining the NPE Observability team means tackling industry‑scale data challenges, operating a high‑performance engine that processes massive real‑time telemetry streams. The engineer combines deep service‑provider networking knowledge with modern big‑data patterns using tools such as Flink, Kafka, and ClickHouse to deliver carrier‑grade distributed systems where every millisecond and every data point matters. • Translate architectural blueprints into automated, scalable infrastructure for global telemetry. • Drive roadmap milestones aligning code with long‑term NPE vision. • Lead design of real‑time streaming telemetry and high‑performance databases. • Convert product requirements into technical specs and manage stakeholder relationships. • Define SLA metrics and ensure ITIL‑compliant observability platform. • Lead incident response, performing root‑cause analysis for cross‑platform outages. • Research emerging technologies and model‑driven telemetry to keep the platform cutting‑edge. • Prove concepts and define change‑management practices influencing broader business strategy.
Key Responsibilities
- ▸infrastructure automation
- ▸real‑time streaming
- ▸high‑performance db
- ▸sla metrics
- ▸incident response
- ▸tech research
What You Bring
• 5+ years in software engineering/distributed systems and 3+ years in large‑scale network engineering. • Bachelor’s degree in Computer Science, Computer Engineering, or a related field. • Mastery of SOLID, Clean Architecture; proficient in Java, Go, and SQL. • Expert in Model‑Driven Telemetry, gNMI over SNMP, and service‑provider protocols (BGP, IS‑IS, MPLS, QoS, EVPN, VXLAN). • Hands‑on experience with Apache Flink, Kafka, ClickHouse, Prometheus, Thanos, and Kubernetes. • Advanced Python or Go skills; built custom collectors or API integrations for network OS (Junos, Nokia, Arista).
Requirements
- ▸bachelor's
- ▸java
- ▸go
- ▸kubernetes
- ▸apache flink
- ▸bgp
Benefits
The position is based in the United States, with target pay ranges of $118,000‑$176,000 in Dallas and $142,000‑$212,000 in Redwood City, plus bonus, equity, and benefits. Compensation reflects role, level, location, and individual factors such as skills, experience, and education. • Employee Assistance Program for personal and professional support. • Comprehensive health, life, disability, and voluntary insurance plans. • Retirement plan with employer contributions. • Accrued paid time off and paid holidays. • Competitive, inclusive, sustainable benefits package.
Work Environment
Hybrid