Sr. Network Observability Engineer SME Job at Netpace Inc, Pleasanton, CA

VU9Xb0NENlBJNGc0cDRINStVZXVjU2o5NEE9PQ==
  • Netpace Inc
  • Pleasanton, CA

Job Description

Sr. Network Observability Engineer SME ( Azure/GCP/OCI, Grafana, MSFT/GCP/OCI tooling)

THIS JOB IS OPEN FOR FULLTIME/SALARIED CANDIDATES AS WELL

Key Responsibilities:

· Design and deploy scalable network observability frameworks for multi/hybrid-cloud environments (Azure, GCP, OCI) using Grafana, Prometheus, OpenTelemetry, and cloud-native tools.

· Implement custom dashboards, alerts, and log analytics for network performance metrics (latency, packet drops, BGP routing health, throughput) and security telemetry (firewall logs, flow logs, IDS/IPS).

· Integrate observability tools with cloud networking services:

o Azure: Monitor ExpressRoute/VNet Gateway metrics, NSG Flow Logs, Traffic Analytics.

o GCP: Stackdriver/Operations Suite for VPC flow logs, Firewall Insights, Network Intelligence Center.

o OCI: VCN Flow Logs, Network Visualizer, Service Connector Hub.

· Automate observability pipelines using Terraform, Python, or PowerShell to ingest, correlate, and visualize telemetry data.

· Troubleshoot network anomalies by analyzing packet captures (PCAP), NetFlow/sFlow, and distributed tracing data.

· Collaborate with SRE and DevOps teams to reduce MTTR via AI/ML-driven anomaly detection (e.g., Azure Sentinel, GCP Chronicle, OCI AI Anomaly Detection).

· Optimize costs by right-sizing monitoring tools and eliminating redundant telemetry data.

Required Skills & Experience:

· 8+ years in network observability, monitoring, or cloud operations, with expertise in Azure/GCP/OCI.

· Hands-on experience with:

o Grafana (dashboarding, Loki for logs, Mimir for metrics).

o Cloud-native tools: Azure Monitor, GCP Cloud Logging/Monitoring, OCI Observability & Management.

o Telemetry protocols: SNMP, gNMI, NetFlow/IPFIX, eBPF.

· Network diagnostics: Wireshark, tcpdump, traceroute, BGP route analytics.

· Automation/scripting: Python, Terraform, or equivalent IaC tools.

· Certifications (Preferred):

o Azure: AZ-120 (Monitoring), AZ-700 (Networking).

o GCP: Professional Cloud Network Engineer.

o OCI: Oracle Cloud Infrastructure Certified Architect.

o Grafana: Grafana Certified Associate (or higher).

Nice-to-Have:

· Experience with AIOps platforms (Dynatrace, New Relic, Splunk ITSI).

· Knowledge of Kubernetes networking observability (Calico, Cilium, Istio).

· Familiarity with compliance frameworks (ISO 27001, NIST CSF) for audit logging.

Job Tags

Full time,

Similar Jobs

Loud Solutions

Chief Business Officer Job at Loud Solutions

 ...Chief Business Officer (CBO) Future CEO Location: Austin, TX Type: Co-Founder CBO | Full-Time | Equity + Competitive Compensation Reports to: Founding CEO Stage: Series A (Targeting Series-B, $20M+ Raise) The Opportunity Were a Series A Robotics... 

KCD

Sr. Experiential Designer Job at KCD

Position Overview: The Sr. Experiential Designer will play a key role in leading experiential design projects, ensuring excellence in conceptualization, execution, and implementation. This individual will oversee and contribute to multiple concurrent projects, collaborating...

New Heritage Recruiters, Inc.

RN Care Coordinator Job at New Heritage Recruiters, Inc.

 ...Job Summary We are seeking a dedicated and compassionate RN Care Coordinator to join our team. SOLAR is a DHS funded Recuperative Care Program serving individuals experiencing homelessness that also have acute medical issues. The RN Care Coordinator will take a leadership... 

Davalyn Corporation

Computer Numerical Control Machinist Job at Davalyn Corporation

 ...position with comprehensive corporate benefits and opportunities for advancement within a highly skilled team environment. Position Summary:...  ...and perform first article inspections on high-precision parts Troubleshoot setup issues and recommend process improvements... 

Tatum by Randstad

Trust Officer Job at Tatum by Randstad

 ...join a small firm in Malvern as a Trust Administrator / Trust Officer. After a 60 - 90 day onboarding process, this role can be...  ...include Estate Administration - including preparing and filing probate documents Trust Administration - including preparing...