Senior Site Reliability Engineer, MCN

NeuralFabric

NeuralFabric

Software Engineering
Bengaluru, Karnataka, India
Posted on Jan 28, 2026

Meet the Team

Join the founding SRE team building Cisco’s next-generation multi-cloud networking (MCN) platform from the ground up. We are a high-impact, 5-person global team (spanning the US and India) dedicated to ensuring the reliability of a platform designed to handle billions of daily transactions across 9+ global regions. You will collaborate closely with Backend, DevOps, and QA teams to pioneer the future of multi-cloud connectivity.

Your Impact

As a Senior SRE, you will be instrumental in the MCN Alpha launch scheduled for Early 2026. You will design observability systems, optimize performance, and ensure the platform operates with 99.9%+ reliability across AWS, Azure, and GCP. This role offers the exciting opportunity to build self-healing infrastructure and AI-powered operations for a brand-new, large-scale networking platform.

  • Build Reliable Infrastructure: Design and operate highly available infrastructure across AWS (EKS), Azure (AKS), and GCP (GKE) regions.
  • Observability & Monitoring: Architect comprehensive monitoring, alerting, and dashboards using Prometheus, Thanos, and Grafana.
  • AI-Powered SRE: Build MCP servers and AI agents to detect anomalies, automate troubleshooting, and enable self-healing.
  • Automation Excellence: Write Python/Go code to automate operations, reduce manual toil, and improve system efficiency.
  • Incident Response: Lead incident response, conduct root cause analysis (RCA), and participate in a 24x7 on-call rotation to ensure 24/7 uptime.

Minimum Qualifications

  • 5-10 years of experience in SRE, DevOps, or infrastructure engineering with a focus on reliability at scale.
  • Proficiency in programming with Python or Go.
  • Expertise in Linux internals (networking, filesystems, memory management) and container orchestration (Kubernetes/EKS/AKS/GKE).
  • Experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible and public cloud providers (AWS, Azure, or GCP).
  • Willingness to participate in a 24x7 on-call rotation.

Preferred Qualifications

  • Strong understanding of networking fundamentals (TCP/IP stack, BGP, IPsec, VPN).
  • Experience managing multi-cloud operations across AWS, Azure, and GCP simultaneously.
  • Knowledge of high-performance networking or VPP (Vector Packet Processing).
  • Experience with secrets management (IAM roles, RBAC, service accounts).
  • Familiarity with building AI-native SRE tools or MCP server development.

Why Cisco?

At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.

Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.

We are Cisco, and our power starts with you.