Site Reliability Engineer, MCN

NeuralFabric

NeuralFabric

Software Engineering
Bengaluru, Karnataka, India
Posted on Jan 28, 2026

Meet the Team
Join the founding SRE team responsible for the reliability of Cisco’s next-generation multi-cloud networking (MCN) platform. We are a high-impact group of engineers dedicated to ensuring seamless operations across global cloud regions. You will work closely with senior SREs in a collaborative environment that prioritizes mentorship, continuous learning, and the implementation of cutting-edge AI-assisted operations.

Your Impact
As an SRE on our founding team, you will ensure Cisco’s multi-cloud networking platform operates reliably for our Early 2026 Alpha launch. You will build monitoring systems, automate operations, and manage infrastructure across AWS, Azure, and GCP, handling billions of daily transactions across 9+ global regions. This role offers the unique opportunity to shape the reliability of a foundational platform from day one, ensuring 99.9%+ uptime while utilizing AI-native SRE practices.

  • Create and maintain dashboards, alerts, and monitoring using Prometheus, Grafana, and ELK stack.
  • Write Python/Go scripts to automate deployment validation, health checks, and routine maintenance tasks.
  • Participate in a 24x7 on-call rotation, assist with incident response, and document troubleshooting procedures.
  • Operate Kubernetes infrastructure across AWS (EKS), Azure (AKS), and GCP (GKE) regions.
  • Utilize AI tools like Cursor and Codex to accelerate automation development and build MCP servers for operational intelligence.
  • Analyze system performance, identify bottlenecks, and implement optimizations.

Minimum Qualifications

  • 3-5 years of experience in SRE, DevOps, or infrastructure engineering.
  • Proficiency in programming with Python or Go (or a strong background in C++, Java, or Ruby).
  • Experience with Linux fundamentals, including command line, system administration, and basic networking.
  • Hands-on experience with monitoring tools such as Prometheus, Grafana, or ELK stack.
  • Experience with Cloud platforms (AWS, Azure, or GCP) and Infrastructure as Code (Terraform or Ansible).

Preferred Qualifications

  • Knowledge of networking basics (TCP/IP, DNS, load balancing).
  • Experience with container orchestration using Docker or Kubernetes.
  • Familiarity with CI/CD pipelines (GitHub Actions, Jenkins).
  • Understanding of secrets management concepts.
  • Eagerness to learn distributed systems, VPP (Vector Packet Processing), and multi-cloud operations.

Why Cisco?

At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.

Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.

We are Cisco, and our power starts with you.