Intersight Site Reliability Engineer | Cloud |CI/CD | Linux | 10-14 Years |Bangalore
NeuralFabric
Meet the Team
As a Site Reliability Engineering (SRE) Technical Leader on the Intersight Team you will play a key role in ensuring the reliability, scalability, and security of our cloud platforms. The broader team is composed of experienced engineers who value innovation and accountability. You will represent the Intersight SRE team, working in a dynamic environment, tackling challenges with creativity, providing technical leadership in defining and delivering on the team's technical roadmap. You will collaborate with cross-functional teams, including software development, product management, customers and security teams, to design, influence, build, and maintain SaaS systems operating at multi-region scale. Your work will directly impact the success of our initiatives by ensuring the underlying platform infrastructure is robust, efficient, and aligned with operational excellence.
Your Impact
We are seeking an experienced Engineer to encourage and represent a high-performing team dedicated to ensuring the reliability and scalability of cloud services, with a focus on a rapidly growing next-generation project. The ideal candidate will have hands-on SRE or systems/network administration experience, with familiarity in AWS. This role involves close collaboration across product engineering, service engineering, and SRE teams in a high-trust, well-coordinated environment.
* Design, build, and optimize cloud and data infrastructure to ensure the high availability, reliability, and scalability of systems to meet customer needs, while implementing SRE principles such as monitoring, alerting, error budgets, and fault analysis.
* Collaborate closely with cross-functional teams, including customers, development, product management, and security teams, to create secure, scalable solutions and enhance operational efficiency through automation.
* Troubleshoot complex technical problems in production environments, perform root cause analyses, and contribute to continuous improvement efforts through postmortem reviews and proactive performance optimization.
* Lead the architectural vision and shape the team’s technical strategy and roadmap, balancing immediate needs with long-term goals, driving innovation, and influencing the technical direction.
* Serve as a mentor and technical leader, guiding teams and fostering a culture of engineering and operational excellence by sharing your deep knowledge and experience.
* Engage with customers and stakeholders to understand use cases and feedback, translating them into actionable insights and effectively influencing stakeholders at all levels.
* Utilize your strong programming skills to integrate software and systems engineering, building core data platform capabilities and automation to meet enterprise customer needs and roadmap objectives.
* Develop strategic roadmaps, processes, plans, and infrastructure to efficiently deploy new software components at an enterprise scale while enforcing engineering best practices.
Minimum Qualifications:
* Bachelor’s Degree in Computer Science with 8-14+ years of related experience
* Extensive practical experience in several of the following areas: Linux, Public Cloud, Docker, Kubernetes, Ansible, Networking, Security, Systems Administration, Software Development, and CI/CD
* Ability to design and implement scalable and well tested solutions, with focus on streamlining operations.
* Strong hands-on experience in cloud, preferably AWS.
* Strong Infrastructure as a Code skills, ideally with Terraform and EKS or Kubernetes.
* Experience with observability tools using Prometheus (Alertmanager), Grafana, Thanos, CloudWatch, OpenTelemetry, and the ELK
* Ability to write high quality code in Python, Go, or equivalent programming languages
* Good understanding of Unix/Linux systems, the kernel, system libraries, file systems, and client-server protocols.
Preferred Qualifications:
* Experience building/managing a cloud-based data platform, automation and orchestration of their infrastructure and maintaining high availability, system reliability at scale.
* Ability to handle multiple competing priorities in a fast-paced environment
* Have experience with architecting software and infrastructure at scale with a sense of ownership and accountability.
* Strong personal interest in learning, researching, and creating new technologies with high customer impact
* Superior verbal and written communication and presentation skills, ability to convey complex technical information to non-experts
* Excellent collaboration skills and the ability to bring out the best in a technically diverse team
* Certifications: CKA (Certified Kubernetes Administrator), CKAD (Certified Kubernetes Application Developer), AWS Certified DevOps Engineer, or equivalent certifications in cloud and security domains.
Why Cisco?
At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.
Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.
We are Cisco, and our power starts with you.