At IBM Software, we transform client challenges into solutions. Building the world’s leading AI-powered, cloud-native products that shape the future of business and society. Our legacy of innovation creates endless opportunities for IBMers to learn, grow, and make an impact on a global scale. Working in Software means joining a team fueled by curiosity and collaboration. You’ll work with diverse technologies, partners, and industries to design, develop, and deliver solutions that power digital transformation. With a culture that values innovation, growth, and continuous learning, IBM Software places you at the heart of IBM’s product and technology landscape. Here, you’ll have the tools and opportunities to advance your career while creating software that changes the world.
We are looking for engineers with a passion for building and operating large-scale distributed systems in the cloud. This role provides an opportunity to work on complex infrastructure challenges across multiple domains, including storage, networking, compute, and security.
What You Will Do:
- Design, develop, and operate large-scale, high-performance infrastructure that powers Confluent Cloud.
- Build foundational software to improve reliability, scalability, and efficiency across cloud environments.
- Work on distributed systems challenges such as consensus algorithms, failover strategies, and resource allocation.
- Collaborate with teams across Confluent to optimize and enhance infrastructure for real-time data streaming use cases.
- Troubleshoot and improve system reliability, observability, and performance across multiple cloud providers (AWS, Azure, GCP).
- Contribute to open-source projects and leverage open-source technologies to drive business impact.
- 4+ years of relevant experience
- Strong fundamentals in distributed systems, cloud infrastructure, and networking.
- Experience in building and operating large-scale, high-availability systems.
- Deep understanding of cloud platforms (AWS, Azure, or GCP) and their services.
- Solid grasp of systems operations (disk, networking, OS-level optimizations).
- Proficiency in Java, Scala, C++, Go, or other statically typed languages.
- A self-starter with strong problem-solving skills and the ability to work in a fast-paced environment.
- BS, MS, or PhD in computer science or a related field, or equivalent work experience
- Experience in one or more of the following domains: storage, compute orchestration, networking, security, or performance engineering.
- Familiarity with Kubernetes, service meshes, and cloud-native architectures.
- Contributions to open-source infrastructure projects.