Senior Software Engineer - Azure Storage
Microsoft
Senior Software Engineer - Azure Storage
Multiple Locations, United States
Save
Overview
Do you love working with data, AI, and large distributed systems? Do you enjoy solving complex technical challenges and delighting customers while pushing the boundaries of cutting-edge technology at immense scale?
Azure Databricks is a Azure service jointly developed and operated by Databricks and Microsoft. It offers the same core Databricks platform but with deep integration into Azure’s ecosystem with a mission to simplify and democratize data and artificial intelligence. From foundational analytics to the bleeding edge of generative AI, our platform is trusted by customers to unlock the value of their data—securely, reliably, and at scale. We're not just building software—we're shaping the future of intelligent systems.
We are looking for a Senior Software Engineer - Azure Storage to join our Azure Databricks team with a specialized focus on improving service resiliency, debugging complex distributed systems, and delivering a world-class experience to customers. This is a unique opportunity to work at the intersection of software engineering, site reliability, and customer engagement. You won’t be just writing code—you’ll be diving into the depths of system performance and reliability, crafting tools to catch issues before customers do, and directly working with some of our most strategic customers to understand their pain points and create lasting solutions.
At Microsoft, our mission is to empower every person and every organization on the planet to achieve more. We embrace a growth mindset, foster innovation to empower others, and work together to achieve shared goals. Every day, we uphold our values—respect, integrity, and accountability—to cultivate a culture of inclusion where everyone can thrive.
Qualifications
- Bachelor's Degree in Computer Science, or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, JavaScript, React, or Python
- OR equivalent experience.
- 4+ years hands-on experience managing live-site operations, leading incident response efforts for distributed systems, particularly within the Azure ecosystem, driving root cause analysis through detailed post-mortems to improve system reliability and performance tuning.
• 4+ years of Azure development experience.
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
- 5+ years of experience debugging complex systems across the Azure ecosystem, including Azure Resource Manager (ARM), Compute Resource Provider (CRP), Network Resource Provider (NRP), Storage Resource Provider (SRP), MySQL, and Azure Kubernetes Service (AKS), etc.
Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications for the role until August 12, 2025
#azurecorejobs
Responsibilities
- Design and implement features and tools that enhance the resiliency, scalability, and reliability of Azure Databricks services.
- Debug and solve complex issues across distributed systems—often without a playbook.
- Investigate customer-reported problems and proactively identify systemic patterns to eliminate root causes.
- Partner closely with teams across Azure to ensure that customer experience consistently exceeds expectations.
- Build monitoring, automation, and self-healing capabilities to reduce operational overhead and human intervention.
- Collaborate with Databricks Microsoft engineers and internal teams to strengthen our Azure-native integrations.