Senior Software Engineer
Microsoft
Senior Software Engineer
Multiple Locations, United States
Save
Overview
The Azure Compute platform is transforming industries and empowering individuals across the globe by delivering world-class cloud infrastructure to host services and workloads at scale. Within this platform, the Azure Holmes team is on a mission to build the world’s best platform for running critical workloads with uninterrupted availability, reliability and scalability. We are hiring a Senior Software Engineer to tackle some of the most complex and fascinating challenges in distributed systems.
As the core of the Azure cloud, the Azure Compute platform is a fault-tolerant distributed system built on commodity datacenter hardware. The Holmes team, a key part of this platform, delivers dynamic resource management capabilities that enhance customer availability and platform efficiency. Our services drive innovations such as placement reshaping, defragmentation, overbooking, and transparent maintenance—integrated through intelligent algorithms for optimal performance.
As a Senior Software Engineer, you will, design and build highly available, event-driven microservices that elevate customer experience. You will also collaborate with Microsoft Research to integrate cutting-edge ML/AI models, and contribute to the evolution of a platform that powers mission-critical workloads at global scale.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Qualifications
- Bachelor's Degree in Computer Science, or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- 2+ years of demonstrated ability to work collaboratively and drive success across teams.
- 1+ year(s) of experience working in distributed systems.
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
- Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript,
- OR Python
- OR Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience.
- 3+ years of experience working in distributed systems.
- 1+ year(s) of experience leading technical projects from beginning to end.
- Experience with ML/AI is a plus.
Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications for the role until August 9, 2025.
#azurecorejobs
Responsibilities
- Collaborates with appropriate stakeholders to determine user requirements for a scenario.
- Drives identification of dependencies and the development of design documents for a product, application, service, or platform.
- Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI).
- Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items.
- Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate.
- Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale.