Design, develop, and maintain scalable data pipelines for ingestion, processing, and storage.
Build and optimize ETL workflows to ensure reliable data movement across systems.
Collaborate with data scientists, analysts, and business stakeholders to understand data needs.
Ensure data quality, integrity, and governance through validation and monitoring.
Develop and manage data models, metadata, and data dictionaries.
Work with big data technologies like Hadoop, Spark, and cloud platforms (AWS, Azure, GCP).
Create and maintain database systems, both SQL and NoSQL, for various use cases.
Implement data security best practices, including access control and encryption.
Monitor system performance and troubleshoot issues related to data pipelines and storage
In this role, you'll work in one of our IBM Consulting Client Innovation Centers (Delivery Centers), where we deliver deep technical and industry expertise to a wide range of public and private sector clients around the world. Our delivery centers offer our clients locally based skills and technical expertise to drive innovation and adoption of new technology.
Design, develop, and maintain scalable data pipelines for ingestion, processing, and storage.
Build and optimize ETL workflows to ensure reliable data movement across systems.
Collaborate with data scientists, analysts, and business stakeholders to understand data needs.
Ensure data quality, integrity, and governance through validation and monitoring.
Develop and manage data models, metadata, and data dictionaries.
Work with big data technologies like Hadoop, Spark, and cloud platforms (AWS, Azure, GCP).
Create and maintain database systems, both SQL and NoSQL, for various use cases.
Implement data security best practices, including access control and encryption.
Monitor system performance and troubleshoot issues related to data pipelines and storage
Strong experience with Master Data Management (MDM) tools and platforms (e.g., Informatica MDM, Reltio, IBM InfoSphere, SAP MDG).
Proven ability to design, implement, and support MDM architecture, including data integration, data quality, and data governance.
Proficiency in ETL development and data pipeline creation using tools like Informatica, Talend, or Apache NiFi.
Solid experience with relational (SQL) and NoSQL databases; ability to write complex queries and perform data profiling.
Knowledge of data modeling concepts, including conceptual, logical, and physical data models.
Familiarity with data quality frameworks, data stewardship practices, and metadata management.
Experience with cloud-based MDM deployments (AWS, Azure, or GCP) and cloud data platforms (e.g., Snowflake, BigQuery, Databricks).
Strong understanding of data governance policies and compliance standards (e.g., GDPR, HIPAA).
Ability to collaborate with cross-functional teams including business, data stewards, and enterprise architects.
Excellent problem-solving skills and the ability to document technical solutions and data flows.
Hiring manager and Recruiter should collaborate to create the relevant verbiage.