Data Steward
Bayer
Bangalore Urban, Karnataka, India
POSITION PURPOSE: Develop and deploy data-based sustainable solutions while working with R&D scientists, IT & data teams and answer important questions that drive key decisions for our business. YOUR TASKS AND RESPONSIBILITIES: Defines data quality rules and implement automated monitoring, reporting, and remediation solutions. Implements and fine-tunes data governance guidelines, policies, processes, and controls. Ensures data consistency across multiple systems and business units Coordinates design sessions with Stewards, Data Engineers, Engineering teams, Data Scientists, Product Managers, business and/or IT stakeholders, that result in design documentation and business metadata capture Participates in trainings and discussions to evangelize these frameworks and objectives - Governance, Data Quality, Data Wrangling, and Best Practices. Maintains records of adequate data collection, maintenance, and usage Implements and utilizes data solutions for data analysis and profiling using a variety of tools such as Postman, R or Python and following team’s established processes and methodologies Collaborates with other data stewards and engineers within the team and across teams on aligning delivery dates and integration efforts Utilizes root cause analysis to identify trends and assess impact of data quality issues. Supports data migration from legacy systems, data inserts and updates not supported by applications. Participates in data scraping, data curation and data compilation efforts. Ensures high quality of the data to end users. Ensures high quality of the inhouse data via data stewardship. Ensures adoption of taxonomy and ontology for the compiled data to end users Has digital mindset and knowledge on Python/R programming to automate Data stewardship workflows. Participates in Open Data efforts, making data FAIR (Findable, Accessible, Interoperable and Reusable) to strengthen effectiveness WHO YOU ARE: Master’s Bachelor's Degree in Computer Science, Engineering, Crop Science, Agriculture, or another related field. Solid experience in areas such as: Relevant business domain Querying SQL and/or NoSQL databases Managing data using APIs Semantic Intelligence and Knowledge Graph Manipulating data using scripting languages and/or data processing software (e.g. Python, R, Pipeline Pilot) and Data management/governance /ETL applications such as Tibco EBX, Talend or Indigo) Profiling data, summarizing, and reporting data quality metrics Ability to deliver detailed technical documentation Experience handling sensitive data. Experience in designing data catalogs, including data design, metadata structures, object relations, catalog population, etc. Knowledge on modern engineering technologies and data principles, for instance: Big Data, Cloud Computing, NoSQL, etc. Understanding of data architecture and modeling Knowledge of industry data practice/governance models (DAMA, CMMI, DGI, etc.) and data strategy frameworks (Gartner, St Gallen, etc.) Knowledge of data management best practices. Knowledge of business or data domain within a business unit