Sunday, January 29, 2017

Data Dictionary Creation (4 of 5)

Although the Data Dictionary was the tactical focus of the engagement described in the previous blogs, we gave it a back seat to the Culture and Data Governance activities … believing those to be foundational in order to make the Data Dictionary a viable tool.
The organization had more than 40 different apps or databases that contained redundant data that has not been correlated.  Our challenge was to develop a process and a format for identifying and capturing the correlations among and between those redundant data elements, the Data Dictionary being the result. Our approach was to involve the organization’s resources in the process, and to limit our scope to the three most significant systems:
1.     Constituent Information
2.     Contact Relationship Management
3.     Human Resources (Insurance Benefits & Payroll)
The goal was to enable the organization’s resources to continue the process for the remaining systems after our departure. While this effort was under way, they were in the process of evaluating replacement or consolidation solutions for Constituent and HR, but there was really nothing sacred at this point.
After reviewing each of the systems, we compiled key fields, descriptions, data types, and validation rules into a single multi-tab spreadsheet for strategic alignment. Acquiring adequate documentation for two of the systems proved to be a challenge.
After analyzing the data elements of the three source systems, we compiled a list of the unique constituent-related attributes that appeared in one or more of the systems. Working with the organization’s team, we identified Harmonized Names for all of the attributes, established data types and validation rules for them, and started the process of mapping them to the corresponding attributes in each of the source systems.
As an aid to understanding their data structures, we developed entity relationship diagrams for two of the source systems based on documentation provided. We also generated a diagram representing the recommended data structure of their standardized constituent dataset.

Overall, the Data Dictionary documented the mission-critical data attributes that are maintained across the entire application landscape. In the absence of either MDM or a strategic middleware strategy, the Data Dictionary will function as the rulebook that will govern the migration of data from the various applications into the Data Warehouse. Critical decisions regarding common identifiers, data rationalization, and transformation rules will be based on the contents of the Data Dictionary.  However, the Data Dictionary is also a dynamic document, and its maintenance will be the responsibility of the Data Governance Council.

No comments:

Post a Comment