<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Beyond Data &#187; master data management</title>
	<atom:link href="http://blog.vasukikasturi.com/tag/master-data-management/feed" rel="self" type="application/rss+xml" />
	<link>http://blog.vasukikasturi.com</link>
	<description>Tempered thoughts on Enterprise Data Management</description>
	<lastBuildDate>Mon, 15 Mar 2010 01:27:44 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Root Cause for Data Quality Issues</title>
		<link>http://blog.vasukikasturi.com/data-quality/root-cause-data-quality-issues</link>
		<comments>http://blog.vasukikasturi.com/data-quality/root-cause-data-quality-issues#comments</comments>
		<pubDate>Mon, 15 Mar 2010 01:20:49 +0000</pubDate>
		<dc:creator>Vasuki Kasturi</dc:creator>
				<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[Data Entry]]></category>
		<category><![CDATA[data governance]]></category>
		<category><![CDATA[Data Integration]]></category>
		<category><![CDATA[master data management]]></category>

		<guid isPermaLink="false">http://blog.vasukikasturi.com/?p=243</guid>
		<description><![CDATA[Data is impacted by numerous processes that bring data into your data environment, most of which affect its quality to some extent. Some processes bring data into your environment, referred as the inflow, and some process operate on the data causing data issues. The fishbone below highlight the different causes for data to decay (in [...]]]></description>
			<content:encoded><![CDATA[<p>Data is impacted by numerous processes that bring data into your data environment, most of which affect its quality to some extent. Some processes bring data into your environment, referred as the inflow, and some process operate on the data causing data issues. The fishbone below highlight the different causes for data to decay (in no set order).</p>
<div id="attachment_256" class="wp-caption alignnone" style="width: 630px"><a href="http://blog.vasukikasturi.com/wp-content/uploads/2010/03/bad_dq.jpg"><img class="size-full wp-image-256 " title="bad_dq" src="http://blog.vasukikasturi.com/wp-content/uploads/2010/03/bad_dq.jpg" alt="Root Cause for Data Quality Issues" width="620" height="410" /></a><p class="wp-caption-text">Root Cause for Data Quality Issues</p></div>
<p>It is difficult to prioritize this list, although philosophically I can  say that lack of governance will most definitely lead to bad data. At  the same time, the list is not finite or complete. Organizational events  like mergers &amp; consolidations can also lead to bad data quality. The fins on the upper side are processes that bring data into your system, the inflow. The lower fins are internal processes that cause bad data to persist. Either of the fins can cause data corruptions. I have summarized each of the fin below, without bloating this post.</p>
<ol>
<li>Legacy Migration: Refers to data that is often migrated from a legacy system. In most cases the data structures and data models are inconsistent between the legacy architecture and the new architecture.</li>
<li>System Migration: This is almost similar to the above, except that these are due system upgrades. As applications evolve, designs change. New fields get added, when no historical data exists for this field. Or god forbid, some fields are deprecated/removed which may lead to serious problems.</li>
<li>Workarounds: This is typical of the business community and packaged applications (ERP/CRM et al). Custom fields are heavily used (often with no documentation), which later lead to some misinterpretations.</li>
<li>Manual Data Entry: Mostly happens when systems collect data from users via a &#8220;free text&#8221; field. Common examples include Addresses, Phone Numbers etc. In the absence of standards/conventions, or lack of policies, data entry users would want to finish a transaction as quickly as possible rather than worry about the accuracy of the data. If the system is not self correcting, users will never understand that they are introducing bad data.</li>
<li>Interfaces: These are the connectors between one system to another. For large enterprises, this is how data typically flows &#8211; Campaigns to Opportunity to Quote to Order to Manufacturing to Service. To compound the matters, each system is sold/supported by a different vendor with no accountability for data correction.</li>
<li>Process Automation: This is different from the Interface issue discussed above. This is more about how a system process (within that system) is automated. As existing business processes are re-engineered (due to dynamic nature of the business), applications get out of sync or new data assumptions are made. If this is not relayed to the IT team that supports the system, there will be some data corruption.</li>
<li>Time Decay: This is especially true for Master data (like customer), where the data was good at some point in time but has since been not updated. Consider your email address (specifically work emails), as customer contacts move from one organization to another their email changes with the move. The data you once had for this customer contact is no longer accurate.</li>
<li>Data Quality Programs: The irony. Yes, sometimes the data quality programs/initiatives are themselves a cause for bad data. This is mostly because of wrong assumptions on business data and rules around the data. So data may be cleansed incorrectly, aggressive merges (in the case of Master Data Management), data purges etc.</li>
<li>Lack of Ownership:  Very few organizations have complete ownership of a system (a CRM is often shared by Sales and Marketing), they often share sections of the data. With shared ownership, comes conflicting business rules and priorities. Concepts like Data Quality Organizations or Data Stewards are new to most organization, which bring accountability to an enterprise.</li>
<li>Lack of Governance: Data Governance is a vast discipline that is beyond the scope of this post. It is about arriving at a standard definitions the the common data, via meta data management. It is about analyzing, defining and base lining the current quality of the data; so some of the quality metrics can be monitored. Its about MDM and a lot more. Lack of governance, means the information management strategy is poorly executed leading to more data issues.</li>
</ol>
<p>In summary, the reasons for bad data quality are many. Before we start looking at cleaning the data, it is prudent that we understand the root causes for bad data. Prioritize and strategize the cleanup activities; devise ongoing monitors to gauge the data and control the inflow of bad data.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.vasukikasturi.com/data-quality/root-cause-data-quality-issues/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Overview</title>
		<link>http://blog.vasukikasturi.com</link>
		<comments>http://blog.vasukikasturi.com#comments</comments>
		<pubDate>Sun, 21 Feb 2010 06:39:45 +0000</pubDate>
		<dc:creator>Vasuki Kasturi</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[data governance]]></category>
		<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[enterprise data management]]></category>
		<category><![CDATA[master data management]]></category>

		<guid isPermaLink="false">http://blog.vasukikasturi.com/?page_id=179</guid>
		<description><![CDATA[Most organizations have come to the realization that data is an important enterprise asset, and that when tapped provides a  competitive advantage in the marketplace. Like any asset, Data requires careful management so it can be turned into useful information. It is this information that enables an organization’s strategic vision to improves its bottom line. [...]]]></description>
			<content:encoded><![CDATA[<p>Most organizations have come to the realization that data is an important enterprise asset, and that when tapped provides a  competitive advantage in the marketplace. Like any asset, Data requires careful management so it can be turned into useful information. It is this information that enables an organization’s strategic vision to improves its bottom line. Organizations that have succeeded in this effort most likely have implemented an enterprise data management (EDM) program.</p>
<p>The vision of EDM is to create and sustain a consistent view of business data for all the stakeholders in the enterprise. And you deliver the vision using the 3 pronged approach of People, Processes and Technologies. When implemented right, EDM provides the semantic layer required for constant insight into the business information across the enterprise. EDM is not a technology or even a component, rather a framework of disciplines for managing the data across the enterprise.</p>
<p>In my view, an EDM program should cater to the following disciplines.</p>
<ul>
<li>Data Governance &amp; Stewardship</li>
<li>Data Integration</li>
<li>Master Data Management</li>
<li>Data Quality</li>
<li>Data Warehouse  &amp; Business Intelligence</li>
<li>Information Lifecycle Management</li>
<li>Data Security</li>
</ul>
<p>This blog is my attempt to provide some practical insight into each of the disciplines of EDM. I understand that there are a zillion posts that cover EDM, and some very reputed gurus who write on this. However I will try to cover both the strategic and the tactical aspects of EDM, based on my experience. The blog will provide first hand reviews of different tools, best practices and &#8220;pilot&#8221; solutions.  You can check the &#8220;About&#8221; page to learn a bit about me and what I do.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.vasukikasturi.com/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
