<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Business Intelligence Review &#187; Enterprise Data Warehousing</title>
	<atom:link href="http://www.bireview.org/bireviewblogs/archives/category/edwh/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bireview.org/bireviewblogs</link>
	<description>All things Business Intelligence related. Read and participate!</description>
	<lastBuildDate>Thu, 27 May 2010 11:32:04 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>BI User Profiles or the Dream of Self-Service BI</title>
		<link>http://www.bireview.org/bireviewblogs/archives/bi-user-profiles-or-the-dream-of-self-service-bi</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/bi-user-profiles-or-the-dream-of-self-service-bi#comments</comments>
		<pubDate>Thu, 20 May 2010 08:14:30 +0000</pubDate>
		<dc:creator>Matthias Blume</dc:creator>
				<category><![CDATA[Business Analytics]]></category>
		<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[BI Tools]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[data warehouse]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=248</guid>
		<description><![CDATA[Software vendors have been feeding the hope for the perfectly flexible and intuitive BI tool ever since BI software came to the market. Of course the tool would also deliver the response times in line with speed of thought analysis.]]></description>
			<content:encoded><![CDATA[<p>Software vendors have been feeding the hope for the perfectly flexible and intuitive BI tool ever since BI software came to the market. Of course the tool would also deliver the response times in line with speed of thought analysis. Managers waiting for responses from the BI team working their way through projects and query requests would be issues of the past.</p>
<p>Slowly, more and more analyst firms and vendors draw a more differentiated picture and accept that there is a large range of user situations and needs which ask for a range of appropriate technical answers.</p>
<p>One of the most important goals of the data warehouse is to provide a single version of the information. In order to avoid potentially different data selections or transformations it would only be sensible to also have just one tool to access the single version of the information, eradicating the problem of different numbers for the same question once and for all. This might be the strongest motivation for BI professionals and business users who seek a universal BI end user tool that answers all business needs and widens the access bottle neck. The growing market opportunity for BI tools motivates the software vendors to promise technological breakthroughs and even miracles where required.</p>
<p>The challenge becomes apparent when taking a closer look at what different users need. At least three use cases need to be distinguished and give rise to different and conflicting software requirements:</p>
<ul>
<li>Most data warehouses will probably form the basis for a stream of ready-made reports being send out to many users across the organisation. Usually no interactivity is provided with the reports and most users would more likely become confused by features allowing for customisation. What is needed are capabilities to automatically and repeatedly generate reports combining several tabular and graphical elements.</li>
<li>Senior management wants the convenience to oversee the entire business and drill down into subject areas when indicated. This need can be answered by information dashboards providing basic flexibility via an intuitive GUI. The bigger challenge here is to select and define the pieces of information to be displayed in line with the rest of the organisation.</li>
<li>Follow-up questions triggered by the two first user groups and all other analysis give rise to non-standard or ad hoc queries. They involve not only tools with full query flexibility but most importantly data analysis skills and knowledge of the data sources restricting these queries to a small minority of the users for most organisations.  Data warehouse thought leader Ralph Kimball wrote in 1998: “Ad hoc query tools, as powerful as they are, can only be effectively used and understood by about 10% of all the potential end users of a data warehouse. … The very best ROLAP-oriented ad hoc query tools improve the 10% number to perhaps 20%.” <a id="_ftnref1" name="_ftnref1" href="#_ftn1"><sup>[1]</sup></a><br />
Also data discovery tools, data mining workbenches and even plain SQL tools fall in this broad category.<br />
See also Stephen Few&#8217;s article <a title="Permanent Link: Big BI is Stuck: Illustrated by SAP  BusinessObjects Explorer" rel="bookmark" href="http://www.perceptualedge.com/blog/?p=727">Big BI is Stuck: Illustrated by SAP  BusinessObjects Explorer</a> (his Blog is highly recommended in general).</li>
</ul>
<p>These three use cases could be further broken down, for example, by distinguishing between being OLAP focused or not, and vendors tend to showcase certain capabilities (such as data search) as constituting a whole new category.</p>
<p>It shows that a unified tool for all users and situations could probably only be developed at the expense of too many compromises and would probably satisfy no one in the end.</p>
<p>The even more utopian plan of BI delivery by pure self-service could only work with a <em>complete </em>data warehouse presented by a <em>clearly documented</em> user interface layer and in situations where queries are limited to <em>simple look ups</em> of stored facts. I’m not sure that the three conditions are fulfilled anywhere.</p>
<hr size="1" /><a id="_ftn1" name="_ftn1" href="#_ftnref1">[1]</a> <em> The Data Warehouse Lifecycle Toolkit</em>, Ralph Kimball et al., 1998, John Wiley &amp; Sons</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/bi-user-profiles-or-the-dream-of-self-service-bi/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Business Intelligence 2.0, Is anyone doing it yet?</title>
		<link>http://www.bireview.org/bireviewblogs/archives/business-intelligence-2-0-is-anyone-doing-it-yet</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/business-intelligence-2-0-is-anyone-doing-it-yet#comments</comments>
		<pubDate>Sat, 16 Jan 2010 17:59:32 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[Enterprise Data Architecture]]></category>
		<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[architecture]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[process]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=207</guid>
		<description><![CDATA[In the early part of 2007, Neil Raden wrote an article entitled "Business Intelligence 2.0: Simpler, More Accessible, Inevitable". In this article, he argues that the a data warehouse (BI 1.0) is a big, slow, expensive machine for cranking out analytical data, and when it does, the data is highly-structured, only allowing a rigidly defined set of of questions to be answered. He continues that in the day of doing Google searches and social network...]]></description>
			<content:encoded><![CDATA[<p>In the early part of 2007, <a href="http://www.intelligententerprise.com/experts/raden;jsessionid=A5PASNQLWP2AZQE1GHPCKHWATMY32JVN">Neil Raden</a> wrote an article entitled &#8220;<a href="http://www.intelligententerprise.com/channels/infomanagement/showArticle.jhtml?articleID=197002610">Business Intelligence 2.0: Simpler, More Accessible, Inevitable</a>&#8220;. In this article, he argues that the a data warehouse (BI 1.0) is a big, slow, expensive machine for cranking out analytical data, and when it does, the data is highly-structured, only allowing a rigidly defined set of of questions to be answered. He continues that in the day of doing Google searches and social networks, that finding the information you want is a matter of &#8220;mashing up&#8221; data from everywhere. That is, BI 2.0 needs to be able to get data from where ever it is, and do the integration on the fly.</p>
<p>He also mentions, as I have in a <a href="http://www.bireview.org/bireviewblogs/archives/data-integration-and-the-data-warehouse-who-does-what" target="_self">previous article here</a>, that master data management (MDM) systems can be used to remove from the traditional data warehouse the complex work of data integration. I see this happening more and more for the reasons I expressed earlier, mainly a reduced total cost of ownership and an increased usability of the information. MDM systems are perfectly suited to the &#8220;data integration&#8221; part of the overall business intelligence world, and as such, helps to allow the traditional data warehouse evolve to a smaller, quicker, and less expensive machine to operate. Also, sourcing a data warehouse from an MDM system means that analytical data can be provided far easier and quicker than before, and this supports Neil Radan&#8217;s BI 2.0.</p>
<p>I have worked with organizations that have come to the conclusion that the types, amounts and sources of information needed to run the business properly are constantly changing as the organization tries to keep up with or keep ahead of the competition in a more and more volatile business environment. What resulted was the organization stepping backward in the maturity of its data warehouse (according to the &#8221; <a href="http://www.tdwi.org/publications/display.aspx?ID=7199">Data Warehouse Maturity Model</a>&#8221; I mentioned in a previous article). One organization in particular, to get the data the business needs in front of them faster, did an end-run around the data warehouse and started pulling data from whatever source system they thought had what they needed. I have to admit that the web-based interface they built was very impressive, and they could indeed deliver information quickly.</p>
<p>The problem I had with it was simply that they had ignored all the &#8220;best practices&#8221; about doing good data integration, good data governance and good data management. Because they built their solution outside the normal project path of the organization, in order to avoid the lengthly project schedules and extra costs, they side-stepped everything the organization had put in place to mitigate just the kinds of problems that this new development brought up again. And the organization had suffered the same problems in the past! I was amazed that the business unit in question was allowed to do this, but in the end, you had to sympathize with them. All they wanted was the data in a form they could use, and quickly. The solution should have been to change the processes to allow for these kinds of tools, but still keeping the principles of proper data management in place.</p>
<p>So, they were trying to do Business Intelligence 2.0 by using Web 2.0 type features on top of an immature approach to enterprise data management. In the end, I think it will hurt them more than help them, but who knows. These days things change so fast inside and outside business&#8217;s that even problems like this can disappear amongst everything else that goes on.</p>
<p>I would like to know if anyone is or has worked somewhere where a proper implementation of &#8220;Business Intelligence 2.0&#8243; is or  has happened. Please share your experiences in the comments section.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/business-intelligence-2-0-is-anyone-doing-it-yet/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why Build a Data Warehouse When You Can Lease One?</title>
		<link>http://www.bireview.org/bireviewblogs/archives/why-build-a-data-warehouse-when-you-can-lease-one</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/why-build-a-data-warehouse-when-you-can-lease-one#comments</comments>
		<pubDate>Tue, 05 Jan 2010 13:41:09 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[DWH Data Modeling]]></category>
		<category><![CDATA[Enterprise Data Architecture]]></category>
		<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[architecture]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[data warehouse as a service]]></category>
		<category><![CDATA[process]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=210</guid>
		<description><![CDATA[Back in 2008, I was working for a telco in the Dominican Republic, consulting for their data warehouse team. I proposed a complete re-architecture of the data warehouse and the reporting platforms that the data warehouse supported. When I had to cost out the project, I had to include some hardware, primarily for the Ab Initio ETL tool we were going to migrate to, but also for the development and test servers, and more tape backup machines. All in all, the tab for the entire project grew to almost 2 million USD. It was just another reminder to me of how expensive a data warehouse can be...]]></description>
			<content:encoded><![CDATA[<p>Back in 2008, I was working for a telco in the Dominican Republic, consulting for their data warehouse team. After a review of their data warehouse and several discussion with stakeholders, I proposed a complete re-architecture of the data warehouse and the reporting platforms that the data warehouse supported. When I had to cost out the project, I included some hardware, primarily for the Ab Initio ETL tool we were going to migrate to, but also for the development and test servers, and more tape backup machines. All in all, the tab for the entire project grew to almost 2 million USD. It was just another reminder to me of how expensive a data warehouse can be. In trying to convince the management board of the project, I conducted interviews with the major stakeholders to try and gauge the value of the data warehouse to the organization. Of course, a data warehouse supporting business intelligence was of enough value to the organization to justify it, but the numbers can be hard to swallow never the less.</p>
<p>All of that reminded me of the <a href="http://www.tdwi.org/publications/display.aspx?ID=7199">Data Warehouse Maturity Model</a> I have seen time and again at conferences I have attended and presented at. It shows that a data warehouse has reached its ultimate maturity when it achieves its potential for the return on the investment of building a data warehouse. That is, the business uses the data warehouse and the information within it to drive the business. I started to think about this while I was putting the business case together for this project. Too often, I have found that the decision makers in an organization are not always aware of the value of the data warehouse, and see it as a big, expensive, slow moving giant that they have to live with rather than really benefit from. It is beyond the scope of this article to discuss all the reasons why this might become the perception of the data warehouse, but it was sure the case here, which is why I needed to include in my business case the feedback from all the users of the data warehouse.</p>
<p>The organization in question was looking for ways to reduce the costs of the data warehouse and it occurred to me that one way to reduce the cost of it was to remove it all together from the business. I don&#8217;t mean throw it away (that would be insane), but why not get a little creative? Take all the hardware, the software, and the people that build, operate and maintain it all, and use it to spin off a separate business? This business would be initially funded by the parent organization, contracted by the parent to provide all the data warehouse services it did before but at a more favourable cost. It would operate as an independent business with the sole core competency of doing data warehousing, and be allowed to also sell its services to other organizations as well. By being wholly owned by the parent company and being able to expand its customer base beyond the parent, allows it to become a revenue centre for the parent. Eventually the separation would probably need to be more complete, I suppose, but still, it goes from being a cost centre to a revenue centre. Sweet!</p>
<p>Afterwards, I thought, hey, I&#8217;ve been doing software design and development for over 20 years, data warehousing for almost 10, and ran my own small consulting firm. Why don&#8217;t I just start up a business that provides a data warehouse as a service? Why, I thought to myself, that&#8217;s a great idea and would be a lot of fun! I know a couple of organizations that are moving in the direction of having other &#8220;systems as a service&#8221; anyway (at least outsourcing other major business support systems), so I could probable lock up a customer or two within a couple of years, and who knows? I started to seriously investigate the possibility. I would need to build/buy/rent a data center and build/rent/lease a bunch of servers and disk, not really a problem, but I would (ideally) need to come up with a data architecture that could support multiple clients, possibly across different industries. That would mean, design it, build it, test it, implement it, and then go sell it. Now that&#8217;s a lot of work. And I know that you can buy such industry specific data models. Teradata did, or maybe still does, have one, Oracle has one to, and I am sure there are others. Possible still, but a lot of work, and it would mean I might miss the opportunities I had because of the timing.</p>
<p>In continuing my investigation into setting all this up, I came across a company that already provides Data Warehousing as a Service (DaaS), called <a href="http://kognitio.com">Kognitio</a>. They have their own data centres, they have their own data models, and although they are the only ones I could find, but there might be others by now. So, someone had my genius idea before me (no surprise really). I still think that for an organization that is trying become more mature in their data warehousing, this is a good solution to get off the ground fast and jump the &#8220;gulf between &#8216;Inform Exectutives&#8217; and &#8216;Empower Knowledge Workers&#8217;&#8221; in the afore-mentioned data warehouse maturity model. If you are looking to implement a new data warehouse, or re-architect one, or try to out-source you existing one, I suggest you give them a call. I think I might, just to see if they need an affiliate in Switzerland, where I live.</p>
<p>What do you thing of Data Warehouse as a Service? Good idea or bad? Easy or difficult to implement? Too big a change to your organization or not? Let us know your thoughts in a comment!</p>
<span class="sfforumlink"><a href="http://www.bireview.org/bireviewblogs/forum/general/why-building-a-data-warehouse-when-you-can-lease-one/"><p><img src="http://www.bireview.org/bireviewblogs/wp-content/plugins/simple-forum/styles/icons/default/bloglink.png" alt="" /> Join the forum discussion on this post</p>
</a></span>]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/why-build-a-data-warehouse-when-you-can-lease-one/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>CAST &#8211; Software analysis to the extreme! &#8211; Part 2</title>
		<link>http://www.bireview.org/bireviewblogs/archives/cast-software-analysis-to-the-extreme-part-2</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/cast-software-analysis-to-the-extreme-part-2#comments</comments>
		<pubDate>Wed, 02 Dec 2009 14:33:44 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[Software Tools Reviews]]></category>
		<category><![CDATA[CAST]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[process]]></category>
		<category><![CDATA[quality]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=193</guid>
		<description><![CDATA[In the first post I took a look at some of the challenges we faced within the data warehouse group that we hoped CAST could help us with. In this post, we will take a look at how the challenges were met by CAST.

The Solutions

To address the challenge of maintaining up-to-date, complete and accurate technical documentation of the data warehouse platform, CAST was used to analyze the production platform after every deliverable...]]></description>
			<content:encoded><![CDATA[<p>In the first post I took a look at some of the challenges we faced within the data warehouse group that we hoped CAST could help us with. In this post, we will take a look at how the challenges were met by CAST.</p>
<p><strong>The Solutions</strong></p>
<p>To address the challenge of maintaining up-to-date, complete and accurate technical documentation of the data warehouse platform, CAST was used to analyze the production platform after every deliverable. The Enlighten tool was made available to all the internal and external staff working on data warehouse projects, and the web-site created by CAST was made available to the entire organization.</p>
<p>And because we also had several on-going projects  (as everyone does), we would also analyze the UAT environments from time to time as well. This allowed the impact analysis to capture changes to a future version of the data warehouse platform as well as the current one.</p>
<p>The code quality analysis was reviewed on a periodic basis to keep track of the &#8220;hotspots&#8221;, as these problem areas would be included in projects that touched on those areas, so that the number of potential problem areas could be reduced over time.</p>
<p><strong>The Benefits</strong></p>
<p>By using CAST, we were able to reduce the costs of project affecting the data warehouse while increasing the quality of the deliverables. These cost saving were realized by having all the technical information of the platform in one place, always up-to-date, and easily available and navigatable. As well, by automating many of the otherwise manual tasks that never seem to get done, the cost of testing and doing impact analysis was signicantly reduced. Also, the abilty to set the expectations of quality and being able to measure those expectations objectively, let to better, more bug-free code being delivered on-time and under budget.</p>
<p><strong>Recommendation</strong></p>
<p>I highly recommend this product to anyone who has a large system or systems that they need to get under control or not lose control of. CAST helps to implement a layer of automated governance that can be otherwise far more expensive to implement, and, as mentioned above, can certainly reduce project costs through increased productivity and increased quality of deliverables. I haven&#8217;t worked on a project since where I didn&#8217;t wish CAST had been implemented there as well.</p>
<p>In the next post, I will look at some things that we could have done with CAST to further improve our business processes, and a couple of things to take into consideration when using CAST. If you have any questions, I urge you to contact them, and you can also leave questions in the comments below and I will answer the to the best of my ability.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/cast-software-analysis-to-the-extreme-part-2/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CAST &#8211; Software analysis to the extreme! &#8211; Part 1</title>
		<link>http://www.bireview.org/bireviewblogs/archives/cast-software-analysis-to-the-extreme</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/cast-software-analysis-to-the-extreme#comments</comments>
		<pubDate>Tue, 01 Dec 2009 11:16:03 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[Software Tools Reviews]]></category>
		<category><![CDATA[CAST]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[process]]></category>
		<category><![CDATA[quality]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=180</guid>
		<description><![CDATA[I recently worked with an organization where we had looked at a tool that was able to analyze all the source code for the ETL jobs of their data warehouse and provide several benefits to the organization as a result. The tool is called CAST, and comes from a company called Cast Software. During the presentation of the tool and its features by the CAST team, I was very impressed by what the tool could do in terms of analyzing source code, and even more, what it could do with the information gathered from that analysis ...]]></description>
			<content:encoded><![CDATA[<blockquote><address style="text-align: center;">Disclaimer: The author does not have any commercial connection with the vendor of this tool.</address>
</blockquote>
<address> </address>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">I recently worked with an organization where we had looked at a tool that was able to analyze all the source code for the ETL jobs of their data warehouse and provide several benefits to the organization as a result. The tool is called CAST, and comes from a company called Cast Software (http://www.castsoftware.com/). During the presentation of the tool and its features by the CAST team, I was very impressed by what the tool could do in terms of analyzing source code, and even more, what it could do with the information gathered from that analysis. I was, however, hesitant to implement such a tool in this organization. This was because I didn&#8217;t feel that the data warehouse group operated at a sufficient level of maturity to be able to fully exploit the features of this software, and as a result, would be implementing a tool that would never get used.</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">Never the less, the organization wanted to implement the tool and they were, in fact, able to mature in their software development methodologies, more quickly than I had expected. In the end, the data warehouse group was able to take advantage of the tool, although not as completely as I would have liked. In this article, then, I will examine in more detail, why we decided to implement the tool, what it could do out of the box, what it couldn&#8217;t do out of the box (but could do later, with a little customization), what we did with it, and what we didn&#8217;t do with it but probably should have.</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">The Challenges</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">Because the organization&#8217;s data warehouse had been built in bits and pieces over a number of years, there was a lack of cohesive technical documentation. It existed, of course, but in documents spread over hundreds of project directories, and in the heads of the internal staff and in the heads of the external staff, if they were still there. This was always a problem in that, a) they were not sure what they had, or how it was built, b) doing an impact analysis across all the ETL tools involved (unix shell scripts, DataStage, custom PL/SQL, and some java) was very labour intensive, and c) with all the ongoing projects for the data warehouse, it was impossible to properly evaluate the quality of the deliverables from the vendors (most importantly, the source code).</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">Getting More Done Better With Less</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">They originally looked at CAST to solve the above problems. The tool could analyse all the source code across all the ETL tools we had, and in doing so, could provide a) complete technical documentation of the production data warehouse platform, b) end-to-end impact analysis, and c) provide a quantitative analysis of the source code delivered by the vendors and provide industry-standard measures of the quality of the code. Having all this would, in turn, allow the organisation to a) significantly reduce the learning curve for new internal staff and external vendors, b) significantly reduce the time and increase the quality of the impact analysis studies they carried out, and c) enable automated enforcement of the development standards and guidelines, reducing the number of errors in the deliverables whilst improving the overall code quality.</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">Code Analysis</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">CAST can parse just about all the source code you can throw at it, regardless of the language or tool it was written in. If they don&#8217;t already have a parser written for a specific tool, their universal parser can be tweaked very quickly. With our organization, we have some source code in Unix shell scripts, some ETL written using DataStage, and some just in PL/SQL. The reporting tool used was Business Objects. CAST had no trouble with anything we gave it. The one issue we did have was that the PL/SQL parser was not equipped to deal with dynamic SQL and PL/SQL that was data driven. Much to my surprise, it took the CAST Switzerland team only a handful of days to update their PL/SQL parser to handle this problem! When they were done, we had a tool that could analyze all the source code in our platform, along with the databases (mainly Oracle) so that we ended up with complete technical documentation (albeit using the included Enlighten tool, with a bit less on the generated web pages). To put it succinctly, they managed to do in 12 days, what the organisation had not been able to do in almost 10 years!</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">Impact Analysis</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">With all the source code analyzed, and all the database objects analyzed, the CAST Enlighten tool could generate an automated impact analysis in the time it would take to get a coffee to start the analysis doing it manually. In the past at this organization, doing a thorough impact analysis required extensive reviews of document when you could find them, reading all the source-to-target documents if they were up-to-date, talking to the developer if they were still on-site, and in the end, reading all the source code line by line if you could get access to it. As you can imagine, this was not only time-consuming, but frought with errors and omissions. It could take weeks on a larger change, but with CAST, it could be done in an afternoon, with just a little digging inside the tool itself.</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">Code Quality</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">Another big issue the organization suffered was that there were never the resources within the organization to do a proper inspection and review of the source code deliverables from the vendors. In some cases, the projects were just too big to manage with a small internal staff size, in other cases just too small to worry about. This led to visicous, lengthly cycles of testing and re-developing, as new bugs were not found until user acceptance testing, or, even worse during the pre-production system testing, and in a few cases during use after production rollout. Not to mention the inevitable commercial disagreements between the vendors and the organization where one would blame the other for the delays and the errors. Since the organization didn&#8217;t have the internal resources to do this due-diligence of the vendor deliverables, a lot of problems like the ones mentioned above crept into every project, pushing back delivery schedules and blowing the budgets of the projects.</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">The CAST tool, because it comes with a huge number (&gt;600 when we used it) of quality measures for source code, and because you can modify them to your taste or add new ones, all the delivered source code could be checked against these quality measures, even before they were manually tested. I didn&#8217;t pretend to understand all the types of measures they would calculate against the source code (things like Cyclomatic Complexity are the things of rocket scientists, I&#8217;m sure), but what I did understand was that they could tell you in an instant how well written the code was, how well it complied to your internal standards as well as industry standards, and where in the code you might have a hotspot later on (if the complexity measure was too high). All these things could be used to do an automated reinforcement of the development standards and guidelines of the organization, and acted as an independant third-party to help in the decision to accept or not the source code being delivered by a vendor.</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">The Solutions</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">To address the challenge of maintaining up-to-date, complete and accurate technical documentation of the data warehouse platform, CAST was used to analyze the production platform after every deliverable. The Enlighten tool was made available to all the internal and external staff working on data warehouse projects, and the web-site created by CAST was made available to the entire organization.</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">And because the orgranization also had several on-going projects, they would also analyze the UAT environments from time to time as well. This allowed the impact analysis to capture changes to a future version of the data warehouse platform as well as the current one.</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">The code quality analysis was reviewed on a periodic basis to keep track of the &#8220;hotspots&#8221;, as these problem areas would be included in projects that touched on those areas, so that the number of potential problem areas could be reduced over time.</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">The Benefits</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">By using CAST, the organization was able to reduce the costs of project affecting the data warehouse while increasing the quality of the deliverables. These cost saving were realized by having all the technical information of the platform in one place, always up-to-date, and easily available and navigatable. As well, by automating many of the otherwise manual tasks that never seem to get done, the cost of testing and doing impact analysis was signicantly reduced. Also, the abilty to set the expectations of quality and being able to measure those expectations objectively, let to better, more bug-free code being delivered on-time and under budget.</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">Extending the Benefits</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">There are a few things that could have been done with CAST and its features that were not taken advantage of by this organization.</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">Recommendation</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">I highly recommend this product to anyone who has a large system or systems that they need to get under control or not lose control of. CAST helps to implement a layer of automated governance that can be otherwise far more expensive to implement, and, as mentioned above, can certainly reduce project costs through increased productivity and increased quality of deliverables. I haven&#8217;t worked on a project since where I didn&#8217;t wish CAST had been implemented there as well.</div>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 152px; width: 1px; height: 1px;">If you have any questions, I urge you to contact them. You can also leave questions in the comments below and I will answer the to the best of my ability.</div>
<p>I recently worked with an organization where we had looked at a tool that was able to analyze all the source code for the ETL jobs of their data warehouse and provide several benefits to the organization as a result. The tool is called CAST, and comes from a company called <a href="http://www.castsoftware.com" target="_blank">Cast Software</a>.  I was hesitant to implement such a tool there because I felt that the data warehouse group didn&#8217;t operate at a sufficient level of maturity to be able to fully exploit the features of this software, and as a result, would be implementing a tool that would never get used. However, during the presentation of the tool and its features by the CAST team, I, for one, was so impressed by what the tool could do in terms of analyzing source code, and even more, what it does with the information gathered from that analysis, that we decided to implement the tool. In the end, the data warehouse group was able to take advantage of the tool, although not as completely as I would have liked. In the next few articles, then, I will examine in more detail, why we decided to implement the tool, what it could do out of the box, what it couldn&#8217;t do out of the box (but could do later, with a little customization), what we did with it, and what we didn&#8217;t do with it but probably should have.</p>
<p><strong>The Challenges</strong></p>
<p>Because our data warehouse had been built in bits and pieces over a number of years, there was a lack of cohesive technical documentation. It existed, of course, but in documents spread over hundreds of project directories, and in the heads of the internal staff and in the heads of the external staff, if they were still there. This was always a problem in that, a) they were not sure what they had, or how it was built, b) doing an impact analysis across all the ETL tools involved (unix shell scripts, DataStage, custom PL/SQL, and some java) was very labour intensive, and c) with all the ongoing projects for the data warehouse, it was impossible to properly evaluate the quality of the deliverables from the vendors (most importantly, the source code).</p>
<p><strong>Getting More Done Better With Less</strong></p>
<p>We originally looked at CAST to solve the above problems. The tool could analyse all the source code across all the ETL tools we had, and in doing so, could provide a) complete technical documentation of the production data warehouse platform, b) end-to-end impact analysis, and c) provide a quantitative analysis of the source code delivered by the vendors and provide industry-standard measures of the quality of the code. Having all this would, in turn, allow us to a) significantly reduce the learning curve for new internal staff and external vendors, b) significantly reduce the time and increase the quality of the impact analysis studies they carried out, and c) enable automated enforcement of the development standards and guidelines, reducing the number of errors in the deliverables whilst improving the overall code quality.</p>
<p><strong>Code Analysis</strong></p>
<p>CAST can parse just about all the source code you can throw at it, regardless of the language or tool it was written in. If they don&#8217;t already have a parser written for a specific tool, their universal parser can be tweaked very quickly. With our organization, we had some source code in Unix shell scripts, some ETL written using DataStage, and some just in PL/SQL. The reporting tool used was Business Objects. CAST had no trouble with anything we gave it. The one issue we did have was that the PL/SQL parser was not equipped to deal with dynamic SQL and PL/SQL that was data driven. Much to my surprise, though, it took the CAST Switzerland team only a handful of days to update their PL/SQL parser to handle this problem! When they were done, we had a tool that could analyze all the source code in our platform, along with the databases (mainly Oracle) so that we ended up with complete technical documentation (albeit using the included Enlighten tool, with a bit less on the generated web pages). To put it succinctly, they managed to do in 12 days, what we had not been able to do in almost 10 years!</p>
<p><strong>Impact Analysis</strong></p>
<p>With all the source code analyzed, and all the database objects analyzed, the CAST Enlighten tool could generate an automated impact analysis in the time it would take to get a coffee before starting to do the analysis manually. In the past, doing a thorough impact analysis required extensive reviews of document when you could find them, reading all the source-to-target documents if they were up-to-date, talking to the developer if they were still on-site, and in the end, reading all the source code line by line if you could get access to it. As you can imagine, this was not only time-consuming, but frought with errors and omissions. It could take weeks on a larger change, but with CAST, it could be done in an afternoon, with just a little digging inside the tool itself.</p>
<p><strong>Code Quality</strong></p>
<p>Another big issue we suffered with was that there were never the resources within the team to do a proper inspection and review of the source code deliverables from the vendors. In some cases, the projects were just too big to manage with our small internal staff size, in other cases just too small to worry about. This led to visicous, lengthly cycles of testing and re-developing, as new bugs were not found until user acceptance testing, or, even worse during the pre-production system testing, and in a few cases during use after production rollout. Not to mention the inevitable commercial disagreements between the vendors and us where one would blame the other for the delays and the errors. Since we didn&#8217;t have the internal resources to do this due-diligence of the vendor deliverables, a lot of problems like the ones mentioned above crept into every project, pushing back delivery schedules and blowing the budgets of the projects.</p>
<p>The CAST tool, because it comes with a huge number (&gt;600 when we used it) of quality measures for source code, and because you can modify them to your taste or add new ones, all the delivered source code could be checked against these quality measures, even before they were manually tested. I didn&#8217;t pretend to understand all the types of measures they would calculate against the source code (things like Cyclomatic Complexity are the things of rocket scientists, I&#8217;m sure), but what I did understand was that they could tell you in an instant how well written the code was, how well it complied with your internal standards as well as industry standards, and where in the code you might have a trouble spot later on (if the complexity measure was too high). All these things could be used to do an automated reinforcement of the development standards and guidelines of the organization, and acted as an independant third-party to help in the decision to accept or not the source code being delivered by a vendor.</p>
<p>continued in the Part 2&#8230;</p>
<p><strong><br />
</strong></p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/cast-software-analysis-to-the-extreme/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Enough Projects Drowned Under the Waterfall</title>
		<link>http://www.bireview.org/bireviewblogs/archives/enough-projects-drowned-under-the-waterfall</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/enough-projects-drowned-under-the-waterfall#comments</comments>
		<pubDate>Wed, 23 Sep 2009 14:41:55 +0000</pubDate>
		<dc:creator>Matthias Blume</dc:creator>
				<category><![CDATA[Development Methodologies]]></category>
		<category><![CDATA[agile methodologies]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[process]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=131</guid>
		<description><![CDATA[It is quite obvious that there is no lack of innovative IT development processes, when one browses through the literature and blogs about software development methodologies. Few are those who would defend the waterfall approach ...]]></description>
			<content:encoded><![CDATA[<p>It is quite obvious that there is no lack of innovative IT development processes, when one browses through the literature and blogs about software development methodologies. Few are those who would defend the waterfall approach without concessions. The reasons for it still being abundant sound more like excuses. The need for budgeting, out-tasking with rigid vendor models and simple resistance to change are frequent arguments. While calling Agile Models innovative, it is actually quite similar to the Bauhaus school for architecture and design of the 1920s and 1930s which are still being looked at as newfangled in many parts of the world. <a href="http://en.wikipedia.org/wiki/Agile_software_development" target="_blank">Agile development models</a> arose already in the 1990s ­­— ages ago in the IT world.</p>
<p>Isn’t it time to restrict the obedience to the business side deciding about what is being developed? Shouldn’t IT considerations at least be as seriously considered when it comes to how to deliver the right systems and quickly? Agile methods are particularly appropriate for BI systems which are close to the end users. That users often cannot precisely formulate their needs initially, is one of the strongest arguments for an incremental and iterative process. But there are aspects specific to BI favouring prototyping. Encouraging active user involvement throughout the development is certainly of high value for both sides. Delivering the parts of highest priority first and using that output can give very useful input for subsequent phases including the option of stopping the project, if first results made the project obsolete or changed the requirements drastically. Among the strongest critisms BI teams face are being slow and not delivering very useful results. Both issues have been strong motivations for agile approaches since the beginning.</p>
<p>As Wikipedia puts it, <a href="http://en.wikipedia.org/wiki/Agile_software_development" target="_blank">agile development</a> is &#8220;based on iterative development, where requirements and solutions evolve through collaboration between self-organizing cross-functional teams&#8221;. It is in particular &#8220;a business approach that aligns development with customer needs and company goals.&#8221;</p>
<p>We might need to reinterpret some of the principles of the <a href="http://en.wikipedia.org/wiki/Agile_Manifesto" target="_blank">agile manifesto</a>. The principle <em>working software is delivered frequently </em>might sometimes be read <em>results are delivered frequently. </em>It is the close collaboration between the BI team and the users supported by frequent output to the users keeping them up to date that matters here. A principle that seems very hard to swallow and contrary to most IT development culture says that <em>even late changes in requirements are welcomed.</em> Ok, maybe not all changes are appropriate and need to be expressed so late. But this openess and flexibility is key to agile development. It means accepting that it is hard to impossible for users to sign and close BI requirements before having seen the data.</p>
<p>The fact that relatively few organisations apply agile processes probably does not mean that people are all happy with the waterfall process. Many large organizations have difficulty bridging the gap between the traditional waterfall method and an agile one. It would also require getting around reasons predominantely external to BI such as overall budgeting, planning and roadmap management and prioritisation.</p>
<p>Wikipedia describes the <a href="http://en.wikipedia.org/wiki/Agile_software_development#Suitability_of_agile_methods" target="_blank">agile home ground</a> as projects of low criticality with a small team of senior developers and requirements that change very often. A consultative management culture would help.</p>
<p>Don&#8217;t get me wrong. Major infrastructure projects, be it enterprise data warehouse or something operational, are <em>not </em>good initial candidates for process innovation. Where the big picture is important to be kept in mind, where the long term maintenance and flexibility has to be assured and users interact with the deliverable only indirectly, a classical planned approach still has its virtue. But when it comes to small analytical marts and BI interfaces we should take a good dose of self-confidence and go agile even it takes a few serious discussions.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/enough-projects-drowned-under-the-waterfall/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Data Integration and the Data Warehouse, Who Does What?</title>
		<link>http://www.bireview.org/bireviewblogs/archives/data-integration-and-the-data-warehouse-who-does-what</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/data-integration-and-the-data-warehouse-who-does-what#comments</comments>
		<pubDate>Mon, 21 Sep 2009 20:24:54 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[Enterprise Data Architecture]]></category>
		<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[architecture]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data governance]]></category>
		<category><![CDATA[master data management]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=95</guid>
		<description><![CDATA[In a <a title="Previous Post" href="http://www.bireview.org/bireviewblogs/archives/enterprise-data-warehouses-and-master-data-management">previous post</a>, I mentioned that one of the impacts of implementing a master data management system was removing some or all of the data integration responsibilities from the data warehouse. Since the data warehouse is the usual place for data integration, in particular customer data integration, I would like to explain myself and see if you agree, or at least see my point of view...]]></description>
			<content:encoded><![CDATA[<p>In a <a title="Previous Post" href="http://www.bireview.org/bireviewblogs/archives/enterprise-data-warehouses-and-master-data-management">previous post</a>, I mentioned that one of the impacts of implementing a master data management system was removing some or all of the data integration responsibilities from the data warehouse. Since the data warehouse is the usual place for data integration, in particular customer data integration, I would like to explain myself and see if you agree, or at least see my point of view.</p>
<p>What I mean is that in the big picture of enterprise data management, there are data applications that produce data, and data applications that consume data. In a lot of cases, a data application can (and must) play both roles. I make the distinction here to show one of the great promises of master data management: the reduction in the amount of duplication of work being done by software processes that must transform data between the applications that produce that data, and the applications that consume it. In the following diagram, we can see what can happen if several data consumer applications (right side) must connect directly to other data producing applications (left side) to get (and transform) the data they need.</p>
<p style="text-align: center;"><img class="aligncenter" title="Not a Pretty Picture.." src="/images/blogpics/mixedbag.png" alt="Not a Pretty Picture." width="600" height="200" /></p>
<h6 style="text-align: center;">Not a Pretty Picture.</h6>
<p>As you can see, it can get pretty messy, pretty quickly. There are things that exist in the IT world to try and reduce this complexity, SOA, EAI, etc. and some do a reasonable job, but in the end, there is more needed to really remove the duplication of effort. This is master data management (MDM). If we have an MDM hub in the middle of the above diagram, to act as a level of abstraction between data producers and data consumers, we can significantly simplify the picture and remove all redundancy.</p>
<p style="text-align: center;"><img class="aligncenter" title="Much better." src="/images/blogpics/mdmmiddle.png" alt="Much better." width="600" height="200" /></p>
<h6 style="text-align: center;">Much Better.</h6>
<p>The MDM hub works the most effectively, where there is an actual data store implemented within it. The efficiency arises from being able to separate source-system specific data transformations from source-system independent data transformations. It enables this by maintaining, at its core, a enterprise-level business entity view of the data held in the department level (purpose-specific) data applications. This business view of the data, is populated from the source system&#8217;s data using source-system specific transformations, and thereby transforming the data into a data model that represents how the business sees the data entities, rather than how the purpose-specific applications see the data. When the data is transformed in this way, into an enterprise level business view of the data, the enterprise as a whole can understand the data and use the data by doing further transformations of that data into purpose-specific data (i.e. data to improve marketing, customer care, logistics efficiency, etc.).  The beauty of the whole system is that the end-users of the data do not need to worry about directly transforming the source system data into their own purpose-specific data, especially when they might need the same type data from multiple source systems (each with their own way of storing that information). They only need to go to one place, where the structure of the data is oriented to how they see the data, and where the data means the same thing thing to everyone (enterprise level). They are then isolated from the changes underneath, that is if a source system is replaced, decommissioned, or radically changed, the impact is kept to a minimum. The only changes required are the source-system specific transformations, and nothing else.</p>
<p>To get back to my original argument, I believe that an MDM system is mandatory for efficient operation of any medium to large-sized organization, and gets more important as the enterprise gets bigger. And I like to keep the main roles of the data managers separate from one another (at least on a personnel level, it not at an organizational structure level). That is, I prefer to have those individuals who use the data to run the business (data consumer enterprise roles) focused on using the data, and those individuals who are responsible for producing the data, focused on the applications that produce the data, regardless of where they fit into the organization&#8217;s operational structure. Therefore, I maintain that those individual who are responsible for the MDM system focus on doing that to the best of their abilities, and those that use the MDM system, such as those who would use the MDM system as the main source for the data warehouse, focused on building and maintaining the data warehouse.</p>
<p>So, at the end of the day, the bulk (possibly) of your data integration would happen in the MDM system, using source-system specific transformations, which would then allow the data warehouse to focus on generating analytical data for downstream use , without having the added burden of being responsible for the data integration as well, having only to worry about using source-system independent transformations, like many other downstream applications.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/data-integration-and-the-data-warehouse-who-does-what/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Enterprise Data Warehouses and Master Data Management</title>
		<link>http://www.bireview.org/bireviewblogs/archives/enterprise-data-warehouses-and-master-data-management</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/enterprise-data-warehouses-and-master-data-management#comments</comments>
		<pubDate>Mon, 07 Sep 2009 20:54:51 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data governance]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[master data management]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=22</guid>
		<description><![CDATA[In organizations where the data warehouse has not reached full maturity, which is pretty much the bulk of them, the data warehouse still operates as a department level. This means that the data warehouse was not designed from the ground up to serve the enterprise ...]]></description>
			<content:encoded><![CDATA[<p>In organizations where the data warehouse has not reached full maturity, which is pretty much the bulk of them, the data warehouse still operates as a department level. This means that the data warehouse was not designed from the ground up to serve the enterprise needs, but to serve the needs of one, or a few, department within the organization. What happens with this department level focus, is that a lot of time is spent reconciling the differences in the data that the data warehouse provides to the individual departments that require it, as each of those departments may have its own definitions for the data they are expecting.</p>
<p>The ultimate result of the data warehouse team having to do this reconciliation on an on-going basis, is the promotion of the idea of data governance at the enterprise level. Pushing the ownership of the meanings (thus the definitions of) the data within the data warehouse, back to the organization seems like the right thing to do, and of course, it is. It is, if it&#8217;s done properly, because what we really want to do by implementing some data governance in the organization, is to implement a master data management system, which data governance is an important part of. We are saying, &#8220;We know that the data must be managed at the enterprise level, not at the department level, and we must engage the entire enterprise in the management of the data.&#8221;.</p>
<p>The management of &#8220;data&#8221; at an enterprise level means removing the shackles of departmental ownership of the data while still maintaining the departmental systems that manage that data. This is, by far, the trickiest part, as it deals with people, rather than just the technology and we will examine this in a later post. For now, though, we need to understand the implications of what it means to &#8220;manage data at an enterprise level&#8221;. These implications include</p>
<ul>
<li>restructuring the organization a little to implement enterprise data governance,</li>
<li>looking at, planning for, and possibly phasing in a master data management system,</li>
<li>removing some or all of the responsibility for data integration from the data warehouse (controversial, I know),</li>
</ul>
<p>to mention but a few. We will go into more details about these and other issues in future posts as well.</p>
<p>Becoming more mature as an enterprise in how we deal with our data requires a shift in the way we look at, work with, and manage the data that we need to run our enterprise. It&#8217;s a lot of work, but the rewards are definitely worth it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/enterprise-data-warehouses-and-master-data-management/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
