<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Business Intelligence Review &#187; business intelligence</title>
	<atom:link href="http://www.bireview.org/bireviewblogs/archives/tag/business-intelligence/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bireview.org/bireviewblogs</link>
	<description>All things Business Intelligence related. Read and participate!</description>
	<lastBuildDate>Thu, 27 May 2010 11:32:04 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>BI User Profiles or the Dream of Self-Service BI</title>
		<link>http://www.bireview.org/bireviewblogs/archives/bi-user-profiles-or-the-dream-of-self-service-bi</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/bi-user-profiles-or-the-dream-of-self-service-bi#comments</comments>
		<pubDate>Thu, 20 May 2010 08:14:30 +0000</pubDate>
		<dc:creator>Matthias Blume</dc:creator>
				<category><![CDATA[Business Analytics]]></category>
		<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[BI Tools]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[data warehouse]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=248</guid>
		<description><![CDATA[Software vendors have been feeding the hope for the perfectly flexible and intuitive BI tool ever since BI software came to the market. Of course the tool would also deliver the response times in line with speed of thought analysis.]]></description>
			<content:encoded><![CDATA[<p>Software vendors have been feeding the hope for the perfectly flexible and intuitive BI tool ever since BI software came to the market. Of course the tool would also deliver the response times in line with speed of thought analysis. Managers waiting for responses from the BI team working their way through projects and query requests would be issues of the past.</p>
<p>Slowly, more and more analyst firms and vendors draw a more differentiated picture and accept that there is a large range of user situations and needs which ask for a range of appropriate technical answers.</p>
<p>One of the most important goals of the data warehouse is to provide a single version of the information. In order to avoid potentially different data selections or transformations it would only be sensible to also have just one tool to access the single version of the information, eradicating the problem of different numbers for the same question once and for all. This might be the strongest motivation for BI professionals and business users who seek a universal BI end user tool that answers all business needs and widens the access bottle neck. The growing market opportunity for BI tools motivates the software vendors to promise technological breakthroughs and even miracles where required.</p>
<p>The challenge becomes apparent when taking a closer look at what different users need. At least three use cases need to be distinguished and give rise to different and conflicting software requirements:</p>
<ul>
<li>Most data warehouses will probably form the basis for a stream of ready-made reports being send out to many users across the organisation. Usually no interactivity is provided with the reports and most users would more likely become confused by features allowing for customisation. What is needed are capabilities to automatically and repeatedly generate reports combining several tabular and graphical elements.</li>
<li>Senior management wants the convenience to oversee the entire business and drill down into subject areas when indicated. This need can be answered by information dashboards providing basic flexibility via an intuitive GUI. The bigger challenge here is to select and define the pieces of information to be displayed in line with the rest of the organisation.</li>
<li>Follow-up questions triggered by the two first user groups and all other analysis give rise to non-standard or ad hoc queries. They involve not only tools with full query flexibility but most importantly data analysis skills and knowledge of the data sources restricting these queries to a small minority of the users for most organisations.  Data warehouse thought leader Ralph Kimball wrote in 1998: “Ad hoc query tools, as powerful as they are, can only be effectively used and understood by about 10% of all the potential end users of a data warehouse. … The very best ROLAP-oriented ad hoc query tools improve the 10% number to perhaps 20%.” <a id="_ftnref1" name="_ftnref1" href="#_ftn1"><sup>[1]</sup></a><br />
Also data discovery tools, data mining workbenches and even plain SQL tools fall in this broad category.<br />
See also Stephen Few&#8217;s article <a title="Permanent Link: Big BI is Stuck: Illustrated by SAP  BusinessObjects Explorer" rel="bookmark" href="http://www.perceptualedge.com/blog/?p=727">Big BI is Stuck: Illustrated by SAP  BusinessObjects Explorer</a> (his Blog is highly recommended in general).</li>
</ul>
<p>These three use cases could be further broken down, for example, by distinguishing between being OLAP focused or not, and vendors tend to showcase certain capabilities (such as data search) as constituting a whole new category.</p>
<p>It shows that a unified tool for all users and situations could probably only be developed at the expense of too many compromises and would probably satisfy no one in the end.</p>
<p>The even more utopian plan of BI delivery by pure self-service could only work with a <em>complete </em>data warehouse presented by a <em>clearly documented</em> user interface layer and in situations where queries are limited to <em>simple look ups</em> of stored facts. I’m not sure that the three conditions are fulfilled anywhere.</p>
<hr size="1" /><a id="_ftn1" name="_ftn1" href="#_ftnref1">[1]</a> <em> The Data Warehouse Lifecycle Toolkit</em>, Ralph Kimball et al., 1998, John Wiley &amp; Sons</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/bi-user-profiles-or-the-dream-of-self-service-bi/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Making Operational BI Operational&#8230;</title>
		<link>http://www.bireview.org/bireviewblogs/archives/making-operational-bi-operational</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/making-operational-bi-operational#comments</comments>
		<pubDate>Tue, 19 Jan 2010 13:59:43 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[agile methodologies]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[Operational BI]]></category>
		<category><![CDATA[process]]></category>
		<category><![CDATA[quality]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=229</guid>
		<description><![CDATA[In a recent article by Wayne W. Eckerson, Director, TDWI Research, entitled &#8220;Operational BI: Sorting Out Your Options&#8221;, the options are given for providing near realtime business intelligence. The point of the article seems to be that you can have your BI either fast or of high quality, but not both. Given that a data <a href='http://www.bireview.org/bireviewblogs/archives/making-operational-bi-operational'>[...]</a>]]></description>
			<content:encoded><![CDATA[<p>In a recent article by Wayne W. Eckerson,  Director, TDWI Research, entitled &#8220;Operational BI: Sorting Out Your Options&#8221;, the options are given for providing near realtime business intelligence. The point of the article seems to be that you can have your BI either fast or of high quality, but not both. Given that a data warehouse by definition is not a near realtime animal, and providing near realtime data means excluding data quality checks, data integration checks and all the other things that add latency to the data warehouse, it seems that you can&#8217;t have both.</p>
<p>But is that really true? In my experience, the only thing holding back a data warehouse from being able to provide near realtime data was not the data warehouse itself, but all the heavy software development processes around it, and a lack of communication with the business about the efforts put in to maintain good data quality. Both of these things are correctable, not just to provide near realtime BI, but to provide better BI in general. </p>
<p>Heavy Software Development Processes</p>
<p>This is almost always the result of doing software development using the waterfall approach. There are better ways (see: <a href="http://www.bireview.org/bireviewblogs/archives/enough-projects-drowned-under-the-waterfall">Enough Projects Drowned Under the Waterfall</a>). By employing a software development methodology that doesn&#8217;t bury itself in hand-crafted documentation (which always requires a lot of time), and takes a multi-streaming approach to getting things done (like implementing Data Governance or taking advantage of it if it already exists), the development cycle can be shortened and more reactive to change. It also helps to have this coupled to an architecture that can support change and extension easily.</p>
<p>Communication with the Business</p>
<p>Often, in an attempt to get around the lack of communication with the development group, business users will build their own solutions to generate the  BI they need quicker. This, of course, works because they are able to side-step all the (historically) heavy processes the data warehouse team has in place. What happens though, is that the solutions they build don&#8217;t take into account an enterprise view of the data, and always end up duplicating data transformation logic that already is or at least should be done somewhere else. This results from a lack of communication between the groups, and the lack of understanding between the business and the data warehouse team about the what is really needed and how what his really needed can be produced.</p>
<p>We Need to Talk and We Need to Act, at the Same Time</p>
<p>As mentioned above, the way out is to talk together, often and purposefully, and to act together, often and purposefully. But don&#8217;t wait for the talking to be done before you act, and don&#8217;t wait for the actions to be completed before talking again. Multi-streaming these activities makes them both go faster and results in a better solution through improved collaboration. Now that&#8217;s Agile Development!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/making-operational-bi-operational/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Business Intelligence 2.0, Is anyone doing it yet?</title>
		<link>http://www.bireview.org/bireviewblogs/archives/business-intelligence-2-0-is-anyone-doing-it-yet</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/business-intelligence-2-0-is-anyone-doing-it-yet#comments</comments>
		<pubDate>Sat, 16 Jan 2010 17:59:32 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[Enterprise Data Architecture]]></category>
		<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[architecture]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[process]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=207</guid>
		<description><![CDATA[In the early part of 2007, Neil Raden wrote an article entitled "Business Intelligence 2.0: Simpler, More Accessible, Inevitable". In this article, he argues that the a data warehouse (BI 1.0) is a big, slow, expensive machine for cranking out analytical data, and when it does, the data is highly-structured, only allowing a rigidly defined set of of questions to be answered. He continues that in the day of doing Google searches and social network...]]></description>
			<content:encoded><![CDATA[<p>In the early part of 2007, <a href="http://www.intelligententerprise.com/experts/raden;jsessionid=A5PASNQLWP2AZQE1GHPCKHWATMY32JVN">Neil Raden</a> wrote an article entitled &#8220;<a href="http://www.intelligententerprise.com/channels/infomanagement/showArticle.jhtml?articleID=197002610">Business Intelligence 2.0: Simpler, More Accessible, Inevitable</a>&#8220;. In this article, he argues that the a data warehouse (BI 1.0) is a big, slow, expensive machine for cranking out analytical data, and when it does, the data is highly-structured, only allowing a rigidly defined set of of questions to be answered. He continues that in the day of doing Google searches and social networks, that finding the information you want is a matter of &#8220;mashing up&#8221; data from everywhere. That is, BI 2.0 needs to be able to get data from where ever it is, and do the integration on the fly.</p>
<p>He also mentions, as I have in a <a href="http://www.bireview.org/bireviewblogs/archives/data-integration-and-the-data-warehouse-who-does-what" target="_self">previous article here</a>, that master data management (MDM) systems can be used to remove from the traditional data warehouse the complex work of data integration. I see this happening more and more for the reasons I expressed earlier, mainly a reduced total cost of ownership and an increased usability of the information. MDM systems are perfectly suited to the &#8220;data integration&#8221; part of the overall business intelligence world, and as such, helps to allow the traditional data warehouse evolve to a smaller, quicker, and less expensive machine to operate. Also, sourcing a data warehouse from an MDM system means that analytical data can be provided far easier and quicker than before, and this supports Neil Radan&#8217;s BI 2.0.</p>
<p>I have worked with organizations that have come to the conclusion that the types, amounts and sources of information needed to run the business properly are constantly changing as the organization tries to keep up with or keep ahead of the competition in a more and more volatile business environment. What resulted was the organization stepping backward in the maturity of its data warehouse (according to the &#8221; <a href="http://www.tdwi.org/publications/display.aspx?ID=7199">Data Warehouse Maturity Model</a>&#8221; I mentioned in a previous article). One organization in particular, to get the data the business needs in front of them faster, did an end-run around the data warehouse and started pulling data from whatever source system they thought had what they needed. I have to admit that the web-based interface they built was very impressive, and they could indeed deliver information quickly.</p>
<p>The problem I had with it was simply that they had ignored all the &#8220;best practices&#8221; about doing good data integration, good data governance and good data management. Because they built their solution outside the normal project path of the organization, in order to avoid the lengthly project schedules and extra costs, they side-stepped everything the organization had put in place to mitigate just the kinds of problems that this new development brought up again. And the organization had suffered the same problems in the past! I was amazed that the business unit in question was allowed to do this, but in the end, you had to sympathize with them. All they wanted was the data in a form they could use, and quickly. The solution should have been to change the processes to allow for these kinds of tools, but still keeping the principles of proper data management in place.</p>
<p>So, they were trying to do Business Intelligence 2.0 by using Web 2.0 type features on top of an immature approach to enterprise data management. In the end, I think it will hurt them more than help them, but who knows. These days things change so fast inside and outside business&#8217;s that even problems like this can disappear amongst everything else that goes on.</p>
<p>I would like to know if anyone is or has worked somewhere where a proper implementation of &#8220;Business Intelligence 2.0&#8243; is or  has happened. Please share your experiences in the comments section.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/business-intelligence-2-0-is-anyone-doing-it-yet/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why Build a Data Warehouse When You Can Lease One?</title>
		<link>http://www.bireview.org/bireviewblogs/archives/why-build-a-data-warehouse-when-you-can-lease-one</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/why-build-a-data-warehouse-when-you-can-lease-one#comments</comments>
		<pubDate>Tue, 05 Jan 2010 13:41:09 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[DWH Data Modeling]]></category>
		<category><![CDATA[Enterprise Data Architecture]]></category>
		<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[architecture]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[data warehouse as a service]]></category>
		<category><![CDATA[process]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=210</guid>
		<description><![CDATA[Back in 2008, I was working for a telco in the Dominican Republic, consulting for their data warehouse team. I proposed a complete re-architecture of the data warehouse and the reporting platforms that the data warehouse supported. When I had to cost out the project, I had to include some hardware, primarily for the Ab Initio ETL tool we were going to migrate to, but also for the development and test servers, and more tape backup machines. All in all, the tab for the entire project grew to almost 2 million USD. It was just another reminder to me of how expensive a data warehouse can be...]]></description>
			<content:encoded><![CDATA[<p>Back in 2008, I was working for a telco in the Dominican Republic, consulting for their data warehouse team. After a review of their data warehouse and several discussion with stakeholders, I proposed a complete re-architecture of the data warehouse and the reporting platforms that the data warehouse supported. When I had to cost out the project, I included some hardware, primarily for the Ab Initio ETL tool we were going to migrate to, but also for the development and test servers, and more tape backup machines. All in all, the tab for the entire project grew to almost 2 million USD. It was just another reminder to me of how expensive a data warehouse can be. In trying to convince the management board of the project, I conducted interviews with the major stakeholders to try and gauge the value of the data warehouse to the organization. Of course, a data warehouse supporting business intelligence was of enough value to the organization to justify it, but the numbers can be hard to swallow never the less.</p>
<p>All of that reminded me of the <a href="http://www.tdwi.org/publications/display.aspx?ID=7199">Data Warehouse Maturity Model</a> I have seen time and again at conferences I have attended and presented at. It shows that a data warehouse has reached its ultimate maturity when it achieves its potential for the return on the investment of building a data warehouse. That is, the business uses the data warehouse and the information within it to drive the business. I started to think about this while I was putting the business case together for this project. Too often, I have found that the decision makers in an organization are not always aware of the value of the data warehouse, and see it as a big, expensive, slow moving giant that they have to live with rather than really benefit from. It is beyond the scope of this article to discuss all the reasons why this might become the perception of the data warehouse, but it was sure the case here, which is why I needed to include in my business case the feedback from all the users of the data warehouse.</p>
<p>The organization in question was looking for ways to reduce the costs of the data warehouse and it occurred to me that one way to reduce the cost of it was to remove it all together from the business. I don&#8217;t mean throw it away (that would be insane), but why not get a little creative? Take all the hardware, the software, and the people that build, operate and maintain it all, and use it to spin off a separate business? This business would be initially funded by the parent organization, contracted by the parent to provide all the data warehouse services it did before but at a more favourable cost. It would operate as an independent business with the sole core competency of doing data warehousing, and be allowed to also sell its services to other organizations as well. By being wholly owned by the parent company and being able to expand its customer base beyond the parent, allows it to become a revenue centre for the parent. Eventually the separation would probably need to be more complete, I suppose, but still, it goes from being a cost centre to a revenue centre. Sweet!</p>
<p>Afterwards, I thought, hey, I&#8217;ve been doing software design and development for over 20 years, data warehousing for almost 10, and ran my own small consulting firm. Why don&#8217;t I just start up a business that provides a data warehouse as a service? Why, I thought to myself, that&#8217;s a great idea and would be a lot of fun! I know a couple of organizations that are moving in the direction of having other &#8220;systems as a service&#8221; anyway (at least outsourcing other major business support systems), so I could probable lock up a customer or two within a couple of years, and who knows? I started to seriously investigate the possibility. I would need to build/buy/rent a data center and build/rent/lease a bunch of servers and disk, not really a problem, but I would (ideally) need to come up with a data architecture that could support multiple clients, possibly across different industries. That would mean, design it, build it, test it, implement it, and then go sell it. Now that&#8217;s a lot of work. And I know that you can buy such industry specific data models. Teradata did, or maybe still does, have one, Oracle has one to, and I am sure there are others. Possible still, but a lot of work, and it would mean I might miss the opportunities I had because of the timing.</p>
<p>In continuing my investigation into setting all this up, I came across a company that already provides Data Warehousing as a Service (DaaS), called <a href="http://kognitio.com">Kognitio</a>. They have their own data centres, they have their own data models, and although they are the only ones I could find, but there might be others by now. So, someone had my genius idea before me (no surprise really). I still think that for an organization that is trying become more mature in their data warehousing, this is a good solution to get off the ground fast and jump the &#8220;gulf between &#8216;Inform Exectutives&#8217; and &#8216;Empower Knowledge Workers&#8217;&#8221; in the afore-mentioned data warehouse maturity model. If you are looking to implement a new data warehouse, or re-architect one, or try to out-source you existing one, I suggest you give them a call. I think I might, just to see if they need an affiliate in Switzerland, where I live.</p>
<p>What do you thing of Data Warehouse as a Service? Good idea or bad? Easy or difficult to implement? Too big a change to your organization or not? Let us know your thoughts in a comment!</p>
<span class="sfforumlink"><a href="http://www.bireview.org/bireviewblogs/forum/general/why-building-a-data-warehouse-when-you-can-lease-one/"><p><img src="http://www.bireview.org/bireviewblogs/wp-content/plugins/simple-forum/styles/icons/default/bloglink.png" alt="" /> Join the forum discussion on this post</p>
</a></span>]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/why-build-a-data-warehouse-when-you-can-lease-one/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Enough Projects Drowned Under the Waterfall</title>
		<link>http://www.bireview.org/bireviewblogs/archives/enough-projects-drowned-under-the-waterfall</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/enough-projects-drowned-under-the-waterfall#comments</comments>
		<pubDate>Wed, 23 Sep 2009 14:41:55 +0000</pubDate>
		<dc:creator>Matthias Blume</dc:creator>
				<category><![CDATA[Development Methodologies]]></category>
		<category><![CDATA[agile methodologies]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[process]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=131</guid>
		<description><![CDATA[It is quite obvious that there is no lack of innovative IT development processes, when one browses through the literature and blogs about software development methodologies. Few are those who would defend the waterfall approach ...]]></description>
			<content:encoded><![CDATA[<p>It is quite obvious that there is no lack of innovative IT development processes, when one browses through the literature and blogs about software development methodologies. Few are those who would defend the waterfall approach without concessions. The reasons for it still being abundant sound more like excuses. The need for budgeting, out-tasking with rigid vendor models and simple resistance to change are frequent arguments. While calling Agile Models innovative, it is actually quite similar to the Bauhaus school for architecture and design of the 1920s and 1930s which are still being looked at as newfangled in many parts of the world. <a href="http://en.wikipedia.org/wiki/Agile_software_development" target="_blank">Agile development models</a> arose already in the 1990s ­­— ages ago in the IT world.</p>
<p>Isn’t it time to restrict the obedience to the business side deciding about what is being developed? Shouldn’t IT considerations at least be as seriously considered when it comes to how to deliver the right systems and quickly? Agile methods are particularly appropriate for BI systems which are close to the end users. That users often cannot precisely formulate their needs initially, is one of the strongest arguments for an incremental and iterative process. But there are aspects specific to BI favouring prototyping. Encouraging active user involvement throughout the development is certainly of high value for both sides. Delivering the parts of highest priority first and using that output can give very useful input for subsequent phases including the option of stopping the project, if first results made the project obsolete or changed the requirements drastically. Among the strongest critisms BI teams face are being slow and not delivering very useful results. Both issues have been strong motivations for agile approaches since the beginning.</p>
<p>As Wikipedia puts it, <a href="http://en.wikipedia.org/wiki/Agile_software_development" target="_blank">agile development</a> is &#8220;based on iterative development, where requirements and solutions evolve through collaboration between self-organizing cross-functional teams&#8221;. It is in particular &#8220;a business approach that aligns development with customer needs and company goals.&#8221;</p>
<p>We might need to reinterpret some of the principles of the <a href="http://en.wikipedia.org/wiki/Agile_Manifesto" target="_blank">agile manifesto</a>. The principle <em>working software is delivered frequently </em>might sometimes be read <em>results are delivered frequently. </em>It is the close collaboration between the BI team and the users supported by frequent output to the users keeping them up to date that matters here. A principle that seems very hard to swallow and contrary to most IT development culture says that <em>even late changes in requirements are welcomed.</em> Ok, maybe not all changes are appropriate and need to be expressed so late. But this openess and flexibility is key to agile development. It means accepting that it is hard to impossible for users to sign and close BI requirements before having seen the data.</p>
<p>The fact that relatively few organisations apply agile processes probably does not mean that people are all happy with the waterfall process. Many large organizations have difficulty bridging the gap between the traditional waterfall method and an agile one. It would also require getting around reasons predominantely external to BI such as overall budgeting, planning and roadmap management and prioritisation.</p>
<p>Wikipedia describes the <a href="http://en.wikipedia.org/wiki/Agile_software_development#Suitability_of_agile_methods" target="_blank">agile home ground</a> as projects of low criticality with a small team of senior developers and requirements that change very often. A consultative management culture would help.</p>
<p>Don&#8217;t get me wrong. Major infrastructure projects, be it enterprise data warehouse or something operational, are <em>not </em>good initial candidates for process innovation. Where the big picture is important to be kept in mind, where the long term maintenance and flexibility has to be assured and users interact with the deliverable only indirectly, a classical planned approach still has its virtue. But when it comes to small analytical marts and BI interfaces we should take a good dose of self-confidence and go agile even it takes a few serious discussions.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/enough-projects-drowned-under-the-waterfall/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Data Integration and the Data Warehouse, Who Does What?</title>
		<link>http://www.bireview.org/bireviewblogs/archives/data-integration-and-the-data-warehouse-who-does-what</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/data-integration-and-the-data-warehouse-who-does-what#comments</comments>
		<pubDate>Mon, 21 Sep 2009 20:24:54 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[Enterprise Data Architecture]]></category>
		<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[architecture]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data governance]]></category>
		<category><![CDATA[master data management]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=95</guid>
		<description><![CDATA[In a <a title="Previous Post" href="http://www.bireview.org/bireviewblogs/archives/enterprise-data-warehouses-and-master-data-management">previous post</a>, I mentioned that one of the impacts of implementing a master data management system was removing some or all of the data integration responsibilities from the data warehouse. Since the data warehouse is the usual place for data integration, in particular customer data integration, I would like to explain myself and see if you agree, or at least see my point of view...]]></description>
			<content:encoded><![CDATA[<p>In a <a title="Previous Post" href="http://www.bireview.org/bireviewblogs/archives/enterprise-data-warehouses-and-master-data-management">previous post</a>, I mentioned that one of the impacts of implementing a master data management system was removing some or all of the data integration responsibilities from the data warehouse. Since the data warehouse is the usual place for data integration, in particular customer data integration, I would like to explain myself and see if you agree, or at least see my point of view.</p>
<p>What I mean is that in the big picture of enterprise data management, there are data applications that produce data, and data applications that consume data. In a lot of cases, a data application can (and must) play both roles. I make the distinction here to show one of the great promises of master data management: the reduction in the amount of duplication of work being done by software processes that must transform data between the applications that produce that data, and the applications that consume it. In the following diagram, we can see what can happen if several data consumer applications (right side) must connect directly to other data producing applications (left side) to get (and transform) the data they need.</p>
<p style="text-align: center;"><img class="aligncenter" title="Not a Pretty Picture.." src="/images/blogpics/mixedbag.png" alt="Not a Pretty Picture." width="600" height="200" /></p>
<h6 style="text-align: center;">Not a Pretty Picture.</h6>
<p>As you can see, it can get pretty messy, pretty quickly. There are things that exist in the IT world to try and reduce this complexity, SOA, EAI, etc. and some do a reasonable job, but in the end, there is more needed to really remove the duplication of effort. This is master data management (MDM). If we have an MDM hub in the middle of the above diagram, to act as a level of abstraction between data producers and data consumers, we can significantly simplify the picture and remove all redundancy.</p>
<p style="text-align: center;"><img class="aligncenter" title="Much better." src="/images/blogpics/mdmmiddle.png" alt="Much better." width="600" height="200" /></p>
<h6 style="text-align: center;">Much Better.</h6>
<p>The MDM hub works the most effectively, where there is an actual data store implemented within it. The efficiency arises from being able to separate source-system specific data transformations from source-system independent data transformations. It enables this by maintaining, at its core, a enterprise-level business entity view of the data held in the department level (purpose-specific) data applications. This business view of the data, is populated from the source system&#8217;s data using source-system specific transformations, and thereby transforming the data into a data model that represents how the business sees the data entities, rather than how the purpose-specific applications see the data. When the data is transformed in this way, into an enterprise level business view of the data, the enterprise as a whole can understand the data and use the data by doing further transformations of that data into purpose-specific data (i.e. data to improve marketing, customer care, logistics efficiency, etc.).  The beauty of the whole system is that the end-users of the data do not need to worry about directly transforming the source system data into their own purpose-specific data, especially when they might need the same type data from multiple source systems (each with their own way of storing that information). They only need to go to one place, where the structure of the data is oriented to how they see the data, and where the data means the same thing thing to everyone (enterprise level). They are then isolated from the changes underneath, that is if a source system is replaced, decommissioned, or radically changed, the impact is kept to a minimum. The only changes required are the source-system specific transformations, and nothing else.</p>
<p>To get back to my original argument, I believe that an MDM system is mandatory for efficient operation of any medium to large-sized organization, and gets more important as the enterprise gets bigger. And I like to keep the main roles of the data managers separate from one another (at least on a personnel level, it not at an organizational structure level). That is, I prefer to have those individuals who use the data to run the business (data consumer enterprise roles) focused on using the data, and those individuals who are responsible for producing the data, focused on the applications that produce the data, regardless of where they fit into the organization&#8217;s operational structure. Therefore, I maintain that those individual who are responsible for the MDM system focus on doing that to the best of their abilities, and those that use the MDM system, such as those who would use the MDM system as the main source for the data warehouse, focused on building and maintaining the data warehouse.</p>
<p>So, at the end of the day, the bulk (possibly) of your data integration would happen in the MDM system, using source-system specific transformations, which would then allow the data warehouse to focus on generating analytical data for downstream use , without having the added burden of being responsible for the data integration as well, having only to worry about using source-system independent transformations, like many other downstream applications.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/data-integration-and-the-data-warehouse-who-does-what/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Enterprise Data Warehouses and Master Data Management</title>
		<link>http://www.bireview.org/bireviewblogs/archives/enterprise-data-warehouses-and-master-data-management</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/enterprise-data-warehouses-and-master-data-management#comments</comments>
		<pubDate>Mon, 07 Sep 2009 20:54:51 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data governance]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[master data management]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=22</guid>
		<description><![CDATA[In organizations where the data warehouse has not reached full maturity, which is pretty much the bulk of them, the data warehouse still operates as a department level. This means that the data warehouse was not designed from the ground up to serve the enterprise ...]]></description>
			<content:encoded><![CDATA[<p>In organizations where the data warehouse has not reached full maturity, which is pretty much the bulk of them, the data warehouse still operates as a department level. This means that the data warehouse was not designed from the ground up to serve the enterprise needs, but to serve the needs of one, or a few, department within the organization. What happens with this department level focus, is that a lot of time is spent reconciling the differences in the data that the data warehouse provides to the individual departments that require it, as each of those departments may have its own definitions for the data they are expecting.</p>
<p>The ultimate result of the data warehouse team having to do this reconciliation on an on-going basis, is the promotion of the idea of data governance at the enterprise level. Pushing the ownership of the meanings (thus the definitions of) the data within the data warehouse, back to the organization seems like the right thing to do, and of course, it is. It is, if it&#8217;s done properly, because what we really want to do by implementing some data governance in the organization, is to implement a master data management system, which data governance is an important part of. We are saying, &#8220;We know that the data must be managed at the enterprise level, not at the department level, and we must engage the entire enterprise in the management of the data.&#8221;.</p>
<p>The management of &#8220;data&#8221; at an enterprise level means removing the shackles of departmental ownership of the data while still maintaining the departmental systems that manage that data. This is, by far, the trickiest part, as it deals with people, rather than just the technology and we will examine this in a later post. For now, though, we need to understand the implications of what it means to &#8220;manage data at an enterprise level&#8221;. These implications include</p>
<ul>
<li>restructuring the organization a little to implement enterprise data governance,</li>
<li>looking at, planning for, and possibly phasing in a master data management system,</li>
<li>removing some or all of the responsibility for data integration from the data warehouse (controversial, I know),</li>
</ul>
<p>to mention but a few. We will go into more details about these and other issues in future posts as well.</p>
<p>Becoming more mature as an enterprise in how we deal with our data requires a shift in the way we look at, work with, and manage the data that we need to run our enterprise. It&#8217;s a lot of work, but the rewards are definitely worth it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/enterprise-data-warehouses-and-master-data-management/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Walking the Tightrope Between Quick Analysis and Solid Enterprise BI Systems</title>
		<link>http://www.bireview.org/bireviewblogs/archives/walking-the-tightrope-between-quick-analysis-and-solid-enterprise-bi-systems</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/walking-the-tightrope-between-quick-analysis-and-solid-enterprise-bi-systems#comments</comments>
		<pubDate>Sun, 06 Sep 2009 20:21:31 +0000</pubDate>
		<dc:creator>Matthias Blume</dc:creator>
				<category><![CDATA[Analytical CRM]]></category>
		<category><![CDATA[analytical crm]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[process]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=14</guid>
		<description><![CDATA[Many people active in analytical CRM got used to the situation of sitting on the fence between marketing and BI technology / DWH. In many organisations...]]></description>
			<content:encoded><![CDATA[<p>Many people active in analytical CRM got used to the situation of sitting on the fence between marketing and BI technology / DWH. In many organisations marketing with its particular and extensive analytical needs created dedicated teams assuring a large autonomy. These people sometimes ask the BI teams to adapt the DWH and sometimes cannot wait and help themselves. The aCRM analysts are mostly driven by the analytical results and relatively short project cycles. They will e.g. process additional intermediary transformations with the tools they know and have at hand. IT architecture, standards and policies are considered as of secondary priority. Let the IT folks take care of that.</p>
<p>Over time their sandbox databases, development pools and other workspaces become crucial components of the aCRM set of data sources, suddenly requiring enterprise support, maintenance and reliability. However, due to the historically grown anarchical structure only very adventurous IT departments would be willing to conclude an SLA for such systems, which are hard to disentangle and maintain.</p>
<p>At first sight there seems to be no escape from such a road block. Depending on the balance of power, either BI will insist on a solid evolution of its systems and even enforce an internal monopoly, which imposes their rhythm on all internal customers. Or the power users extend the corporate BI systems with the many little things they need and value while progressing steadily towards an unmanageable situation. Compliance becomes a sheer nightmare. Collaboration between marketing and IT often unpleasant.</p>
<p>Either way, be it the <em>freedom for power users approach</em> or the <em>IT rules approach, </em> it is not sustainable. Flexible processes need to enable the business to obtain results at a speed acceptable for them and handle the transition between prototyping systems towards solid ones whenever appropriate. Throwaway extensions which have been used once or twice and left alone afterwards, should be flagged as such, archived and phased out of the workspace in order to avoid that such dead bodies prevent you from taking care of the extensions still being used.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/walking-the-tightrope-between-quick-analysis-and-solid-enterprise-bi-systems/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Mining and Presentable Graphics</title>
		<link>http://www.bireview.org/bireviewblogs/archives/data-mining-and-presentable-graphics</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/data-mining-and-presentable-graphics#comments</comments>
		<pubDate>Sun, 06 Sep 2009 20:15:27 +0000</pubDate>
		<dc:creator>Matthias Blume</dc:creator>
				<category><![CDATA[Business Analytics]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[presentation graphics]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=10</guid>
		<description><![CDATA[In early phases of a data mining project, when selecting, discovering and preparing the data the analyst looks at quite a few graphs to get a quick understanding of what can be used and where further preparation and transformation is needed. Those graphs...]]></description>
			<content:encoded><![CDATA[<p>In early phases of a data mining project, when selecting, discovering and preparing the data the analyst looks at quite a few graphs to get a quick understanding of what can be used and where further preparation and transformation is needed. Those graphs are disposable in most cases and don&#8217;t need fancy axis labels or even alignment with corporate design. What matters here is speed, ease of use and readability for the informed power user. The leading data mining suites such as PASW Modeler aka Clementine, the SAS Enterprise Miner and also the open source workbench KNIME<a  id="_ftnref1" name="_ftnref1" href="#_ftn1"><sup>[1]</sup></a> from University of Konstanz are well prepared for this kind of graph mass production and disposal. This phase of data preparation and step by step building up of understanding is often quite time consuming. The ergonomics and efficiency gains provided here are one of the reasons d&#8217;être of the mining suites as opposed to isolated tools</p>
<p>But from time to time one or more of these disposable graphs turn out to be nuggets &#8212; valuable discoveries or insight that deserve being shared with a wider audience. So the graph &#8212; say a histogram comparing numbers of contacts for a few segments &#8212; is already there. Capturing this by-product should not slow us down on our way towards the model we are heading for. Unfortunately the histogram’s layout, axes and captions are not in compliance with the visualisation standards and rules we have been preaching and teaching before<a id="_ftnref2" name="_ftnref2" href="#_ftn2"><sup>[2]</sup></a> . The histogram with tick marks for 0.8  1.8 and so on until 19.8 for these difficult to be more integer data is simply not presentation ready. Too bad.</p>
<p>Now starting up another tool reproducing the selection, the little transformation and the histo itself will likely take a few minutes and distract us from our core task, the model to be built.</p>
<p>So after all, we might be inclined to limit the celebration of the nugget to showing it to their colleague who happens to take a coffee 2 meters away. But anything more would be too much hassle and not as target oriented as we ought to be. At least as long as data mining suites and presentable graphics remain two different stories.</p>
<p>PS: This hold actually more for Clementine and KNIME which have the aspiration of being exhaustive and fully integrated. SAS will often require the use of two tools (Guide and Miner) anyway, depending on where the data come from.</p>
<hr size="1" /><a  id="_ftn1" name="_ftn1" href="#_ftnref1">[1]</a> The three tools do not make up an exhaustive list but simple happen to be the ones that I happen to have used</p>
<p><a id="_ftn2" name="_ftn2" href="#_ftnref2">[2]</a> See for example <a href="http://www.perceptualedge.com/blog/">http://www.perceptualedge.com/blog/</a> or various publications on data visualisation for guidelines how presentable graphs should look like.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/data-mining-and-presentable-graphics/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
