<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Business Intelligence Review &#187; architecture</title>
	<atom:link href="http://www.bireview.org/bireviewblogs/archives/tag/architecture/feed" rel="self" type="application/rss+xml" />
	<link>http://www.bireview.org/bireviewblogs</link>
	<description>All things Business Intelligence related. Read and participate!</description>
	<lastBuildDate>Thu, 27 May 2010 11:32:04 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Battle of the Open Source/Free Data Modelling Tools</title>
		<link>http://www.bireview.org/bireviewblogs/archives/battle-of-the-open-sourcefree-data-modelling-tools</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/battle-of-the-open-sourcefree-data-modelling-tools#comments</comments>
		<pubDate>Tue, 09 Mar 2010 14:25:54 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[architecture]]></category>
		<category><![CDATA[data modelling]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=239</guid>
		<description><![CDATA[I am going to get this started soon with the first data modelling tool to be reviewed...]]></description>
			<content:encoded><![CDATA[<p>I am going to get this started soon with the first data modelling tool to be reviewed being DBDesigner Fork. This application is a development off-shoot of the fabForce DBDesigner v4. The sourceForge website is <a href="http://dbdesigner-fork.sourceforge.net/">here</a>. Before I dive in though, it might be useful to set the criteria for the battle. That being said, here they are:</p>
<ol>
<li>The ability to create ERD diagrams or at least a facsimile thereof using UML class diagrams. Extra points awarded if it does both or if the UML class diagrams can be customized for ERD type diagrams</li>
<li>The ability to generate web-sites from the final diagrams and/or any other supplementary information that can be added to the diagram objects</li>
<li>Speaking of which, extra points awarded for being able to attach significant amounts of meta-data and/or reference external files</li>
<li>Naturally, ease-of-use will be a priority</li>
<li>As will a short learning curve</li>
<li>Ease of implementation is a bonus as will be the ability to install the tool on multiple platforms (in my case, I will test some version of windows, Snow Leopard (Macs) and Linux (likely Ubuntu).</li>
</ol>
<p>I will also include whatever pains I went through during the installation as well, if there was any. Since I run mainly Snow Leopard and will need to virtualize environments for Windows (probably running Wine under OS X) and Linux, I am guessing there might be some. I will keep the pain of installing in the virtual environments separate from the pain of installing the tool itself (which there really shouldn&#8217;t be any).</p>
<p>If you have any particular data modelling tool that you would like me to test, please leave a comment below. Also, soon, I will present a compiled list of contenders, so speak up quickly <img src='http://www.bireview.org/bireviewblogs/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> .</p>
<span class="sfforumlink"><a href="http://www.bireview.org/bireviewblogs/forum/open-source-tools/battle-of-the-open-sourcefree-data-modelling-tools/"><p><img src="http://www.bireview.org/bireviewblogs/wp-content/plugins/simple-forum/styles/icons/default/bloglink.png" alt="" /> Join the forum discussion on this post</p>
</a></span>]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/battle-of-the-open-sourcefree-data-modelling-tools/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Business Intelligence 2.0, Is anyone doing it yet?</title>
		<link>http://www.bireview.org/bireviewblogs/archives/business-intelligence-2-0-is-anyone-doing-it-yet</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/business-intelligence-2-0-is-anyone-doing-it-yet#comments</comments>
		<pubDate>Sat, 16 Jan 2010 17:59:32 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[Enterprise Data Architecture]]></category>
		<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[architecture]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[process]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=207</guid>
		<description><![CDATA[In the early part of 2007, Neil Raden wrote an article entitled "Business Intelligence 2.0: Simpler, More Accessible, Inevitable". In this article, he argues that the a data warehouse (BI 1.0) is a big, slow, expensive machine for cranking out analytical data, and when it does, the data is highly-structured, only allowing a rigidly defined set of of questions to be answered. He continues that in the day of doing Google searches and social network...]]></description>
			<content:encoded><![CDATA[<p>In the early part of 2007, <a href="http://www.intelligententerprise.com/experts/raden;jsessionid=A5PASNQLWP2AZQE1GHPCKHWATMY32JVN">Neil Raden</a> wrote an article entitled &#8220;<a href="http://www.intelligententerprise.com/channels/infomanagement/showArticle.jhtml?articleID=197002610">Business Intelligence 2.0: Simpler, More Accessible, Inevitable</a>&#8220;. In this article, he argues that the a data warehouse (BI 1.0) is a big, slow, expensive machine for cranking out analytical data, and when it does, the data is highly-structured, only allowing a rigidly defined set of of questions to be answered. He continues that in the day of doing Google searches and social networks, that finding the information you want is a matter of &#8220;mashing up&#8221; data from everywhere. That is, BI 2.0 needs to be able to get data from where ever it is, and do the integration on the fly.</p>
<p>He also mentions, as I have in a <a href="http://www.bireview.org/bireviewblogs/archives/data-integration-and-the-data-warehouse-who-does-what" target="_self">previous article here</a>, that master data management (MDM) systems can be used to remove from the traditional data warehouse the complex work of data integration. I see this happening more and more for the reasons I expressed earlier, mainly a reduced total cost of ownership and an increased usability of the information. MDM systems are perfectly suited to the &#8220;data integration&#8221; part of the overall business intelligence world, and as such, helps to allow the traditional data warehouse evolve to a smaller, quicker, and less expensive machine to operate. Also, sourcing a data warehouse from an MDM system means that analytical data can be provided far easier and quicker than before, and this supports Neil Radan&#8217;s BI 2.0.</p>
<p>I have worked with organizations that have come to the conclusion that the types, amounts and sources of information needed to run the business properly are constantly changing as the organization tries to keep up with or keep ahead of the competition in a more and more volatile business environment. What resulted was the organization stepping backward in the maturity of its data warehouse (according to the &#8221; <a href="http://www.tdwi.org/publications/display.aspx?ID=7199">Data Warehouse Maturity Model</a>&#8221; I mentioned in a previous article). One organization in particular, to get the data the business needs in front of them faster, did an end-run around the data warehouse and started pulling data from whatever source system they thought had what they needed. I have to admit that the web-based interface they built was very impressive, and they could indeed deliver information quickly.</p>
<p>The problem I had with it was simply that they had ignored all the &#8220;best practices&#8221; about doing good data integration, good data governance and good data management. Because they built their solution outside the normal project path of the organization, in order to avoid the lengthly project schedules and extra costs, they side-stepped everything the organization had put in place to mitigate just the kinds of problems that this new development brought up again. And the organization had suffered the same problems in the past! I was amazed that the business unit in question was allowed to do this, but in the end, you had to sympathize with them. All they wanted was the data in a form they could use, and quickly. The solution should have been to change the processes to allow for these kinds of tools, but still keeping the principles of proper data management in place.</p>
<p>So, they were trying to do Business Intelligence 2.0 by using Web 2.0 type features on top of an immature approach to enterprise data management. In the end, I think it will hurt them more than help them, but who knows. These days things change so fast inside and outside business&#8217;s that even problems like this can disappear amongst everything else that goes on.</p>
<p>I would like to know if anyone is or has worked somewhere where a proper implementation of &#8220;Business Intelligence 2.0&#8243; is or  has happened. Please share your experiences in the comments section.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/business-intelligence-2-0-is-anyone-doing-it-yet/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why Build a Data Warehouse When You Can Lease One?</title>
		<link>http://www.bireview.org/bireviewblogs/archives/why-build-a-data-warehouse-when-you-can-lease-one</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/why-build-a-data-warehouse-when-you-can-lease-one#comments</comments>
		<pubDate>Tue, 05 Jan 2010 13:41:09 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[DWH Data Modeling]]></category>
		<category><![CDATA[Enterprise Data Architecture]]></category>
		<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[architecture]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[data warehouse as a service]]></category>
		<category><![CDATA[process]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=210</guid>
		<description><![CDATA[Back in 2008, I was working for a telco in the Dominican Republic, consulting for their data warehouse team. I proposed a complete re-architecture of the data warehouse and the reporting platforms that the data warehouse supported. When I had to cost out the project, I had to include some hardware, primarily for the Ab Initio ETL tool we were going to migrate to, but also for the development and test servers, and more tape backup machines. All in all, the tab for the entire project grew to almost 2 million USD. It was just another reminder to me of how expensive a data warehouse can be...]]></description>
			<content:encoded><![CDATA[<p>Back in 2008, I was working for a telco in the Dominican Republic, consulting for their data warehouse team. After a review of their data warehouse and several discussion with stakeholders, I proposed a complete re-architecture of the data warehouse and the reporting platforms that the data warehouse supported. When I had to cost out the project, I included some hardware, primarily for the Ab Initio ETL tool we were going to migrate to, but also for the development and test servers, and more tape backup machines. All in all, the tab for the entire project grew to almost 2 million USD. It was just another reminder to me of how expensive a data warehouse can be. In trying to convince the management board of the project, I conducted interviews with the major stakeholders to try and gauge the value of the data warehouse to the organization. Of course, a data warehouse supporting business intelligence was of enough value to the organization to justify it, but the numbers can be hard to swallow never the less.</p>
<p>All of that reminded me of the <a href="http://www.tdwi.org/publications/display.aspx?ID=7199">Data Warehouse Maturity Model</a> I have seen time and again at conferences I have attended and presented at. It shows that a data warehouse has reached its ultimate maturity when it achieves its potential for the return on the investment of building a data warehouse. That is, the business uses the data warehouse and the information within it to drive the business. I started to think about this while I was putting the business case together for this project. Too often, I have found that the decision makers in an organization are not always aware of the value of the data warehouse, and see it as a big, expensive, slow moving giant that they have to live with rather than really benefit from. It is beyond the scope of this article to discuss all the reasons why this might become the perception of the data warehouse, but it was sure the case here, which is why I needed to include in my business case the feedback from all the users of the data warehouse.</p>
<p>The organization in question was looking for ways to reduce the costs of the data warehouse and it occurred to me that one way to reduce the cost of it was to remove it all together from the business. I don&#8217;t mean throw it away (that would be insane), but why not get a little creative? Take all the hardware, the software, and the people that build, operate and maintain it all, and use it to spin off a separate business? This business would be initially funded by the parent organization, contracted by the parent to provide all the data warehouse services it did before but at a more favourable cost. It would operate as an independent business with the sole core competency of doing data warehousing, and be allowed to also sell its services to other organizations as well. By being wholly owned by the parent company and being able to expand its customer base beyond the parent, allows it to become a revenue centre for the parent. Eventually the separation would probably need to be more complete, I suppose, but still, it goes from being a cost centre to a revenue centre. Sweet!</p>
<p>Afterwards, I thought, hey, I&#8217;ve been doing software design and development for over 20 years, data warehousing for almost 10, and ran my own small consulting firm. Why don&#8217;t I just start up a business that provides a data warehouse as a service? Why, I thought to myself, that&#8217;s a great idea and would be a lot of fun! I know a couple of organizations that are moving in the direction of having other &#8220;systems as a service&#8221; anyway (at least outsourcing other major business support systems), so I could probable lock up a customer or two within a couple of years, and who knows? I started to seriously investigate the possibility. I would need to build/buy/rent a data center and build/rent/lease a bunch of servers and disk, not really a problem, but I would (ideally) need to come up with a data architecture that could support multiple clients, possibly across different industries. That would mean, design it, build it, test it, implement it, and then go sell it. Now that&#8217;s a lot of work. And I know that you can buy such industry specific data models. Teradata did, or maybe still does, have one, Oracle has one to, and I am sure there are others. Possible still, but a lot of work, and it would mean I might miss the opportunities I had because of the timing.</p>
<p>In continuing my investigation into setting all this up, I came across a company that already provides Data Warehousing as a Service (DaaS), called <a href="http://kognitio.com">Kognitio</a>. They have their own data centres, they have their own data models, and although they are the only ones I could find, but there might be others by now. So, someone had my genius idea before me (no surprise really). I still think that for an organization that is trying become more mature in their data warehousing, this is a good solution to get off the ground fast and jump the &#8220;gulf between &#8216;Inform Exectutives&#8217; and &#8216;Empower Knowledge Workers&#8217;&#8221; in the afore-mentioned data warehouse maturity model. If you are looking to implement a new data warehouse, or re-architect one, or try to out-source you existing one, I suggest you give them a call. I think I might, just to see if they need an affiliate in Switzerland, where I live.</p>
<p>What do you thing of Data Warehouse as a Service? Good idea or bad? Easy or difficult to implement? Too big a change to your organization or not? Let us know your thoughts in a comment!</p>
<span class="sfforumlink"><a href="http://www.bireview.org/bireviewblogs/forum/general/why-building-a-data-warehouse-when-you-can-lease-one/"><p><img src="http://www.bireview.org/bireviewblogs/wp-content/plugins/simple-forum/styles/icons/default/bloglink.png" alt="" /> Join the forum discussion on this post</p>
</a></span>]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/why-build-a-data-warehouse-when-you-can-lease-one/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Data Integration and the Data Warehouse, Who Does What?</title>
		<link>http://www.bireview.org/bireviewblogs/archives/data-integration-and-the-data-warehouse-who-does-what</link>
		<comments>http://www.bireview.org/bireviewblogs/archives/data-integration-and-the-data-warehouse-who-does-what#comments</comments>
		<pubDate>Mon, 21 Sep 2009 20:24:54 +0000</pubDate>
		<dc:creator>Christopher Shortt</dc:creator>
				<category><![CDATA[Enterprise Data Architecture]]></category>
		<category><![CDATA[Enterprise Data Warehousing]]></category>
		<category><![CDATA[architecture]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[data governance]]></category>
		<category><![CDATA[master data management]]></category>

		<guid isPermaLink="false">http://www.bireview.org/bireviewblogs/?p=95</guid>
		<description><![CDATA[In a <a title="Previous Post" href="http://www.bireview.org/bireviewblogs/archives/enterprise-data-warehouses-and-master-data-management">previous post</a>, I mentioned that one of the impacts of implementing a master data management system was removing some or all of the data integration responsibilities from the data warehouse. Since the data warehouse is the usual place for data integration, in particular customer data integration, I would like to explain myself and see if you agree, or at least see my point of view...]]></description>
			<content:encoded><![CDATA[<p>In a <a title="Previous Post" href="http://www.bireview.org/bireviewblogs/archives/enterprise-data-warehouses-and-master-data-management">previous post</a>, I mentioned that one of the impacts of implementing a master data management system was removing some or all of the data integration responsibilities from the data warehouse. Since the data warehouse is the usual place for data integration, in particular customer data integration, I would like to explain myself and see if you agree, or at least see my point of view.</p>
<p>What I mean is that in the big picture of enterprise data management, there are data applications that produce data, and data applications that consume data. In a lot of cases, a data application can (and must) play both roles. I make the distinction here to show one of the great promises of master data management: the reduction in the amount of duplication of work being done by software processes that must transform data between the applications that produce that data, and the applications that consume it. In the following diagram, we can see what can happen if several data consumer applications (right side) must connect directly to other data producing applications (left side) to get (and transform) the data they need.</p>
<p style="text-align: center;"><img class="aligncenter" title="Not a Pretty Picture.." src="/images/blogpics/mixedbag.png" alt="Not a Pretty Picture." width="600" height="200" /></p>
<h6 style="text-align: center;">Not a Pretty Picture.</h6>
<p>As you can see, it can get pretty messy, pretty quickly. There are things that exist in the IT world to try and reduce this complexity, SOA, EAI, etc. and some do a reasonable job, but in the end, there is more needed to really remove the duplication of effort. This is master data management (MDM). If we have an MDM hub in the middle of the above diagram, to act as a level of abstraction between data producers and data consumers, we can significantly simplify the picture and remove all redundancy.</p>
<p style="text-align: center;"><img class="aligncenter" title="Much better." src="/images/blogpics/mdmmiddle.png" alt="Much better." width="600" height="200" /></p>
<h6 style="text-align: center;">Much Better.</h6>
<p>The MDM hub works the most effectively, where there is an actual data store implemented within it. The efficiency arises from being able to separate source-system specific data transformations from source-system independent data transformations. It enables this by maintaining, at its core, a enterprise-level business entity view of the data held in the department level (purpose-specific) data applications. This business view of the data, is populated from the source system&#8217;s data using source-system specific transformations, and thereby transforming the data into a data model that represents how the business sees the data entities, rather than how the purpose-specific applications see the data. When the data is transformed in this way, into an enterprise level business view of the data, the enterprise as a whole can understand the data and use the data by doing further transformations of that data into purpose-specific data (i.e. data to improve marketing, customer care, logistics efficiency, etc.).  The beauty of the whole system is that the end-users of the data do not need to worry about directly transforming the source system data into their own purpose-specific data, especially when they might need the same type data from multiple source systems (each with their own way of storing that information). They only need to go to one place, where the structure of the data is oriented to how they see the data, and where the data means the same thing thing to everyone (enterprise level). They are then isolated from the changes underneath, that is if a source system is replaced, decommissioned, or radically changed, the impact is kept to a minimum. The only changes required are the source-system specific transformations, and nothing else.</p>
<p>To get back to my original argument, I believe that an MDM system is mandatory for efficient operation of any medium to large-sized organization, and gets more important as the enterprise gets bigger. And I like to keep the main roles of the data managers separate from one another (at least on a personnel level, it not at an organizational structure level). That is, I prefer to have those individuals who use the data to run the business (data consumer enterprise roles) focused on using the data, and those individuals who are responsible for producing the data, focused on the applications that produce the data, regardless of where they fit into the organization&#8217;s operational structure. Therefore, I maintain that those individual who are responsible for the MDM system focus on doing that to the best of their abilities, and those that use the MDM system, such as those who would use the MDM system as the main source for the data warehouse, focused on building and maintaining the data warehouse.</p>
<p>So, at the end of the day, the bulk (possibly) of your data integration would happen in the MDM system, using source-system specific transformations, which would then allow the data warehouse to focus on generating analytical data for downstream use , without having the added burden of being responsible for the data integration as well, having only to worry about using source-system independent transformations, like many other downstream applications.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bireview.org/bireviewblogs/archives/data-integration-and-the-data-warehouse-who-does-what/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
