Chapter 1. Getting Started

This is a very interesting time for open source. As soon as a novel concept is put forth by enthusiasts, new functionality is included into software that changes our lives, and a lot of it is built on open source technology. Having been an open source advocate for some time now, I have seen a phenomenal amount of change and progress in the quality and quantity of Open Source Software (OSS) projects. From the thoughtful minds of professional software developers, engineers, and hobbyists, tools have sprung up to support a number of disciplines, including programmers, authors, office staff, teachers, students, media, and graphic designers. While there was a time when there was only expensive proprietary commercial software available to use to perform particular tasks, now there are a lot of new and free alternatives based on OSS.

Open source projects start, and also die, all the time. Each project starts to address what a user, or group of users, perceives as a relative shortcoming in the current computing landscape. While OpenOffice.org was derived from StarOffice to address the lack of an open source office suite, Mozilla has grown from the ashes of Netscape to compete with Internet Explorer, leading to the creation of Firefox, which not only has provided an alternative to IE but has revitalized the browser wars, even garnering attention to its commercial competitor Opera. And there is no end to the innovations that PERL and PHP have brought about.

Even non-free software benefits from OSS. Many different projects take portions of fully functional open source software implementations to use in their products. Commercial routers from companies such as Linksys have embedded Linux in them, and even gamers are affected as the Sony PlayStation 3 and Nintendo Wii were both designed to run using Linux. Mono, the open source implementation of the .Net framework, has helped with growth in the .Net community.

But there has always been one area that has been severely lacking, and that is the area of business intelligence. While there are solutions such as writing PERL or PHP scripts, these really don't leverage full fledged Business Intelligence, the idea that reports and tools can be used by businesses to make strategic decisions based on short term and long term data and trend analysis. There has not been an open source tool that really addresses this shortcoming. Crafty developers can take the long approach and write scripts and programs that automate data reporting tasks, but this is a long and complicated process. Proprietary software for doing reporting tasks do exist, such as the report developer inside Microsoft Access for reporting off Access databases, Crystal Reports, and larger offerings such as Business Objects. These are tools that have been built to automate reporting tasks such as data retrieval, sorting, aggregation, and presentation into a format that is meaningful to the user. Such tools have been lacking in the open source community, and have only begun to gain speed in the last few years.

Introduction to BI

There are two major questions that need to be answered at this point of the book—what is Business Intelligence and why do we need it? Business Intelligence is a lot like many other technology buzzwords that get thrown around. Many people are just aware of this word and say it, and many people will give a complicated definition of it. But the answer is really quite simple.

If I had to give it a formal definition, I would say that Business Intelligence, as it relates to information technology, is any tool or method that allows developers to take data or information, process it, manipulate it, associate it with related information, and present it to decision makers. As for a simplified definition, Business Intelligence involves presenting information to decision makers in a way that helps them make informed decisions.

Consider this scenario. You are the manager for a chain of retail stores. You need to figure out what products you should push to the forefront for the upcoming holiday season. You have a two method approach. First, you get the latest buzz from your marketers, who tell you what the new and upcoming products are. Your second approach is to look at products that have traditionally sold very well. You could also look at web tracking data, usually from large analytics companies such as Omniture or Google Analytics.

In order to project how the current year may go, you need to look at the data you have at your disposal. What product categories have traditionally sold well over a five-year period or even a ten-year period? Has there been any sort of pattern to these sales figures. What individual items have sold well in that period? Are there any trends? When you look at these figures, can you give a projection of how well the products will do? What items should be put on sale in order to push for higher sales? These are the kinds of questions that a decision maker may need answered.

So, the first step in the process is getting to this data, which is usually located someplace such as a data warehouse. If you are looking at a regional level, you may have this information stored separately in a localized data mart, with specific data. If you are looking forward to working on the minutest of data, you may need access to transactional data. Either way, the first hurdle is getting access to the data.

What do you do once you have access to the data? How do you format and present the data in some meaningful fashion that can be used to assist in the BI process? In other words, what is the best way to use this data to tell a good story, one that is easy to understand and also gets you the answers you are looking for? This is where BI tools come into play. Using BI tools, you have the ability to write reports that can present this often time sporadic and confusing data into some sort of format that is useful to decision makers. Once this data is presented, trends can be identified, total figures can be aggregated, and decisions can be made.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset