Info > Data Mining3-1. DM and DW > The Architecture of Data


¢¹¢º The Architecture of Data

 

¢º Data is the heart of both DW and DM

 

¢º Description way

¢¹ simply differentiation between internal data and external data
¢¹ a more useful way of looking at data is in terms of abstraction

¡Ø operational data - who, what, where and when
¡Ø summary data - summaries by who, what, where, when
¡Ø database schema - physical layout of the data, tables, fields.
¡Ø metadata - logical model and mapping to physical layout and sources.
¡Ø business rule - what's been learned from the data.

 

¢º Operation Data

¡Ø the most basic form of data

¢¹ Example : every product purchased by a customer, every bank transaction, every credit card purchases, every flight segment, every package, every telephone call.
¢¹ Changes to operational data are quite common
¢¹ The fact that operational data changes has to be part of any robust data warehousing approach.
¢¹ the amount of data : enormous

¡Ø a typical fast food restaurant sells hundreds of thousands of meal over the course of a year.
¡Ø a chain of food markets

 

¢º Summary Data

¡Ø the first level of abstraction
¢¹ derived from operational data and the most common way that users interact with data
¢¹ summary data is the data in reports that provide static views of the business.
¢¹ today's summary data may not be important in the future.

¢º Database Schema

¡Ø data about the data itself
¢¹ This most basic level is the physical layout of the operational data and the summary data ¡æ tell us what we have and what we can know.
¢¹ Do we know the telephone numbers of customer ?
¢¹ If there is some telephone number field somewhere in the data. Can we determine sales trends for products in different regions?
¢¹ If there are tables with historical sales data for products in different regions or tables with operational data that can be used to create the appropriate summaries.

 

¢º Metadata

¡Ø a logical data model that defines the data in terms of entities (the product of hierarchy), attributes, and relationships (business and its various partners) meaningful on the business level.
¢¹ The importence of metadata

¡Ø provides the key link between the business users and the data
¡Ø a good metadata system gives users the ability to browse through the metadata on their desktop.
¡Ø accurate and accessible

 

¢º The business rules

¡Ø not only describe the structure of the data(metadata) but they also describe why relationship exist and how they are applied.
¢¹ a close relationship to DM
¢¹ Example

¡Ø market basket analysis
¡Ø decision tree
¡Ø produce explicit rules ¡æ business ¡æ DM technique

¢¹ Even more generally, the process of using DM should be captured as business rules as well as specific results


Info > Data Mining3-1. DM and DW > The Architecture of Data