Your shopping cart is empty!
Data warehouse (DW) is a repository of an organization's electronically stored data. Data warehouses are designed to manage and store the data whereas the Business Intelligence (BI) focuses on the usage of data to facilitate reporting and analysis.
The purpose of a data warehouse is to house standardized, structured, consistent, integrated, correct, cleansed and timely data, extracted from various operational systems in an organization. The extracted data is integrated in the data warehouse environment in order to provide an enterprise wide perspective, one version of the truth. Data is structured in a way to specifically address the reporting and analytic requirements. The design of structural metadata commonality using a data modeling method such as entity relationship model diagraming is very important in any data warehouse development effort.
An essential component of a data warehouse/business intelligence system is the metadata and tools to manage and retrieve metadata. Ralph Kimball describes metadata as the DNA of the data warehouse as metadata defines the elements of the data warehouse and how they work together.
Kimball et al. refers to three main categories of metadata: Technical metadata, business metadata and process metadata. Technical metadata is primarily definitional while business metadata and process metadata are primarily descriptive. Keep in mind that the categories sometimes overlap.
Data warehousing has specific metadata requirements. Metadata that describes tables typically includes:
Metadata describes columns within tables:
The concept of metadata is divided into 3 categories: Descriptive, Administrative, and Structural. However, metadata can further be classified into 14 types. The first type is the relational database metadata which is obviously used for relational database files. This type of metadata may include data about the tables, their sizes, names, and the number of rows, and the columns available in each database. Another metadata type is the data warehouse metadata which may be seen in two separate sections: the back room and the front room. There is also the business intelligence metadata that is used in explaining on how the corporate finance is reported. General IT metadata is another type of metadata. There is also the File system metadata that is utilized for different filing systems.
Image metadata is also available and is obviously used for different image files. Audio metadata is also included, which is also an important element in many audio files. There is also the program metadata which assists the executable files. Included to the list of metadata types is the existing software metadata, document metadata, and geospatial metadata. Digital library metadata is also a type of metadata, which may be separated into three categories: descriptive, administrative, and structural. Other metadata types are the metamodels and the meta-metadata.
There are three main types of metadata:
• Descriptive metadata describes a resource for purposes such as discovery and identification. It can include elements such as title, abstract, author, and keywords.
• Structural metadata indicates how compound objects are put together, for example, how pages are ordered to form chapters.
• Administrative metadata provides information to help manage a resource, such as when and how it was created, file type and other technical information, and who can access it. There are several subsets of administrative data; two that are sometimes listed as separate metadata types are:
− Rights management metadata, which deals with intellectual property rights,
− Preservation metadata, which contains information needed to archive and preserve a resource