- Information
- AI Chat
Was this document helpful?
Dm notes - NNNNNNNNNNN
Course: Software Engineering (CS530)
376 Documents
Students shared 376 documents in this course
University: Visvesvaraya Technological University
Was this document helpful?
Data Mining and Data Warehousing(18CS641)
Department of Information Science and Engineering, SDMIT, Ujire. 1
Module-1
DATA WAREHOUSING & MODELLING
1.1 Introduction
Data warehouses generalize and consolidate data in multidimensional space. The construction of
data warehouses involves data cleaning, data integration, and data transformation, and can be viewed
as an important preprocessing step for data mining. Moreover, data warehouses provide online
analytical processing (OLAP) tools for the interactive analysis of multidimensional data of varied
granularities, which facilitates effective data generalization and data mining.
Many other data mining functions, such as association, classification, prediction, and clustering,
can be integrated with OLAP operations to enhance interactive mining of knowledge at multiple
levels of abstraction. Hence, the data warehouse has become an increasingly important platform For
data analysis and OLAP and will provide an effective platform for datamining. Therefore ,data
warehousing and OLAP form an essential step in the knowledge discovery process.
1.2 Data Warehouse: Basic Concepts
What Is a Data Warehouse?
Data warehousing provides architectures and tools for business executives to systematically
organize, understand, and use their data to make strategic decisions. Data warehouse systems are
valuable tools in today’s competitive, fast-evolving world. In the last several years, many firms have
spent millions of dollars in building enterprise-wide data warehouses. Many people feel that with
competition mounting in every industry, data warehousing is the latest must-have marketing
weapon—a way to retain customers by learning more about their needs.
A data warehouse refers to a data repository that is maintained separately from an
organization’s operational databases. Data warehouse systems allow for integration of a variety of
application systems. They support information processing by providing a solid platform of
consolidated historic data for analysis.
Syllabus
Basic Concepts: Data Warehousing: A multitier Architecture, Data warehouse models:
Enterprise warehouse, Data mart and virtual warehouse, Extraction, Transformation and
loading, Data Cube: A multidimensional data model, Stars, Snowflakes and Fact constellations:
Schemas for multidimensional Data models, Dimensions: The role of concept Hierarchies,
Measures: Their Categorization and computation, Typical OLAP Operations