Zum Hauptinhalt springen
Dekorationsartikel gehören nicht zum Leistungsumfang.
Building the Data Warehouse
Taschenbuch von W H Inmon
Sprache: Englisch

55,40 €*

inkl. MwSt.

Versandkostenfrei per Post / DHL

Lieferzeit 1-2 Wochen

Kategorien:
Beschreibung
* The new edition of the classic bestseller that launched the data warehousing industry covers new approaches and technologies, many of which have been pioneered by Inmon himself
* In addition to explaining the fundamentals of data warehouse systems, the book covers new topics such as methods for handling unstructured data in a data warehouse and storing data across multiple storage media
* Discusses the pros and cons of relational versus multidimensional design and how to measure return on investment in planning data warehouse projects
* Covers advanced topics, including data monitoring and testing
* Although the book includes an extra 100 pages worth of valuable content, the price has actually been reduced from [...] to [...]
* The new edition of the classic bestseller that launched the data warehousing industry covers new approaches and technologies, many of which have been pioneered by Inmon himself
* In addition to explaining the fundamentals of data warehouse systems, the book covers new topics such as methods for handling unstructured data in a data warehouse and storing data across multiple storage media
* Discusses the pros and cons of relational versus multidimensional design and how to measure return on investment in planning data warehouse projects
* Covers advanced topics, including data monitoring and testing
* Although the book includes an extra 100 pages worth of valuable content, the price has actually been reduced from [...] to [...]
Über den Autor

William H. Inmon is the acknowledged "Father of Data Warehousing" and a partner in [...] a Web site featuring information on data warehousing and related technologies. He has written more than 40 books on database and data warehousing technologies, and is a frequent speaker (and often the keynote) at major conferences.

Inhaltsverzeichnis

Preface xix

Acknowledgments xxvii

Chapter 1 Evolution of Decision Support Systems 1

The Evolution 2

The Advent of DASD 4

PC/4GL Technology 4

Enter the Extract Program 5

The Spider Web 6

Problems with the Naturally Evolving Architecture 7

Lack of Data Credibility 7

Problems with Productivity 9

From Data to Information 12

A Change in Approach 14

The Architected Environment 16

Data Integration in the Architected Environment 18

Who Is the User? 20

The Development Life Cycle 20

Patterns of Hardware Utilization 22

Setting the Stage for Re-engineering 23

Monitoring the Data Warehouse Environment 25

Summary 28

Chapter 2 The Data Warehouse Environment 29

The Structure of the Data Warehouse 33

Subject Orientation 34

Day 1 to Day n Phenomenon 39

Granularity 41

The Benefits of Granularity 42

An Example of Granularity 43

Dual Levels of Granularity 46

Exploration and Data Mining 50

Living Sample Database 50

Partitioning as a Design Approach 53

Partitioning of Data 53

Structuring Data in the Data Warehouse 56

Auditing and the Data Warehouse 61

Data Homogeneity and Heterogeneity 61

Purging Warehouse Data 64

Reporting and the Architected Environment 64

The Operational Window of Opportunity 65

Incorrect Data in the Data Warehouse 67

Summary 69

Chapter 3 The Data Warehouse and Design 71

Beginning with Operational Data 71

Process and Data Models and the Architected Environment 78

The Data Warehouse and Data Models 79

The Data Warehouse Data Model 81

The Midlevel Data Model 84

The Physical Data Model 88

The Data Model and Iterative Development 91

Normalization and Denormalization 94

Snapshots in the Data Warehouse 100

Metadata 102

Managing Reference Tables in a Data Warehouse 103

Cyclicity of Data - The Wrinkle of Time 105

Complexity of Transformation and Integration 108

Triggering the Data Warehouse Record 112

Events 112

Components of the Snapshot 113

Some Examples 113

Profile Records 114

Managing Volume 115

Creating Multiple Profile Records 117

Going from the Data Warehouse to the Operational Environment 117

Direct Operational Access of Data Warehouse Data 118

Indirect Access of Data Warehouse Data 119

An Airline Commission Calculation System 119

A Retail Personalization System 121

Credit Scoring 123

Indirect Use of Data Warehouse Data 125

Star Joins 126

Supporting the ODS 133

Requirements and the Zachman Framework 134

Summary 136

Chapter 4 Granularity in the Data Warehouse 139

Raw Estimates 140

Input to the Planning Process 141

Data in Overflow 142

Overflow Storage 144

What the Levels of Granularity Will Be 147

Some Feedback Loop Techniques 148

Levels of Granularity - Banking Environment 150

Feeding the Data Marts 157

Summary 157

Chapter 5 The Data Warehouse and Technology 159

Managing Large Amounts of Data 159

Managing Multiple Media 161

Indexing and Monitoring Data 162

Interfaces to Many Technologies 162

Programmer or Designer Control of Data Placement 163

Parallel Storage and Management of Data 164

Metadata Management 165

Language Interface 166

Efficient Loading of Data 166

Efficient Index Utilization 168

Compaction of Data 169

Compound Keys 169

Variable-Length Data 169

Lock Management 171

Index-Only Processing 171

Fast Restore 171

Other Technological Features 172

DBMS Types and the Data Warehouse 172

Changing DBMS Technology 174

Multidimensional DBMS and the Data Warehouse 175

Data Warehousing across Multiple Storage Media 182

The Role of Metadata in the Data Warehouse Environment 182

Context and Content 185

Three Types of Contextual Information 186

Capturing and Managing Contextual Information 187

Looking at the Past 187

Refreshing the Data Warehouse 188

Testing 190

Summary 191

Chapter 6 The Distributed Data Warehouse 193

Types of Distributed Data Warehouses 193

Local and Global Data Warehouses 194

The Local Data Warehouse 197

The Global Data Warehouse 198

Intersection of Global and Local Data 201

Redundancy 206

Access of Local and Global Data 207

The Technologically Distributed Data Warehouse 211

The Independently Evolving Distributed Data Warehouse 213

The Nature of the Development Efforts 213

Completely Unrelated Warehouses 215

Distributed Data Warehouse Development 217

Coordinating Development across Distributed Locations 218

The Corporate Data Model - Distributed 219

Metadata in the Distributed Warehouse 223

Building the Warehouse on Multiple Levels 223

Multiple Groups Building the Current Level of Detail 226

Different Requirements at Different Levels 228

Other Types of Detailed Data 232

Metadata 234

Multiple Platforms for Common Detail Data 235

Summary 236

Chapter 7 Executive Information Systems and the Data Warehouse 239

EIS - The Promise 240

A Simple Example 240

Drill-Down Analysis 243

Supporting the Drill-Down Process 245

The Data Warehouse as a Basis for EIS 247

Where to Turn 248

Event Mapping 251

Detailed Data and EIS 253

Keeping Only Summary Data in the EIS 254

Summary 255

Chapter 8 External Data and the Data Warehouse 257

External Data in the Data Warehouse 260

Metadata and External Data 261

Storing External Data 263

Different Components of External Data 264

Modeling and External Data 265

Secondary Reports 266

Archiving External Data 267

Comparing Internal Data to External Data 267

Summary 268

Chapter 9 Migration to the Architected Environment 269

A Migration Plan 270

The Feedback Loop 278

Strategic Considerations 280

Methodology and Migration 283

A Data-Driven Development Methodology 283

Data-Driven Methodology 286

System Development Life Cycles 286

A Philosophical Observation 286

Summary 287

Chapter 10 The Data Warehouse and the Web 289

Supporting the eBusiness Environment 299

Moving Data from the Web to the Data Warehouse 300

Moving Data from the Data Warehouse to the Web 301

Web Support 302

Summary 302

Chapter 11 Unstructured Data and the Data Warehouse 305

Integrating the Two Worlds 307

Text - The Common Link 308

A Fundamental Mismatch 310

Matching Text across the Environments 310

A Probabilistic Match 311

Matching All the Information 312

A Themed Match 313

Industrially Recognized Themes 313

Naturally Occurring Themes 316

Linkage through Themes and Themed Words 317

Linkage through Abstraction and Metadata 318

A Two-Tiered Data Warehouse 320

Dividing the Unstructured Data Warehouse 321

Documents in the Unstructured Data Warehouse 322

Visualizing Unstructured Data 323

A Self-Organizing Map (SOM) 324

The Unstructured Data Warehouse 325

Volumes of Data and the Unstructured Data Warehouse 326

Fitting the Two Environments Together 327

Summary 330

Chapter 12 The Really Large Data Warehouse 331

Why the Rapid Growth? 332

The Impact of Large Volumes of Data 333

Basic Data-Management Activities 334

The Cost of Storage 335

The Real Costs of Storage 336

The Usage Pattern of Data in the Face of Large Volumes 336

A Simple Calculation 337

Two Classes of Data 338

Implications of Separating Data into Two Classes 339

Disk Storage in the Face of Data Separation 340

Near-Line Storage 341

Access Speed and Disk Storage 342

Archival Storage 343

Implications of Transparency 345

Moving Data from One Environment to Another 346

The CMSM Approach 347

A Data Warehouse Usage Monitor 348

The Extension of the Data Warehouse across Different Storage Media 349

Inverting the Data Warehouse 350

Total Cost 351

Maximum Capacity 352

Summary 354

Chapter 13 The Relational and the Multidimensional Models as a Basis for Database Design 357

The Relational Model 357

The Multidimensional Model 360

Snowflake Structures 361

Differences between the Models 362

The Roots of the Differences 363

Reshaping Relational Data 364

Indirect Access and Direct Access of Data 365

Servicing Future Unknown Needs 366

Servicing the Need to Change Gracefully 367

Independent Data Marts 370

Building Independent Data Marts 371

Summary 375

Chapter 14 Data Warehouse Advanced Topics 377

End-User Requirements and the Data Warehouse 377

The Data Warehouse and the Data Model 378

The Relational Foundation 378

The Data Warehouse and Statistical Processing 379

Resource Contention in the Data Warehouse 380

The Exploration Warehouse 380

The Data Mining Warehouse 382

Freezing the Exploration Warehouse 383

External Data and the Exploration Warehouse 384

Data Marts and Data Warehouses in the Same Processor 384

The Life Cycle of Data...

Details
Erscheinungsjahr: 2005
Fachbereich: Anwendungs-Software
Genre: Importe, Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
Inhalt: 576 S.
ISBN-13: 9780764599446
ISBN-10: 0764599445
Sprache: Englisch
Herstellernummer: 19769944000
Einband: Kartoniert / Broschiert
Autor: Inmon, W H
Auflage: 4th edition
Hersteller: Wiley
John Wiley & Sons
Verantwortliche Person für die EU: Wiley-VCH GmbH, Boschstr. 12, D-69469 Weinheim, amartine@wiley-vch.de
Maße: 235 x 191 x 31 mm
Von/Mit: W H Inmon
Erscheinungsdatum: 01.10.2005
Gewicht: 1,064 kg
Artikel-ID: 102293807
Über den Autor

William H. Inmon is the acknowledged "Father of Data Warehousing" and a partner in [...] a Web site featuring information on data warehousing and related technologies. He has written more than 40 books on database and data warehousing technologies, and is a frequent speaker (and often the keynote) at major conferences.

Inhaltsverzeichnis

Preface xix

Acknowledgments xxvii

Chapter 1 Evolution of Decision Support Systems 1

The Evolution 2

The Advent of DASD 4

PC/4GL Technology 4

Enter the Extract Program 5

The Spider Web 6

Problems with the Naturally Evolving Architecture 7

Lack of Data Credibility 7

Problems with Productivity 9

From Data to Information 12

A Change in Approach 14

The Architected Environment 16

Data Integration in the Architected Environment 18

Who Is the User? 20

The Development Life Cycle 20

Patterns of Hardware Utilization 22

Setting the Stage for Re-engineering 23

Monitoring the Data Warehouse Environment 25

Summary 28

Chapter 2 The Data Warehouse Environment 29

The Structure of the Data Warehouse 33

Subject Orientation 34

Day 1 to Day n Phenomenon 39

Granularity 41

The Benefits of Granularity 42

An Example of Granularity 43

Dual Levels of Granularity 46

Exploration and Data Mining 50

Living Sample Database 50

Partitioning as a Design Approach 53

Partitioning of Data 53

Structuring Data in the Data Warehouse 56

Auditing and the Data Warehouse 61

Data Homogeneity and Heterogeneity 61

Purging Warehouse Data 64

Reporting and the Architected Environment 64

The Operational Window of Opportunity 65

Incorrect Data in the Data Warehouse 67

Summary 69

Chapter 3 The Data Warehouse and Design 71

Beginning with Operational Data 71

Process and Data Models and the Architected Environment 78

The Data Warehouse and Data Models 79

The Data Warehouse Data Model 81

The Midlevel Data Model 84

The Physical Data Model 88

The Data Model and Iterative Development 91

Normalization and Denormalization 94

Snapshots in the Data Warehouse 100

Metadata 102

Managing Reference Tables in a Data Warehouse 103

Cyclicity of Data - The Wrinkle of Time 105

Complexity of Transformation and Integration 108

Triggering the Data Warehouse Record 112

Events 112

Components of the Snapshot 113

Some Examples 113

Profile Records 114

Managing Volume 115

Creating Multiple Profile Records 117

Going from the Data Warehouse to the Operational Environment 117

Direct Operational Access of Data Warehouse Data 118

Indirect Access of Data Warehouse Data 119

An Airline Commission Calculation System 119

A Retail Personalization System 121

Credit Scoring 123

Indirect Use of Data Warehouse Data 125

Star Joins 126

Supporting the ODS 133

Requirements and the Zachman Framework 134

Summary 136

Chapter 4 Granularity in the Data Warehouse 139

Raw Estimates 140

Input to the Planning Process 141

Data in Overflow 142

Overflow Storage 144

What the Levels of Granularity Will Be 147

Some Feedback Loop Techniques 148

Levels of Granularity - Banking Environment 150

Feeding the Data Marts 157

Summary 157

Chapter 5 The Data Warehouse and Technology 159

Managing Large Amounts of Data 159

Managing Multiple Media 161

Indexing and Monitoring Data 162

Interfaces to Many Technologies 162

Programmer or Designer Control of Data Placement 163

Parallel Storage and Management of Data 164

Metadata Management 165

Language Interface 166

Efficient Loading of Data 166

Efficient Index Utilization 168

Compaction of Data 169

Compound Keys 169

Variable-Length Data 169

Lock Management 171

Index-Only Processing 171

Fast Restore 171

Other Technological Features 172

DBMS Types and the Data Warehouse 172

Changing DBMS Technology 174

Multidimensional DBMS and the Data Warehouse 175

Data Warehousing across Multiple Storage Media 182

The Role of Metadata in the Data Warehouse Environment 182

Context and Content 185

Three Types of Contextual Information 186

Capturing and Managing Contextual Information 187

Looking at the Past 187

Refreshing the Data Warehouse 188

Testing 190

Summary 191

Chapter 6 The Distributed Data Warehouse 193

Types of Distributed Data Warehouses 193

Local and Global Data Warehouses 194

The Local Data Warehouse 197

The Global Data Warehouse 198

Intersection of Global and Local Data 201

Redundancy 206

Access of Local and Global Data 207

The Technologically Distributed Data Warehouse 211

The Independently Evolving Distributed Data Warehouse 213

The Nature of the Development Efforts 213

Completely Unrelated Warehouses 215

Distributed Data Warehouse Development 217

Coordinating Development across Distributed Locations 218

The Corporate Data Model - Distributed 219

Metadata in the Distributed Warehouse 223

Building the Warehouse on Multiple Levels 223

Multiple Groups Building the Current Level of Detail 226

Different Requirements at Different Levels 228

Other Types of Detailed Data 232

Metadata 234

Multiple Platforms for Common Detail Data 235

Summary 236

Chapter 7 Executive Information Systems and the Data Warehouse 239

EIS - The Promise 240

A Simple Example 240

Drill-Down Analysis 243

Supporting the Drill-Down Process 245

The Data Warehouse as a Basis for EIS 247

Where to Turn 248

Event Mapping 251

Detailed Data and EIS 253

Keeping Only Summary Data in the EIS 254

Summary 255

Chapter 8 External Data and the Data Warehouse 257

External Data in the Data Warehouse 260

Metadata and External Data 261

Storing External Data 263

Different Components of External Data 264

Modeling and External Data 265

Secondary Reports 266

Archiving External Data 267

Comparing Internal Data to External Data 267

Summary 268

Chapter 9 Migration to the Architected Environment 269

A Migration Plan 270

The Feedback Loop 278

Strategic Considerations 280

Methodology and Migration 283

A Data-Driven Development Methodology 283

Data-Driven Methodology 286

System Development Life Cycles 286

A Philosophical Observation 286

Summary 287

Chapter 10 The Data Warehouse and the Web 289

Supporting the eBusiness Environment 299

Moving Data from the Web to the Data Warehouse 300

Moving Data from the Data Warehouse to the Web 301

Web Support 302

Summary 302

Chapter 11 Unstructured Data and the Data Warehouse 305

Integrating the Two Worlds 307

Text - The Common Link 308

A Fundamental Mismatch 310

Matching Text across the Environments 310

A Probabilistic Match 311

Matching All the Information 312

A Themed Match 313

Industrially Recognized Themes 313

Naturally Occurring Themes 316

Linkage through Themes and Themed Words 317

Linkage through Abstraction and Metadata 318

A Two-Tiered Data Warehouse 320

Dividing the Unstructured Data Warehouse 321

Documents in the Unstructured Data Warehouse 322

Visualizing Unstructured Data 323

A Self-Organizing Map (SOM) 324

The Unstructured Data Warehouse 325

Volumes of Data and the Unstructured Data Warehouse 326

Fitting the Two Environments Together 327

Summary 330

Chapter 12 The Really Large Data Warehouse 331

Why the Rapid Growth? 332

The Impact of Large Volumes of Data 333

Basic Data-Management Activities 334

The Cost of Storage 335

The Real Costs of Storage 336

The Usage Pattern of Data in the Face of Large Volumes 336

A Simple Calculation 337

Two Classes of Data 338

Implications of Separating Data into Two Classes 339

Disk Storage in the Face of Data Separation 340

Near-Line Storage 341

Access Speed and Disk Storage 342

Archival Storage 343

Implications of Transparency 345

Moving Data from One Environment to Another 346

The CMSM Approach 347

A Data Warehouse Usage Monitor 348

The Extension of the Data Warehouse across Different Storage Media 349

Inverting the Data Warehouse 350

Total Cost 351

Maximum Capacity 352

Summary 354

Chapter 13 The Relational and the Multidimensional Models as a Basis for Database Design 357

The Relational Model 357

The Multidimensional Model 360

Snowflake Structures 361

Differences between the Models 362

The Roots of the Differences 363

Reshaping Relational Data 364

Indirect Access and Direct Access of Data 365

Servicing Future Unknown Needs 366

Servicing the Need to Change Gracefully 367

Independent Data Marts 370

Building Independent Data Marts 371

Summary 375

Chapter 14 Data Warehouse Advanced Topics 377

End-User Requirements and the Data Warehouse 377

The Data Warehouse and the Data Model 378

The Relational Foundation 378

The Data Warehouse and Statistical Processing 379

Resource Contention in the Data Warehouse 380

The Exploration Warehouse 380

The Data Mining Warehouse 382

Freezing the Exploration Warehouse 383

External Data and the Exploration Warehouse 384

Data Marts and Data Warehouses in the Same Processor 384

The Life Cycle of Data...

Details
Erscheinungsjahr: 2005
Fachbereich: Anwendungs-Software
Genre: Importe, Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
Inhalt: 576 S.
ISBN-13: 9780764599446
ISBN-10: 0764599445
Sprache: Englisch
Herstellernummer: 19769944000
Einband: Kartoniert / Broschiert
Autor: Inmon, W H
Auflage: 4th edition
Hersteller: Wiley
John Wiley & Sons
Verantwortliche Person für die EU: Wiley-VCH GmbH, Boschstr. 12, D-69469 Weinheim, amartine@wiley-vch.de
Maße: 235 x 191 x 31 mm
Von/Mit: W H Inmon
Erscheinungsdatum: 01.10.2005
Gewicht: 1,064 kg
Artikel-ID: 102293807
Sicherheitshinweis