Resource Section Overview

Newsletter Archive
Whitepapers
Presentations
Links
Recommended reading

You will need Adobe Acrobat Reader to open documents with the pdf logo. For more information, see

Tom Breur combines Art & Science approach to data mining concepts. His abilities and visionary approach to data mining, the unique way of implementing models, are the most professional and advance we came across among many companies we are working with worldwide. He combines deep knowledge of customer behavior and has the ability to implement complex solutions and systems to get, keep and grow customers in a very competitive environment. Tom has the most pleasant personality and highest communicative skills.
With great respects
Dr. Ronit HaNegby
CEO
EASTAT ltd.

What to do with bad data?

Tom Breur
September 2009

Introduction

Everybody hates poor quality data. But let’s face it, in most organizations you will run into data quality issues, at least from time to time. So instead of arguing against poor data quality, a more useful question is: how do youdeal with it?

To make business intelligence (BI) applications add value to the corporation, some level of data integration invariably takes place. This might be a data warehouse (DWH), operational data store (ODS), enterprise resource planning (ERP), customer relationship management (CRM) application or another application you might have. We’ll assume that the data quality problems originate in upstream (primary) systems that are generating your source data. In this article, I’ll focus exclusively on DWH solutions.

There are two fundamentally different ways of dealing with bad data: either you load all of the data “as is” (and deal with the errors later) or you clean/scrub the data on the way in to the DWH. The former is the approach advocated by Data Vault architects, the latter I will label the “Ralph Kimball” approach, in honor of his extensive writing on this subject. This article focuses on the pros and cons of both approaches. 

Contact
XLNT Consulting
Tom Breur, Principal

E-mail
Email Tom Breur

Telephone
+31-6-463 468 75

Address
Langestraat 8-03
5038 SE Tilburg
the Netherlands