what are common data cleaning tasks that all can perform using DQS
-
Tuesday, August 28, 2012 7:51 PM
i know a few:
i) address validate (needs connection to external data, right? like mellissa data?)
ii) merge "similar" looking names like NBA or basketball
iii) validate email using regular expressions
What else? Can we share what we are using DQS for?
All Replies
-
Thursday, August 30, 2012 11:05 AM
Hello,
I would say DQS broadly enables you to do the following in terms of "data cleansing":
- Data Correction/validation: Using domain rules and values. Check out the AdventureWorks Sample to have hands on.
- Data Enrichment: Using Reference Data from Windows Azure Marketplace. For example, when you cleanse address records using reference data, the cleansed data returned from the provider has additional info. Look at point 6 in this topic: http://msdn.microsoft.com/en-us/library/hh510392.
- Data Standardization: You can specify the format of the data to be output in a domain for each data type. For more information, see http://msdn.microsoft.com/en-us/library/gg524800#Standardize. Further, you can specify term-based relations to standardize your data.
- Cleanse simple (single-value) and complex (multiple-value) data. An example of DQS usage to cleanse complex source data: Cleansing complex data using composite domains.
- Data Profiling and Notifications: Last but not the least, DQS provides data profiling inofrmation that helps you assess the effectiveness of the data cleansing activity. Notifications provide you handy tips on how you can improve the cleansing activity. For more info, see here.
Thanks,
Vivek
SQL Server Documentation- Marked As Answer by SSISQ Saturday, September 22, 2012 5:00 PM
-
Saturday, September 22, 2012 5:01 PMThanks vivek.

