Main Content

Data Profiling

A talk by Greg Larsen

About the Talk

January 31, 2013 2:00 PM

Tacaom SQL Server User Group, University of Washington Tacoma

Tacaom SQL Server User Group, University of Washington Tacoma

Are you receiving data from various different external sources? Would you like to know of a quick and easy way to profile this incoming data? Profiling data allows you to understand various aspects of new data sources. By profiling new data you can make sure it meets your data quality standards prior to loading it into your environment. During this session I will be exploring the SSIS Data Profiling task to quickly analyze your data. I will be reviewing the different profiles available, and explaining the limitations of the Data Profiling task. I will be showing you how to use the different profiles to help you identify candidate keys, null value counts by column, dependencies between columns, length distribution, data format, etc.
I will be performing a number of Data Profiling task demos that demonstrate how to use and configure the different profiles. Knowing how to use Data Profiling and interpreting the output helps you quickly determine how best to migrate newly received data into your environment. I will also be showing you how to use the Data Profiling task output to control your SSIS data flow. By controlling your data flow, based on profiling output, you can write processing rules to only allow new data that meets your requirements to get uploaded into your environment. Additionally I will also be showing you how you can store profiling output in SQL Server, so you can produce trend reports based on profile output information.

Ratings and Recommendations

This Talk hasn't been rated yet. Sign In to rate Talks.

comments powered by Disqus