| |
| ||||||
SQL Server Data Quality Services ? First LookThis is a discussion on SQL Server Data Quality Services ? First Look within the BI Monkey forums, part of the CORTEX Blogs category; So I have managed to get SSDQS up and running, and here’s what I get when I open the SSDQS Client: SQL Server Data Quality Services Client front page For ... |
![]() |
| | LinkBack | Thread Tools | Search this Thread | Display Modes |
| | #1 |
| Guru Join Date: Jun 2009
Posts: 135
![]() | So I have managed to get SSDQS up and running, and here’s what I get when I open the SSDQS Client: SQL Server Data Quality Services Client front pageFor those without a microscope (you can click on the image to see the full screen), the client has 3 areas of activity:
Knowledge Base Management Knowledge Bases are at the heart of SSDQS functionality. Here you define the rules that SSDQS will apply when validating data, and the action taken when those rules are breached. The idea behind calling them “Knowledge Bases” is that they capture knowledge about the data in your organisation – effectively they are catalogues of rules. (I rather hope this name changes as Microsoft already heavily uses Knowledge Base as a term in its support activities, so it’s a bit of a confusing choice). Anyway, nomenclature moans aside, the Knowledge Base is the most important thing to grasp here. It’s the container for what the SSDQS engine, either through the client or SSIS will use when validating data. The Knowledge Base contains Domains, which represents an area of Data Quality – so for example in an Address style Knowledge Base this could be a single field – such as City – or a compound selection of fields such as Full Address, made up of Address, City and Postal Code. Hence the term Domain – its important to note SSDQS is not a single field only validation engine, but can deal with collections of fields, and also create different Domains relating to the same fields, allowing the application of Data Quality with context. Again, sticking with the Address model, this means a Knowledge Base can subject Addresses from different countries to different rules by creating two Domains to address the same set of fields and calling the Domain as appropriate. There’s also what appears to be fuzzy matching capabilities as well. Data Quality Projects Projects are actual applications of Knowledge Bases across sets of Data. From what I can see so far, there are two modes of operation: Cleansing & Matching. Cleansing is the application of the Knowledge Base against a set of data to test and refine the rules you have in place. Matching is applying the Knowledge Base against a set of data and applying what appears to be Fuzzy Matching algorithms over it to match it to existing records. I’ll be able to flesh this out more once I have some data to play with (I’m going to have to create some as there don’t appear to be any samples to work with) – but the thing I like about this is that these projects are not an isolated process of an end result, but feed back into the Knowledge Base so they become part of an ongoing process – which is what dealing with Data Quality is. Administration Well, there’s always some Admin – it this case the console covers Activity Monitoring and Configuration. Activity monitoring covers usage and activities against Knowledge Bases. Configuration manages some matching default parameters and connections to 3rd party Reference databases. Thrilling stuff. Summary & Links So, so far SSDQS looks like a powerful and user-friendly solution to Data Quality problems. In subsequent posts I’ll dig into the capabilities and start picking it apart. In the meantime, here are some useful resources to get you started: Get More from the original blog... |
| | |
![]() |
| Bookmarks |
| Thread Tools | Search this Thread |
| Display Modes | |
| |
Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Installing SQL Server Data Quality Services (Denali CTP3) | James Beresford | BI Monkey | 0 | 26th July 2011 06:12 PM |
| New Data Visualizations in SQL Server Reporting Services 2008 R2 | Latest News Headlines | Microsoft News and Views | 0 | 10th March 2010 09:47 AM |
| SQL Server 2008 R2 August CTP – What’s New In Reporting Services? | Latest News Headlines | Microsoft News and Views | 0 | 25th November 2009 01:49 PM |
| Starter Kit for PerformancePoint Planning Server SQL Server Integration Services (SSI | Latest News Headlines | Microsoft News and Views | 0 | 25th November 2009 01:49 PM |
| Which Edition is my Analysis Services Server? | Darren Gosbell | Random Procrastination | 0 | 15th October 2009 12:56 AM |
| | |
| | |