none
Matching Policy in DQS

    Question

  • Hi,
    I'm New with MDS and DQS.
    I'm working on DQS matching Policy.
    We can create matching Policy only on a Single data Source?
    My Scenario is :
    If I have 2 data Sources, say PDMS and PNO.
    Can we set the trust score on the Data Sources. If so,  Can we set like only the record which has highest trust score will win?
    For Example :
    I have data samples like
    Name Code DataSource
    ABC        1        PNO
    ABD        2        PNO
    ABC        3        PDMS
    ABD        4        PDMS
    If I set the Highest trust Score for PNO Can we get the PNO record as a leading record.
    Is there any way to acieve the above criteria using DQS. Thanks!!

    Saturday, July 05, 2014 1:35 PM

All replies

  • I have not worked with multiple sources. But one way I can think of doing this is to bring data from sources to a common table in your database and then use it for DQS. WHen you bring it you can add an extra column to include Source details(PNO,PDMS,..).Then use DQS task in SSIS and create a domain on source system column and set matching policy where you specify a higher weight value for the PNO source compared to PDMS for the domain. Now set Similarity etc values and it should choose PNO as leading record.

    Please Mark This As Answer if it helps to solve the issue Visakh ---------------------------- http://visakhm.blogspot.com/ https://www.facebook.com/VmBlogs

    Saturday, July 05, 2014 2:17 PM
  • Hi,

    Usually we combine data from both the sources and add the column for source detail to identify the data is coming from which source.

    If we add the the Source detail column in Matching rule (Exact or Fuzzy) it is acting like other fuzzy columns (Example : Name) only.

    Please tell me where we can specify the trust score for PNO or PDMS in SSIS tool. Please help me out in this regard.

    Thank you!

    Tuesday, July 08, 2014 6:18 AM
  • Hi,

    Usually we combine data from both the sources and add the column for source detail to identify the data is coming from which source.

    If we add the the Source detail column in Matching rule (Exact or Fuzzy) it is acting like other fuzzy columns (Example : Name) only.

    Please tell me where we can specify the trust score for PNO or PDMS in SSIS tool. Please help me out in this regard.

    Thank you!

    Nope

    it should not done inside SSIS DQS task

    what i told needs to done using DQS client tool by opening and editing the  knowledge base and adding matching policy with weight etc specified


    Please Mark This As Answer if it helps to solve the issue Visakh ---------------------------- http://visakhm.blogspot.com/ https://www.facebook.com/VmBlogs

    Tuesday, July 08, 2014 6:35 AM
  • Hi,

    1) In DQS Client tool, if I add the matching Policy as

    Column_name              Match_Type   Weight

    Name                           Similar          50%

    Source_Identifier           Similar          50%

    in Source_Identifier Column I have PNO and PDMS. If I assign weight to the Source identifier column, it will work like Name column. It will search the fuzzy match for PNO or PDMS.

    2 )If I create Domain for PNO and PDMS

    Then I'll have the column

    Name PNO_Source PDMS_Source
    ABC PNO Null
    ABD PNO Null
    ABC Null PDMS
    ABD Null PDMS

    I'm setting the Matching rule as

    Column_name      Match_Type   Weight

    Name                    Similar          20%

    PNO                      Similar          50%

    PDMS                    Similar          30%   

    Again it DQS take the Weight as matching score and Search for the matches according to the rule.

    It is not taking as a trust Score.

    I dont know what I'm missing. Please help me out in this regard.

    Thank you!

    Tuesday, July 08, 2014 9:40 AM