Задайте вопросЗадайте вопрос
 

ВопросMining Structure Design

  • 27 мая 2009 г. 16:20Mr.QuestionMark Медали пользователяМедали пользователяМедали пользователяМедали пользователяМедали пользователя
     
    My task is to find exceptions from sales data which has area# and sales values column. Let's see,
    area#,sales
    1,2
    1,100
    1,23
    1,11
    2,12
    2,13
    4,12
    ..,..

    For each area there are around 200 rows.

    I think this is a typical clustering task. I can create a clustering model and use PredictCaseLikelihood function to find the exception. I tried one area. I created a small table which includes RowNo(Key), sales (Input, Predict) for Area# 1. And was using Microsoft Clustering algorithm (Cluster Counter set as 0). It is working well. The result is exactly same as I use highlight exception in excel table analysis tools.

    And then I tried to build a mining structure for all my data (i.e. all areas). I created a table which includes RowNo(key), AreaNo(text, input), sales(input, predict). And use same way before. The issue is the result is different than first model for same Area# 1.

    My guess is in the second structure areaNo is added as input, so that affect the calculation. But how can I avoid this affection? I believe the first result is more accurate. But in that way, I have to loop each area to create the model and query the result. That sounds no sense.

    I am wondering if I can use nested table. I tried it, but I did not get it.

    Any suggestion?

    Thanks,
    :)
    • ПеремещеноDarren GosbellMVP28 мая 2009 г. 3:08is a data mining question (From:SQL Server Analysis Services)
    •  

Все ответы