Poser une questionPoser une question
 

Traitéehow to predict PK attribute?

  • jeudi 2 juillet 2009 08:35Guennadiy Vanine Médailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateur
     
    I am creating the DM model (engaging DT and Clustering Algorithms).

    The Key attribute cannot be marked also as Input or Predictable.

    And if I need to predict and to input on key column?
    Should I duplicate all this column(s)?




    Why cannot I predict PK


    Guennadi Vanine -- Gennady Vanin -- Геннадий Ванин

Réponses

  • mardi 7 juillet 2009 04:00Bogdan CrivatModérateurMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateur
     Traitée
    PK is typically not interestin for predictions. Primary keys have distinct occurrences in a data set (unique values for each row), and a single occurrence is not enough to learn patterns about a column. Also, in general primary keys do not hold any significance in their values -- a primary key may be a unique integer or a Guid.

    However, if your primary key actually holds significant information (numerical values, I would guess?) then you may duplicate it in your mining model. For example, you could create a model which uses the PK column as a key, then right click the Mining Structure object and select Add Column, and add the PK column once more.


    bogdan crivat [sql server data mining] / http://www.bogdancrivat.net/dm

Toutes les réponses

  • mardi 7 juillet 2009 04:00Bogdan CrivatModérateurMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateur
     Traitée
    PK is typically not interestin for predictions. Primary keys have distinct occurrences in a data set (unique values for each row), and a single occurrence is not enough to learn patterns about a column. Also, in general primary keys do not hold any significance in their values -- a primary key may be a unique integer or a Guid.

    However, if your primary key actually holds significant information (numerical values, I would guess?) then you may duplicate it in your mining model. For example, you could create a model which uses the PK column as a key, then right click the Mining Structure object and select Add Column, and add the PK column once more.


    bogdan crivat [sql server data mining] / http://www.bogdancrivat.net/dm