Le réseau pour les développeurs > Forums - Accueil > Data Mining > Efficient way to cluster points on line y=ax+b? and now extend the idea to higher dimensional data?
Poser une questionPoser une question
 

TraitéeEfficient way to cluster points on line y=ax+b? and now extend the idea to higher dimensional data?

Réponses

  • vendredi 17 août 2007 17:32Shuvro MitraAuteur de réponseMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateur
     Traitée

    Are you using SQL Server Data Mining for clustering?

     

    For points (x,y) in the line y=ax+b, you'll detect clusters only using attribute x since y is a dependent variable. Any clustering algorithm will be able to handle this efficiently. When you mention higher dimensional data, I assume you mean more independent variables, but the idea is the same.

     

    Please let me know any specific questions you have regarding SQL Server Data Mining or Clustering algorithm in general

Toutes les réponses

  • vendredi 17 août 2007 17:32Shuvro MitraAuteur de réponseMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateur
     Traitée

    Are you using SQL Server Data Mining for clustering?

     

    For points (x,y) in the line y=ax+b, you'll detect clusters only using attribute x since y is a dependent variable. Any clustering algorithm will be able to handle this efficiently. When you mention higher dimensional data, I assume you mean more independent variables, but the idea is the same.

     

    Please let me know any specific questions you have regarding SQL Server Data Mining or Clustering algorithm in general

  • mercredi 1 juillet 2009 06:17Guennadiy Vanine Médailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateurMédailles de l'utilisateur
     
    For points (x,y) in the line y=ax+b, you'll detect clusters only using attribute x since y is a dependent variable. Any clustering algorithm will be able to handle this efficiently. When you mention higher dimensional data, I assume you mean more independent variables, but the idea is the same.

    Please let me know any specific questions you have regarding SQL Server Data Mining or Clustering algorithm in general

    Can this be described in more detail?

    I could not grasp who will do what and how. That is - to discern between "you'll detect" and "Any clustering algorithm will be able...".

    Also, I could not understand why "only using attribute x since y is a dependent variable".
    To me, this is quite symmetric:
    x is also dependent on y as x = (y - b) /a


    Guennadi Vanine -- Gennady Vanin -- Геннадий Ванин