column names of DM model (BOL2008)<p>I follow BOL2008 <br/>(SQL Server 2008 Books Online (April 2009))<br/> Tutorials...<br/><br/>Data Mining Tutorials and How-to Topics &gt; Tutorials: Using DMX &gt; Bike Buyer DMX Tutorial  <br/><br/>1)<br/>Why do we have to explicitly determine columns for [Decision Tree] Model creation <br/>1a)<br/>ALTER MINING STRUCTURE [Bike Buyer]<br/>ADD MINING MODEL [Decision Tree]<br/>(<br/>   CustomerKey,<br/>   [Age],<br/>   [Bike Buyer] PREDICT,<br/>   [Commute Distance],<br/>   [Education],<br/>   [Gender],<br/>   [House Owner Flag],<br/>   [Marital Status],<br/>   [Number Cars Owned],<br/>   [Number Children At Home],<br/>   [Occupation],<br/>   [Region],<br/>   [Total Children],<br/>   [Yearly Income]<br/>) USING Microsoft_Decision_Trees<br/>WITH DRILLTHROUGH<br/><br/>and  NOT for  [Clustering] model<br/>1b)<br/>ALTER MINING STRUCTURE [Bike Buyer]<br/>ADD MINING MODEL [Clustering]<br/>USING Microsoft_Clustering <br/>?<br/><br/>1c)<br/>Is it possible to omit column names in 1a) too? <br/><br/>Anyway it is impossible to use the different column names from those indicated during cration of mining structure.<br/><br/>2)<br/>How can one change the names to differing from those (created in mining structure)? <br/>by script?</p> <hr class=sig> Guennadi Vanine -- Gennady Vanin -- Геннадий Ванин© 2009 Microsoft Corporation. All rights reserved.Fri, 03 Jul 2009 07:51:33 Z9fc0aea0-9d79-4707-8137-48b274d2dec9http://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#9fc0aea0-9d79-4707-8137-48b274d2dec9http://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#9fc0aea0-9d79-4707-8137-48b274d2dec9Guennadiy Vaninehttp://social.msdn.microsoft.com/Profile/en-US/?user=Guennadiy%20Vaninecolumn names of DM model (BOL2008)<p>I follow BOL2008 <br/>(SQL Server 2008 Books Online (April 2009))<br/> Tutorials...<br/><br/>Data Mining Tutorials and How-to Topics &gt; Tutorials: Using DMX &gt; Bike Buyer DMX Tutorial  <br/><br/>1)<br/>Why do we have to explicitly determine columns for [Decision Tree] Model creation <br/>1a)<br/>ALTER MINING STRUCTURE [Bike Buyer]<br/>ADD MINING MODEL [Decision Tree]<br/>(<br/>   CustomerKey,<br/>   [Age],<br/>   [Bike Buyer] PREDICT,<br/>   [Commute Distance],<br/>   [Education],<br/>   [Gender],<br/>   [House Owner Flag],<br/>   [Marital Status],<br/>   [Number Cars Owned],<br/>   [Number Children At Home],<br/>   [Occupation],<br/>   [Region],<br/>   [Total Children],<br/>   [Yearly Income]<br/>) USING Microsoft_Decision_Trees<br/>WITH DRILLTHROUGH<br/><br/>and  NOT for  [Clustering] model<br/>1b)<br/>ALTER MINING STRUCTURE [Bike Buyer]<br/>ADD MINING MODEL [Clustering]<br/>USING Microsoft_Clustering <br/>?<br/><br/>1c)<br/>Is it possible to omit column names in 1a) too? <br/><br/>Anyway it is impossible to use the different column names from those indicated during cration of mining structure.<br/><br/>2)<br/>How can one change the names to differing from those (created in mining structure)? <br/>by script?</p> <hr class=sig> Guennadi Vanine -- Gennady Vanin -- Геннадий ВанинWed, 01 Jul 2009 10:59:13 Z2009-07-01T11:02:16Zhttp://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#8e8c9517-962e-4cba-a1bf-7f25d34df89ahttp://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#8e8c9517-962e-4cba-a1bf-7f25d34df89aAllan Mitchellhttp://social.msdn.microsoft.com/Profile/en-US/?user=Allan%20Mitchellcolumn names of DM model (BOL2008)<span style="font-family:monospace;font-size:13px;white-space:pre-wrap">Hi 1b)</span> <div><span style="font-family:monospace;font-size:13px;white-space:pre-wrap">So I guess you didn't try to create a model from a structure without specifying the column names. Here is what happens when I did </span> <pre lang=x-sql>ALTER MINING STRUCTURE [Targeted Mailing] ADD MINING MODEL ZZY USING Microsoft_Decision_Trees</pre> <div><span style="font-family:monospace;font-size:13px;white-space:pre-wrap"> <pre lang=x-sql>Executing the query ... Error (Data mining): Error validating attributes for the 'ZZY' mining model. Error (Data mining): The algorithm requires at least one predictable attribute. None found in mining model, ZZY. Execution complete</pre> 2.</span></div> <div><span style="font-family:monospace, Arial, Helvetica, sans-serif;font-size:small"><span style="font-size:13px;white-space:pre-wrap"><br/></span></span></div> <div><span style="font-family:monospace, Arial, Helvetica, sans-serif;font-size:small"><span style="font-size:13px;white-space:pre-wrap">Not sure this is possible in the UI (Couldn't see it anyway) but yes your model column names can be different</span></span></div> <div><span style="font-family:monospace, Arial, Helvetica, sans-serif;font-size:small"><span style="font-size:13px;white-space:pre-wrap"><br/></span></span></div> <div><span style="font-family:monospace, Arial, Helvetica, sans-serif;font-size:small"><span style="font-size:13px;white-space:pre-wrap"> <div> <pre lang=x-sql>ALTER MINING STRUCTURE [Targeted Mailing] ADD MINING MODEL ZZY2 ([Customer Key], [Gender] as [____], Region as [Where I live]) USING Microsoft_Clustering</pre> <br/></div> <div><br/></div> </span></span></div> </div>Wed, 01 Jul 2009 21:18:10 Z2009-07-01T21:18:10Zhttp://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#4f6aa1f1-b443-4323-bc2a-3d398d359378http://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#4f6aa1f1-b443-4323-bc2a-3d398d359378Allan Mitchellhttp://social.msdn.microsoft.com/Profile/en-US/?user=Allan%20Mitchellcolumn names of DM model (BOL2008)Gender has a weird alias there because I think the UI to the forum may have taken exception to my naming.  You can guess what it was.Wed, 01 Jul 2009 21:19:23 Z2009-07-01T21:19:23Zhttp://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#0cc6314a-fa93-47ef-b38b-92ac591a4de8http://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#0cc6314a-fa93-47ef-b38b-92ac591a4de8Guennadiy Vaninehttp://social.msdn.microsoft.com/Profile/en-US/?user=Guennadiy%20Vaninecolumn names of DM model (BOL2008)<blockquote> <div><span style="font-family:monospace;font-size:13px;white-space:pre-wrap">So I guess you didn't try to create a model from a structure without specifying the column names. Here is what happens when I did </span> <div style="color:#000000;background-color:#ffffff"> <pre><span style="color:#0000ff">ALTER</span> MINING STRUCTURE [Targeted Mailing] <span style="color:#0000ff">ADD</span> MINING MODEL ZZY USING Microsoft_Decision_Trees </pre> </div> </div> </blockquote> I guess I underspecified the question (though it seems clear what I implicitly asked).<br/><br/>While creating DM model (or sructure with model) in BIDS AS project the wizard does not permit to pass without marking at least one checkbox in Predictable column (this tep and Window have heading &quot; Data Mining Wizard/ Specify the Training Data&quot; ), this is during creation <br/><br/>Meanwhile DMX script permits creation of model without predictable attributes in case of Microsoft Clustering Algoritm (but not in case of DT Algorithm).<br/><br/>I also doublechecked it. I scripted the model created through AS project (Basic DM Tutorial) and through script.<br/>The former script contains &quot;predict&quot; (&lt;Usage&gt;Predictonly&lt;/Usage&gt; ) and the latter script of model creatin does not have any &quot;predict&quot; strings. <br/><br/>Making all the procedures through AS project in BIDS does no distinction between Decision Trees and MS Clustering Algorithm model creation.<br/>While the creatiion of model through DMX script does make the difference.<br/><br/>Here are few questions... that make a lot of combinations...<br/>Who is wrong - the DMX script compiler permitting to mit predictable attributes at all or AS project wizard?<br/>Am I confusing something?<br/><br/><br/> <hr class=sig> Guennadi Vanine -- Gennady Vanin -- Геннадий ВанинThu, 02 Jul 2009 08:12:18 Z2009-07-02T08:12:53Zhttp://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#65f3be4c-99c0-4669-add7-17fd839e7ef0http://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#65f3be4c-99c0-4669-add7-17fd839e7ef0Allan Mitchellhttp://social.msdn.microsoft.com/Profile/en-US/?user=Allan%20Mitchellcolumn names of DM model (BOL2008)Personally speaking, I kind of like the way the wizard does it.  A wizard is a lot easier for people to see visually what is going on and that will make it easier for people to understand which in the case of DM is no bad thing (let's get rid of this mentality that DM is for the few).   <div><br/></div> <div>The fact that using RAW DMX allows you to shortcut this process i think is no bad thing either.  if you are happier with the DMX &quot;Shortcut&quot; then that works for me as well.</div> <div><br/></div> <div>In short then I do not think either option is wrong it is just the wizard if it allowed for &quot;Shortcut&quot; then it may be off-putting for some.</div> <div><br/></div> <div><br/></div> <div>HTH</div> <div><br/></div> <div>allan</div> <div><br/></div> <div>   <div><br/></div> <div><br/></div> </div>Thu, 02 Jul 2009 08:22:18 Z2009-07-02T08:22:18Zhttp://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#3a8ed138-87b5-4e1f-bb09-6c05981e40dchttp://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#3a8ed138-87b5-4e1f-bb09-6c05981e40dcGuennadiy Vaninehttp://social.msdn.microsoft.com/Profile/en-US/?user=Guennadiy%20Vaninecolumn names of DM model (BOL2008)<blockquote>Personally speaking <div> <div><br/></div> </div> </blockquote> Speaking from the name of my client, the latter insists on pure scripting solutions...<br/><br/>But my question is not on that. I am interested what is going under the hood.<br/>Should the noted diferrences underline some kind of error in part of scripting in tutorial?<br/>or this underlines some profound difference in configuration of the modeling based on MS Clustering Algorithm?<br/>i.e. possibility to make predictions on any attributes without marking as Predictable any of them?<br/><hr class="sig">Guennadi Vanine -- Gennady Vanin -- Геннадий ВанинThu, 02 Jul 2009 09:03:01 Z2009-07-02T09:03:01Zhttp://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#8fdb401c-d2c0-4cff-ad85-7c30e186630ahttp://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#8fdb401c-d2c0-4cff-ad85-7c30e186630aAllan Mitchellhttp://social.msdn.microsoft.com/Profile/en-US/?user=Allan%20Mitchellcolumn names of DM model (BOL2008)<div><br/></div> <div>Clustering does not require you make anything predictable.  You can, but it is not required.  DTs require that you have something that is predictable and there is the difference.</div> <div><br/></div> <div><br/></div> <div><br/></div>Thu, 02 Jul 2009 09:07:05 Z2009-07-02T09:07:05Zhttp://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#596e7a05-f16c-425e-9d4c-01d0099b07fahttp://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/9fc0aea0-9d79-4707-8137-48b274d2dec9#596e7a05-f16c-425e-9d4c-01d0099b07faAllan Mitchellhttp://social.msdn.microsoft.com/Profile/en-US/?user=Allan%20Mitchellcolumn names of DM model (BOL2008)And this page helps <div><br/></div> <div><span style="font-family:Tahoma;font-size:12px;white-space:pre">http://technet.microsoft.com/en-us/library/ms132066.aspx</span></div> <div><span style="font-family:Tahoma, Arial, Helvetica, sans-serif;font-size:small"><span style="font-size:12px;white-space:pre"><br/></span></span></div> <div><span style="font-family:Tahoma, Arial, Helvetica, sans-serif;font-size:small"><span style="font-size:12px;white-space:pre"><br/></span></span></div> <div><span style="font-family:Tahoma, Arial, Helvetica, sans-serif;font-size:small"><span style="font-size:12px;white-space:pre"><span style="font-family:Verdana, Arial, Helvetica, sans-serif;font-size:9px;white-space:normal;line-height:12px">&quot;If the model does not require a predictable column, for example, models that are built by using the Microsoft Clustering and Microsoft Sequence Clustering algorithms, you do not have to include a column definition in the statement. All the attributes in the resulting model will be treated as inputs.&quot;</span></span></span></div> <div><span style="line-height:12px"><br/></span></div> <div><span style="line-height:12px"><br/></span></div>Thu, 02 Jul 2009 09:12:22 Z2009-07-02T09:12:22Z