I just posted on CodePlex a templated reduction class backed by a C++ AMP kernel. Don't know if it's optimal and/or faster than other implementations...
Microsoft is conducting an online survey to understand your opinion of the Msdn Web site. If you choose to participate, the online survey will be presented to you when you leave the Msdn Web site.