User281315223 posted
As I am sure you have come across
this related discussion based on your question as it pertains to the built-in Microsoft.Speech library, which has a few ideas on how to accomplish this through the use of including built-in dictation grammars using the Microsoft.Speech.Recognition namespace.
However I personally cannot vouch for it or provide any kind of confidence with regards to its accuracy.
In my opinion, if you want any type of production-level accuracy, you would likely need to use some type of third-party software or service to handle transcription :
As you can see there are quite a few APIs out there that you may be able to take advantage of. Additionally, it should be noted that if you are handling any type of critical data (i.e. medical records, financial data, etc.) that you should probably have
a stringent QA process to evaluate the accuracy of the conversions or perhaps use a professional transcriptionist to handle this process.