Word HTML Cleaner: Dean Allen has always done a great job with his Word HTML Cleaner. You save a Word document as HTML, upload it, and you get HTML back that’s stripped of all the Word HTML funkiness.
“This utility strips proprietary Microsoft tags and other cruft from Word HTML documents, leaving basic formatting intact.
Typographic quotes, proper dashes and other special characters, if they exist, will be converted to HTML entities to increase their portability among browsers and platforms.”
With this version, he’s added a subscription option for $20 a year whereby you can upload documents greater than 20KB. Now he just needs to make it a Web service so I can access it from script, and we’d be set.