See
http://textism.com/wordcleaner/"A tool to strip Microsoft's proprietary tags and other superfluous noise from Word-generated HTML documents, leaving all the basic goodness intact."A perfect example of why closed source sucks!!!
Trying to fix up badness in programs using other programs.
Sounds like somebody should modify MS Word HTML generation part.