Microsoft Word and HTML Don’t mix!
I was reminded again this week why Microsoft Office and html do not work together whilst assisting a client with their Wordpress blog. They asked me to find out why their blog was performing poorly in the search engines and I was looking for answers.
There are obviously many on-page and off-page factors that impact on a website or blog’s SEO (search engine optimization) strategy and in my experience the best place to start is with the html on your pages. The good news is that if you’ve chosen to use a CMS such as Wordpress then it is designed to produce nice, clean html. The bad news is that you will spoil it if you are careless when adding your posts or pages.
It is extremely important to bear in mind when using word processors such as Microsoft Word, that despite appearances to the contrary, they add lots of unnecessary formatting code such as font tags. Your website or blog should reference all of its styling etc. via its external Cascading Style Sheet (CSS) and this code is unnecessary.
When I began to view the html of the posts and pages, I saw examples such as this …
<p class=”MsoNormal” style=”margin: 0in 0in 0pt;”> </p>
<p class=”MsoNormal” style=”margin: 0in 0in 0pt;”><span style=”font-size: small; font-family: Times New Roman;”>Hi there,</span></p>
<p class=”MsoNormal” style=”margin: 0in 0in 0pt;”>
<p class=”MsoNormal” style=”margin: 0in 0in 0pt;”><span style=”font-size: small; font-family: Times New Roman;”>Thank you for visiting this page!</span></p>
<p class=”MsoNormal” style=”margin: 0in 0in 0pt;”>
<p class=”MsoNormal” style=”margin: 0in 0in 0pt;”><span style=”font-size: small; font-family: Times New Roman;”>I’ve been working really hard this week.</span></p>;
All this to display less than 20 words, I think not
So why should you care? You want your website or blog to gain a good search engine ranking right! Well it’s important to always remember that when search engine spiders crawl your website or blog pages/posts, one of their main tasks is to analyse the text and index the content. One aspect that affects how your content is indexed is the keyword density and this makes it extremely important to avoid the use of code such as this.
What Can I Do About It?
So what’s the solution? I’ve always found the simplest way to avoid this issue is to copy and paste from Microsoft Word into the Notepad program which comes bundled with Microsoft Windows. This copies the text in ASCII format (just the text without the unnecessary clutter) and I then paste it into my file or Wordpress editor. Yes, you’ll still have some work to do with formatting etc. but you’ll have nice clean code.
If you are using a CMS such as Wordpress that uses the Tiny MCE editor, then you’ll be pleased to know that they have already provided an easy to use inbuilt solution. To paste content from a Microsoft Word document into Wordpress via the editor, do as follows:-
- Select the text that you want to paste into your Wordpress from MS Word and copy it ready for pasting.
- Makes sure that your Wordpress editor is in Visual mode and click on the button
appearing on the toolbar with the Microsoft Word logo which says “Paste from Word” when you hover the mouse over it. - This loads the Paste from Word window into which you paste your text.
- Click on the Insert button and the text is inserted free of any Microsoft Word formatting.
I’ve put together the following brief video that demonstrates these steps.
Like this post? Subscribe to my RSS feed and get loads more!


4 comments
Sylvia Dickens on January 3, 2009 at 1:53 pm
This is great to know. Thank you. I wondered why my posts kept weirding out on me when I copied from Word. I’ve since learned that this issue causes problems when you publish an article on some article sites, too.
Which text editor for no "msonormal" on April 2, 2009 at 6:57 am
[...] Re: Which text editor for no "msonormal" Take a look at this - one of many if you search Wordpress - Microsoft Word and HTML Don’t mix! | Jono’s Jottings [...]
Chris on April 16, 2009 at 3:40 am
This needs to be highlighted somewhere. This copy and paste secret sauce that word puts in there is obnoxious.
Sometimes I use the Windows Live writer to compose my posts — that seems to strip it automatically.
But when I’m in the Wordpress admin screen, all that HTML styling code comes.
Thanks for sharing the solution. It’s not intuitively obvious if I had never seen that button before.
Chris
Ruth Archer on June 27, 2009 at 9:11 am
Thanks. This helps a lot. I’m trying to learn Wordpress on the fly, and some things I just can’t figure out. Like, how do you put some space around a captioned picture that you want positioned on the side of your text? I’ve just been deleting the captions and using style=”margin-left: 0px;margin-right: 20px”, because I can’t get hspace to work, either. Frustrating.