Differences between revisions 3 and 4

Deletions are marked like this. Additions are marked like this.
Line 30: Line 30:
I don't know what those last two are. I don't know what those last two are, but I am now to learn that odt files are of open document format, which is used by, for example, OpenOffice.

I just wanted to mention an amazing OS X utility called textutil because someone gave me some data as a bunch of word files! But I needed to keep the data on jones, so I had to be able to work with it in the command line. I did a bunch of searching, and I was reading discouraging reports and starting to lose heart, but then I happened upon the mention of textutil.

All I had to do was type in:

textutil -convert txt -encoding mac *.doc,

and all of my horrible doc files were instantly txt files! It was amazing! Then I discovered this utility is standard with OS X starting with 10.4. So if you too are searching and finding only extremely old converter applications that probably do not work with modern versions of word--since they keep changing their format in order to break converter applications--I hope you find this page and it helps you decide to use textutil for this purpose.

It also has other functions, which I may soon detail in its very own wiki entry.


The simplest use is

textutil -convert txt Some\ File.doc

This produces Some File.txt. You can also specify an output encoding besides UTF-8, though this isn't usually needed.

textutil -convert txt -encoding mac Some\ File.doc

You can convert to

  • txt -- plain text
  • html -- web pages (no images)
  • rtf -- rich text (different fonts, etc, much like html)
  • doc -- Word 97-03
  • docx -- Word 07-10
  • webarchive -- Safari web page package (includes images)
  • wordml, odt

I don't know what those last two are, but I am now to learn that odt files are of open document format, which is used by, for example, OpenOffice.

How to convert MS doc to txt (last edited 2009-07-08 17:04:13 by 140-182-171-152)