This program will extract the text even from damaged or corrupted Microsoft Office and Open Office files 2.X and 3.X files with the extensions .doc, docx, xls, xlsx, ppt, pptx, odt, ods and odp as well as possibly the template and macro variants of these extensions such as dot, xlt and pps if they are changed to the correct corresponding extensions mentioned. It may succeed at doing so where MS Office Open Office itself fails to salvage text. It can also attempt to recover formatting in the form of a full Open Office file with a regular, odt, ods or odp extension. At this time unfortunately there is no facility for recovering anything but basic formatting for MS Office files through the previously mentioned text extractions. This program can be used as a viewer of text within healthy MS Office and Open Office files without having Open Office installed.
The text extraction is accomplished with the use of the command line application, SILVERCODERS DocToText. The program also uses command line tools from The Chicago Project and ReadText, rt.exe to extract data and text from MS Office version 97-2003 format files. The reconstructed version of the Open Office file is accomplished by unzipping the Open Office file with the somewhat zip corruption immune CakeCMD unzipper. Once unzipped, the manifest/manifest.xml file is replaced with a greatly simplified version as described here: http://www.oooforum.org/forum/viewtopic.phtml?t=57600.
If this application doesn't work, there are other things worth trying as summarized here: http://s2services.com/open_office.htm