Caution: This documentation is for eZ Publish legacy, from version 3.x to 6.x.
For 5.x documentation covering Platform see eZ Documentation Center, for difference between legacy and Platform see 5.x Architecture overview.

TextExtractionTool

Summary

Sets the text extraction tool used to fetch information from Word files.

Description

This setting sets the text extraction tool used to fetch information from Word files. This text extraction tool is an external program which returns the content of the MS word file as plain text.
This feature requires the Word View (wv) program to work properly.
WV or Word View is a library and set of command-line programs that understands Microsoft Word. It is able to convert MS Word documents into HTML (wvHtml), Abiword (wvAbw), plain text (wvText), RTF, PostScript, PDF, and other formats.

By default wvText is used by eZ Publish, the default settings for using this program are:

 

[HandlerSettings]
 
MetaDataExtractor[application/msword]=ezword
 
[WordHandlerSettings]
 
TextExtractionTool=wvWare -x /usr/local/wv/wvText.xml


This setting is case sensitive.

Examples

TextExtractionTool=wvWare -x /usr/local/wv/wvText.xml

This is the default setting.

Ester Heylen (08/10/2009 12:15 pm)

Ester Heylen (24/06/2010 7:31 am)


Comments

There are no comments.