Please start any new threads on our new site at https://forums.sqlteam.com. We've got lots of great SQL Server experts to answer whatever question you can come up with.

 All Forums
 SQL Server 2000 Forums
 SQL Server Development (2000)
 get plain text from office files without office?

Author  Topic 

p2bl
Yak Posting Veteran

54 Posts

Posted - 2002-09-18 : 06:46:23
Is it possible to convert office documents to plain text without ms-office installed?I need to get the text content of the document for search(I will save the text content in database and i will not save the document file on the machine where SQL-server installed,so i could not use index server)?
and another question, can CHM file be indexed,that is seached use full-text search engine?

========================
look!

robvolk
Most Valuable Yak

15732 Posts

Posted - 2002-09-18 : 12:42:22
CHM files are encrypted and compressed, without expanding them their contents won't be available for a full-text search. I'm not aware of a filter for Index Server that can read CHM files.

You should reconsider the approach of storing the files in the database itself. You will only further complicate an already complicated process, with little or no added benefit. You will also end up wasting a lot of disk space keeping copies of files in the database and on disk, especially if you have Word and text versions (yes, you WILL have to have a copy on disk somewhere, how else will you import it into SQL Server? And what happens when the document gets edited?)

You can store the file elsewhere (doesn't have to be on the SQL Server) and keep a link to that file in the database instead. You can configure Index server to index those files for text searching.

Go to Top of Page
   

- Advertisement -