Zophar's Message Domain

Go Back   Zophar's Message Domain > General Chat > Talk of the Town

Reply
 
Thread Tools Display Modes
Old 12-29-2005, 06:24 AM   #1
Freaker
Senior Member
 
Join Date: Jul 2002
Posts: 120
Default Looking for a document indexer

I'm looking for a program that will go though a directory of MS Word documents, and then create an index of each word and which documents it occurs in, and has to be compatible with cyrilic characters.
Any suggestions?
<P ID="signature"></P>
Freaker is offline   Reply With Quote
Old 12-29-2005, 09:27 AM   #2
GhettoFabulous
Senior Member
 
Join Date: Nov 2004
Posts: 277
Default Re: Looking for a document indexer

> Any suggestions?
>

No idea what you're looking for really. The closest I can think of is outdated software called Agent Ransack that finds keywords in plain text files (doesn't work with MS Word). Check attached screenshot for example.

http://www.mythicsoft.com/agentransack/default.aspx
<P ID="signature"></P>

C:Serverxampphtdocszopharoldwwwthrea...entsagentr.jpg
GhettoFabulous is offline   Reply With Quote
Old 12-29-2005, 05:41 PM   #3
Freaker
Senior Member
 
Join Date: Jul 2002
Posts: 120
Default Re: Looking for a document indexer

I have a directory containing russian documents, and I want an index file telling me which files each word occurs in.
For example, I want the index to say да occurs in(Document1, Document3, Document5), and нет does in(Document2, Document4).
I don't want to search, I just want a human readable index file.
<P ID="signature"></P>
Freaker is offline   Reply With Quote
Old 12-29-2005, 05:59 PM   #4
SwampGas
Senior Member
 
Join Date: Apr 2000
Posts: 6,915
Default Re: Looking for a document indexer

desktop.google.com
<P ID="signature"><marquee direction=right scrollamount=10></marquee></P>
SwampGas is offline   Reply With Quote
Old 12-29-2005, 06:01 PM   #5
Audigy
Senior Member
 
Audigy's Avatar
 
Join Date: Apr 2004
Location: Houston, TX
Posts: 755
Default Re: Looking for a document indexer


> I don't want to search, I just want a human readable index
> file.
>

hehe...

grep (args) > index.txt ?

i have no idea.
<P ID="signature">--

http://www.oddigy.com
beadsprites and PSFs, oh my!</P>
__________________
http://www.oddigy.com
Audigy is offline   Reply With Quote
Old 12-29-2005, 07:14 PM   #6
RaiBlastoise
Senior Member
 
Join Date: Sep 2004
Posts: 322
Default Re: Looking for a document indexer

> cyrilic characters.

In Soviet Russia, documents index you.
<P ID="signature"></P>
RaiBlastoise is offline   Reply With Quote
Old 12-29-2005, 08:10 PM   #7
Isildur
Senior Member
 
Isildur's Avatar
 
Join Date: Nov 2004
Posts: 1,339
Default Re: Looking for a document indexer

> No idea what you're looking for really. The closest I can
> think of is outdated software called Agent Ransack that
> finds keywords in plain text files (doesn't work with MS
> Word).

It can find raw ASCII text in any file, Word or otherwise, although it doesn't recognize line breaks other than plain ASCII CRs and LFs, or any sort of special formating that is specific to a particular file format. For stuff like that you need the shareware version (FileLocator Pro), which allows interpreters for particular file types like PDF or Word.
<P ID="signature"><center>
<a href=http://1001insomniacnights.com><img src=http://pages.nyu.edu/~jc73/misc/1k1IN.gif border=0>
1k1IN:</a><font color=#903030> A Dark Comedy About 2 Roomates</font></center></P>
__________________
Holding out for Hostess Snack Cakes...
Isildur is offline   Reply With Quote
Old 12-29-2005, 11:05 PM   #8
Dan
Senior Member
 
Join Date: Jan 2002
Posts: 499
Default Re: Looking for a document indexer

> I'm looking for a program that will go though a directory of
> MS Word documents, and then create an index of each word and
> which documents it occurs in, and has to be compatible with
> cyrilic characters.
> Any suggestions?

http://www.copernic.com/Copernic</a>?
<P ID="signature"><div align="center"><font size=2>http://dan.panicus.orgDan's Space</a></font></div></P>
Dan is offline   Reply With Quote
Old 12-30-2005, 01:41 AM   #9
MegaManJuno
Senior Member
 
Join Date: Jan 2003
Location: WV
Posts: 626
Default Re: Looking for a document indexer

> I'm looking for a program that will go though a directory of
> MS Word documents, and then create an index of each word and
> which documents it occurs in, and has to be compatible with
> cyrilic characters.
> Any suggestions?
>
Don't know what OS you're running, but the Indexing Service in XP seems to do full text indexing...

If you have it installed, you can add a directory to the catalog for indexing. Once it's indexed, right-click My Computer and select Manage. Go to the Services and Applications section. From there you'll see a System tree item, under which there's a Query the Catalog entry. I think with the advanced search capabilities, you can even use regular expression queries. I assume it would support any character set that Windows does, but I've never tried it with foreign characters so I don't know for certain.
<P ID="signature"></P>
MegaManJuno is offline   Reply With Quote
Old 12-30-2005, 06:00 AM   #10
Freaker
Senior Member
 
Join Date: Jul 2002
Posts: 120
Default Re: Looking for a document indexer

Many of the listed programs would be perfect, but I'm doing this for computer illiterate people, and rather then searching the index I want to print it out.
I'm going to take a stab at making one myself, I can't find anything to do what I want.
<P ID="signature"></P>
Freaker is offline   Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 07:11 PM.

Contact Us - Zophar's Domain - Archive - Top

Powered by vBulletin® Version 3.8.4
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.