FreePint: Join Join FreePint and receive the Newsletter every two weeks for free. Join now »
|
|
|
If you find FreePint useful, please supply a testimonial »
|
|
FreePint Family:
 Monthly magazine reviewing business information products »
 Articles, tools, and a monthly magazine, to give you practical help with information skills »
 Recruit for information-related roles, or find your next challenge. »
 Daily update of web-based resources »
 Daily update of free, full-text reports »
|
|
|
|
Home > Forum > Bar > Message |
|
|
Join FreePint and receive the twice-weekly Bar Digest and twice-monthly tip-packed Newsletter. It's free. [ more ]
|
|
|
|
 The FreePint Bar is generously sponsored by Dow Jones Factiva.
|
|
|
| Start New | Message Index  | Flat View |
| Indexing files |
| Author: | Stuart |
| Date: | Wednesday, 25th Feb 2004 15:37 |
| Views: | 1,997 (excluding Digests and RSS feeds) |
| Category: | Computers and Software | | URL: | http://www.freepint.com/go/b27774 |
|
Suggestions please for software able to take a document file and extract unique words and phrases, ignoring common expressions and words (how / when / where / why etc).
I'm preparing a master index that will contain links to documents themselves, but for space and -in some cases- copyright reasons should not duplicate the content of those files.
The master index will have inbuilt search capability, but can't 'see' the content of referenced files. Our workaround so far is to batch index the files and include a (much shorter) list of indexed words with each link.
As an example, searching for 'fly + fishing' should then turn up a respectably short but relevant listing of possible links even if one of the documents is compleatangler.pdf
Thanks in advance for any suggestions...
Stuart |
|
| Start New | |
| Topic |
Author |
Date |
ID |
| Indexing files | | Suggestions please for software able to take a document file and extract unique words and phrases, ignoring common expressions ... |
|
Stuart |
25/02/04 15:37 |
27774 |
|
Stuart |
04/03/04 18:47 |
27871 |
 | Re: Indexing files | | HI,
let me see if I get your request right :
You want to have a search engine kind a thing and ... |
|
S.W.Schilke |
27/02/04 09:40 |
27795 |
Please note: The reply form is not showing because the posting is older than six months or the thread is locked. Please start a new topic or contact the forum administrator.
|
|