IBM Omnifind

Discussion in 'Hardware and Software Tips and Suggestions' started by Graham, Dec 16, 2006.

  1. Graham

    Graham Developer Staff Member

    Ok, I installed this to test it out.

    Here's the article http://www.dailytech.com/Article.aspx?newsid=5341

    I added my pdf directory for it to index, ( cache-listener), but since all my pdfs are scans without OCR, it was not able to make any sense of those. Sure it was able to index on the file names. I should probably OCR all of them somehow, and create separate text files for each pdf. Or, create pdfs with searchable text.


    I then added http://www.synapsedirect.com to the external websites to index, and that is better. So, I can easily, if I were a user, do a search on synapsedirect for help files.



  2. Jason

    Jason Developer / Handyman Staff Member

    I have Adobe 7 Pro and it could batch OCR entire directories.

    Problem is .. it changes the dates.

    Another problem .. it would change the checksum.

  3. Graham

    Graham Developer Staff Member

    Yeah .. so better to OCR your docs at the time of scanning .. not as a retrofit.

  4. Graham

    Graham Developer Staff Member

    I had only 512Mb of ECC ram on my Windows 2003 server, and it is not enough. Omnifind reduced my resources too much .. had to kill it.

Share This Page