Skip to main content

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]
[smila-user] File formats supported by SMILA (file) crawlers?

Hello there,

found the note that "currently only plain text and html files are crawled and indexed correctly by SMILA crawlers" in your 5 minutes to success documentation. Is this information still up to date? I guess, it was written based upon 0.5 M1.

I also did some qucik tests today, trying to crawl .pdf, .doc and so on via the file crawler, SMILA "only" found plain text and html files, .zip and images. Or did I something wrong? Within configuration/org.eclipse.smila.connectivity.framework/file.xml I included all file extensions I wanted SMILA to find, set a new <BaseDir> but changed/configured nothing else.

Keep up the great work and thank you very much.
-- 
GRATIS für alle GMX-Mitglieder: Die maxdome Movie-FLAT!
Jetzt freischalten unter http://portal.gmx.net/de/go/maxdome01


Back to the top