[Owncloud] ownCloud 5.0.10 - lucene fails to index .txt files?

Jörn Friedrich Dreyer jfd at owncloud.com
Thu Aug 22 21:49:29 UTC 2013


The warnings about pdf and word are from getid3 lib and can be ignored if you are using search lucene. It comes with special indexers for these filetypes.

The error about not beeing able to determine the file format for txt files also is from getid3 and might be caused by empty txt files.

Can you check if the reported txt file has 0 bytes? Can you search for a text in the pdf or word files and see if you get any results?

So long

Jörn



Stefan Vollmar <vollmar at nf.mpg.de> schrieb:
>Hello,
>
>we seem to have problems with indexing files - this apparently works
>well for some files and does not for others - so far we have not worked
>out a pattern.
>
>uname -a
>Linux owncloud 3.5.0-39-generic #60~precise1-Ubuntu 
>
>ownCloud 5.0.10
>
>Error messages in /owncloud/data/owncloud.log (see below) seem to
>suggest that the file type of simple ".txt" files could not be
>determined? These days, I would also expect indexing of PDF data - but
>a failure to index ".txt"-files definitely sound like a bug, right? 
>
>Many thanks in advance.
>
>Best regards,
> Stefan
>
>{"app":"PHP","message":"iconv(): Detected an illegal character in input
>string at
>\/var\/www\/owncloud\/apps\/search_lucene\/3rdparty\/Zend\/Search\/Lucene\/Analysis\/Analyzer\/Common\/TextNum.php#58","level":2,"time":"2013-08-22T20:00:07+00:00"}
>{"app":"PHP","message":"Only variables should be passed by reference at
>\/var\/www\/owncloud\/apps\/search_lucene\/lib\/indexer.php#163","level":2,"time":"2013-08-22T20:02:33+00:00"}
>{"app":"search_lucene","message":"failed to extract meta information
>for \/stefan\/files\/x.pdf: PDF parsing not enabled in this version of
>getID3()
>[1.9.3-20111213]","level":2,"time":"2013-08-22T20:02:34+00:00"}
>{"app":"search_lucene","message":"failed to extract meta information
>for \/stefan\/files\/y.doc: MS Office (.doc, .xls, etc) parsing not
>enabled in this version of getID3()
>[1.9.3-20111213]","level":2,"time":"2013-08-22T20:02:55+00:00"}
>{"app":"search_lucene","message":"failed to extract meta information
>for \/stefan\/files\/z.txt: unable to determine file
>format","level":2,"time":"2013-08-22T20:03:22+00:00"}
>{"app":"search_lucene","message":"failed to extract meta information
>for \/stefan\/files\/z (2).txt: unable to determine file
>format","level":2,"time":"2013-08-22T20:03:42+00:00"}
>-- 
>Dr. Stefan Vollmar, Dipl.-Phys.
>Head of IT group
>Max-Planck-Institut für neurologische Forschung
>Gleuelerstr. 50, 50931 Köln, Germany
>Tel.: +49-221-4726-213  FAX +49-221-4726-298
>Tel.: +49-221-478-5713  Mobile: 0160-93874279
>Email: vollmar at nf.mpg.de   http://www.nf.mpg.de
>
>
>
>
>
>
>
>
>------------------------------------------------------------------------
>
>_______________________________________________
>Owncloud mailing list
>Owncloud at kde.org
>https://mail.kde.org/mailman/listinfo/owncloud

-- 
Diese Nachricht wurde von meinem Android-Mobiltelefon mit K-9 Mail gesendet.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.kde.org/pipermail/owncloud/attachments/20130822/499ea6db/attachment.html>


More information about the Owncloud mailing list