<html><head></head><body>The warnings about pdf and word are from getid3 lib and can be ignored if you are using search lucene. It comes with special indexers for these filetypes.<br>
<br>
The error about not beeing able to determine the file format for txt files also is from getid3 and might be caused by empty txt files.<br>
<br>
Can you check if the reported txt file has 0 bytes? Can you search for a text in the pdf or word files and see if you get any results?<br>
<br>
So long<br>
<br>
Jörn<br><br><div class="gmail_quote"><br>
<br>
Stefan Vollmar <vollmar@nf.mpg.de> schrieb:<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
<pre class="k9mail">Hello,<br /><br />we seem to have problems with indexing files - this apparently works well for some files and does not for others - so far we have not worked out a pattern.<br /><br />uname -a<br />Linux owncloud 3.5.0-39-generic #60~precise1-Ubuntu <br /><br />ownCloud 5.0.10<br /><br />Error messages in /owncloud/data/owncloud.log (see below) seem to suggest that the file type of simple ".txt" files could not be determined? These days, I would also expect indexing of PDF data - but a failure to index ".txt"-files definitely sound like a bug, right? <br /><br />Many thanks in advance.<br /><br />Best regards,<br />Stefan<br /><br />{"app":"PHP","message":"iconv(): Detected an illegal character in input string at \/var\/www\/owncloud\/apps\/search_lucene\/3rdparty\/Zend\/Search\/Lucene\/Analysis\/Analyzer\/Common\/TextNum.php#58","level":2,"time":"2013-08-22T20:00:07+00:00"}<br />{"app":"PHP","message":"Only variables should be passed by reference at
\/var\/www\/owncloud\/apps\/search_lucene\/lib\/indexer.php#163","level":2,"time":"2013-08-22T20:02:33+00:00"}<br />{"app":"search_lucene","message":"failed to extract meta information for \/stefan\/files\/x.pdf: PDF parsing not enabled in this version of getID3() [1.9.3-20111213]","level":2,"time":"2013-08-22T20:02:34+00:00"}<br />{"app":"search_lucene","message":"failed to extract meta information for \/stefan\/files\/y.doc: MS Office (.doc, .xls, etc) parsing not enabled in this version of getID3() [1.9.3-20111213]","level":2,"time":"2013-08-22T20:02:55+00:00"}<br />{"app":"search_lucene","message":"failed to extract meta information for \/stefan\/files\/z.txt: unable to determine file format","level":2,"time":"2013-08-22T20:03:22+00:00"}<br />{"app":"search_lucene","message":"failed to extract meta information for \/stefan\/files\/z (2).txt: unable to determine file format","level":2,"time":"2013-08-22T20:03:42+00:00"}</pre></blockquote></div><br>
-- <br>
Diese Nachricht wurde von meinem Android-Mobiltelefon mit K-9 Mail gesendet.</body></html>