[Owncloud] Best architecture for a new app for showing web previews for any kind of files including office docs( word, power point, spreadsheet )

Lloyd Harischandra ayubowan.ll at gmail.com
Wed Jun 26 00:53:48 UTC 2013


Hi all,

We are developing an OwnCloud application for use in a University setting (
https://github.com/uws-eresearch/apps/tree/master/crate_it and
https://github.com/uws-eresearch/apps/tree/master/file_previewer).

One of the functions of our app is being able to show file-previews for any
kind of file, not just images. At the moment we are doing this via a
back-end service which creates web-ready previews for a variety of document
types including office docs (word, powerpoint, spreadsheets) for both the
Microsoft and open document formats as well as a variety of other file
formats.

We are wondering about the best way to integrate this functionality, and
what plans there might be for OC6.
* Are there events in the system which are triggered whenever there are new
files via the web UI AND via the sync client? We want to be able to
generate web-ready views of files as they are added, probably via a message
queue
* Is there a standard interface for previews/thumbnails, like the current
image viewer that can be registered somewhere so other plugins and the
files app can access them? Currently we have registered the actions to view
previews with fileactions registry.
* What are the plans for the Lucene search service? It should be possible
to combine some services. For example, if you're rendering a Word document
into HTML then the rendering process can extract a full-text version of the
doc, and also some metadata such as title and author, that could be put in
the Lucene index and possibly also the main database.

We'd love to contribute in this area and are wondering how best to get
involved.

Our current application is using this approach:

* A crude file-watching service crawls the OC data directory directly
(TODO: tie this to OC events or use the main DB)
* Each file is placed on a message queue and dealt with by a conversion
service which generates HTML and/or image thumbails, down-samples video etc
(this uses the Integrated Content Environment (ICE)
https://code.google.com/p/integrated-content-environment/)
* The converted assets are kept in a repository - this is based on The
Fascinator, a repository platfom used in a number of Australian
universities for managing research data:
https://sites.google.com/site/fascinatorhome/
* Our OC app hooks in to the Files app, so when a user clicks on a file it
looks for an HTML version of it in the repository.

The Fascinator also includes an Apache Solr index, so could provide both
full-text and faceted search, but we need to evaluate whether this is
necessary, given that OC has Lucene.

Dr Peter Sefton
Lloyd Harischandra
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.kde.org/pipermail/owncloud/attachments/20130626/9df7371b/attachment.html>


More information about the Owncloud mailing list