LogicalSpark is pleased to announce the release of the Apache Tika Server OpenShift Cartridge. Created by LogicalSpark for a talk at JBUG Scotland, the cartridge allows users to get up and running with an Apache Tika Server instance in their environment with minimal effort.
If you would like to try out the Apache Tika server you can find a deployed instance using this cartridge at:
To extract the contents of a file, you can use the following:
curl -T <file> http://tikaserver-logicalspark.rhcloud.com/tika
To extract the metadata of a file, you can use the following:
curl -T <file> http://tikaserver-logicalspark.rhcloud.com/meta
You can read more about the TIKA JAXRS server and its commands here.
All of the code is hosted on GitHub and spinning up your own instance takes a couple of minutes. Happy Parsing!