SearchStax Help Center


Can we use Apache Tika?

Some of our SearchStax Managed Search service clients use Apache Tika with their deployments.

SearchStax deployments include the following Tika .jar files by default:

tika-core-*.jar
tika-java7-*.jar
tika-parsers-*.jar
tika-xmp-*.jar

If there are other Tika jar files you would like to add, our support staff is ready to assist you. However, we cannot host the Tika server for you. That is out-of-scope for our support-level agreements (SLAs).

That said, we have noted some issues from our Tika clients:

  • Solr can issue a timeout error when Tika encounters a 100MB PDF file. We increased the timeout limits.
  • Tika had difficulty parsing Excel files due to a problem with Solr 8.6. We helped the client upgrade to Solr 8.8.1.

SearchStax has Solr Architects who provide Solr Advisory Services to premium clients on a contract basis. All others are advised to consult the Solr user community.

Questions?

Do not hesitate to contact the SearchStax Support Desk.


Return to Frequently Asked Questions.