SearchStax Named to the 2024 Deloitte Technology Fast 500 | LEARN MORE
Jul. 03, 2019
Karan Jeet Singh
|
Some of our SearchStax clients index websites that use multiple languages. We were recently asked how to enable Solr indexing of Mandarin on a cloud platform. (This post describes indexing Traditional Chinese characters. It is also possible to use Simplified Chinese by following a similar series of steps. Contact us at support@searchstax.com for an example.)
Update solrconfig.xml file by adding following line after all the lib declarations.
This library comes with Solr, so you don’t have to alter your deployment in any way to make it work.
A. Create a new field type in the managed-schema file with the SmartChineseAnalyzer.
<fieldType name="text_mandarin" class="solr.TextField"> <analyzer> <tokenizer class="solr.ICUTokenizerFactory"/> <filter class="solr.CJKWidthFilterFactory"/> <filter class="solr.LowerCaseFilterFactory"/> </analyzer> </fieldType>
B. Create a field that uses this field type.
<field name=”text_man” type=”text_mandarin” multiValued=”true” indexed=”true” stored=”true”/>
The Stack is delivered bi-monthly with industry trends, insights, products and more
Copyrights © SearchStax Inc.2014-2024. All Rights Reserved.
SearchStax Site Search solution is engineered to give marketers the agility they need to optimize site search outcomes. Get full visibility into search analytics and make real-time changes with one click.
close
SearchStax Managed Search service automates, manages and scales hosted Solr infrastructure in public or private clouds. Free up developers for value-added tasks and reduce costs with fewer incidents.
close
close