Solr Performance Monitoring - SearchStax


Overview

SearchStax® Managed Solr provides real-time and historical statistical graphs of critical Solr server and JVM metrics and statistics. These statistics provide a birds-eye view of Solr cluster health, enabling you to tune server performance. This can be crucial when performing Solr shard management and setting up your collections for best performance.

Contents of this page:

How to Access the Pulse Graphs

You can access the graphs from the Managed Solr Dashboard.

To access the graphs:

  1. From the Managed Solr Dashboard, select Pulse.
  2. Using the droplist in the upper left corner, select a deployment.
  3. Using the second droplist, check the servers you want to inspect.

Multiple pages of graphs may be accessed from the horizontal menu bar at the top of the display.

SearchStax Pulse Monitoring

Pulse offers several pages of diagnostic displays, accessible through the menu bar at the top of the screen. These are:

Menu Description
CPU, Memory, JVM Shows server-related metrics
Collections List of collections in this deployment
Garbage Collection Garbage-collection metrics for Solr cluster
Throughput, Latency Search and indexing performance for the Solr cluster
Caches Cache-performance metrics for the Solr cluster
Alerts See Solr Service Alerting to manage alerts.
Service This graph reports the number of requests received by a cluster's load balancer.

Click on any menu to view the associated graphs.

About the Graphs

Managed Solr currently provides graphs for the following server and search metrics.

CPU, Memory, JVM

These are monitoring graphs for a server that is running out of memory.

SearchStax Solr Pulse monitoring SearchStax Solr Pulse monitoring SearchStax Solr Pulse monitoring

The following server-performance metrics are displayed as graphs by Pulse:

Metric Definition
CPU Usage (%) CPU load as a percent of maximum. (See Is 100% CPU a bad thing?)
System Load Average Number of processes per CPU over one-minute intervals. Values below 1.0 indicate that processes are not waiting.
Memory Total memory size and used memory size plotted on a single graph. (See How do I recognize a Solr Out-of-Memory Error?)
Swap Usage Total disc swap space available and used over time.
JVM Heap Memory Usage Heap memory usage over time. Heap memory is used for storing Java objects. When heap memory reaches 100%, Solr stops running.
Non-Heap Memory Non-heap memory usage over time. Non-heap memory is used by Java to store loaded classes and other metadata.
JVM Thread Count Server thread count usage over time.
1 min 5XX Error Rate 5XX errors summed over one-minute intervals.
Disk Usage Disk space usage in gigabytes. The dashed line is the maximum available.

Collections

The Collections table shows the health of replicas across servers.

SearchStax Pulse Collections Monitoring

Garbage Collection

Screenshot

The following garbage-collection metrics are reported by Managed Solr:

Metric Definition
ConcurrentMarkSweep CollectionTime & CollectionCount Collection time and count for the ConcurrentMarkSweep approach.
ParNew CollectionTime & CollectionCount Collection time and count for the ParNew approach.

Search Throughput and Latency

Screenshot

Managed Solr provides graphs that report on throughput and latency.

Metric Definition
Average Requests/second (ms) Reports average search requests per second.
Average Time/Request (ms) Reports average time for search request to fulfill the query.
5-minute Request Rate Reports search-request rate over a 5-minute interval.
15-minute Request Rate Reports search-request rate over 15-minute interval.
Timeouts Reports server timeouts, where the server did not respond to the request fast enough.
Errors Numbers of search-request errors.

Indexing Throughput and Latency

ScreenshotScreenshot

Managed Solr provides graphs that report on throughput and latency.

Metric Definition
Average Requests/Second Reports average indexing requests per second.
Average Time/Request (ms) Reports average time for indexing request to fulfill the query.
Timeouts Reports server timeouts, where the server did not respond to the request fast enough.
Errors Numbers of indexing request errors.
Update Handler graphs Reports various Update Handler metrics over a period of time, such as commits, autoCommits, deletesById, etc.

Caches

ScreenshotScreenshot

Managed Solr provides graphs that report on Caches.

Metric Definition
QueryResultCache This cache stores ordered sets of document IDs — the top N results of a query ordered by some criterion. Multiple metrics report how the query result cache is being utilized to fulfill search requests. Metrics include hitratio, evictions, warmup time, etc.
FilterCache This cache stores unordered sets of document IDs that match the queries. Multiple metrics report how the filter cache is being utilized to fulfill search requests. Metrics include itratio, evictions, warmup time, etc.
DocumentCache The documentCache stores Lucene Document objects that have been fetched from disk. Multiple metrics report how the document cache is being utilized. Metrics include itratio, evictions, warmup time, etc.
FieldValueCache The fieldValueCache is similar to fieldCache, and supports multiple values per document. It is primarily used for faceting. Metrics include warmup time, size.

Alerting

The Managed Solr alerting features are described on the Solr Service Alerting page.

Questions?

Do not hesitate to contact the SearchStax Support Desk.