JSON Metrics
Routinator’s monitoring service provides comprehensive
metrics in JSON format /api/v1/status endpoint. Here you can find an
overview of all metrics and their meaning.
The JSON metrics consists of an object with the following members:
versionThe version of Routinator.
serialThe current serial number for data served to RTR clients.
nowThe date and time in UTC when this report was created.
lastUpdateStartThe date and time in UTC when the last validation run started.
lastUpdateDoneThe date and time in UTC when the last validation run completed.
lastUpdateDurationThe duration of the last validation run in seconds.
talsMetrics for each configured trust anchor. In most cases these will be the five Regional Internet Registries, but will include the trust anchors of any configured testbeds as well.
Each element of this object contains a publication metrics value as described below.
repositoriesMetrics for each repository encountered during validation. Note that the data given here relates to the repository content used during validation. If the repository failed to update, then these numbers are from the stored old data.
Each element of this object contains a publication metrics value as described below. In addition, there is a member
typethat describes whether the repository is an RRDP or rsync repository.vrpsAddedLocallyThe number of VRPs added to the final data set from local exceptions.
rsyncMetrics for updates via rsync.
This is an object with one element for each repository that was updated via rsync during the last validation run. Each element contains an rsync update metrics value as described below.
rrdpMetrics for updates via RRDP.
This is an object with one element for each repository that was updated via rsync during the last validation run. Each element contains an RRDP update metrics value as described below.
rtrMetrics for the built-in RTR server. See RTR metrics below.
httpMetrics for the built-in HTTP server. See HTTP metrics below.
Publication Metrics
Publication metrics are provided both for all trust anchors and for each RPKI repository. They contain the following information:
vrpsTotalThe total number of VRPs found to be present and valid.
vrpsUnsafeThe number of VRPs that are considered unsafe. Depending on configuration, these may be included in the final set or dropped from it.
vrpsLocallyFilteredThe number of VRPs that are filtered as the result of a local exception.
vrpsDuplicateThe number of duplicate VRPs resulting from ROAs containing the same authorisation.
Note that if a VRP appears in multiple trust anchors or repositories, which occurrence is considered the duplicate depends on the order of processing which may change between validation runs. Thus, this number may change unexpectedly.
vrpsFinalThe number of VRPs that are contributed by this trust anchor or repository to the final set provided to your routers. This is the total number of VRPs, minus the ones that are locally filtered, duplicate, and, if configured to be dropped, unsafe.
validPublicationPointsThe number of valid publication points.
rejectedPublicationPointsThe number of rejected publication points.
A publication point is rejected if its manifest is invalid or if any objects listed on the manifest are missing or have a different content hash.
validManifestsThe number of valid manifests.
invalidManifestsThe number of invalid manifests.
A manifest is invalid if it is not correctly encoded, has expired or is not correctly signed by the issuing CA.
staleManifestsThe number of stale manifests.
A manifest is stale if the current time is past the time an update to the manifest should have been issued. Whether a stale manifest is valid or invalid depends on configuration. By default it is considered invalid.
missingManifestsThe number of missing manifests.
validCRLsThe number of valid certificate revocation lists.
invalidCRLsThe number of invalid certificate revocation lists.
A CRL is invalid if it is not correctly encoded or is not correctly signed by the issuing CA.
staleCRLsThe number of stale certificate revocation lists.
A CRL is stale if the current time is past the time an update should have been issued. Whether a stale CRL is valid or invalid depends on configuration. By default it is considered invalid.
strayCRLsThe number of stray certificate revocation lists.
Each CA should only issue one CRL. This CRL should both be listed on the manifest and used by the manifest’s certificate itself. Any manifest listed on the manifest that is not also the manifest’s own CRL is considered a stray.
validCACertsThe number of Certificate Authority (CA) certificates found to be present and valid.
validEECertsThe number of End Entity (EE) certificates found to be present and valid.
This only refers to such certificates included as stand-alone files which are BGPsec router certificates.
invalidCertsThe number of invalid stand-alone certificates, either CA or EE certificates.
validROAsThe number of valid Route Origin Attestations
invalidROAsThe number of invalid Route Origin Attestations.
validGBRsThe number of valid Ghostbusters Records.
Note that currently the content of a Ghostbuster Record is not checked.
InvalidGBRsThe number of invalid Ghostbusters Records.
otherObjectsThe number of objects found that are not certificates (.cer), Certificate Revocation Lists (.crl), manifests (.mft), ROAs (.roa), or Ghostbuster Records (.gbr).
Rsync Update Metrics
For each repository updated via rsync the following values are given.
statusThe status code returned by the rsync process. A value of 0 means the process has finished successfully. The meaning of other values depends on the rsync client used. Please refer to its documentation for further details.
durationThe duration the rsync process was running in seconds.
RRDP Update Metrics
For each repository updated via RRDP the following values are given.
statusThe overall status of the update. This will be 200 if the updated succeeded, 304 if no update was necessary because the data was already current, and any other value for a failed update. If the value is -1, it was not possible to reach the HTTPS server at all.
notifyStatusThe status of retrieving the notification file. This is the first step of an RRDP update. A value of 200 indicates that the file was successfully retrieved. A value of 304 indicates that the file hasn’t changed since last update and no actual update is necessary. Any other value represents an error.
payloadStatusThe status of retrieving the actual payload. This is the second step of an RRDP update and may either represent a single HTTPS request for the snapshot file or a series of HTTPS request for the sequence of delta files necessary to update from the last known state.
A value of 0 means that no payload retrieval was necessary. A value of 200 means that the update was successful. Any other value indicates an error. In case of a sequence of delta updates, this error may have been preceded by one or more successful requests.
durationThe overall duration of the RRDP update in seconds.
serialThe serial number stated by the RRDP server for the current data set. With each update the serial number is increased by one.
sessionThe identifier of the current session of the RRDP server. Serial numbers are only valid within the same session. If the server needs to restart its sequence for whatever reason, it needs to choose a new session ID and all data will have to be updated through a snapshot.
deltaWhether data was updated via a sequence of deltas (
true) or a full snapshot had to be retrieved (false).snapshotReasonIf this is not
null, it provides a reason why a snapshot was used instead of a delta as a short explanatory string.
RTR Server Metrics
A number of metrics are provided describing the state of the included RTR server. These metrics are available whether the RTR server is actually enabled or not.
currentConnectionsThe number of currently open RTR connections.
bytesReadThe total number of bytes read from RTR connections. In other words, describes how much data has been sent by clients.
bytesWrittenThe total number of bytes written to RTR connections. In other words, describes how much data has been sent to clients.
If rtr-client-metrics are enabled via configuration or command line,
an additional object clients will appear that list the IP addresses of
clients seen by the RTR server providing the following information for them.
connectionsThe number of currently open connections from that address. The number should normally be 0 or 1 but can be higher if the address is the public side of a NAT.
updatedContains the time of the last successful update by the client.
lastResetContains the time of the last successful cache reset by the client.
resetQueriesContains the number of reset queries by the client.
serialQueriesContains the number of serial queries by the client.
serialThe highest serial of the data provided to a client from that address. This can be used to determine when the client has last updated.
readandwrittenBytes read from and written to clients from that address.
HTTP Server Metrics
A number of metrics are provided describing the state of the included HTTP server.
totalConnectionsThe total number of connections made with the HTTP server.
currentConnectionsThe number of currently open connections. This should at least be 1 as there is a connection open when requesting the JSON metrics.
requestsThe total number of requests received and answered by the HTTP server.
bytesReadandbytesWrittenThe number of bytes read from and written to HTTP clients.