Basic Info
Name | prometheus |
---|---|
Description | Prometheus itself exports metrics about scraping, storage etc. |
Default port | 9090 |
Endpoint | /metrics |
Metrics
Also see Go Exporter for generic metrics.
Type | Metric | Description |
---|---|---|
gauge | prometheus_build_info | A metric with a constant '1' value labeled by version, revision, branch, and goversion from which prometheus was built. |
gauge | prometheus_config_last_reload_success_timestamp_seconds | Timestamp of the last successful configuration reload. |
gauge | prometheus_config_last_reload_successful | Whether the last configuration reload attempt was successful. |
gauge | prometheus_engine_queries | The current number of queries being executed or waiting. |
gauge | prometheus_engine_queries_concurrent_max | The max number of concurrent queries. |
summary | prometheus_evaluator_duration_seconds | The duration of rule group evaluations. |
counter | prometheus_evaluator_iterations_skipped_total | The total number of rule group evaluations skipped due to throttled metric storage. |
summary | prometheus_local_storage_checkpoint_duration_seconds | The duration in seconds taken for checkpointing open chunks and chunks yet to be persisted |
gauge | prometheus_local_storage_checkpoint_last_duration_seconds | The duration in seconds it took to last checkpoint open chunks and chunks yet to be persisted. |
gauge | prometheus_local_storage_checkpoint_last_size_bytes | The size of the last checkpoint of open chunks and chunks yet to be persisted |
summary | prometheus_local_storage_checkpoint_series_chunks_written | The number of chunk written per series while checkpointing open chunks and chunks yet to be persisted. |
gauge | prometheus_local_storage_checkpointing | 1 if the storage is checkpointing, 0 otherwise. |
counter | prometheus_local_storage_chunk_ops_total | The total number of chunk operations by their type. |
gauge | prometheus_local_storage_chunks_to_persist | The current number of chunks waiting for persistence. |
counter | prometheus_local_storage_fingerprint_mappings_total | The total number of fingerprints being mapped to avoid collisions. |
counter | prometheus_local_storage_inconsistencies_total | A counter incremented each time an inconsistency in the local storage is detected. If this is greater zero, restart the server as soon as possible. |
summary | prometheus_local_storage_indexing_batch_duration_seconds | Quantiles for batch indexing duration in seconds. |
summary | prometheus_local_storage_indexing_batch_sizes | Quantiles for indexing batch sizes (number of metrics per batch). |
gauge | prometheus_local_storage_indexing_queue_capacity | The capacity of the indexing queue. |
gauge | prometheus_local_storage_indexing_queue_length | The number of metrics waiting to be indexed. |
counter | prometheus_local_storage_ingested_samples_total | The total number of samples ingested. |
summary | prometheus_local_storage_maintain_series_duration_seconds | The duration in seconds it took to perform maintenance on a series. |
gauge | prometheus_local_storage_max_chunks_to_persist | The maximum number of chunks that can be waiting for persistence before sample ingestion will stop. |
gauge | prometheus_local_storage_max_memory_chunks | The configured maximum number of chunks that can be held in memory |
gauge | prometheus_local_storage_memory_chunkdescs | The current number of chunk descriptors in memory. |
gauge | prometheus_local_storage_memory_chunks | The current number of chunks in memory, excluding cloned chunks (i.e. chunks without a descriptor). |
gauge | prometheus_local_storage_memory_dirty_series | The current number of series that would require a disk seek during crash recovery. |
gauge | prometheus_local_storage_memory_series | The current number of series in memory. |
counter | prometheus_local_storage_non_existent_series_matches_total | How often a non-existent series was referred to during label matching or chunk preloading. This is an indication of outdated label indexes. |
counter | prometheus_local_storage_out_of_order_samples_total | The total number of samples that were discarded because their timestamps were at or before the last received sample for a series. |
counter | prometheus_local_storage_persist_errors_total | The total number of errors while persisting chunks. |
gauge | prometheus_local_storage_persistence_urgency_score | A score of urgency to persist chunks, 0 is least urgent, 1 most. |
counter | prometheus_local_storage_queued_chunks_to_persist_total | The total number of chunks queued for persistence. |
gauge | prometheus_local_storage_rushed_mode | 1 if the storage is in rushed mode, 0 otherwise. In rushed mode, the system behaves as if the persistence_urgency_score is 1. |
histogram | prometheus_local_storage_series_chunks_persisted | The number of chunks persisted per series. |
counter | prometheus_local_storage_series_ops_total | The total number of series operations by their type. |
gauge | prometheus_local_storage_started_dirty | Whether the local storage was found to be dirty (and crash recovery occurred) during Prometheus startup. |
counter | prometheus_notifications_dropped_total | Total number of alerts dropped due to errors when sending to Alertmanager. |
gauge | prometheus_notifications_queue_capacity | The capacity of the alert notifications queue. |
gauge | prometheus_notifications_queue_length | The number of alert notifications in the queue. |
summary | prometheus_rule_evaluation_duration_seconds | The duration for a rule to execute. |
counter | prometheus_rule_evaluation_failures_total | The total number of rule evaluation failures. |
summary | prometheus_sd_azure_refresh_duration_seconds | The duration of a Azure-SD refresh in seconds. |
counter | prometheus_sd_azure_refresh_failures_total | Number of Azure-SD refresh failures. |
summary | prometheus_sd_consul_rpc_duration_seconds | The duration of a Consul RPC call in seconds. |
counter | prometheus_sd_consul_rpc_failures_total | The number of Consul RPC call failures. |
counter | prometheus_sd_dns_lookup_failures_total | The number of DNS-SD lookup failures. |
counter | prometheus_sd_dns_lookups_total | The number of DNS-SD lookups. |
summary | prometheus_sd_ec2_refresh_duration_seconds | The duration of a EC2-SD refresh in seconds. |
counter | prometheus_sd_ec2_refresh_failures_total | The number of EC2-SD scrape failures. |
counter | prometheus_sd_file_read_errors_total | The number of File-SD read errors. |
summary | prometheus_sd_file_scan_duration_seconds | The duration of the File-SD scan in seconds. |
summary | prometheus_sd_gce_refresh_duration | The duration of a GCE-SD refresh in seconds. |
counter | prometheus_sd_gce_refresh_failures_total | The number of GCE-SD refresh failures. |
counter | prometheus_sd_kubernetes_events_total | The number of Kubernetes events handled. |
summary | prometheus_sd_marathon_refresh_duration_seconds | The duration of a Marathon-SD refresh in seconds. |
counter | prometheus_sd_marathon_refresh_failures_total | The number of Marathon-SD refresh failures. |
summary | prometheus_sd_triton_refresh_duration_seconds | The duration of a Triton-SD refresh in seconds. |
counter | prometheus_sd_triton_refresh_failures_total | The number of Triton-SD scrape failures. |
summary | prometheus_target_interval_length_seconds | Actual intervals between scrapes. |
counter | prometheus_target_scrape_pool_sync_total | Total number of syncs that were executed on a scrape pool. |
counter | prometheus_target_scrapes_exceeded_sample_limit_total | Total number of scrapes that hit the sample limit and were rejected. |
counter | prometheus_target_skipped_scrapes_total | Total number of scrapes that were skipped because the metric storage was throttled. |
summary | prometheus_target_sync_length_seconds | Actual interval to sync the scrape pool. |
gauge | prometheus_treecache_watcher_goroutines | The current number of watcher goroutines. |
counter | prometheus_treecache_zookeeper_failures_total | The total number of ZooKeeper failures. |