summaryrefslogtreecommitdiff
path: root/monitoring
Commit message (Collapse)AuthorAgeFilesLines
* mgr/dashboard: UI fixesAvan Thakkar2020-02-102-2/+2
| | | | | | Fixes: https://tracker.ceph.com/issues/42914 Signed-off-by: Avan Thakkar <athakkar@redhat.com>
* mgr/grafana: sum pg states for clusterAleksei Zakharov2020-01-291-15/+14
| | | | | | Also, revert table formatting. Signed-off-by: Aleksei Zakharov <zaharov@selectel.ru>
* monitoring/grafana,prometheus: add per-pool pg states supportAleksei Zakharov2020-01-292-18/+19
| | | | Signed-off-by: Aleksei Zakharov <zaharov@selectel.ru>
* monitoring: add details to Prometheus' alertsPatrick Seidensal2020-01-241-28/+105
| | | | | | Fixes: https://tracker.ceph.com/issues/43764 Signed-off-by: Patrick Seidensal <pseidensal@suse.com>
* Merge pull request #32325 from Kriechi/fix-42982Jan Fajerski2020-01-201-1/+1
|\ | | | | monitoring: fix prometheus alert for full pools
| * monitoring: fix prometheus alert for full poolsThomas Kriechbaumer2019-12-181-1/+1
| | | | | | | | | | | | | | | | | | The existing alert (introduced via https://tracker.ceph.com/issues/24977) already triggers when still 50% of storage space are available. Fixes: https://tracker.ceph.com/issues/42982 Signed-off-by: Thomas Kriechbaumer <thomas@kriechbaumer.name>
* | Switch spelling of utilizationBryan Stillwell2020-01-071-2/+2
| | | | | | | | | | | | | | Prefer the non-British spelling of utilization since that's what the majority of the code base seems to use. Signed-off-by: Bryan Stillwell <bstillwell@godaddy.com>
* | mgr/dashboard: grafana charts match time picker selection. (#31964)Lenz Grimmer2019-12-033-6/+6
|\ \ | | | | | | | | | | | | | | | | | | mgr/dashboard: grafana charts match time picker selection. Reviewed-by: Jan Fajerski <jfajerski@suse.com> Reviewed-by: Laura Paduano <lpaduano@suse.com> Reviewed-by: Patrick Seidensal <pnawracay@suse.com>
| * | mgr/dashboard: grafana charts match time picker selection.Alfonso Martínez2019-12-033-6/+6
| | | | | | | | | | | | | | | Fixes: https://tracker.ceph.com/issues/43097 Signed-off-by: Alfonso Martínez <almartin@redhat.com>
* | | mgr/dashboard,grafana: remove shortcut menuErnesto Puerta2019-12-031-12/+2
|/ / | | | | | | | | | | | | Remove shortcut menu (links) and add check in grafana CI script. Fixes: https://tracker.ceph.com/issues/43091 Signed-off-by: Ernesto Puerta <epuertat@redhat.com>
* | monitoring: fix indentation of ceph default alertsPatrick Seidensal2019-11-181-4/+4
| | | | | | | | Signed-off-by: Patrick Seidensal <pseidensal@suse.com>
* | monitoring: wait before firing osd full alertPatrick Seidensal2019-11-181-0/+1
| | | | | | | | | | | | Fixes: https://tracker.ceph.com/issues/42862 Signed-off-by: Patrick Seidensal <pseidensal@suse.com>
* | mgr/dashboard: fix grafana dashboardsRadu Toader2019-10-308-72/+76
|/ | | | | | | | | | | | | Fixes: https://tracker.ceph.com/issues/42542 Sort order was wrong for some dashboards, fixed empty / buggy Top 3 clients IOPS by pool / Throughput - in Pools Overall performance fixed Avg utilization Multiple series found - in Host Overall performance Fixed invalid dimensions for plot - in OSD Overall performance Signed-off-by: Radu Toader <radu.m.toader@gmail.com>
* monitoring: SNMP OID per every Prometheus alert ruleVolker Theile2019-05-281-1/+17
| | | | | | | | | | | Use the Ceph enterprise OID 50495 (https://www.iana.org/assignments/enterprise-numbers/enterprise-numbers) and create OIDs for every Prometheus alert rule according to the schema at https://github.com/SUSE/prometheus-webhook-snmp/blob/master/README.md. Example OID: 1.3.6.1.4.1.50495.15.1.2.2.1 All alert rule OIDs are located below the object identifier 15 (15 for p which is the first character of prometheus). Check out the MIB at https://github.com/SUSE/prometheus-webhook-snmp/blob/master/PROMETHEUS-ALERT-CEPH-MIB.txt for more details. Signed-off-by: Volker Theile <vtheile@suse.com>
* monitoring: update Grafana dashboardsJan Fajerski2019-05-1411-648/+1119
| | | | | | | | | Fix various panels that used outdated metric names, cluncky or unnecessary label_replace calls. Also unify the style of many panels. Fixes: http://tracker.ceph.com/issues/39652 Signed-off-by: Jan Fajerski <jfajerski@suse.com>
* monitoring: add a few prometheus alertsJan Fajerski2019-04-262-0/+161
| | | | | | | | Alerts are from https://github.com/SUSE/DeepSea/blob/SES5/srv/salt/ceph/monitoring/prometheus/files/ses_default_alerts.yml but updated for the mgr module and node_exporter >= 0.15. Signed-off-by: Jan Fajerski <jfajerski@suse.com>
* monitoring/grafana: fix typo in READMEJan Fajerski2019-04-161-1/+1
| | | | Signed-off-by: Jan Fajerski <jfajerski@suse.com>
* mgr/dashboard: Fixed performance details context for host list row selectionNeha Gupta2019-01-182-2/+2
| | | | | | Fixes: http://tracker.ceph.com/issues/37854 Signed-off-by: Neha Gupta <gnehapk@gmail.com>
* monitoring/grafana: new RBD overview dashboard pageJason Dillaman2019-01-111-0/+685
| | | | | | This page pulls RBD stats from the Natuatilus prometheus exporter. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
* cmake: Support grafana dashboard installationBoris Ranto2018-10-251-0/+8
| | | | | | | We are currently hosting the grafana dashboards in our repo but we do not install them. This patch adds the cmake support. Signed-off-by: Boris Ranto <branto@redhat.com>
* Merge pull request #24314 from rhcs-dashboard/dashboardsLenz Grimmer2018-10-1921-1/+6109
|\ | | | | | | | | mgr/dashboard: Grafana dashboard updates and additions Reviewed-by: Boris Ranto <branto@redhat.com>
| * MGR/dashboard: make grafana datasource selectablePaul Cuzner2018-10-091-7/+22
| | | | | | | | | | | | | | Grafana dashboard updated to use a templating variable for the datasource Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * MGR/dashboard: make grafana datasource selectablePaul Cuzner2018-10-091-5/+20
| | | | | | | | | | | | | | Grafana dashboard updated to use a templating variable for the datasource Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * MGR/dashboard: make grafana datasource selectablePaul Cuzner2018-10-091-7/+22
| | | | | | | | | | | | | | Grafana dashboard updated to use a templating variable for the datasource Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * MGR/dashboard: make grafana datasource selectablePaul Cuzner2018-10-091-4/+19
| | | | | | | | | | | | | | Grafana dashboard updated to use a templating variable for the datasource Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * MGR/dashboard: make grafana datasource selectablePaul Cuzner2018-10-091-10/+26
| | | | | | | | | | | | | | Grafana dashboard updated to use a templating variable for the datasource Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * MGR/dashboard: make grafana datasource selectablePaul Cuzner2018-10-091-8/+23
| | | | | | | | | | | | | | Grafana dashboard updated to use a templating variable for the datasource Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * MGR/dashboard: make grafana datasource selectablePaul Cuzner2018-10-091-12/+27
| | | | | | | | | | | | | | Grafana dashboard updated to use a templating variable for the datasource Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * MGR/dashboard: make grafana datasource selectablePaul Cuzner2018-10-091-12/+27
| | | | | | | | | | | | | | Grafana dashboard updated to use a templating variable for the datasource Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * MGR/dashboard: make grafana datasource selectablePaul Cuzner2018-10-091-3/+18
| | | | | | | | | | | | | | Grafana dashboard updated to use a templating variable for the datasource Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * Fix linewidth issue in pools overview dashboardPaul Cuzner2018-10-091-2/+2
| | | | | | | | | | | | | | | | Linewidth was set to two, but the idea is that a linewidth of >1 is reserved for eye-catcher plot lines like maximums Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * Refresh of the dashboardsPaul Cuzner2018-10-097-207/+80
| | | | | | | | | | | | | | Fixes some minor anomalies and tested against node_exporter 0.15 and 0.16 Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * Updated requirements informationPaul Cuzner2018-10-091-1/+3
| | | | | | | | Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * Added new Overview dashboardsPaul Cuzner2018-10-092-0/+1682
| | | | | | | | | | | | | | | | These new dashboard definitions provide the high level views for the hosts in the cluster and the OSDs. Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * Screenshots added for all dashboardsPaul Cuzner2018-10-0910-0/+0
| | | | | | | | Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * Add Host level details dashboardPaul Cuzner2018-10-092-4/+1142
| | | | | | | | | | | | | | | | | | | | | | | | | | | | The host-details.json file provides a view of host level metrics. The panels are arranged in two rows; Overview : Cpu/RAM/Network related stats OSD Performance: OSD physical drive stats The overview row is shown by default. Click on the OSD Performance row to show the remaining graphs Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * Document the current state of the dashboardsPaul Cuzner2018-10-091-0/+25
| | | | | | | | Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
| * Initial grafana dashboard definitionsPaul Cuzner2018-10-096-0/+3252
| | | | | | | | Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
* | [grafana] Fix OSD Capacity Utlization graphMaxime2018-10-041-1/+1
|/ | | | Signed-off-by: Maxime <maxime@root314.com>
* add monitoring subdir and Grafana cluster dashboardJan Fajerski2018-05-074-0/+1262
Signed-off-by: Jan Fajerski <jfajerski@suse.com>