Systems Health Map
BVQ System health map is the game changer feature in BVQ v6.0. It delivers the answer to the main question: "Are my systems healthy?". We have broken this down into details like:
- "Do my systems hit any configuration limitations?"
- "Is my system configuration according to best practices?"
- "Are my licenses in compliance?"
- "Is the status of all my objects healthy?"
- "Is there enough capacity left to ensure safe operation?"
- "Is my performance at risk or imbalanced?"
- "Are there any network congestion issues?"
The BVQ Systems health map is a compact visualization of your systems' health status in form of a sunburst chart. Each section of this layered donut chart shows either a group of states in the inner rings or system checks on the outer edge ring.
The status level of a group or check is visualized in colors similar to traffic lights:
A group's status level is always propagated from the subordinated groups or checks with the worst status level.
All system checks are based on BVQ Alert rules and their calculated result delivers the status level for the map. Therefore the Systems health map is a valuable extension of our cherished Alerting feature and awards its maximum flexibility and a familiar user experience.
You can interactively drill down from groups to their details like alert rules and affected objects. Click on a group section to see more details of subsections and the contained Alert rules. You can also move backwards.
Drill down capabilities
On the right side of the map, there are two additional areas:
- The top one shows a pie chart with the distribution of status levels across all contained objects of a group or Alert rules. The more red you see here the more critical your issue is.
The lower one shows a button row and a table containing affected objects. This area is only filled when an Alert rule is selected on the outer edge ring.
The buttons in the upper row lead to the following functions:
Info Single Alert rule overview ✎ Edit Alert rule editor Help text of the Alert rule
The table shows all objects and their levels affected by the selected Alert rule. The inline buttons lead to:
Info Single Object Alert level overview Eye Exclude this Object from all maps
Current state is not enough! How were the states yesterday? Last week? Did I miss a critical state change?
On top of the map panel you can choose a time period of interest.
- The maps always show the aggregated worst state in the selected time period.
- “Autoplay” will keep the maps up to date with respect to the prior selected timeframe.
- The default timeframe is set to one hour.
Alert rule grouping
Two grouping views are available:
Components are preconfigured and not customizable. Alert rules are grouped according to their object types by major system functions or hardware components.
In this view, System and User categories are displayed:
- System categories are preconfigured and not customizable. Alert rules are grouped according to their system type, criticality and check types (i.e. Limits, Configuration health, License compliance, Component status, Capacity usage, Performance and Transmission health)
- User categories can be created in an individual hierarchy in the Alerting configuration.
Systems health maps are configured as part of the BVQ Alerting configuration:
Three types of rules available
1. Predefined rules:
- Preconfigured set of system checks based on best practices
- Limited customization potential
- Ready to go: in most cases you just need to turn them on.
2. Custom rules:
- Template rules (introduced in BVQ v5.7):
- Preconfigured rule templates
- Medium customization potential
- Easy to adjust: Wizard based configuration
- Advanced rules (introduced in BVQ v5.5):
- Use all options: Maximum customization flexibility
- Configured via advanced Web editor or Expert GUI
Predefined alert rules
This type of rules is new in BVQ 6.0. Internally it is still based on classic Alert rules introduced with BVQ v5.5 but they are simplified with the objective to be most easy to activate and adjustable with an acceptable reduction of flexibility.
~ 150 of them come preconfigured with your BVQ installation and are ready to go: In most cases you just need to turn them on.
You can set an optional notification method for all checks in a given category.
We have chosen the preconfigured thresholds according to best practices but you can adjust them individually.
Custom alert rules integration
For a maximum of configuration flexibility, you can optionally select a category for any of your custom alert rules, no matter if they are created by the template or advanced editor. With that, you can integrate your own rules at any place in the maps or extend them with your own categories.
Object Status Exclusion
Is your Health map red due to known problems? To get it green, you can exclude unwanted Alert results for any object in question.
If you want an object to be effective in the health maps again, just go to the Alerting results > Excluded Tab and re-include this object again:
Systems health map in Expert GUI
Health map states are also present in the Expert GUI, to support deeper interactive
analysis and drill down:
- Use Treemap Coloring > Alerting > Object specific > Overall to color the Treemap's plates.
This will give you a quick and filterable overview of affected objects.
- Open an object’s detail pane for detailed status information.
Current states of all System checks and other active Alert rules will show up here.
Worst Alert level history charts
No longer miss a critical event!
Du to the new worst level aggregation capabilities of the new System health maps feature, we can now offer a major enhancement for the classic Alert rule history charts shown in the Alerting results overviews.
Alert level history charts “per Alert rule” or “per Object” now respect all state changes in a displayed interval.
Example: Chart showing the past 4 hours Alert level history of an Alert rule:
- Here, one bar is shown all 30 minutes representing
the objects’ worst Alert level of the past 30 minutes before.
- The counter values show the number of objects
affected by this Alert rule and level.
- The bar at 13:25 shows the time from 12:55 to 13:25:
- 45 objects affected by this Alert rule in total
- 6 objects reached or stayed at a worst level of “Error”
- 2 objects reached or stayed at a worst level of “Warn”
- 37 objects stayed at level of “OK”
In older BVQ versions, a bar only represented the last Alert level change before its point in time and ignored all other changes since its forgone bar. So an “Error” in “OK → Error → OK” transitions was hidden between two green bars.
Redesigned report template creation
Report template creation in the BVQ Web UI has been completely redesigned and is now easier to use.
Go to Reporting → Templates → Create new template to start the configuration:
Fill your report with content by adding Snippets:
- Use General Layout elements to structure the report by adding headers, page breaks or informational text
Add individual charts or topology tables that are saved in BVQ Favorites - all System favorites and those stored in the All Users-Folder are available for reporting. Favorites stored in the User-Folder are not listed.
System Favorites are subject to change
Please keep in mind that System Favorites are maintained by BVQ and their content might change between different BVQ versions without notice. Therefore, please use them with care or use a copy of the corresponding favorite in the All users folder to ensure compatibility with your applications!
- Select Other chart types if you want to add tables with alerts or events to your report or if you want to add a pie chart.
- Snippet loops can be used in reports once a Snippet loop object type has been selected and added to the report.
Fill in the OBJECT TYPE and add further Snippets to your loop.
Guidance through the WebUI: Server-Tours
The BVQ Web UI now offers Server-Tours which automatically show up if the BVQ server is started for the first time. Those tours quickly explain the usage of the application and inform about news and changes.
There are currently three tours available:
- Sidebar - Learn which main features the Web UI offers and how to use them
- Alerting Configuration - Learn how to configure the new System Health Map feature using Alert rules
- Alerting Results - Learn how the results of the Alert rules are visualized and displayed
Each tour can be turned on and off individually on a per user basis by clicking on the User's Home screen:
Show context object type in performance charts
The behaviour which objects types are shown in a performance chart has been improved.
While in previous releases only the selected object type was part of the performance chart, now the context object type is included as well.
This is best explained in an example:
The screenshot below shows the treemap of path BVQ/Cluster/MDisk group/VDisk. Three MDisk groups have been selected to analyze their VDisk performance.
In earlier releases, this chart would initially only show one aggregate line which is the BVQ object containing all VDisks that belong to the previously selected MDisk groups. The information about the selected MDisk groups themselves in this content was lost. It was difficult to tell which VDisk is in which MDisk group.
Now one line per content object type is drawn. In this case this is the MDisk group which was the selection base in the above screenshot.
The System favorite Events is now much more comfortable and flexible to filter. In addition, all of those filtering capabilites are available for performance charts as well.
While in earlier BVQ releases only a few filter options were available, much more functionality has been added in this release:
- More filter options are available. Select one, more or all options in a category or search for a particular Object name or description:
- Specify which table columns are displayed:
- Enable Live monitoring to be informed about new entries immediately:
- Profit from the same functionality in all performance charts:
Enabling Events in the Drawing-Tab of the performance property panel, adds an additional tab Event Filters to the pane which offers the same options as the Event table system favorite.
Requirements and restrictions
Requirements of the HW/SW environment
Minimum BVQ version required for an update
Update from a previous MongoDB Installation step by step
- Check for the correct function of the whole environment, in particular the function of the scanner.
- Manual stop of BVQ Scanner- and Server-services and also close all GUIs
- Backup the BVQ installation directory and BVQ DB (e.g. DB backup with
mongodumpor VMware snapshots)
- Restart the BVQ Scanner
- Update BVQ Install Package Repository:
- MongoDB update
- BVQ GUI update
- BVQ Server update
- Test the function of the BVQ environment.
- If necessary inform colleagues about the need to update locally installed BVQ GUI clients.
|BVQ-14578||Scanner: VMware||Change error handling in case of errors during performance scane|
|BVQ-14605||WebUI: Notifications||Create notification configuration to call an executable|
|Major||BVQ-14601||SVC Perf Persist||Folder access denied|
|Minor||BVQ-14609||GUI: Event Filter||Invisible object types are shown|
|Critical||BVQ-14589||SVC: Performance Persist||Port speed information is missing|
|Major||BVQ-14587||MongoDB||Exceeded memory limit for $group|
|BVQ-14469||General||Change semantic of subelementcount|
Allow floating point values for system checks
|BVQ-14467||WebUI: Reporting: Template Creation||Remove interval for "Aggregated capacity charts"|
|BVQ-14255||Server||Integrate Metric/PI/Attribute Store as html reference|
|BVQ-14538||REST Interface||Accept smaller intervals down to 1min|
|BVQ-14354||Reporting: Datasources||Enable CSV output for topo tables|
|BVQ-14532||Scanner: SVC||Add on/off switch for htfcs-scan|
|BVQ-14523||Server & Scanner||check filesystem privileges on startup|
|BVQ-14554||Brocade SAN: Persister||Exception when trying to persist topology data with Fabrics connected to many SANs|
|BVQ-14546||Brocade SAN||Added duplicate Node Port WWPN handling|
|Critical||BVQ-14463||GUI||Exception when trying to open capacity view on a VMware Object|
|Critical||BVQ-14520||Scanner||SVC Scanner persist too slow|
|Major||BVQ-14552||Alerting||Disabling a custom alert rule doesn't update the valid dates|
|Major||BVQ-14524||Alerting||Valid_until setting for disabled rule could be overwritten|
|Major||BVQ-14500||GUI: Event-Filter||Only SVC cluster are shown for BVQ Alerting - System selection|
|Major||BVQ-14513||HFTCS||set hftcs scan regardless of licensed capacity|
|Minor||BVQ-14567||Server||Reference: Custom PI attributes shouldn't be visible|
|Minor||BVQ-14539||Server||Web UI freezes when hitting ESC in guided tours view|
|Minor||BVQ-14540||Grafana||Fix typo for Capacity used|
|BVQ-14210||GUI: Detail pane: All tab||Adjusted column width should survive an Object change|
|BVQ-14389||GUI Favorites||New BVQ Favorite structure|
|BVQ-14135||Scanner||Zip perf folder|
|BVQ-14280||Scanner: VMware: PerfScan||Add timeout when scan is not finished within a certain timeframe|
|BVQ-14291||WebUI: Reporting||New system reports for SAN Performance & VMware Capacity|
|BVQ-14319||Common||Make number parsing lenient|
|Major||BVQ-14211||VMware||Null Object: VM Cluster: Undef VM Clusters are not linked to an Undef VM DC|
|Major||BVQ-14343||WebUI: Alerting: Custom rules||Selected Folder not respected|
|Minor||BVQ-14429||GUI: Property Panel: Table columns:||Linked and Foreign Objects cannot be selected as group|
|Minor||BVQ-14425||WebUI: Alerting||Unable to disable "SHOW IN HEALTH MAPS" in complex editor|
|Minor||BVQ-14421||WebUI: Predefined Alert Rules||Incorrect conditions in alert rule "Port Logins per SVC FC Node port"|
|Minor||BVQ-14377||WebUI: Predefined Alert Rules||Remove incorrect predefined rules|
|Minor||BVQ-14436||WebUI: Reporting||Table favourite with multi OT filters is ignored by select system during report creation|
|Minor||BVQ-14378||WebUI: Scanner overview||Logs not always accessible|
|BVQ-14048||Common||Change vmVM CPU ms per state to CPU % per state|
|BVQ-14259||GUI||Cache files should be deleted after update|
|BVQ-13970||GUI||Node/SVC Host port should use the same path as IO group/SVC Host port|
|BVQ-14330||Grafana||Add new system dashboards for SVC capacity|
|Critical||BVQ-14386||GUI||Exception on Performance chart reload (Ctrl-R)|
|Critical||BVQ-14346||SVC||Cluster CPU% aggregation shows sum of node instead of mean|
|Critical||BVQ-14340||Server||DB Backup download corrupt|
|Major||BVQ-14344||Alerting: Systems health map||VM Memory Usage Alert seems to check CPU|
|Major||BVQ-14230||GUI: Editor: MDisk||Drive array no longer has an Add and Newest Button|
|Major||BVQ-14350||Reporting||Cannot add object name as header to snippet loop|
|Major||BVQ-14376||Grafana||Filter in annotations ignored|
|Minor||BVQ-14325||Alerting: Systems health map||Timeframe is set to 1 minute if autoplay is enabled and timeframe deleted|
|Minor||BVQ-14373||Alerting: Predefined alert rules||Incorrect pre-filters in rule "VDisk copies on same Site or MDisk group"|
|Minor||BVQ-14349||Alerting: Predefined alert rules||Max number of extents-rule must ignore FS840/FS900|
|Minor||BVQ-14335||Alerting: Custom alert rules||Drag & Drop of folders and alert rules not possible|
|Minor||BVQ-14323||Alerting||Change initial value for results to Unknown|
|Minor||BVQ-14324||Reporting||Warning about not found snippet.mimicryPie.cost_center.dataObjectTable|
|Minor||BVQ-14375||GUI: Favorites||Performance Overview favorites open with change-flag|
|Minor||BVQ-14216||GUI: L&F Black eye: Table view||Link Texts unreadable when selected|
|Minor||BVQ-14336||GUI: Table view||Column width in tree mode not correct|
|Minor||BVQ-14260||GUI: Table view||Refresh clears selection from table column filters|
|Minor||BVQ-14353||DataGrid: SVC IOGrp||"Logged in Host port count" counts same SVC Host Port multiple times|
|BVQ-14310||WebUI: Alerting||Rework existing Alert templates and add additional ones|
|BVQ-13453||GUI: Capacity charts||Improve Y-Axis annotation|
|BVQ-14320||Grafana||Update SAN - Fabric overview Dashboard|
|BVQ-14321||Grafana||Update VMware - Cluster overview Dashboard|
|BVQ-14315||Web UI||Adapt router names to match alerting routes|
|Major||BVQ-14328||GUI: Table||Exception when adding attribute Special > Alerting > Level in a VDisk table|
|Major||BVQ-14332||GUI: Performance view||No Undo of Timing change possible|
|Minor||BVQ-14200||GUI: Performance view||Inconsistent fonts in a charts|
|Minor||BVQ-14329||GUI: Performance view||Setting filters causes Nullpointer exception|
|Minor||BVQ-14331||GUI: Capacity Charts||Exception if clusters are not loaded|
|Minor||BVQ-14327||GUI: Filtering||Attributes overall_aggregated_alert_level and overall_alert_level are not working in filters|
|Minor||BVQ-14337||GUI: Alerting||Alert rule name is UNKNOWN for predefined rules in Event Details|
|Minor||BVQ-14317||WebUI: Notifications||'Required user role' in 'Web configuration' should be a mandatory field|
|Minor||BVQ-14326||WebUI: Predefined alert rules||Incorrect pre-filters in some rules|
|Minor||BVQ-14239||WebUI: DB Backups||Backup download invisible for user and running in background|
|BVQ-13892||System Health Map||Visualize alerting results in system health maps|
|BVQ-14077 & BVQ-14106||Reporting||Improve report template creation and usability|
|BVQ-14158||WebUI||Tour through UI|
|BVQ-14098||GUI||Adjust GUI on Different Screen Resolutions|
|BVQ-14027||GUI: Topology view||Object Context Menu takes long to open with many objects selected|
|BVQ-9959||GUI: Perf view||Allow filtering of events|
|BVQ-13997||GUI: Event view||Enhanced filtering|
|BVQ-13994||GUI: Event view||Live view|
|BVQ-13844||GUI: PBC||Keep Context Object Type in Path of Object Context Menu initiated PBCs|
|BVQ-13846||Common||Add additional descriptor to non-unique objects|
|BVQ-12837||Notification config||Handling multi receivers email|
|BVQ-13729||Reporting: Templates||Add Dates "Last changed" & "Last rendered" to table|
|BVQ-14192||Scanner||Prevent duplicate scanners|
|Major||BVQ-14302||GUI||Capacity / Topology Chart throws Exception|
|Major||BVQ-14205||GUI: Favorites||Some views are desigend for Total but show all objects|
|Minor||BVQ-14236||Grafana||Node lower cache fullness max and node upper cache fullnes data is not accessible in Grafana|