Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Section
Column
width50%
Panel
borderColor#CCCCCC
bgColor#FFFFFF
titleBGColor#F0F0F0
borderStylesolid
titleOn the page:

Table of Contents

 

Column

 

General information

The

To view the system status data, you need to:

...

web interface of the self-diagnostics service is available for monitoring the system statuses and analyzing its performance.

Access to the self-diagnostics service

To go to the monitoring interface, do the following:

  1. Open the web browser.
  2. In the address line, enter: http://127.0.0.1:20040/.
  3. Image Removed
  4. Click the Enter button.

Interface and queries execution

The service interface allows viewing metrics as a table or graphs. To run a query, do the following:

  1. Select the

  2. required
  3. metric

  4. in
  5. from the drop-down list

  6. (
  7. 1

  8. )
  9. or enter the query manually in the Expression field.

  10. Expandtitle Description of useful metrics
  11. You can:

    1. Use several metrics at a time. The system has the following available metrics:

      Metric

      Description

      ALERTS_FOR_STATE

  12. Troubleshooting by the self-diagnostics service
    1. Found and fixed malfunctions. Contains the alertname parameter with the problem type.

      Code Block
      languagego
      titleExample
      ALERTS_FOR_STATE{alertname="ipint_is_not_activated",ep_name="hosts/Server1/DeviceIpint.99",instance="127.0.0.1:20108",job="ngp_exporter",ngp_alert="true"}
  13. Possible values
    1. Decryption of the alertname

  14. parameter
    1. values (see

  15. What is
    1. General information about the self-diagnostics service) for the ALERTS_FOR_STATE metric:

      • low_os_memory
  16. out
      • —out of RAM
  17. ;
      • .
      • ipint_is_not_activated
  18. camera
      • —camera is connected
  19. but does not send data;
      • , but there is no data from it.
      • no_samples_in_detector
  20. no
      • —no events from
  21. a detection tool;
      • the detector.
      • restart_services_when_archive_source_not_activated
  22.  the archive is not working;
      • —the archive recording isn't working.
      • restart_services_when_no_samples_in_archive
  23. recording
      • —recording to archive with 0
  24. FPS;
      • fps.
      • restart_services_when_no_ping_from_detector_to_archive
  25.  no
      • —no recording to the archive
  26. on detection tool triggering;
      • at the event from the detector.
      • logs_disk_space_is_low / db_disk_space_is_low
  27. out
      • —out of system disk space.

      ngp_archive_channel_fps

      The frame rate of all

  28. video
    1. cameras when recording to the archive

      ngp_archive_volume_size

      The current total size of the archive (in bytes)

      ngp_cpu_total_usage

      The

  29. percentage of
    1. CPU load

  30. on a Server
    1. of the server

      ngp_fps

      The frame rate of all

  31. Server video
    1. server cameras,

  32. all detection tools
    1. detectors and

  33. their
    1. decoders

  34. The request allows for:

  35. Using multiple metrics.

  36. Using expressions to find problems. For example, a query like ngp_fps <17 will return all metrics, where FPS was less than 17. For a complete list of logical and arithmetic operators, see the official Prometheus documentation.  

  37. Filtering by any of the parameters. For example, a query like
    1. ngp_people_count

      The last captured number of people in the frame by the Crowd estimation VA detector

      ngp_errors

      Number of errors in the detectors' operation:

      ngp_skipped_pp

      Number of missed frames by the Crowd estimation VA detector due to the lack of resources for processing

    2. Apply logic and arithmetic operators for anomaly searching. The full list of logic and arithmetic operators is specified in the official Prometheus documentation.
      Code Block
      languagego
      titleExample. All metrics where fps is less than 17
      ngp_fps < 17
    3. Faltering by metrics parameters using curly brackets.
      Code Block
      languagego
      titleExample. Fps values only for the specified source
    1. ngp_fps{ep_name=~"hosts/
  38. V-BELYAKOV
    1. TEST/DeviceIpint.2/SourceEndpoint.video:0:0"}
  39. will return FPS values only for the specified source.
    1. Image Added
  40. If necessary, set the time range for the data.
  41. Click the Execute button.

Viewing results:

  • The Console tab displays the current metrics values in the table format.
    Image Added
    When you specify the date and time in the calendar, the data is updated.
    Image Added
  • On the Graph tab you can create the graph of selected metrics at the specified period.
    • The 1 field—sets the graph time interval.
    • The 2 field—specifies the end graph point.
    • The 3 field—sets the interval between data points.
    • The 4 checkbox—enables the display mode with accumulation (filling the areas under the graph).
      Image Added

Examples of useful queries for Windows

...

OS

  1. The CPU loading graph
  2. similar to
  3. (analog of the
  4. Windows
  5. System monitor):
    Code Block
    languagego
    sum by (process_id) (100 / scalar(wmi_cs_logical_processors) * (irate(wmi_process_cpu_time_total{
  6. job="os_export",
  7. process="AppHost"}[10m]))) or ngp_cpu_total_usage
  8. The graph of
  9. RAM usage by
  10. the
  11. the AppHost processes and a total memory space:
    Code Block
    languagego
    sum by (process_id) (avg_over_time(wmi_process_working_set{
  12. job=~"os_export",
  13. process="AppHost"}[5m])) / 1024 or avg_over_time(wmi_os_virtual_memory_bytes
  14. {job=~"os_export"}
  15. [5m]) / 1024
  16. The percentage of RAM usage:
    Code Block
    100.0 - 100 * avg_over_time(wmi_os_virtual_memory_free_bytes
  17. {job=~"os_export"}
  18. [5m]) / avg_over_time(wmi_os_virtual_memory_bytes
  19. {job=~"os_export"}
  20. [5m])

Examples of useful queries for Linux

...

OS

  1. The
  2. graph of
  3. total RAM usage by
  4. the
  5. the AppHost processes
  6. and a total memory space in bytes
  7. :
    Code Block
    languagego
    sum by (groupname) (namedprocess_namegroup_memory_bytes{memtype="resident"})
  8. The percentage of RAM usage:
    Code Block
    languagego
    100 - node_memory_MemAvailable_bytes * 100 / node_memory_MemTotal_bytes
  9. The
  10. graph of the
  11. CPU load by
  12. the
  13. the AppHost processes as a percentage:
    Code Block
    sum by (
  14. groupname
  15. object_id) (rate(namedprocess_namegroup_cpu_seconds_total{groupname="AppHost"}[1m])) * 100
  16. The
  17. graph of the
  18. total CPU load
  19.  as
  20. as a percentage:
    Code Block
    languagego
    100 * avg without (cpu) (1 - rate(node_cpu_seconds_total{mode="idle"}[1m]))
  21. Click the Execute button (2).
    The Console tab will display all possible values of all elements at the time the query is completed.
    Image Removed

  22. When you set a date and time in your calendar, the data will be updated.
    Image Removed

  23. RAM usage by the AppHost processes to detect the memory leak:
    Code Block
    languagego
    namedprocess_namegroup_memory_bytes{object_id=~"APP_HOST.*",memtype="proportionalResident"}
  24. To build a graph, go to the Graph tab.
    Image Removed
    In the field 1, set the time interval of the graph. In the field 2, set its end point. In the filed 3, set the interval between the data points. To fill the chart, set the stacked (4) checkbox.