Collector for Background Jobs Monitoring

The technical name of the Collector for Background Job Monitoring is /DVD/MON_CL_COL_BG_JOBS.

This collector monitors the performance and health of background jobs. It collects KPIs and details about problematic jobs:

  • Failed background jobs

  • Jobs running too long

  • Not started jobs: Jobs that should be already started but are still in a delayed state

Default KPIs Delivered with This Collector

The following default KPIs are delivered with this collector:

KPI name Description Unit Detail table
BTC_JOB_RUN_DUR_LONGEST_JOB BTC: Duration of longest running job Min No
BTC_JOB_RUN_RUNTIME_TOTAL BTC: Total runtime of currently running jobs Min No
BTC_MON_JOB_NUM_FAILED BTC: Number of failed monitoring collector jobs Count No
BTC_NUM_DELAYED BTC: Number of delayed background jobs (>= 60 seconds) Count Yes
BTC_NUM_FAILED BTC: Number of failed background jobs Count Yes
BTC_NUM_LONGRUN BTC: Number of BG jobs running too long (>= 10 minutes) Count Yes
BTC_SLR_DELAY_RUNTIME_PERC BTC: Delay/runtime for all jobs since last run % No
BTC_SLR_DELAY_S_PER_MIN BTC: Total delay of all jobs per min. since last run Sec No
BTC_SLR_DELAY_S_TOTAL BTC: Total delay of all jobs since last run Sec No
BTC_SLR_JOBS_NUM_COMPLETED BTC: Total number of completed jobs since last run Count No
BTC_SLR_JOBS_NUM_NOT_STARTED BTC: Total number of not started jobs since last run Count No
BTC_SLR_JOBS_NUM_STARTED BTC: Total number of started jobs since last run Count No
BTC_SLR_JOBS_NUM_TOTAL BTC: Total number of released and started jobs since last run Count No
BTC_SLR_JOBS_NUM_TOT_PER_MIN BTC: Number of released and started jobs per min. since last run Count No
BTC_SLR_RUNTIME_S_PER_MIN BTC: Total runtime of all jobs per min. since last run Sec No
BTC_SLR_RUNTIME_S_TOTAL BTC: Total runtime of all jobs since last run Sec No

Input Table

This collector uses the input table /DVD/MON_BGJOBI1.

The rules in this input table define the monitoring of custom and default KPIs. You can specify a job name pattern to filter specific jobs for KPI calculations. In addition, you have the following options:

  • Define the time threshold for delayed or long-running job

  • Define various check types for jobs based on the field CHECK_TYPE (see the table below)

  • Disable or enable the collection of job details into the corresponding detail table

The input table contains the following fields:

Technical name Column name Description
KPI KPI name Name of the KPI to which the rule applies
KPI_RULE_NUMBER KPI rule number KPI rule Number
JOBNAME_PATTERN BG job name pattern Pattern for job names to which the rule applies
CHECK_TYPE Background job check type

Check type for background jobs. Possible values:

  • FINISHED

  • FAILED

  • RUNNING

  • DELAYED

  • LONGRUN

TIMELIMIT Time limit for delayed and long-running BG jobs [s] Time threshold in seconds for background jobs to be considered as delayed and long-running.
SID System ID System ID to which the rule applies
ADD_TO_DETAILS Add relevant records to details If set to X, details for given KPI-related background jobs are reported in the detail table. Otherwise, they are not reported.
ACTIVE KPI rule active If set to X, the KPI rule is active.

Parameters

The following parameters are used for this collector:

Parameter name Default value Description
ONLY_NOT_STARTED_AS_DELAYED X

This parameter defines which jobs are processed as delayed jobs for default KPI BTC_NUM_DELAYED.

If this parameter is enabled by setting it to value X, only jobs that have not started yet are included into this KPI. If it’s disabled by setting into space value, also running or finished jobs within the last snapshot with delayed time greater than predefined threshold can be included into this KPI.

Detail Table

The collector reports the details for specified background jobs in the detail table Monitored BTC jobs. The technical name of the detail table is /DVD/MON_S_BTC_JOB_DETAILS. It contains the following fields:

Technical name Description
TIMESTAMP Time when the records are saved to the detail table
SID System ID
JOBNAME Background job name
JOBCOUNT Background job ID number
CLIENT_CODE Client
SERVER Server name
JOBSTATUS Background job status
DELAYED BG job delay time [s]
LONGRUN BG job long runtime [s]
JOB_START BG job start time
JOB_END BG job end time
CREATED_BY BG job creator username
ERROR_MSG Last error message for job log
KPI KPI name
TIMELIMIT Time limit for delayed and long-running BG jobs in seconds