List of built-in playbooks¶

Application Visibility and Troubleshooting¶

Restart loop reporter¶

Playbook Action

Description

When a pod is in restart loop, debug the issue, fetch the logs, and send useful information on the restart

Automating it

This action can be run automatically.

Add this to your Robusta configuration (values.yaml when installing with Helm):

actions:
- restart_loop_reporter: {}
triggers:
- on_pod_all_changes: {}

The above is an example. Try customizing the trigger and parameters.

Parameters

optional:

rate_limit (int) = 3600: Rate limit the execution of this action (Seconds).

restart_reason (str): Limit restart loops for this specific reason. If omitted, all restart reasons will be included.

Supported Triggers

on_pod_create

on_pod_all_changes

on_pod_delete

on_prometheus_alert

on_pod_update

This action can be manually triggered using the Robusta CLI:

robusta playbooks trigger restart_loop_reporter name=POD_NAME namespace=POD_NAMESPACE 

Pod ps¶

Playbook Action

Description

Fetch the list of running processes in a pod.

Automating it

This action can be run automatically.

Add this to your Robusta configuration (values.yaml when installing with Helm):

actions:
- pod_ps: {}
triggers:
- on_pod_all_changes: {}

The above is an example. Try customizing the trigger and parameters.

Parameters

No action parameters

Supported Triggers

on_pod_create

on_pod_all_changes

on_pod_delete

on_prometheus_alert

on_pod_update

This action can be manually triggered using the Robusta CLI:

robusta playbooks trigger pod_ps name=POD_NAME namespace=POD_NAMESPACE 

Kubernetes Events¶

Event report¶

Playbook Action

Description

Create finding based on the kubernetes event

Automating it

This action can be run automatically.

Add this to your Robusta configuration (values.yaml when installing with Helm):

actions:
- event_report: {}
triggers:
- on_kubernetes_warning_event: {}

The above is an example. Try customizing the trigger and parameters.

Parameters

optional:

rate_limit (int) = 3600: Rate limit the execution of this action (Seconds).

finding_key (str) = DEFAULT: Specify the finding identifier, to reference it in other actions.

Supported Triggers

on_kubernetes_warning_event
on_event_all_changes
on_event_create
on_event_delete
on_event_update

Event resource events¶

Playbook Action

Description

Enrich the finding with the kubernetes events of the involved resource specified in the event

Automating it

This action can be run automatically.

Add this to your Robusta configuration (values.yaml when installing with Helm):

actions:
- event_resource_events: {}
triggers:
- on_kubernetes_warning_event: {}

The above is an example. Try customizing the trigger and parameters.

Parameters

optional:

finding_key (str) = DEFAULT: Specify the finding identifier, to reference it in other actions.

Supported Triggers

on_kubernetes_warning_event
on_event_all_changes
on_event_create
on_event_delete
on_event_update

Integrations¶

Argo app sync¶

Playbook Action

Description

Sync a specified Argo CD application. Send a finding notifying the sync was performed

Automating it

This action can be run automatically.

Add this to your Robusta configuration (values.yaml when installing with Helm):

actions:
- argo_app_sync:
    argo_app_name: string
    argo_token: '********'
    argo_url: https://my-argo-cd.com
triggers:
- on_pod_create: {}

The above is an example. Try customizing the trigger and parameters.

Parameters

required:

argo_url (str): http(s) Argo CD server url.

argo_token (str): Argo CD authentication token.

argo_app_name (str): Argo CD application that needs syncing.

optional:

argo_verify_server_cert (bool) = True: verify Argo CD server certificate. Defaults to True.

rate_limit_seconds (int) = 1800: this playbook is rate limited. Defaults to 1800 seconds.

Supported Triggers

Any trigger

This action can be manually triggered using the Robusta CLI:

robusta playbooks trigger argo_app_sync  argo_url=ARGO_URL argo_token=ARGO_TOKEN argo_app_name=ARGO_APP_NAME

Kubernetes Optimization¶

Config ab testing¶

Playbook Action

Description

Apply YAML configurations to Kubernetes resources for limited periods of time.

Adds adds grafana annotations showing when each configuration was applied.

The execution schedule is defined by the playbook trigger. (every X seconds)

Commonly used for:

Troubleshooting - Finding the first version a production bug appeared by iterating over image tags Cost/performance optimization - Comparing the cost or performance of different deployment configurations

Note:

Only changing attributes that already exists in the active configuration is supported.

For example, you can change resources.requests.cpu, if that attribute already exists in the deployment.

Automating it

This action can be run automatically.

Add this to your Robusta configuration (values.yaml when installing with Helm):

actions:
- config_ab_testing:
    configuration_sets:
    - config_items: '"spec.template.spec.containers[0].resources.requests.cpu": 250m,

        "spec.template.spec.containers[0].resources.requests.memory": 128Mi'
      config_set_name: string
    - config_items: '"spec.template.spec.containers[0].resources.requests.cpu": 250m,

        "spec.template.spec.containers[0].resources.requests.memory": 128Mi'
      config_set_name: string
    grafana_api_key: '********'
    grafana_dashboard_uid: 09ec8aa1e996d6ffcd6817bbaff4db1b
    grafana_url: http://grafana.namespace.svc
    kind: string
    name: string
triggers:
- on_schedule: {}

The above is an example. Try customizing the trigger and parameters.

Parameters

required:

grafana_api_key (str): grafana key with write permissions.

grafana_dashboard_uid (str): dashboard ID as it appears in the dashboard's url

kind (str): The kind of the tested resource. Kind can be 'Deployment'/'StatefulSet' etc

name (str): The name of the tested resource.

configuration_sets (complex list)

List of test configurations.

each entry contains:

required:

config_set_name (str): The name of this configuration set. .

optional:

config_items (str dict): The yaml attributes values for this configuration set.

optional:

grafana_url (str): http(s) url of grafana or None for autodetection of an in-cluster grafana

api_version (str) = v1: The api version of the tested resource.

namespace (str) = default: The namespace of the tested resource.

Supported Triggers

on_schedule

Disk benchmark¶

Playbook Action

Description

Run disk benchmark in your cluster. The benchmark creates a PVC, using the configured storage class, and runs the benchmark using fio. For more details: https://fio.readthedocs.io/en/latest/

Automating it

This action can be run automatically.

Add this to your Robusta configuration (values.yaml when installing with Helm):

actions:
- disk_benchmark:
    storage_class_name: string
triggers:
- on_pod_create: {}

The above is an example. Try customizing the trigger and parameters.

Parameters

required:

storage_class_name (str): Pvc storage class, From the available cluster storage classes. standard/fast/etc.

optional:

pvc_name (str) = robusta-disk-benchmark: Name of the pvc created for the benchmark.

test_seconds (int) = 20: The benchmark duration.

namespace (str) = robusta: Namespace used for the benchmark.

disk_size (str) = 10Gi: The size of pvc used for the benchmark.

Supported Triggers

Any trigger

This action can be manually triggered using the Robusta CLI:

robusta playbooks trigger disk_benchmark  storage_class_name=STORAGE_CLASS_NAME

List of built-in playbooks¶

Application Visibility and Troubleshooting¶

Restart loop reporter¶

Pod ps¶

Kubernetes Error Handling¶

Node health watcher¶

Alert on hpa reached limit¶

Scale hpa callback¶

Kubernetes Events¶

Event report¶

Event resource events¶

Kubernetes Monitoring¶

Git change audit¶

Deployment status report¶

Resource babysitter¶

Incluster ping¶

Integrations¶

Argo app sync¶

Kubernetes Optimization¶

Config ab testing¶

Disk benchmark¶

Stress Testing and Chaos Engineering¶

Generate high cpu¶

Http stress test¶

Prometheus alert¶