Redaction processor
This processor deletes span, log, and metric datapoint attributes that don't match a list of allowed
attributes. It also masks attribute values that match a blocked value
list. Attributes that aren't on the allowed list are removed before any
value checks are done.
Use Cases
Typical use-cases:
- Prevent sensitive fields from accidentally leaking into traces
- Ensure compliance with legal, privacy, or security requirements
For example:
- EU General Data Protection Regulation (GDPR) prohibits the transfer of any
personal data like birthdates, addresses, or ip addresses across borders
without explicit consent from the data subject. Popular trace aggregation
services are located in US, not in EU. You can use the redaction processor
to scrub personal data from your data.
- PRC legislation prohibits the transfer of geographic coordinates outside of
the PRC. Popular trace aggregation services are located in US, not in the
PRC. You can use the redaction processor to scrub geographic coordinates
from your data.
- Payment Card Industry (PCI) Data Security Standards prohibit logging certain
things or storing them unencrypted. You can use the redaction processor to
scrub them from your traces.
The above is written by an engineer, not a lawyer. The redaction processor is
intended as one line of defence rather than the only compliance measure in
place.
Processor Configuration
Please refer to config.go for the config spec.
Examples:
processors:
redaction:
# allow_all_keys is a flag which when set to true, which can disables the
# allowed_keys list. The list of blocked_values is applied regardless. If
# you just want to block values, set this to true.
allow_all_keys: false
# allowed_keys is a list of span/log/datapoint attribute keys that are kept on the span/log/datapoint and
# processed. The list is designed to fail closed. If allowed_keys is empty,
# no attributes are allowed and all span attributes are removed. To
# allow all keys, set allow_all_keys to true.
allowed_keys:
- description
- group
- id
- name
# Ignore the following attributes, allow them to pass without redaction.
# Any keys in this list are allowed so they don't need to be in both lists.
ignored_keys:
- safe_attribute
# blocked_values is a list of regular expressions for blocking values of
# allowed span attributes. Values that match are masked
blocked_values:
- "4[0-9]{12}(?:[0-9]{3})?" ## Visa credit card number
- "(5[1-5][0-9]{14})" ## MasterCard number
# summary controls the verbosity level of the diagnostic attributes that
# the processor adds to the spans/logs/datapoints when it redacts or masks other
# attributes. In some contexts a list of redacted attributes leaks
# information, while it is valuable when integrating and testing a new
# configuration. Possible values:
# - `debug` includes both redacted key counts and names in the summary
# - `info` includes just the redacted key counts in the summary
# - `silent` omits the summary attributes
summary: debug
Refer to config.yaml for how to fit the configuration
into an OpenTelemetry Collector pipeline definition.
Ignored attributes are processed first so they're always allowed and never
blocked. This field should only be used where you know the data is always
safe to send to the telemetry system.
Only span/log/datapoint attributes included on the list of allowed keys list are retained.
If allowed_keys
is empty, then no attributes are allowed. All
attributes are removed in that case. To keep all span attributes, you should
explicitly set allow_all_keys
to true.
blocked_values
applies to the values of the allowed keys. If the value of an
allowed key matches the regular expression for a blocked value, the matching
part of the value is then masked with a fixed length of asterisks.
For example, if notes
is on the list of allowed keys, then the notes
attribute is retained. However, if there is a value such as a credit card
number in the notes
field that matched a regular expression on the list of
blocked values, then that value is masked.