Try to match all ANSI escape sequences, not just CSI (i.e. `\e[`) - also the ones that are followed by additional bytes (e.g. CSI, OSC).
24 lines
730 B
Bash
Executable File
24 lines
730 B
Bash
Executable File
#!/bin/sh
|
|
# SPDX-License-Identifier: MIT
|
|
# Copyright (c) 2025 Julian Prein
|
|
#
|
|
# Remove ANSI escape sequences.
|
|
#
|
|
# Additionally to the general form `\e[ -/]*[0-~]` this also tries to cover
|
|
# sequences that are followed by additional bytes (e.g. CSI, OSC). This script
|
|
# covers no C1 sequences.
|
|
#
|
|
# See:
|
|
# - https://en.wikipedia.org/wiki/ANSI_escape_code
|
|
# - https://www.ecma-international.org/wp-content/uploads/ECMA-35_6th_edition_december_1994.pdf
|
|
#
|
|
# TODO: which bytes do DCS, OSC, SOS, PM & APC (second pattern) actually allow?
|
|
# Find some documentation, since `[ -~]` is guessed.
|
|
|
|
env LC_ALL=C sed -E "$(printf "%b" \
|
|
's/' \
|
|
'\033\\[[0-?]*[ -/]*[@-~]' '|' \
|
|
'\033[]PX^_][ -~]*\033\\\\' '|' \
|
|
'\033[ -/]*[0-~]' \
|
|
'//g')"
|