Cluster-Wide Catalog Table Query Aggregation

We learned in a previous YugabyteDB Tip that we can Consolidate Data From YB Catalog Tables From All Nodes To One Node.

That solution required a lot of tedious setup using foreign data wrappers. In today’s tip, we explore how a shell script can be used to achieve the same result.

Here is the script:

				
					#!/bin/bash

# Script:      yb_cluster_query.sh
# Description: This script will execute provided query on all nodes in a cluster and
#              dump the output to a file.
# Created:     04/11/2025
# Last update: 04/11/2025

# Call Spec:
#   ./yb_cluster_query.sh <<TServer IP>> <<Database Name>> <<Database Username>> <<SQL File>> <<Output File>> <<Delimeter>>

# Example Use:
#   ./yb_cluster_query.sh 127.0.0.1 yugabyte yugabyte pg_stat_statements.sql cluster_output.csv "|"

# Important Notes:
#  1. This script is for YSQL only
#  2. ysqlsh must be in the PATH
#  3. The SQL File must contain only one query
#  4. The delimiter is optional, and will default to "|"

# Expected input parameters
MASTER_HOST=$1
DATABASE=$2
USER=$3
SQL_FILE=$4
OUTPUT_FILE=$5
DELIM=${6:-"|"}

# Setup
PORT=5433

if [[ $# -lt 4 ]]; then
  echo "Usage: $0 <master_host> <port> <sql_file> <output_file> [delimiter]"
  exit 1
fi

# Get the query from the SQL file, strip the trailing semicolon if present
QUERY=$(<"$SQL_FILE")
QUERY=${QUERY%;}

# Get host list using ysqlsh in tuples-only and aligned-off mode
HOSTS=$(ysqlsh -h "$MASTER_HOST" -p "$PORT" -U "$USER" -d "$DATABASE" -Atc "SELECT host FROM yb_servers();")

# Create a temp dir for output
TMP_DIR=$(mktemp -d)

# Get header using LIMIT 0
HEADER_LINE=$(ysqlsh -h "$MASTER_HOST" -p "$PORT" -U "$USER" -d "$DATABASE" -F "$DELIM" -Ac "SELECT * FROM (${QUERY}) AS subq LIMIT 0;" 2>/dev/null | head -n 1)
echo "host${DELIM}${HEADER_LINE}" > "$TMP_DIR/header"

# Function to run query on a single host
run_query() {
  local HOST=$1
  local OUT_FILE="$TMP_DIR/${HOST}.out"
  ysqlsh -h "$HOST" -p "$PORT" -U "$USER" -d "$DATABASE" -AtF "$DELIM" -c "$QUERY" 2>/dev/null | sed "s/^/${HOST}${DELIM}/" > "$OUT_FILE"
  if [[ ! -s "$OUT_FILE" ]]; then
    echo "Warning: No output for $HOST" >&2
  fi
}

export -f run_query
export QUERY PORT DELIM TMP_DIR

# Run all queries in parallel
echo "$HOSTS" | xargs -n1 -P 10 -I {} bash -c 'run_query "$@"' _ {}

# Write header
cat "$TMP_DIR/header" > "$OUTPUT_FILE"

# Sort data rows and append
find "$TMP_DIR" -name '*.out' -exec cat {} + | sort >> "$OUTPUT_FILE"

# Cleanup
rm -rf "$TMP_DIR"

echo -e "\n✅ Results written to $OUTPUT_FILE"

Below is a brief demonstration of the script in use…

On a three node cluster I’ll create a user named lucy and a database named lucy_db, assigning ownership of the database to the new user:

				
					yugabyte=# SELECT host, cloud, region, zone FROM yb_servers() ORDER BY host;
   host    | cloud |  region   |      zone
-----------+-------+-----------+----------------
 127.0.0.1 | aws   | us-east-1 | us-east-1-zone
 127.0.0.2 | aws   | us-east-2 | us-east-2-zone
 127.0.0.3 | aws   | us-west-2 | us-west-2-zone
(3 rows)

yugabyte=# CREATE USER lucy;
CREATE ROLE

yugabyte=# CREATE DATABASE lucy_db WITH OWNER lucy;
CREATE DATABASE

Next, we’ll have the new user do some work on nodes 127.0.0.1 and 127.0.0.2:

				
					[root@cloud-server-0 ~]# ysqlsh -h 127.0.0.1 -U lucy -d lucy_db -c "CREATE TABLE tab (id INT PRIMARY KEY);"
CREATE TABLE

[root@cloud-server-0 ~]# ysqlsh -h 127.0.0.1 -U lucy -d lucy_db -c "INSERT INTO tab SELECT generate_series(1, 100000);"
INSERT 0 100000

[root@cloud-server-0 ~]# ysqlsh -h 127.0.0.1 -U lucy -d lucy_db -c "SELECT COUNT(*) FROM tab;"
 count
--------
 100000
(1 row)

[root@cloud-server-0 ~]# ysqlsh -h 127.0.0.2 -U lucy -d lucy_db -c "SELECT COUNT(*) FROM tab WHERE id BETWEEN 10000 AND 100000;"
 count
-------
 90001
(1 row)

I want to check the pg_stat_statements statistics for this user across the cluster. While this can be done through YugabyteDB Anywhere, it’s also easy to accomplish using the new script!

First, I’ll create a file containing the query I want to run on each node. In this example, the query joins three catalog tables: pg_stat_statements, pg_user, and pg_database.

				
					[root@cloud-server-0 ~]# cat pg_stat_statements.sql
SELECT datname, LEFT(query, 25) query, calls, rows, mean_exec_time FROM pg_stat_statements ss JOIN pg_user u ON u.usesysid = userid JOIN pg_database d ON d.oid = dbid WHERE usename = 'lucy'

Now I’m ready to run the script, passing in all the required parameters, including my SQL file.

				
					[root@cloud-server-0 ~]# ./yb_cluster_query.sh 127.0.0.1 lucy_db yugabyte pg_stat_statements.sql cluster_output.csv "|"
Warning: No output for 127.0.0.3

✅ Results written to cluster_output.csv

[root@cloud-server-0 ~]# cat cluster_output.csv
host|datname|query|calls|rows|mean_exec_time
127.0.0.1|lucy_db|CREATE TABLE tab (id INT |1|0|178.813274
127.0.0.1|lucy_db|INSERT INTO tab SELECT ge|1|100000|838.076101
127.0.0.1|lucy_db|SELECT COUNT(*) FROM tab|1|1|14.20426
127.0.0.2|lucy_db|SELECT COUNT(*) FROM tab |1|1|39.662449

Note: The initial output from the script includes a warning indicating that no results were returned from node 127.0.0.3. In this case, it simply means that the user Lucy hasn’t executed any SQL statements on that node.

For easy analysis, I can load the data from the file into a YugbayteDB temp table:

				
					yugabyte=# CREATE TEMP TABLE pg_stat_statements_lucy(host TEXT, datname TEXT, query TEXT, calls
CREATE TABLE

yugabyte=# \COPY pg_stat_statements_lucy FROM 'cluster_output.csv' WITH (FORMAT 'csv', HEADER TRUE, DELIMITER '|');
WARNING:  batched COPY is not supported on temporary tables
DETAIL:  Defaulting to using one transaction for the entire copy.
HINT:  Either copy onto non-temporary table or set rows_per_transaction option to `0` to disable batching and remove this warning.
COPY 4

yugabyte=# SELECT * FROM pg_stat_statements_lucy;
   host    | datname |           query           | calls |  rows  | mean_exec_time
-----------+---------+---------------------------+-------+--------+----------------
 127.0.0.1 | lucy_db | CREATE TABLE tab (id INT  |     1 |      0 |     178.813274
 127.0.0.1 | lucy_db | INSERT INTO tab SELECT ge |     1 | 100000 |     838.076101
 127.0.0.1 | lucy_db | SELECT COUNT(*) FROM tab  |     1 |      1 |       14.20426
 127.0.0.2 | lucy_db | SELECT COUNT(*) FROM tab  |     1 |      1 |      39.662449
(4 rows)

yugabyte=# SELECT * FROM pg_stat_statements_lucy WHERE mean_exec_time > 500;
   host    | datname |           query           | calls |  rows  | mean_exec_time
-----------+---------+---------------------------+-------+--------+----------------
 127.0.0.1 | lucy_db | INSERT INTO tab SELECT ge |     1 | 100000 |     838.076101
(1 row)

Have Fun!