Categories Above the Average of Averages

The problem

Brightlane's product team wants to find categories whose average product price is above the cross-category average — that is, whose category-level average sits above the average of every category's average price.

Write a query to return the category ID and average price for every category whose average exceeds the cross-category average.

Assumptions:

A category's average price is the average of every price value linked to that category_id.
The cross-category average is the average of those category-level averages, computed across every category.
Only categories whose average price exceeds the cross-category average should appear.

Output:

One row per qualifying category, with columns category_id and avg_price.

Schema · ecommerce 5 tables

categories

id integer

name text

parent_id? integer

products

id integer

name text

category_id integer

price numeric

stock_qty integer

attributes? jsonb

order_items

id integer

order_id integer

product_id integer

quantity integer

unit_price numeric

customers

id integer

name text

email text

city? text

country text

created_at timestamptz

is_active boolean

orders

id integer

customer_id integer

ordered_at timestamptz

status text

total_amount numeric

Check answerShift Ctrl ↵

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first

Solution query

WITH
  category_stats AS (
    SELECT
      category_id,
      AVG(price) AS avg_price
    FROM
      products
    GROUP BY
      category_id
  ),
  above_avg AS (
    SELECT
      category_id,
      avg_price
    FROM
      category_stats
    WHERE
      avg_price > (
        SELECT
          AVG(avg_price)
        FROM
          category_stats
      )
  )
SELECT
  category_id,
  avg_price
FROM
  above_avg

The shape

The second CTE reads category_stats twice in the same query. Once row-by-row to source each category's average, and once through a scalar subquery (SELECT AVG(avg_price) FROM category_stats) that reduces the layer to a single number for comparison. Naming the per-category averages in a CTE is what makes the same set available on both sides of the threshold check.

Clause by clause

The first CTE produces one row per category with its average price:

WITH category_stats AS (
  SELECT category_id, AVG(price) AS avg_price
  FROM products
  GROUP BY category_id
)

GROUP BY category_id produces one row per category, and AVG(price) is computed inside each group.

The second CTE compares each row's avg_price to the cross-category average:

above_avg AS (
  SELECT category_id, avg_price
  FROM category_stats
  WHERE avg_price > (SELECT AVG(avg_price) FROM category_stats)
)

FROM category_stats reads the per-category rows. The scalar subquery (SELECT AVG(avg_price) FROM category_stats) reads the same CTE again and aggregates it down to a single number, then WHERE avg_price > ... compares each row's value to that single number. The subquery is evaluated once for the whole query and that one result is reused on every row.

SELECT category_id, avg_price FROM above_avg returns the three categories whose average price exceeds the cross-category average.

Why a CTE referenced twice and not the raw `products` table

The comparison value is the average of the per-category averages, not the average of every product price. Writing WHERE avg_price > (SELECT AVG(price) FROM products) would compute a different number, weighted by how many products each category has. Computing the per-category averages first and then averaging those averages is what the prompt actually asks for, and putting them in a named CTE means the two references to that intermediate set agree by definition. The same set is on both sides of the comparison.

You practiced layering two WITH stages where the second references the first twice — once row-by-row, and once through a scalar subquery that aggregates the layer to a single comparison value.

Return the category ID and average price for every category whose average exceeds the cross-category average

The shape

Clause by clause

Why a CTE referenced twice and not the raw `products` table

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

The shape

Clause by clause

Why a CTE referenced twice and not the raw products table

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

Why a CTE referenced twice and not the raw `products` table