N030-M1 Tier 3 · Intermediate · medium ecommerce · Brightlane

Return the category ID and average price for every category meeting that threshold

Part of Common Table Expressions (CTEs) in SQL

The problem

Brightlane's product team wants to identify high-price-point categories — those whose average product price exceeds $200.

Write a query to return the category ID and average price for every category meeting that threshold.

Assumptions:

  • The products table has one row per product with a category_id and a price.
  • A category's average price is the average of every product's price linked to that category_id.
  • Only categories whose average price exceeds $200 should appear.

Output:

  • One row per qualifying category, with columns category_id and avg_price.
Schema · ecommerce 5 tables
categories
id integer
name text
parent_id? integer
products
id integer
name text
category_id integer
price numeric
stock_qty integer
attributes? jsonb
order_items
id integer
order_id integer
product_id integer
quantity integer
unit_price numeric
customers
id integer
name text
email text
city? text
country text
created_at timestamptz
is_active boolean
orders
id integer
customer_id integer
ordered_at timestamptz
status text
total_amount numeric

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
WITH
  category_prices AS (
    SELECT
      category_id,
      AVG(price) AS avg_price
    FROM
      products
    GROUP BY
      category_id
  )
SELECT
  category_id,
  avg_price
FROM
  category_prices
WHERE
  avg_price > 200

The shape

The aggregation and the threshold check live in two named layers. The WITH layer category_prices computes AVG(price) per category; the main query reads that result and applies the WHERE avg_price > 200 filter against the aggregate as a named column.

Clause by clause

  • The WITH clause defines category_prices:
WITH category_prices AS (
  SELECT category_id, AVG(price) AS avg_price
  FROM products
  GROUP BY category_id
)

GROUP BY category_id partitions products by category, and AVG(price) collapses each partition to a single average price. Every category gets one row in the named layer, including the ones whose averages are below the threshold.

  • SELECT category_id, avg_price FROM category_prices WHERE avg_price > 200 is the main query. It reads category_prices and keeps only the rows where the aggregate exceeds 200. Categories 5, 6, and 7 survive with averages of 782.33, 1459, and 882.33.

Why this and not a derived table in FROM

A derived table would put the aggregation inside the main query's FROM and apply the threshold in the same WHERE. Both shapes return identical results. The WITH version pulls the aggregation out, names it, and lets the main query read top to bottom in the order the work happens: compute the per-category average first, then keep the ones above the cutoff. The derived-table version nests the same logic inside a single statement and forces the reader to look inward to find the aggregation. For "find groups whose aggregate exceeds X," WITH is the cleaner spelling.

The trap

The filter has to run on the aggregate, not on individual product prices. Putting WHERE price > 200 inside the layer would keep only products priced above 200 and then average those, which is a different calculation entirely. The threshold belongs in the main query, against the named aggregate column avg_price, which only exists once the layer's grouping is complete.

You practiced computing a per-category average in a WITH layer and applying a threshold check in the main query — the canonical shape for 'find groups whose aggregate exceeds X'.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.