N061-M4 Tier 5 · Expert · medium ecommerce · Brightlane

Return each `product_id` and the average price of all `products` in that same category

Part of Query Structure Patterns for Performance in SQL

The problem

Scenario: Brightlane's merchandising team is benchmarking each product against its category's average price.

Task: Write a query to return each product_id and the average price of all products in that same category.

Assumptions:

  • The products table holds one row per product, with id identifying it, category_id recording its category, and price storing its current price.
  • The category_avg_price for a product is the average price across every product sharing the same category_id.
  • The result covers every product in the catalog.

Output:

  • One row per product.
  • Columns in this order: product_id, category_avg_price.
Schema · ecommerce 5 tables
categories
id integer
name text
parent_id? integer
products
id integer
name text
category_id integer
price numeric
stock_qty integer
attributes? jsonb
order_items
id integer
order_id integer
product_id integer
quantity integer
unit_price numeric
customers
id integer
name text
email text
city? text
country text
created_at timestamptz
is_active boolean
orders
id integer
customer_id integer
ordered_at timestamptz
status text
total_amount numeric

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
SELECT
  p.id AS product_id,
  cat_avg.avg_price AS category_avg_price
FROM
  products p
  JOIN (
    SELECT
      category_id,
      AVG(price) AS avg_price
    FROM
      products
    GROUP BY
      category_id
  ) AS cat_avg ON cat_avg.category_id = p.category_id

The shape

Every product's row needs a value computed across the whole category that product belongs to. A derived table computes the per-category average once per category, then the main query joins each product to its category's row to attach the benchmark.

Clause by clause

  • SELECT p.id AS product_id, cat_avg.avg_price AS category_avg_price returns the product's identifier and the per-category average pulled from the derived table.
  • FROM products p is the parent set — every catalog product appears once.
  • The derived table:
SELECT category_id, AVG(price) AS avg_price
FROM products
GROUP BY category_id

One row per category, with the category's average price already computed. - AS cat_avg ON cat_avg.category_id = p.category_id joins each product to its category's row. Every product has a category, and every category that contains products appears in the derived table, so the inner join loses nothing.

Why this and not a correlated subquery per row

The same numbers come out if the average is written as a correlated subquery in the SELECT list: SELECT p.id, (SELECT AVG(price) FROM products p2 WHERE p2.category_id = p.category_id) AS category_avg_price FROM products p. The result is identical. The cost is not: the correlated form re-runs the subquery once per outer row, computing the same category average over and over. The derived-table form computes each category's average exactly once and broadcasts it to every product in that category through the join. On a large catalog this is the structural difference between a quadratic and a linear pass over the data.

The trap

AVG(price) averages over the rows actually in the group. Products with a null price would be excluded from the average automatically — AVG ignores nulls. That is the standard behavior, and it is the right one here. The trap is assuming AVG divides by every row including nulls; it does not, so the average reflects only the priced rows.

You practiced precomputing per-category averages in a derived table before reattaching to each product — separating the per-category calculation from the per-product output.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.