N024-H1 Tier 2 · Core SQL · hard ecommerce · Brightlane

Return the product name, price, and that product's own-category average price for every product

Part of Scalar Subqueries in SQL

The problem

Brightlane's category-management team wants to see each product alongside its price and the average price across all products in the same category (not the catalogue-wide average).

Write a query to return the product name, price, and that product's own-category average price for every product.

Assumptions:

  • The products table contains every product in the catalogue.
  • The third column is the average for the category that this row's product belongs to — different rows from different categories will see different averages.
  • A product's own-category average includes that product itself in the calculation.

Output:

  • One row per product, with columns name, price, and category_avg.
Schema · ecommerce 5 tables
categories
id integer
name text
parent_id? integer
products
id integer
name text
category_id integer
price numeric
stock_qty integer
attributes? jsonb
order_items
id integer
order_id integer
product_id integer
quantity integer
unit_price numeric
customers
id integer
name text
email text
city? text
country text
created_at timestamptz
is_active boolean
orders
id integer
customer_id integer
ordered_at timestamptz
status text
total_amount numeric

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
SELECT
  p.name,
  p.price,
  (
    SELECT
      AVG(p2.price)
    FROM
      products p2
    WHERE
      p2.category_id = p.category_id
  ) AS category_avg
FROM
  products p

The shape

The inner query references the outer row's category_id. PostgreSQL re-runs the subquery once per outer row, substituting that row's category_id each time, so the third column is a different average for each category. Apex Titan 15 sees its electronics category average (782.33); Crest Pro 14" sees its computers category average (1459). One query, sixty-three different category-specific averages.

Clause by clause

  • SELECT p.name, p.price reads the current product's name and price. The alias p is what lets the subquery refer back to this row.
  • The scalar subquery in the SELECT list:
(SELECT AVG(p2.price)
 FROM products p2
 WHERE p2.category_id = p.category_id) AS category_avg

This is a correlated subquery — p.category_id is a value from the outer query, used inside the inner one. The inner FROM products p2 aliases the same table separately so the two references don't collide. For each outer row, PostgreSQL plugs in that row's category_id, computes the average price across all products sharing it, and returns the single resulting number. - FROM products p is the outer source — every product, processed one row at a time. For every product in this table, the subquery runs once with that product's category_id filled in.

The trap

Three rows in the result have category_avg as NULL: the two gift cards and the mystery bundle. These are products with no category_id on file — the column is NULL for those rows. The subquery's filter becomes WHERE p2.category_id = NULL, which never matches anything in SQL: NULL = NULL is not true, it's unknown. The inner query returns zero rows, and a scalar subquery that returns zero rows resolves to NULL.

No error fires. The query runs cleanly and silently delivers a NULL in the third column for any outer row whose correlated key was itself NULL. The rule: a correlated subquery on a nullable key produces NULL for any outer row missing that key, not a zero or an error. If those rows need a different default — 0, the catalogue-wide average, an explicit "uncategorised" bucket — the query has to handle the missing key directly.

You practiced a correlated scalar subquery — one whose result depends on the outer row. The recurring shape: when each row needs a per-group statistic of its own, the inner query references the outer row and PostgreSQL re-evaluates it once per outer row.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.