N041-H1 Tier 3 · Intermediate · hard ecommerce · Brightlane

Return each qualifying category's ID, product count, combined price, and average product price

Part of Temp Tables and CREATE TABLE AS SELECT in SQL

The problem

Brightlane's pricing pipeline materializes category-level metrics into a temp table for categories with 3 or more products on record. For each qualifying category the temp table carries the product count, the combined price across the category, and the average product price.

Write a query to return each qualifying category's ID, product count, combined price, and average product price.

Assumptions:

  • The products table has one row per product with a category_id and a price.
  • A category's product count is the number of products in that category_id. A category's combined price is the sum of price across those products. A category's average product price is the combined price divided by the product count.
  • Only categories whose product count is greater than 2 should appear.

Output:

  • One row per qualifying category, with columns category_id, product_count, total_price, and avg_price.
Schema · ecommerce 5 tables
categories
id integer
name text
parent_id? integer
products
id integer
name text
category_id integer
price numeric
stock_qty integer
attributes? jsonb
order_items
id integer
order_id integer
product_id integer
quantity integer
unit_price numeric
customers
id integer
name text
email text
city? text
country text
created_at timestamptz
is_active boolean
orders
id integer
customer_id integer
ordered_at timestamptz
status text
total_amount numeric

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
WITH
  cat_totals AS (
    SELECT
      category_id,
      COUNT(*) AS product_count,
      SUM(price) AS total_price
    FROM
      products
    GROUP BY
      category_id
  )
SELECT
  category_id,
  product_count,
  total_price,
  total_price / product_count AS avg_price
FROM
  cat_totals
WHERE
  product_count > 2

The shape

The CTE computes the two raw aggregates that depend on the underlying rows — the product count and the combined price per category. The outer SELECT then derives the average by dividing those two numbers, and applies the count threshold. Doing the division in the outer layer rather than inside the CTE means the materialized table can carry the components, not only the derived metric, which is what downstream reports comparing categories on either total spend or average price will want.

Clause by clause

  • WITH cat_totals AS (SELECT category_id, COUNT(*) AS product_count, SUM(price) AS total_price FROM products GROUP BY category_id) groups products by category_id and computes the per-category count and total price. The result is one row per category with both raw aggregates as named columns.
  • SELECT category_id, product_count, total_price, total_price / product_count AS avg_price FROM cat_totals reads the CTE and adds a fourth column. The average price is computed by dividing the two aggregates that already exist on each row. So category 8's total_price of 364.95 divided by product_count of 5 gives the avg_price of 72.99.
  • WHERE product_count > 2 filters the result to qualifying categories. The threshold compares against product_count, which is a real column on the CTE result, so the comparison runs at the row level after aggregation.

Why derive the average from the raw components and not call AVG(price)

AVG(price) and SUM(price) / COUNT(*) agree on this data because every product has a recorded price, but they are not the same expression. AVG skips rows where price is NULL; the explicit SUM(price) / COUNT(*) divides the sum-over-non-null-prices by the full row count. When the materialized table needs to expose both the components and the derived metric so reports can recompute either separately, computing them as two distinct aggregates is the right shape.

The trap

The category 8 example divides 364.95 by 5 and lands on 72.99 cleanly because both operands are decimals. Watch what happens when both operands are integers: in PostgreSQL, integer division truncates, so COUNT(*) divided into another COUNT(*) would silently drop the fractional part. Here SUM(price) is numeric because price is numeric, so the division promotes to numeric automatically and no truncation happens. If you ever derive a ratio from two COUNT(*) aggregates, one side has to carry a decimal point or the result loses precision before it ever reaches the outer SELECT.

You practiced computing two raw aggregates first (count and sum) and deriving the average from them by division in the outer SELECT — the right shape when the materialized table needs the underlying components rather than just the derived metric.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.