N062-M4 Tier 5 · Expert · medium ecommerce · Brightlane

Return each `category_name` and its `revenue` — the combined line-item revenue across its products

Part of Choosing Between Subqueries, CTEs, and Joins in SQL

The problem

Scenario: Brightlane's merchandising team needs each product category's name paired with the total revenue it has generated from line items.

Task: Write a query to return each category_name and its revenue — the combined line-item revenue across its products.

Assumptions:

  • A line item's revenue is quantity multiplied by unit_price.
  • A category's revenue is the combined line-item revenue across every product in that category.
  • The result covers only categories with at least one line item across their products.

Output:

  • One row per qualifying category.
  • Columns in this order: category_name, revenue.
Schema · ecommerce 5 tables
categories
id integer
name text
parent_id? integer
products
id integer
name text
category_id integer
price numeric
stock_qty integer
attributes? jsonb
order_items
id integer
order_id integer
product_id integer
quantity integer
unit_price numeric
customers
id integer
name text
email text
city? text
country text
created_at timestamptz
is_active boolean
orders
id integer
customer_id integer
ordered_at timestamptz
status text
total_amount numeric

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
WITH
  category_revenue AS (
    SELECT
      p.category_id,
      SUM(oi.quantity * oi.unit_price) AS revenue
    FROM
      order_items oi
      JOIN products p ON oi.product_id = p.id
    GROUP BY
      p.category_id
  )
SELECT
  c.name AS category_name,
  cr.revenue
FROM
  categories c
  JOIN category_revenue cr ON cr.category_id = c.id

The shape

A CTE aggregates line-item revenue by category id once, and an inner JOIN to categories attaches the category name. The CTE makes the aggregation step its own named layer; the outer join handles the labeling step.

Clause by clause

  • WITH category_revenue AS (SELECT p.category_id, SUM(oi.quantity * oi.unit_price) AS revenue FROM order_items oi JOIN products p ON oi.product_id = p.id GROUP BY p.category_id) joins each line item to its product to reach category_id, then totals line-item revenue per category. One row per category that has at least one line item across its products.
  • SELECT c.name AS category_name, cr.revenue FROM categories c JOIN category_revenue cr ON cr.category_id = c.id brings the category name in. The inner JOIN is deliberate — categories with no line items are excluded, which matches "covers only categories with at least one line item across their products."

Why pre-aggregate in a CTE and not aggregate in the final query

SELECT c.name, SUM(oi.quantity * oi.unit_price) FROM categories c JOIN products p ON p.category_id = c.id JOIN order_items oi ON oi.product_id = p.id GROUP BY c.name produces the same numbers. The shapes differ in how they handle row counts:

-- Pre-aggregate in CTE, then join for the label
WITH category_revenue AS (
    SELECT p.category_id, SUM(oi.quantity * oi.unit_price) AS revenue
    FROM order_items oi JOIN products p ON oi.product_id = p.id
    GROUP BY p.category_id
)
SELECT c.name AS category_name, cr.revenue
FROM categories c JOIN category_revenue cr ON cr.category_id = c.id

-- Single flat query: join everything, then aggregate
SELECT c.name AS category_name, SUM(oi.quantity * oi.unit_price) AS revenue
FROM categories c JOIN products p ON p.category_id = c.id
JOIN order_items oi ON oi.product_id = p.id
GROUP BY c.name

In the flat shape, the join produces one row per line item before the GROUP BY collapses it. The pre-aggregated shape produces one row per category before the final join, keeping the join's row count exactly equal to the number of qualifying categories. When the many side is wide, pre-aggregating is the cleaner separation between "compute the metric" and "label the result."

The trap

The aggregation has to happen by category_id, not by category name. If two categories share a name (rare but possible in a real catalog), grouping on name silently merges their revenue. Grouping on category_id first and joining for the label keeps each category distinct regardless of name collisions.

You practiced precomputing per-category revenue in a CTE before pairing it with the category lookup — separating the per-category calculation from the name attachment.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.