N059-H3 Tier 5 · Expert · hard ecommerce · Brightlane

Return each customer's `customer_id`, the total number of line items purchased across all their `orders`, and the total revenue across those line items

Part of Join Fanout and Aggregate Correctness in SQL

The problem

Scenario: Brightlane's CRM team is profiling each customer's purchasing depth — both the number of line items they have purchased and the revenue those line items represent.

Task: Write a query to return each customer's customer_id, the total number of line items purchased across all their orders, and the total revenue across those line items.

Assumptions:

  • A line item's revenue is quantity multiplied by unit_price.
  • A customer's total_line_items is the combined count of line items across every order they have placed; their total_item_revenue is the combined revenue across the same set.
  • The result covers only customers who have placed at least one order containing line items.

Output:

  • One row per qualifying customer.
  • Columns in this order: customer_id, total_line_items, total_item_revenue.
Schema · ecommerce 5 tables
categories
id integer
name text
parent_id? integer
products
id integer
name text
category_id integer
price numeric
stock_qty integer
attributes? jsonb
order_items
id integer
order_id integer
product_id integer
quantity integer
unit_price numeric
customers
id integer
name text
email text
city? text
country text
created_at timestamptz
is_active boolean
orders
id integer
customer_id integer
ordered_at timestamptz
status text
total_amount numeric

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
SELECT
  o.customer_id,
  COUNT(oi.id) AS total_line_items,
  SUM(oi.quantity * oi.unit_price) AS total_item_revenue
FROM
  orders o
  JOIN order_items oi ON oi.order_id = o.id
GROUP BY
  o.customer_id

The shape

Both COUNT(oi.id) and SUM(oi.quantity * oi.unit_price) aggregate columns that live on order_items — the child side of the join — so the per-line-item row shape is the correct shape for both. Grouping by o.customer_id rolls those line-item values up through their parent orders to one row per customer in a single pass.

Clause by clause

  • SELECT o.customer_id, COUNT(oi.id) AS total_line_items, SUM(oi.quantity * oi.unit_price) AS total_item_revenue returns each customer's ID alongside two metrics on their line items. COUNT(oi.id) totals the line-item rows inside the customer group; SUM totals the per-line revenue values across those same rows.
  • FROM orders o reads the orders, carrying the customer_id that the customer-level grouping needs.
  • JOIN order_items oi ON oi.order_id = o.id pairs each order with its line items. Customers with multiple orders, and orders with multiple line items, both contribute their full row counts to the joined result.
  • GROUP BY o.customer_id collapses the joined rows down to one per customer. Both aggregates run inside each group against the same row set.

Why both metrics can share the same query

Because every aggregated column lives on order_items, the row-multiplied shape that the join produces is the right shape for both COUNT and SUM. Adding a third metric that came from orders — say, the number of distinct orders, or the sum of an order-level total — would require either COUNT(DISTINCT o.id) or a separate pre-aggregation pass, because those parent-side values would inflate by the line-item count under the current shape. The rule that lets these two metrics coexist is "the column being aggregated comes from the table that is fanning out."

The trap

The instinct that fails here is reaching for SUM(o.order_total) to get total revenue instead of computing it from line items. An order with three line items appears in the joined result three times, and its order_total would be counted three times — a customer with a single $500 order and three line items on it would report $1500 in revenue. The line-item revenue formula SUM(oi.quantity * oi.unit_price) avoids the trap entirely because every value being summed is unique to its row. Any time a query needs both a child-row count and a revenue total, the safest shape is the one that computes both from the child table itself; reach for parent-side aggregates only when the join shape is already one-to-one.

You practiced totaling row counts and revenue across the same child layer — a count and a sum drawn from one line-item set, rolled up to the customer.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.