N014-M4 Tier 2 · Core SQL · medium ecommerce · Brightlane

Return each `status` and the number of unique customers who have placed at least one order in that status

Part of GROUP BY in SQL

The problem

Brightlane's CRM team is assessing how broadly each order status touches the customer base.

Write a query to return each status and the number of unique customers who have placed at least one order in that status.

Assumptions:

  • The orders table contains every order Brightlane has processed.
  • A customer with multiple orders in the same status should be counted once for that status (not once per order).

Output:

  • One row per status value, with columns status and unique_customers.
Schema · ecommerce 5 tables
categories
id integer
name text
parent_id? integer
products
id integer
name text
category_id integer
price numeric
stock_qty integer
attributes? jsonb
order_items
id integer
order_id integer
product_id integer
quantity integer
unit_price numeric
customers
id integer
name text
email text
city? text
country text
created_at timestamptz
is_active boolean
orders
id integer
customer_id integer
ordered_at timestamptz
status text
total_amount numeric

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
SELECT
  status,
  COUNT(DISTINCT customer_id) AS unique_customers
FROM
  orders
GROUP BY
  status

The shape

COUNT(DISTINCT customer_id) deduplicates customers inside each status bucket before counting them. A buyer with three delivered orders contributes one to the delivered count, not three. The delivered row shows 59 unique customers across 161 total delivered orders, which is exactly the spread between an order count and a customer count.

Clause by clause

  • SELECT status, COUNT(DISTINCT customer_id) AS unique_customers returns the status label and the deduplicated customer count for that status. status is in GROUP BY; the count is an aggregate.
  • FROM orders is the input population.
  • GROUP BY status partitions the orders by status before the count runs. COUNT(DISTINCT customer_id) then runs inside each partition, and DISTINCT only deduplicates within the partition it is running in.

Why this and not COUNT(*)

COUNT(*) per status answers "how many orders are in each status," which double-counts a buyer whose three orders all landed in the same status. The CRM team wants reach, not volume. COUNT(DISTINCT customer_id) strips the repeats inside each bucket, so the result is the number of unique buyers touched by each status. The two answers differ by exactly the amount of repeat business inside each pipeline stage.

The trap

The same customer can appear in the count for multiple statuses, and that is correct, not a bug. DISTINCT deduplicates within a group, not across groups. A buyer with one delivered order and one cancelled order adds one to the delivered count and one to the cancelled count. Summing the four unique_customers values across statuses therefore does not give you the total number of distinct customers in the orders table. For that, you would need COUNT(DISTINCT customer_id) over the whole table with no GROUP BY.

You practiced combining COUNT(DISTINCT col) with GROUP BY. The aggregate's per-group behavior compounds: DISTINCT deduplicates within each group independently, so the same customer can appear in the count of two different statuses without contradiction.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.