N026-M2 Tier 2 · Core SQL · medium ecommerce · Brightlane

Return the category ID and product count for every assigned category that contains **more than five** products

Part of Derived Tables (Subqueries in FROM) in SQL

The problem

Brightlane's product team wants to identify well-stocked categories. Products that have not been assigned to any category should not factor into the analysis.

Write a query to return the category ID and product count for every assigned category that contains more than five products.

Assumptions:

  • The products table contains every product in the catalogue.
  • Some products have a missing category_id (unassigned). These rows must be excluded before the per-category count runs — otherwise they would form a missing-category_id group that the report intends to leave out.
  • The threshold (> 5) applies to the per-category product count, after the unassigned rows are removed.

Output:

  • One row per qualifying category, with columns category_id and product_count.
Schema · ecommerce 5 tables
categories
id integer
name text
parent_id? integer
products
id integer
name text
category_id integer
price numeric
stock_qty integer
attributes? jsonb
order_items
id integer
order_id integer
product_id integer
quantity integer
unit_price numeric
customers
id integer
name text
email text
city? text
country text
created_at timestamptz
is_active boolean
orders
id integer
customer_id integer
ordered_at timestamptz
status text
total_amount numeric

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
SELECT
  category_id,
  product_count
FROM
  (
    SELECT
      category_id,
      COUNT(*) AS product_count
    FROM
      products
    WHERE
      category_id IS NOT NULL
    GROUP BY
      category_id
  ) AS category_counts
WHERE
  product_count > 5

The shape

Two filters at two different layers do two different jobs. The inner WHERE drops unassigned products before the per-category count runs, so the missing-category_id group never forms. The outer WHERE then narrows the per-category counts to categories with more than five products.

Clause by clause

  • The inner block computes a per-category product count, but only over categorised products:
SELECT category_id, COUNT(*) AS product_count
FROM products
WHERE category_id IS NOT NULL
GROUP BY category_id

The inner WHERE runs before GROUP BY. Rows where category_id is NULL are gone before grouping starts, so no NULL group lands in the derived table. - FROM (...) AS category_counts materialises those per-category counts as a derived table. - WHERE product_count > 5 is the outer filter. It operates on the per-category count, not on individual product rows. Five categories survive — 1, 4, 5, 11, 12 — each holding more than five products. - SELECT category_id, product_count returns the two columns the product team needs for the well-stocked shortlist.

Why this and not put both filters at the same layer

The two filters target different things. WHERE category_id IS NOT NULL is a row filter — it cares about individual product rows and removes uncategorised ones. WHERE product_count > 5 is a group filter — it cares about per-category totals and removes thinly-populated categories.

Moving the row filter to the outer layer wouldn't work cleanly. By that point the rows are already grouped, and the missing-category_id group exists as one row with a NULL category and a real product count. Excluding it after the fact is possible but leaves a footprint: the count of unassigned products was computed and then thrown away. Filtering inside the inner query keeps the unassigned products out of the aggregation entirely, which matches the prompt's intent.

You practiced narrowing inside the derived table before aggregation. WHERE inside the inner query removes rows before the inner grouping runs; WHERE outside narrows the per-group results.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.