N030-M4 Tier 3 · Intermediate · medium hr · Helix Systems

Return the department ID and employee count for every department that has `3` or more employees on record

Part of Common Table Expressions (CTEs) in SQL

The problem

The HR team at Helix Systems is auditing team size across the organization.

Write a query to return the department ID and employee count for every department that has 3 or more employees on record.

Assumptions:

  • The employees table has one row per employee with a department_id.
  • A department's employee count is the number of employees records linked to that department_id.
  • Only departments with 3 or more employees should appear.

Output:

  • One row per qualifying department, with columns department_id and headcount.
Schema · hr 4 tables
departments
id integer
name text
location text
budget numeric
salaries
id integer
employee_id integer
amount numeric
effective_date date
end_date? date
employees
id integer
name text
email text
department_id integer
manager_id? integer
hire_date date
title text
is_active boolean
job_history
id integer
employee_id integer
title text
department_id integer
start_date date
end_date? date

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
WITH
  dept_headcount AS (
    SELECT
      department_id,
      COUNT(*) AS headcount
    FROM
      employees
    GROUP BY
      department_id
  )
SELECT
  department_id,
  headcount
FROM
  dept_headcount
WHERE
  headcount >= 3

The shape

The WITH layer computes a headcount per department; the main query filters that named result with WHERE headcount >= 3 against the aggregate column. The threshold check happens after the grouping, in a separate named layer.

Clause by clause

  • The WITH clause defines dept_headcount:
WITH dept_headcount AS (
  SELECT department_id, COUNT(*) AS headcount
  FROM employees
  GROUP BY department_id
)

GROUP BY department_id partitions employees by department; COUNT(*) counts each partition. Every department with at least one employee gets a row in the layer, headcount attached.

  • SELECT department_id, headcount FROM dept_headcount WHERE headcount >= 3 is the main query. It reads the named layer and keeps the rows whose count is 3 or more. Department 1 stays in with 17; department 3 stays in with 11; the other six departments also clear the threshold on this data.

Why this and not a derived table in FROM

A derived table would put the per-department aggregation inside the main query's FROM and apply the same threshold in the same WHERE. The result set is identical either way. The WITH version pulls the aggregation out, names it, and lets the main query read as two clear steps: compute the per-department count, then keep the ones at the cutoff. The threshold against an aggregate is the recurring shape that named layers were built to make legible.

The trap

COUNT(*) only produces headcount after the grouping runs. The named column does not exist on employees itself, only on the layer's output. Trying to write WHERE COUNT(*) >= 3 directly against employees without the layer fails, because the aggregate has not been computed yet. The two-stage shape — group first, filter the named aggregate second — is the structural answer to that ordering problem.

You practiced computing a per-category count in a WITH layer and applying a threshold check in the main query.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.