N015-M4 Tier 2 · Core SQL · medium analytics · Streamhub

Return the user ID and session count for every user who has recorded **more than five** sessions

Part of HAVING in SQL

The problem

Streamhub's platform team is identifying highly active users for an outreach campaign.

Write a query to return the user ID and session count for every user who has recorded more than five sessions.

Assumptions:

  • The sessions table contains every session ever recorded on the Streamhub platform.
  • user_id links each session to the user who initiated it.
  • The threshold (> 5) applies to the per-user session count.

Output:

  • One row per qualifying user, with columns user_id and session_count.
Schema · analytics 5 tables
users
id integer
name text
email text
country text
plan text
signed_up_at timestamptz
is_active boolean
conversions
id integer
user_id integer
converted_at timestamptz
plan text
amount numeric
sessions
id integer
user_id integer
started_at timestamptz
ended_at? timestamptz
event_count integer
events
id integer
user_id integer
session_id? integer
event_type text
occurred_at timestamptz
properties? jsonb
periods
id integer
name text
start_month integer
end_month integer

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
SELECT
  user_id,
  COUNT(*) AS session_count
FROM
  sessions
GROUP BY
  user_id
HAVING
  COUNT(*) > 5

The shape

Same canonical shape as orders-per-customer, in a different domain. GROUP BY user_id builds the per-user session set; HAVING COUNT(*) > 5 keeps only the users with more than five sessions. The outreach campaign's eligible cohort comes out as user_id 1 and user_id 3, both with 9 sessions each.

Clause by clause

  • SELECT user_id, COUNT(*) AS session_count returns each qualifying user's id with their session count. COUNT(*) counts the rows in each user's group — every session that landed in the group contributes 1.
  • FROM sessions is the source set: every session ever recorded on Streamhub.
  • GROUP BY user_id partitions the rows into one group per user. After this clause, each row in the working set represents one user with their total session count aggregated behind it.
  • HAVING COUNT(*) > 5 filters those user rows by their session count. Users with five or fewer sessions drop out; six or more survive.

Why this and not WHERE COUNT(*) > 5

The per-user count doesn't exist until grouping has finished. WHERE runs row by row before any grouping happens, so it has no COUNT(*) to compare against. The condition has to wait until the groups have been built, which is exactly the moment HAVING runs.

The structural shape generalises. Anywhere a fact table records repeated events keyed to an entity — sessions per user here, orders per customer in N015-E1, plays per song, opens per email, logins per device — the question "which entities have more than N events" is GROUP BY entity plus HAVING COUNT(*) > N. The schema names change; the shape doesn't.

You practiced applying the HAVING COUNT(*) > N pattern in a different domain (sessions per user). The same structural shape recurs anywhere a fact table records repeated events keyed to an entity — orders per customer, plays per song, opens per email.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.