N038-M3 Tier 3 · Intermediate · medium analytics · Streamhub

Return the ID, user ID, and event count of every session, plus the average event count across that user's sessions on each row

Part of Window Functions Introduction (OVER, PARTITION BY) in SQL

The problem

Streamhub's engagement team needs to compare each session's event count to the average event count across that session's user.

Write a query to return the ID, user ID, and event count of every session, plus the average event count across that user's sessions on each row.

Assumptions:

  • The sessions table has one row per session with an id, a user_id, and an event_count.
  • A user's average event count is the average of event_count across every session linked to that user_id. The same value should appear on every row that shares a user_id.

Output:

  • One row per session, with columns id, user_id, event_count, and user_avg_events.
Schema · analytics 5 tables
users
id integer
name text
email text
country text
plan text
signed_up_at timestamptz
is_active boolean
conversions
id integer
user_id integer
converted_at timestamptz
plan text
amount numeric
sessions
id integer
user_id integer
started_at timestamptz
ended_at? timestamptz
event_count integer
events
id integer
user_id integer
session_id? integer
event_type text
occurred_at timestamptz
properties? jsonb
periods
id integer
name text
start_month integer
end_month integer

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
SELECT
  id,
  user_id,
  event_count,
  AVG(event_count) OVER (
    PARTITION BY
      user_id
  ) AS user_avg_events
FROM
  sessions

The shape

AVG(event_count) OVER (PARTITION BY user_id) computes a separate average for each user_id and writes that user's average onto every session belonging to the user. Two sessions from the same user share an average; sessions from different users see different averages. Every session row stays in the output.

Clause by clause

  • SELECT id, user_id, event_count returns each session's identifier, the user who owned it, and the session's individual event count, one row per session.
  • The window column is:
AVG(event_count) OVER (PARTITION BY user_id) AS user_avg_events

PARTITION BY user_id splits the row set into one group per distinct user_id. AVG(event_count) runs inside each group independently, so the value attached to a given session is the average of event_count across every session that shares its user_id. All sessions belonging to the same user see the same user_avg_events.

  • FROM sessions reads every session. The engagement team is comparing each session to its user's average, so every row stays in.

Why this and not GROUP BY user_id

GROUP BY user_id would collapse the table to one row per user holding the per-user average, and the individual session rows would be gone. The comparison the engagement team is making, this session's event count versus this user's typical event count, requires both numbers on the same row. PARTITION BY is the construct that produces the per-group aggregate without throwing away the rows it summarises.

You practiced partitioning an AVG window by a foreign-key column — every record sees its own user's per-user average alongside the individual value.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.