N056-M4 Tier 4 · Advanced · medium analytics · Streamhub

Return each user's `user_id`, calendar month, total event count in that month, and event count in the immediately preceding month within that same user's own activity history

Part of Period-over-Period Analysis in SQL

The problem

Scenario: Streamhub's engagement team flags users whose monthly activity is shifting and needs each user's monthly event count shown alongside their own prior-month count.

Task: Write a query to return each user's user_id, calendar month, total event count in that month, and event count in the immediately preceding month within that same user's own activity history.

Assumptions:

  • A user is present in a month only if they have at least one event recorded in that month.
  • Each user's prev_month_count is drawn solely from that same user's earlier activity history.
  • The earliest month in each user's own history has no preceding month within that user; its prev_month_count value is missing.

Output:

  • One row per (user_id, month) pair present in the data.
  • Columns in this order: user_id, month (the first day of the calendar month), event_count, prev_month_count.
  • Sorted by user_id ascending, then month ascending.
Schema · analytics 5 tables
users
id integer
name text
email text
country text
plan text
signed_up_at timestamptz
is_active boolean
conversions
id integer
user_id integer
converted_at timestamptz
plan text
amount numeric
sessions
id integer
user_id integer
started_at timestamptz
ended_at? timestamptz
event_count integer
events
id integer
user_id integer
session_id? integer
event_type text
occurred_at timestamptz
properties? jsonb
periods
id integer
name text
start_month integer
end_month integer

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
SELECT
  user_id,
  DATE_TRUNC('month', occurred_at)::date AS MONTH,
  COUNT(*) AS event_count,
  LAG(COUNT(*)) OVER (
    PARTITION BY
      user_id
    ORDER BY
      DATE_TRUNC('month', occurred_at)
  ) AS prev_month_count
FROM
  events
GROUP BY
  user_id,
  DATE_TRUNC('month', occurred_at)
ORDER BY
  user_id,
  MONTH

The shape

Each user's monthly activity is its own series, and the prior-month count has to come from that user's history rather than from whoever happened to come before them in the data. PARTITION BY user_id is what isolates the lookback to a single user at a time.

Clause by clause

  • SELECT user_id, DATE_TRUNC('month', occurred_at)::date AS month, COUNT(*) AS event_count, LAG(COUNT(*)) OVER (PARTITION BY user_id ORDER BY DATE_TRUNC('month', occurred_at)) AS prev_month_count returns each user's month bucket, that month's event count, and the same user's prior-month count. PARTITION BY user_id resets the lookback at every user boundary; ORDER BY DATE_TRUNC('month', occurred_at) sets the chronological sequence inside each user's partition.
  • FROM events reads the event stream.
  • GROUP BY user_id, DATE_TRUNC('month', occurred_at) reduces the events to one row per (user, month) pair so the count is a per-user monthly total.
  • ORDER BY user_id, month prints each user's history as a contiguous block, in time order.

The trap

The first month of every user's history returns NULL for prev_month_count because there is no prior row inside that user's partition. That NULL is correct — there is no prior-month measurement for the user's first appearance — but the count of NULL rows scales with the number of users. A multi-entity period-over-period report can have a large fraction of its rows reporting NULL in the prior-period column, one per partition, and a downstream aggregate that doesn't account for them will quietly under-count. The behavior is the right one; the consumer needs to know it is there.

You practiced partitioning LAG by user_id so each user's prior-month count is drawn from their own history, never from another user's row.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.