N057-H1 Tier 4 · Advanced · hard analytics · Streamhub

Return each month-of-year and the total number of `events` recorded in that month-of-year across every year in the data

Part of Grouping by Date Periods in SQL

The problem

Scenario: Streamhub's product team wants to identify seasonal patterns by looking at which months of the year draw the highest event volume, ignoring which year each event happened in.

Task: Write a query to return each month-of-year and the total number of events recorded in that month-of-year across every year in the data.

Assumptions:

  • The month_num value is the calendar month of the event, expressed as a number from 1 (January) through 12 (December), with no year component.
  • An event from January 2023 and an event from January 2024 contribute to the same month-of-year row.

Output:

  • One row per month-of-year present in the data, between 1 and 12 inclusive.
  • Columns in this order: month_num, event_count.
Schema · analytics 5 tables
users
id integer
name text
email text
country text
plan text
signed_up_at timestamptz
is_active boolean
conversions
id integer
user_id integer
converted_at timestamptz
plan text
amount numeric
sessions
id integer
user_id integer
started_at timestamptz
ended_at? timestamptz
event_count integer
events
id integer
user_id integer
session_id? integer
event_type text
occurred_at timestamptz
properties? jsonb
periods
id integer
name text
start_month integer
end_month integer

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
SELECT
  EXTRACT(
    MONTH
    FROM
      occurred_at
  ) AS month_num,
  COUNT(*) AS event_count
FROM
  events
GROUP BY
  EXTRACT(
    MONTH
    FROM
      occurred_at
  )

The shape

EXTRACT(month FROM occurred_at) returns the month number alone — a value from 1 to 12 with no year attached. Grouping on that single number collapses January 2022, January 2023, and January 2024 events all into the same row, which is exactly the seasonal view the product team wants.

Clause by clause

  • SELECT EXTRACT(month FROM occurred_at) AS month_num, COUNT(*) AS event_count returns one row per distinct month number. The output is twelve numeric rows at most, one per calendar month of the year, with the total event volume across every year in the data.
  • FROM events reads every recorded event.
  • GROUP BY EXTRACT(month FROM occurred_at) uses the same extraction as the grouping key. Every event whose timestamp falls in March, regardless of year, extracts to the number 3 and lands in the same group. The count of 32 on month 3 is the total of every March event in the table.

Why EXTRACT and not date_trunc

date_trunc('month', occurred_at) would return one row per calendar month per year, separating March 2022 from March 2023 from March 2024. That is the right shape for a time-series report but the wrong shape here. The product team is looking for a seasonal pattern that ignores year entirely, and EXTRACT(month FROM ...) is the only construct in this node that strips the year off and returns just the month number. The difference between the two functions is exactly the difference between a calendar grouping and a seasonal one: date_trunc preserves the year axis, EXTRACT(month FROM ...) collapses it.

The trap

The result has at most twelve rows, and the natural reading is to scan them in January-through-December order. The query does not sort them, and GROUP BY alone does not guarantee any output order. The rows can come back in any sequence, which is fine for a seasonal pivot the team will plot or join to a month-name table downstream, but a reader expecting the numbers to land in calendar order will read the values out of sequence. The values themselves are correct; only the row order is undefined.

You practiced extracting a month-of-year value with no year component so events from different years collapse together by season rather than by date.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.