N057-M3 Tier 4 · Advanced · medium analytics · Streamhub

Return the year, the month number from 1 through 12, and the total number of `events` for each year-month combination

Part of Grouping by Date Periods in SQL

The problem

Scenario: Streamhub's data team is producing a report where year and month-of-year are reported as two separate numeric columns rather than a single date, so analysts can slice by either dimension independently.

Task: Write a query to return the year, the month number from 1 through 12, and the total number of events for each year-month combination.

Assumptions:

  • The year value is the four-digit calendar year of the event.
  • The month_num value is the calendar month of the event, expressed as a number from 1 (January) through 12 (December).
  • One result row covers every event whose calendar year and calendar month match.

Output:

  • One row per (year, month_num) combination present in the data.
  • Columns in this order: year, month_num, event_count.
Schema · analytics 5 tables
users
id integer
name text
email text
country text
plan text
signed_up_at timestamptz
is_active boolean
conversions
id integer
user_id integer
converted_at timestamptz
plan text
amount numeric
sessions
id integer
user_id integer
started_at timestamptz
ended_at? timestamptz
event_count integer
events
id integer
user_id integer
session_id? integer
event_type text
occurred_at timestamptz
properties? jsonb
periods
id integer
name text
start_month integer
end_month integer

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
SELECT
  EXTRACT(
    YEAR
    FROM
      occurred_at
  ) AS YEAR,
  EXTRACT(
    MONTH
    FROM
      occurred_at
  ) AS month_num,
  COUNT(*) AS event_count
FROM
  events
GROUP BY
  EXTRACT(
    YEAR
    FROM
      occurred_at
  ),
  EXTRACT(
    MONTH
    FROM
      occurred_at
  )

The shape

EXTRACT pulls the year and the month-of-year out of occurred_at as two independent numeric values, and the grouping happens on the pair. The query reports time as two columns of numbers, not as a single truncated date, so analysts can slice on either dimension on its own.

Clause by clause

  • SELECT EXTRACT(year FROM occurred_at) AS year, EXTRACT(month FROM occurred_at) AS month_num, COUNT(*) AS event_count returns one row per (year, month) pair. Each EXTRACT returns a number, not a date, so the report shows 2023 and 12 instead of a December 2023 date. COUNT(*) counts the events that fall inside the pair.
  • FROM events reads every event.
  • GROUP BY EXTRACT(year FROM occurred_at), EXTRACT(month FROM occurred_at) uses the same two expressions as the grouping keys. The grouping is on the pair, not on each value independently, so December 2022 and December 2023 are kept apart because their year values differ even though their month_num matches.

Why EXTRACT and not date_trunc

date_trunc('month', occurred_at) would return a single timestamp per month, which is the right shape for time-series charts but the wrong shape here. The prompt asks for year and month-of-year as two separate numeric columns. EXTRACT returns a number for each part, which is exactly that shape. The two functions live in the same node for this reason: date_trunc is for grouping along a continuous calendar axis, and EXTRACT is for pulling calendar parts out as numbers that analysts can pivot on independently.

You practiced extracting calendar parts as separate numeric values so year and month-of-year can be reported as independent dimensions instead of a single truncated date.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.