N042-H2 Tier 4 · Advanced · hard ecommerce · Brightlane

Return every order's ID, customer ID, order amount, the difference between that order's amount and the customer's average order amount across all their orders, and that same customer's previous order amount

Part of LAG and LEAD in SQL

The problem

Brightlane's customer analytics team needs each order annotated with two contextual values: how the order's amount compares to that customer's average across their history, and the previous order amount for that customer.

Write a query to return every order's ID, customer ID, order amount, the difference between that order's amount and the customer's average order amount across all their orders, and that same customer's previous order amount.

Assumptions:

  • A customer's average is the average of total_amount across every order for that customer_id. The same customer-level average appears on every row sharing a customer_id.
  • The diff-from-average at each row is the current total_amount minus the customer's average.
  • A customer's previous order is the order with the largest ordered_at strictly before the current row's ordered_at, restricted to that customer. For a customer's first order, the previous-amount value is missing.
  • The final result is sorted by customer_id ascending, then by ordered_at ascending.

Output:

  • One row per order, with columns id, customer_id, total_amount, diff_from_avg, and prev_order_amount. Sorted by customer_id, then ordered_at.
Schema · ecommerce 5 tables
categories
id integer
name text
parent_id? integer
products
id integer
name text
category_id integer
price numeric
stock_qty integer
attributes? jsonb
order_items
id integer
order_id integer
product_id integer
quantity integer
unit_price numeric
customers
id integer
name text
email text
city? text
country text
created_at timestamptz
is_active boolean
orders
id integer
customer_id integer
ordered_at timestamptz
status text
total_amount numeric

Run previews · Check grades

Write a query, then run it to see results here.

Worked solution Try it yourself first
Solution query
SELECT
  id,
  customer_id,
  total_amount,
  total_amount - AVG(total_amount) OVER (
    PARTITION BY
      customer_id
  ) AS diff_from_avg,
  LAG(total_amount) OVER (
    PARTITION BY
      customer_id
    ORDER BY
      ordered_at
  ) AS prev_order_amount
FROM
  orders
ORDER BY
  customer_id,
  ordered_at

The shape

Two window functions share the same partition (PARTITION BY customer_id) but answer different questions on each row. AVG(total_amount) OVER (PARTITION BY customer_id) computes the customer's lifetime average and duplicates it onto every row; LAG(total_amount) OVER (PARTITION BY customer_id ORDER BY ordered_at) reaches back one chronological step. Both annotations land on the same row without a self-join.

Clause by clause

  • SELECT id, customer_id, total_amount, total_amount - AVG(total_amount) OVER (PARTITION BY customer_id) AS diff_from_avg, LAG(total_amount) OVER (PARTITION BY customer_id ORDER BY ordered_at) AS prev_order_amount returns the order's identifying columns plus two derived annotations. The AVG window has no ORDER BY because an average over the whole partition does not depend on row order; the LAG window does have ORDER BY ordered_at because "previous" only means something against a defined sequence.
  • FROM orders reads every order.
  • ORDER BY customer_id, ordered_at sorts the printed output chronologically within each customer.

Why two separate OVER clauses and not one shared window

The two functions need different window definitions. AVG needs the unordered partition; adding ORDER BY would not change its result here (a partition-wide average is invariant to row order), but LAG strictly requires ORDER BY to know which row is previous. Writing the two OVER clauses out independently makes each function's dependencies explicit. A shared WINDOW alias could deduplicate the PARTITION BY customer_id text but would not change semantics.

The trap

The two window functions both partition by customer_id, which can read as a single shared window. They are not the same window. AVG OVER (PARTITION BY customer_id) and LAG OVER (PARTITION BY customer_id ORDER BY ordered_at) differ in whether ordering is part of the window definition, and that difference is load-bearing. Adding ORDER BY ordered_at to the AVG window would not change its result on this query, but on a frame-sensitive aggregate it absolutely would; an ordered window introduces an implicit running frame, and the average would become a running average instead of a partition average. The cleanest rule when authoring a query with multiple windows is: write each window's clauses to match exactly what that function needs, and do not share unless the sharing is provably semantics-preserving.

You practiced two window functions on the same partition with different roles — AVG OVER (PARTITION BY ...) for a per-customer summary and LAG OVER (PARTITION BY ... ORDER BY ...) for a chronological lookup, both attached to the same row.

How you actually get good at SQL

Reading explains SQL. Writing it, over and over with instant feedback, is what makes you fluent.

That's the whole SQLMaxx loop: 600+ real problems, instant AI feedback, mastery you can actually see, and spaced review that won't let you forget.

A stack of SQL practice problem cards, the top card showing an employees table.
615 problems · 66 concepts

Real problems. Not toy examples.

615 hand-built problems spanning all 66 concepts, from basic SELECTs to window functions, built on real schemas and real business questions, the kind you'll actually get asked on the job. Enough reps to make SQL automatic.

A retro computer showing a SQL query marked correct with a green checkmark.
Instant AI feedback

Write a query. Know if it's right in one second.

No copying an answer and hoping it clicked. The AI grader checks your real query against real data, catches exactly what's wrong, and explains the fix in plain English, like a senior analyst reading over your shoulder on every problem.

A circular mastery progress dial filling from blue to green, the SQLMaxx diamond at its center.
Mastery tracking

Stop guessing whether you actually know it.

SQLMaxx tracks every concept and shows you what you've mastered and what's still shaky. Your skills fill in one concept at a time, so 'I think I get joins' becomes something you can prove.

A SQL query editor circled by a blue return arrow with a clock, scheduled to come back for review.
Spaced review

Learn it once. Keep it for good.

Most of what you learn this week fades by next week. So when a concept comes due for review, SQLMaxx hands you a fresh problem to solve from a blank editor, not a flashcard to re-read. A research-backed spaced-repetition algorithm (FSRS) times each return for right before you'd forget, so your SQL is still there months later, when the interview or the job actually needs it.

Practice, feedback, mastery, review. That's the loop that turns reading into real skill.

Start free

No account, no credit card. Start solving in under a minute.