One-Day Calendar — Availability

"Suppose we want to turn this take-home into a real service that people can call." Worked through interview-style, in order: pin the requirements, meet every component (and why it earns its place), make the three big technology calls, draw the picture, then follow each request start to finish.

1 · Pin the requirements

Everything below is justified by these eight lines — if a component doesn't serve one of them, it doesn't get to exist.

Functional — what it must do

Users can upload / manage calendars (people + events) instead of re-sending a CSV per call.
Callers can query availability: all slots, first slot, free windows — for any set of people and duration.
Calendars are persistent and shared: upload once, query many times, from any client.
Invalid input gets a clear, line-numbered error, never a silent wrong answer.

Non-functional — how well it must do it

Read-heavy: ~100 : 1 queries to uploads. Optimize for query latency, p95 < 100 ms.
Small hot data: one day = 720 bits per person → whole org fits in cache.
Consistency: a finished upload must be visible to the next query (read-your-writes per calendar).
Availability over freshness for reads: serving a mask a few seconds stale beats a 500.

2 · Meet the components

Every box in the diagram, explained like you'd explain it to a new teammate: what it is, why it's there, and what it actually holds.

API Gateway front door

The bouncer with the guest list.

What it is: One managed entry point (nginx / AWS API Gateway / Kong) that every request passes through before touching our code.
Why it's here: So each service behind it doesn't reimplement auth, rate limits and routing. Check the ticket once, at the door — not at every room inside.
What it does: Verifies the JWT, stamps the request with a tenant_id, throttles noisy clients, routes /calendars/… to the right service.

Calendar Service writes

The intake clerk — takes your paperwork, gives you a receipt.

What it is: A small stateless service (this take-home's parser, grown up) that owns uploads and edits.
Why it's here: Writes and reads scale and fail differently. Uploads are rare, chunky and validation-heavy; queries are constant and tiny. Splitting them means a flood of uploads can never slow down "is Alice free at 3?".
What it does: Accepts the CSV, stores the raw file, queues a parse job, returns 202 + uploadId immediately.

Availability Service reads

The calculator — the only box doing actual math.

What it is: MeetingScheduler as a fleet: stateless replicas behind the gateway, autoscaled on traffic.
Why it's here: This is the hot path — 100 queries per upload. It must answer in milliseconds and keep answering even if uploads are on fire.
What it does: Fetches busy masks from Redis, ORs them, scans for slots — the same three lines of BitSet logic this demo runs, just fed from a cache instead of a fresh parse.

Person Registry identity

The company phonebook. Yes, it's a service.

What it is: A tiny service (or a module inside Calendar Service on day one) that owns the person table and hands out canonical personIds.
Why it's here: In the take-home a person is a string — query Alise (typo) and she's silently "free all day". That's a wrong meeting time nobody notices. IDs make typos impossible instead of undetected.
What it does: Resolves names → IDs at upload time, serves GET /people?q=ali autocomplete so the UI can only pick people who exist.

Postgres source of truth

The filing cabinet. If it's not here, it didn't happen.

What it is: The system of record: calendar, person, event tables (DDL below).
Why it's here: Our data is naturally relational (events belong to people belong to calendars), an upload must be all-or-nothing (transactions), and "no duplicate names per calendar" is one UNIQUE constraint. See "why not Mongo/Dynamo" below.
What it holds: Every event ever uploaded, forever. Everything else in this diagram can be lost and rebuilt from here.

Redis cache

The sticky note on the monitor — fast, tiny, disposable.

What it is: An in-memory key-value store sitting between the Availability Service and Postgres.
Why it's here: Answering "is Alice free?" from Postgres means a query per request. Answering from RAM means microseconds — and our hot data is comically small.
What it holds: Exactly one thing: busy masks. Key avail:{personId}:{date} → value = the same 720-bit BitSet this demo builds, ~90 bytes. 100,000 people ≈ 9 MB — the whole company fits in the smallest Redis tier. Lost? Rebuild any mask from Postgres in one query.

Queue (SQS-style) work

The conveyor belt — every job gets done exactly once-ish.

What it is: A simple job queue between "file received" and "file parsed".
Why it's here: Parsing in the request thread means a big file blocks the caller and a crash mid-parse loses everything. Queued jobs retry with backoff; poisoned files land in a dead-letter queue with the line-numbered parse error attached.
What flows through: One message per upload: {uploadId, s3Key, calendarId}. Workers consume, parse, write.

Kafka events · later

The company newspaper — everyone reads their own copy.

What it is: An append-only event log. Not in the day-one design — drawn dashed until it earns its keep.
Why (eventually): A queue message is consumed once, by one worker. When several systems care that a calendar changed — cache invalidation, webhooks, search indexing, analytics — each needs its own copy. That's a log with consumer groups, not a queue.
The trigger: The moment a second consumer wants calendar.updated events, the outbox relay starts publishing to Kafka instead of calling Redis directly. Bonus: replaying the topic rebuilds any corrupted cache.

Object store (S3) raw files

The shoebox of receipts.

What it is: Blob storage for every uploaded file, exactly as it arrived.
Why it's here: When a customer says "your parser ate my calendar", you replay the original bytes. Audit, debugging, reprocessing after a parser fix — all free once the file is kept.

Parser workers async

The kitchen staff — nobody sees them, everything depends on them.

What it is: A pool of processes running this take-home's CsvEventParser, fed by the queue.
What it does: Parse → resolve names to personIds → one Postgres transaction (all rows or none — same fail-fast rule as the demo) → rebuild affected masks in Redis → emit "updated".

3 · The three big technology calls

The questions an interviewer actually asks: why this database, why this cache, why this pipe — and what would make us change the answer.

Why Postgres — and not Mongo or DynamoDB?

The data is a textbook relational shape. Events belong to people, people belong to calendars. Foreign keys and a JOIN express that in one line; in Mongo you'd pick a document nesting and regret it the first time a query cuts across it.
Uploads need transactions. "All 5,000 rows or none" is BEGIN … COMMIT. Doable in Mongo, awkward; in Dynamo, painful beyond 100 items.
Constraints are features. UNIQUE (calendar_id, name) and CHECK (end_min >= start_min) move validation into the database — the same "invalid data cannot exist" philosophy as the Event constructor.
Dynamo's superpower is wasted here. It shines at millions of ops/sec on known key patterns. Our hot reads don't even hit the DB (Redis does), and our write volume is human-scale. We'd pay in flexibility and get nothing back.
Choose boring. Postgres + read replicas carries this to thousands of tenants. The day one table gets too big: partition by calendar_id — clean, because no query ever crosses tenants.

Why Redis — and what exactly is inside it?

One key type, one value type. avail:{personId}:{2026-07-06} → a 720-bit bitmap (Redis SETBIT/GETBIT speak bitmap natively). That's it. No sessions, no objects, no cleverness.
The math: 720 bits ≈ 90 bytes per person-day. 100k people × 365 hot days ≈ a few GB; cache only ±30 days and it's megabytes.
A query never touches Postgres. MGET N masks → OR → scan. The same microsecond BitSet logic as this demo — Station 3 of the Data journey, verbatim.
Cache, not database. Miss → one SQL query rebuilds the mask → SETEX. Redis dies → everything still works, just slower. Never the other way around: Redis is allowed to lie briefly (seconds-stale mask), never to be the only copy.
Invalidation is easy here because writes are chunky: an upload rewrites one calendar → delete that calendar's keys. No distributed-cache horror stories at this shape.

Queue vs Kafka — conveyor belt vs newspaper

A queue is a to-do list: each parse job is picked up by one worker, done, deleted. Retries, backoff, dead-letter queue for poisoned files. This we need on day one.
Kafka is a newspaper: the same "calendar updated" story is read independently by the cache invalidator, the webhook sender, the search indexer — each at its own pace, with replay. This we need the day a second reader shows up, and not before.
Using Kafka as a job queue (or SQS as an event bus) is the classic résumé-driven mistake — the tools aren't interchangeable, they answer different questions.

What stays exactly the same as the take-home

The BitSet mask is untouched — it just moves from a per-request local variable into Redis, keyed per (person, date).
The parser and its five gates run verbatim inside workers — fail-fast with line numbers, now retryable and asynchronous.
The write/read seam was already in the code: CsvEventParser vs MeetingScheduler becomes Calendar Service vs Availability Service. The architecture is the class diagram, blown up.

4 · Data model & API

Core entities

Calendar — id, ownerId, day bounds (07:00–19:00 today; a field, not a constant).
Person — id, calendarId, name. Identity lives here, not in free-text strings.
Event — id, personId, subject, startMin, endMin. Minutes-from-day-start, exactly like the BitSet.

The APIcalendar is the resource, availability is a view on it

POST /calendars                ← CSV body, returns calendarId
GET  /calendars/{id}/people     ← who exists (drives the UI)
GET  /calendars/{id}/availability
     ?people=p1,p2&durationMinutes=60
GET  /calendars/{id}/availability/first?…
GET  /calendars/{id}/free-windows?people=…

Data modelPostgres DDL

-- one row per uploaded calendar
CREATE TABLE calendar (
  id         uuid PRIMARY KEY,
  owner_id   uuid NOT NULL,
  day_start  smallint NOT NULL DEFAULT 420,  -- 07:00 in minutes
  day_end    smallint NOT NULL DEFAULT 1140, -- 19:00
  created_at timestamptz NOT NULL DEFAULT now()
);

CREATE TABLE person (
  id          uuid PRIMARY KEY,
  calendar_id uuid NOT NULL REFERENCES calendar(id),
  name        text NOT NULL,
  UNIQUE (calendar_id, name)
);

CREATE TABLE event (
  id         uuid PRIMARY KEY,
  person_id  uuid NOT NULL REFERENCES person(id),
  subject    text NOT NULL,
  start_min  smallint NOT NULL,  -- minutes from day start
  end_min    smallint NOT NULL,
  CHECK (end_min >= start_min)
);
CREATE INDEX event_by_person ON event(person_id);

The queries that matter

-- upload = one transaction (all-or-nothing,
-- same fail-fast semantics as the parser)
BEGIN;
INSERT INTO calendar …;
INSERT INTO person …;      -- batch
INSERT INTO event …;       -- batch
COMMIT;

-- availability read (cache miss): one round trip
-- fetches every event needed to build the masks
SELECT p.name, e.start_min, e.end_min
FROM person p
LEFT JOIN event e ON e.person_id = p.id
WHERE p.calendar_id = :calId
  AND p.name IN (:people);

-- who exists? (drives the UI + typo detection)
SELECT name FROM person
WHERE calendar_id = :calId ORDER BY name;

The mask math stays in the service, not SQL — ORing bits in Java is microseconds; a generate_series SQL version would be slower and unindexable.
LEFT JOIN keeps people-without-events visible — the "free all day" rule becomes explicit instead of accidental.

5 · The picture

All the components from section 2, wired together. Solid lines are the synchronous request path; the dashed line is the async invalidation that follows every upload.

High-level designclick any box for what it is and why it's there

Click any box in the diagram — this panel explains what it is, why it earned its place, and the trade-off behind it.

Two services, one seam. Writes (parse, validate, store) and reads (mask math) scale differently and fail differently — the same split the code already has: CsvEventParser vs MeetingScheduler.
The calendar becomes a stored resource. Upload once → calendarId → query forever. Today's stateless re-upload-per-query becomes GET with an id.
Cache is an optimization, not a dependency. Miss → rebuild from DB in O(events); Redis down → slower, not down.

6 · Follow a request — the happy paths

Every API, walked start → finish. Green is where the caller gets their answer.

POST/calendars — upload a calendar

The write path. Rare, chunky, validation-heavy — so it's asynchronous.

Clientsends the CSV

→

Gatewaychecks JWT, stamps tenant, rate-limits

→

Calendar Servicestores raw file in S3, enqueues {uploadId, s3Key}

→

202 Acceptedcaller gets uploadId in ~50 ms — before any parsing

Workerpicks the job off the queue

→

Parse + validatethe take-home's five gates, verbatim

→

Person Registrynames → canonical IDs

→

Postgresone transaction: people + events + outbox row

→

Redisrebuild that calendar's masks

→

DoneGET /uploads/{id} flips to ready; webhook fires

If a row is bad: the job lands in the dead-letter queue with the line-numbered parse error; the upload status becomes failed with that exact message. Nothing partial is ever committed.

GET/calendars/{id}/availability?people=…&durationMinutes=60 — the hot path

100× more frequent than uploads. Target: p95 < 100 ms — typically single-digit.

Clientasks "when can these people meet?"

→

Gatewayauth + tenant check

→

Availability ServiceMGET one mask per person from Redis

→

OR + scanthe demo's Station 3, microseconds

→

200 JSON{ availableSlots: […] } — Postgres never touched

Cache miss? One extra hop: SELECT that person's events from a read replica → rebuild the 90-byte mask → SETEX into Redis → continue as above. First query after an upload pays ~10 ms; every one after rides the cache.

GET/availability/first & /free-windows — same road, different last step

Client→ Gateway → Availability Service

→

Same maskssame MGET, same OR

→

Different scanfirst short-circuits on the first hit; free-windows walks the gaps

→

200 JSONone slot, or a list of ranges

GET/people?q=ali — autocomplete

Clienttypes "ali" in the people picker

→

Gatewayauth

→

Person Registryprefix search on the person table (tiny, indexed)

→

200 JSON[{id, name}] — the UI can only submit real IDs, so typos die here

7 · Two deep dives worth having ready

Consistency & the outbox

Upload is transactional: people + events commit together or not at all — matches today's fail-fast parser.
The naïve version — "commit, then delete cache keys" — has a failure window: crash between commit and invalidation leaves stale masks forever.
Fix: transactional outbox. The same DB transaction that writes events also inserts an outbox(calendar_id, 'updated') row. A relay reads the outbox and performs invalidation (and webhook fan-out). Crash-safe: the message exists iff the data does.
Availability answers stay idempotent reads — safe to retry, safe to serve from any replica.

Growth pressure points

Too many events per person (imports with years of history): masks are per-day so query cost is flat, but rebuild cost grows — materialize masks at write time (workers) instead of on cache miss.
Too many people per query ("is anyone in engineering free?", N=2000): ORing 2000 masks is still < 1 ms — the real cost is 2000 cache GETs. Batch with MGET, or maintain group masks updated incrementally on member change.
Too many tenants: partition person/event by calendar_id hash; Redis keys already shard naturally. Nothing crosses tenants, so sharding is embarrassingly clean.
DB write ceiling: events are append-mostly; if a single primary chokes, move ingestion to bulk COPY per upload before reaching for distributed SQL.

The questions this project should be able to answer out loud — asked the way an interviewer would ask them, answered the way I'd answer across the table. Click a question to reveal its answer — try answering first.

If you went over the project right now, what would be your opinion — what would you change?

Honest answer: the core is right and I'd keep it — BitSet masks, fail-fast parsing, the layer seams. But a fresh read finds real things to poke at:

The slot grid is the biggest known trade-off. Candidates step by the meeting duration (MeetingScheduler.java:36), so a 60-minute meeting can only start at 07:00, 08:00, … — exactly matching the README's expected output, which is why I chose it. But a real product wants a sliding scan: a 45-minute meeting should be able to start at 08:30. The fix is small — step by 5–15 minutes, or jump with busy.nextClearBit() — and the README example survives as a special case.
AvailabilityService news up new WorkingDay() per request (line 42). Works, but the 07:00–19:00 day should be injected configuration — the WorkingDay(start, end) constructor already exists and is tested; the service just doesn't use it.
People are strings. Case-sensitive, no identity — alice ≠ Alice, and a typo is silently "free all day" (a logged warning the caller never sees). Fine for the exercise, the first thing to fix on the way to production (see Q6).
Every HTTP query re-parses the CSV. A deliberate choice — stateless server, no storage, no cleanup — and honest for a demo. But it means the API's cost is dominated by parsing, not by the microsecond scheduling math it exists to show off.
Smaller nits: Event could be a Java 17 record (same guarantees, half the code); java.util.logging → SLF4J; scheduler invariants ("first slot == head of all slots" is already tested) would suit property-based testing.

The slot-grid problem, drawn: one 30-minute meeting at the start of the day, and a 60-minute meeting to place.

Verdict: I'd change the slot stepping first — it's the one place where a documented assumption could surprise a user rather than protect them.

Explain the design choices — commons-csv, BitSet, the structure.

Apache Commons CSV, not split(","). The sample data itself contains "Lunch, then a walk" — a quoted comma. split corrupts that row silently; an RFC 4180 parser handles quotes, and the wrapper adds BOM-stripping and header detection because that's what real spreadsheet exports look like. Rule: never hand-roll a parser for a format that has a spec and a library.
BitSet, not interval arithmetic. The day is 720 minutes → 720 bits per person (~90 bytes). Loading an event = set its bits. Overlaps merge for free (setting a bit twice is a no-op). Combining people = one OR each. The free-check is two lines (isFree, lines 87–90). The alternative — sort, merge, subtract intervals — is where off-by-one bugs live. I traded a few bytes for an algorithm with no edge cases left.
The package structure is the SOLID answer. parsing / domain / repository / scheduling / service / controller — each layer has one reason to change. The proof it works: the web API was added later without touching a single core class.
EventParser is an interface with one implementation — normally a smell, justified here because parsing is the system's natural seam: tests inject events directly, and a JSON or ICS parser drops in without the scheduler knowing.
Immutability + constructor validation. An Event with end < start cannot exist. Every layer downstream trusts its inputs instead of re-checking them.
Fail-fast parsing with line numbers. A silently dropped row would produce a silently wrong meeting time — the worst possible failure for a scheduler. So one bad row aborts with "line 3, here's the content, here's what's wrong".
Half-open ranges [start, end). An event ending 09:00 doesn't block a meeting starting 09:00 — how real calendars behave, and it makes back-to-back events compose without fake conflicts.

The engine, drawn: the README's own query — Alice + Jack, 60 minutes — as bits.

Doesn't the BitSet break with conflicting meetings — or when a meeting is added or removed?

Sharp question — this is exactly where the trade-off lives. Two separate problems hide in it: a bit can't count, and a bit can't be safely un-set.

Adding is safe. set() is idempotent — setting a busy minute twice is a no-op. That's why overlapping meetings cost nothing for availability: busy is busy, whether Alice has one meeting at 09:00 or three.
But the merge destroys information. Once two meetings burn into the same bits, the mask can't tell you Alice is double-booked, can't say which meeting owns 09:15, and can't model "the room fits two meetings". Bit = 1 means busy; it doesn't remember why.
Removing is the real trap. Say Alice has a standup 09:00–10:00 and a review 09:30–10:30. Delete the standup and naively clear(120, 180) — you just zeroed 09:30–10:00, minutes the review still occupies. The scheduler now happily books over the review. Silent wrong answer, the worst kind.

Why this project doesn't have the bug: the mask is never treated as the data. Look at EventCalendar — the constructor builds all masks from the full event list, and no method mutates them afterwards. Events are the source of truth; the BitSet is a throw-away index derived from them, rebuilt per request. There is no delete-a-bit code path to get wrong.

And in production, three tiers depending on what you actually need:

Just add/move/delete events? Keep the same rule: mutate the event table, then rebuild that person-day's mask from the remaining events. It's 720 bits and one indexed query — microseconds. Never surgically clear bits; recompute them. (Upgrades tab, card 13.)
Need to detect conflicts, or count capacity? Swap the bit for a small counter: int[720] per person-day. Add = increment, delete = decrement, busy = count > 0, double-booked = count > 1, "room fits 2" = count < capacity. Same build-then-scan pipeline, one type swap hidden behind EventCalendar. (Upgrades tab, card 12.)
Need "which meeting is at 09:15?" That's an interval question, not a bitmap question — keep the events sorted by start time (or an interval tree at scale) next to the mask, and let each structure answer what it's good at.

The delete trap, drawn: Alice's overlapping standup and review — and what each strategy leaves behind when the standup is deleted.

Verdict: the BitSet is the right index for "when is everyone free?" and the wrong system of record. Keep it derived — rebuilt, never mutated — and conflicts and deletions can't hurt it. This codebase already follows that rule.

Explain the bonus features.

The brief said "feel free to go above and beyond". What was added, in order of usefulness:

Two extra scheduling queries on the same engine: findFirstAvailableSlot (short-circuits, returns Optional — tested to always equal the head of the full list) and getFreeWindows (no duration; walks the gaps with nextClearBit/nextSetBit).
A REST API (Spring Boot, 4 endpoints) wrapped around the untouched core — multipart CSV upload, JSON out, domain errors mapped to 400 { error } with the exact parse message.
This site — a live client for the API that doubles as the project's documentation: the day board, the line-by-line data journey, memory-level internals, edge cases demonstrated against your own file. The browser computes nothing; every answer comes from the Java API.
Deployability: a Dockerfile and fly.toml — the demo runs as a container, not just on a laptop.

The point: every bonus reuses MeetingScheduler untouched — the "above and beyond" is proof the core abstraction was right, not a second system bolted on.

Which edge cases did you check, and how did you answer them?

Each answer follows from one of the key assumptions (documented in SOLUTION.md, pinned by tests, demonstrable live on the Edge cases tab):

Edge case	Ruling & the assumption behind it	Where
Quoted comma in subject	Parsed as one field — CSV has a spec (RFC 4180), follow it.	CsvEventParserTest
Header row / BOM / blank lines	Recognised and skipped — real exports have them; they're noise, not errors.	with-header.csv
Malformed row (columns, bad time, end<start)	Fail fast with line number + raw line — a dropped row means a wrong answer, and wrong beats loud never.	invalid-row.csv
Event outside 07:00–19:00	Clipped to the day (06:00–08:00 blocks only 07:00–08:00); fully outside → ignored. The day bounds are law.	MeetingSchedulerTest
Overlapping events	Merge silently — busy minutes are a set, not a sum. (BitSet gives this for free.)	overlapping.csv
Back-to-back events (end == next start)	No conflict — half-open `[start, end)` ranges.	meetingEndingExactlyAt…
Zero-length event (start == end)	Legal input, blocks nothing.	zeroLengthEvent…
Person not in the file	Free all day + logged warning — "all persons available" is satisfiable by someone with no events. (The weakest ruling; Q1 and Q5 both flag it.)	unknownPersonIs…
Duplicate / padded names ("Alice ")	Trimmed and deduped to one person; case stays significant.	duplicateAndWhitespace…
Bad duration (0, negative, 90 s, > 12 h)	Rejected with `IllegalArgumentException` → HTTP 400 — garbage questions get errors, not guesses.	validateDuration tests
Fully booked day	Empty list / `Optional.empty()` — "no slots" is an answer, not an error.	fully-booked.csv
Meeting exactly the whole day	One slot at 07:00 — the scan condition is `<=`, so ending exactly at 19:00 counts.	durationExactlyEqual…
The README example itself	Asserted verbatim, unit + end-to-end — the spec is a pinned regression test.	readme.csv

It works, people are happy — now expose it as a service where people connect or upload whole calendars. How do you design that?

The one-paragraph answer: make the calendar a stored resource instead of a request parameter. Upload once → get a calendarId → query it forever. Postgres holds the truth, Redis holds the 720-bit masks this demo already computes, and uploads become asynchronous jobs so a big file can never block a query.

The shape of it:

UPLOAD (rare, async) Client → Gateway (auth) → Calendar Service → S3 (raw file) + queue └→ 202 + uploadId Worker → parse (the take-home's 5 gates) → Person Registry (names → IDs) → Postgres (one transaction) → Redis (rebuild masks) → webhook "ready" QUERY (constant, fast) Client → Gateway → Availability Service → Redis MGET masks → OR + scan → 200 JSON (miss: 1 SQL read → rebuild → cache)

"Connect their whole calendar" (Google/Outlook sync) is the same pipeline with a different front end: an OAuth connector pulls events and feeds the identical ingestion path — the parser is just one more event source behind the EventParser seam.
Real dates: the unit of computation becomes (person, date) → mask. Events store real timestamps; ingestion slices them into per-date masks in the calendar's timezone. A week query = 7 masks per person. The BitSet doesn't blink.
What deliberately doesn't change: the mask math, the fail-fast validation, the write/read seam — the architecture is the take-home's class structure, scaled out.

The full HLD — every component, why Postgres and not Mongo/Dynamo, what exactly lives in Redis, queue vs Kafka, and the happy path of every API — lives on the Architecture tab, one click left of here.

Calendar file

People

Meeting duration

Actions

Results

Live request log

1 · Pin the requirements

Functional — what it must do

Non-functional — how well it must do it

2 · Meet the components

API Gateway front door

Calendar Service writes

Availability Service reads

Person Registry identity

Postgres source of truth

Redis cache

Queue (SQS-style) work

Kafka events · later

Object store (S3) raw files

Parser workers async

3 · The three big technology calls

Why Postgres — and not Mongo or DynamoDB?

Why Redis — and what exactly is inside it?

Queue vs Kafka — conveyor belt vs newspaper

What stays exactly the same as the take-home

4 · Data model & API

Core entities

The APIcalendar is the resource, availability is a view on it

Data modelPostgres DDL

The queries that matter

5 · The picture

High-level designclick any box for what it is and why it's there

6 · Follow a request — the happy paths

POST/calendars — upload a calendar

GET/calendars/{id}/availability?people=…&durationMinutes=60 — the hot path

GET/availability/first & /free-windows — same road, different last step

GET/people?q=ali — autocomplete

7 · Two deep dives worth having ready

Consistency & the outbox

Growth pressure points

Target high-level designclick any box for what it is and why it's there

See an upgrade in action

01Person identity — stop trusting free-text names

02Real storage — a database instead of a re-parsed file

03Uploads that propagate — ingestion pipeline

04Scaling reads — masks as a cache-native structure

05Security & multi-tenancy

06API hardening

07Time correctness — beyond one hardcoded day

08Observability & operations

09Failure modes, rehearsed

10What if the day isn't 07:00–19:00 — or is the whole day?

11What if it's a real calendar — dates, weeks, months?

12What if colliding events matter — instead of silently merging?

13What if events can be added, moved, deleted?

The big picturewho calls whom

ParsingCsvEventParser, EventParser, CalendarParseException

DomainEvent, WorkingDay, TimeRange

The core data structureEventCalendar

Answering queriesMeetingScheduler

Web layerAvailabilityController, AvailabilityService, ApiExceptionHandler