CASE STUDY · FREECITY RAG

Market research that answers back.

Retrieval-augmented generation · Build-to-Rent · Sydney

Freecity — a Sydney property developer with a growing Build-to-Rent pipeline — was sitting on years of market reports, rent comps, zoning analyses, demographic studies, and post-mortems. All of it trapped in PDFs and SharePoint folders. We designed and built a retrieval-augmented generation system that ingests the corpus, grounds every answer in the source documents, and lets the acquisitions team interrogate it through a chat interface with agent tools for quantitative analysis.

Freecity RAG chat interface — a query about median 2BR rents within 3km of Parramatta station, with a grounded answer, inline source citations, and an expandable tool-trace panel showing the rent-comp pull

APPROACH

Market data, trapped in PDFs.

WK 01 INGEST THE CORPUS MONSharePoint sit-in, 3 hrs TUEcorpus inventory · ~4,200 docs WEDPDF extraction tests · table-heavy THUchunking pass 1 · 512 tokens, 64 overlap FRIfirst retrievals · failure log started

WK 02 GROUND THE ANSWERS MONeval set, 80 real analyst questions TUEembedding bake-off · Voyage-3 won WEDreranker in front · Cohere v3 THUpage-level citation provenance FRIanalyst review · 32/80 still weak

WK 03+ REASON OVER THE DATA ▸agent tools · rent-comp, demo cross-tab ▸site-feasibility + memo-draft skills ▸structured data side-channel for SQL ▸tool traces visible in the UI

ONGOING GUARD THE OUTPUT ▸red-team set · 40 adversarial prompts ▸no-source → no-answer policy ▸full query + retrieval audit log

01 · INGEST

We started with their mess, not a clean corpus.

Around 4,200 documents spread across SharePoint, email exports, and analyst laptops. Table-heavy PDFs needed a layout-aware extractor before chunking. 512-token chunks with 64-token overlap, parent-document metadata preserved so every retrieval could walk back to its page.

Expanded tool-trace view — a site-feasibility tool call on a Parramatta parcel, with retrieved chunks, reranker scores, and page-level citations from a 2025 demographic report

OUTCOMES

The morning hunt
disappeared.

The system rolled out to the acquisitions team first, then BD and investment. What changed, in order of how the team talked about it:

ANALYST HOURS RECOVERED

Roughly 11 hours per analyst per week returned to higher-value work — the morning "hunt for the number" disappeared. Queries that used to mean trawling three shared drives now resolve in one chat turn.

RETRIEVAL PRECISION

92% on the 80-question eval set at top-5, up from 58% on the first-pass pipeline. Adversarial red-team pass rate locked above 95% before each release.

QUERY LATENCY

1.8 seconds p50 from question to first citation on the screen, tool calls included. Fast enough that analysts query mid-meeting instead of taking notes and circling back later.

SOURCE-CITATION RATE

100% of answers carry page-level source citations — every claim clickable back to the PDF it came from. Ungrounded responses are refused rather than hallucinated.

AUDIT & PROVENANCE

Full query, retrieval, and tool-call logs with user, timestamp, and document provenance. The answer to "where did that figure come from" is a query, not an archaeology dig.

Agent-tool panel — a rent-comp pull on 2BR units within 3km of a nominated station, returning a sorted table with sample sizes, median rent, and links back to the source listings

“It reads the research so we can actually use it.”

— Acquisitions lead, Freecity

Started as a pre-engagement website audit; escalated into an AI system build as the data problem surfaced.

Freecity is a Sydney property developer with an expanding Build-to-Rent book. We began with a website audit, watched the acquisitions team work, and built a retrieval-augmented generation system end-to-end — corpus ingestion, grounding, agent tools, guardrails, and the chat UI — and continue to ship improvements under a care retainer.

ONGOING

Living corpus.
Living system.

The RAG system is a living artifact — the corpus grows each week as new market reports, zoning updates, and project post-mortems land. We ship retrieval and prompt improvements iteratively, expand the agent-tool set in response to analyst asks, and hold the eval and red-team sets as the quality bar. The version live today is meaningfully different from the version six months ago, and it will be different again in six.