OfferWise
About Methodology Pricing Sign in
Methodology

How we turn 50 pages of paperwork into an offer strategy.

OfferWise is built on a proprietary ML pipeline trained on real California disclosure findings. Here's what's in the corpus, how the models work, and what they can and can't do.

01 — The corpus

What we trained on.

Every prediction OfferWise makes leans on a labeled training corpus. We didn't scrape a generic real estate dataset — we built ours from the ground up, focused on the California buyer.

257,804
Labeled findings
1,324
Contradiction pairs
13
Active municipal portals
40
Finding-category baskets

The corpus combines three data sources:

We do not buy or scrape inspection reports from any third-party provider. Findings in the corpus come from publicly available documents, code-enforcement records, and user-uploaded reports analyzed with the user's permission.

02 — The models

What runs on every analysis.

An OfferWise analysis is not a single model. It's a pipeline of specialized models, each doing one job well. The result is then synthesized into the buyer report.

Each model is trained separately and evaluated against held-out portions of the corpus. We don't ask a single language model to do everything — and we don't trust a single language model with any of it.

03 — What we measure

Honest accuracy numbers.

These are the numbers from our most recent training cycle, measured on held-out data:

85.8%
Category accuracy
0.82
Cost R² (MAE $1,915)
97.1%
Contradiction precision
65.4%
Severity accuracy

The numbers we are most proud of, and the numbers we are honest about:

We retrain the corpus on a rolling schedule. As more properties are analyzed, the training data grows and accuracy compounds. The numbers above reflect the most recent training cycle.

04 — What we don't do

Honest limitations.

OfferWise is a strong tool for what it does. We try to be clear about what it doesn't do.

Run an analysis on a real property.
Free to try. No credit card required.
Analyze a Property →