What does a 90-day engagement look like?
Three acts. Each act has me doing one kind of work and your team doing another.
By the end, your team is running the agent and I'm gone.
Act 01
Anchor and Ground
Weeks 1 to 3
What I'm doing
Scoping the workflow in writing, establishing the baseline (what the workflow costs you today), and building the eval harness against thirty to fifty real historical examples. By the end of week three, the harness runs and produces a scorecard.
What your team is doing
Surfacing the historical examples, validating the workflow spec, and learning to run the harness. The eval harness ships in week three and stays with you. Your team can score the agent before there is an agent.
Act 02
Engineer
Weeks 4 to 9
What I'm doing
Building the agent, iterating against the eval, keeping the architecture as small as the eval will tolerate. Traces are visible for every run. Weekly progress is real, not performative.
What your team is doing
Reviewing weekly progress, flagging edge cases as they surface, and beginning to read traces. The engineering counterpart on your team is in the loop on architectural decisions, not handed a finished system at the end.
Act 03
iNstrument and Transfer
Weeks 10 to 13
What I'm doing
Shipping the dashboard, tuning alert rules against real production samples, writing the runbook, and walking your team through the operating protocols.
What your team is doing
Taking ownership. By week thirteen, your engineering counterpart can read traces, extend the eval set with new examples, and respond to alerts without me. If they can't, the engagement isn't done.