05. Assignment 15 — Capstone Project¶
Week 15. Build the first shippable version of your capstone. Use this with 01_weekly_plan.md, 02_explainer.md, and 03_study_material.md.
Goal¶
Ship an end-to-end AI system that a senior engineer can review seriously. It does not need final polish yet. It does need clear scope, working integration, first-pass evals, and a credible deployment story.
Required deliverables¶
1. Product spec¶
A short document covering: - user persona, - problem statement, - success metrics, - non-goals, - failure risks, - what the demo will show.
2. Architecture pack¶
- Component diagram.
- Data flow diagram.
- API / interface contracts.
- One paragraph on failure isolation.
- Build-vs-buy decisions.
3. Working MVP¶
At least one happy-path flow must work end-to-end. A stranger should be able to see the value in two minutes.
4. Evaluation pack¶
- 10-30 gold queries or scenarios.
- Pass/fail or scored rubric.
- One system-level latency budget.
- One cost tracking sheet or log.
- One failure-mode list with mitigations.
5. Packaging¶
- Reproducible run instructions.
- Container or equivalent environment definition.
- README skeleton with architecture, evals, and future work.
- Rough demo recording or script.
Suggested repository structure¶
capstone-project/
├── README.md
├── app/
├── prompts/
├── evals/
│ ├── gold_set.jsonl
│ └── run_eval.py
├── docs/
│ ├── architecture.md
│ ├── latency_budget.md
│ └── failure_modes.md
├── Dockerfile
└── Makefile or run.sh
Milestone checklist¶
- [ ] Idea locked and scoped.
- [ ] Architecture diagram written.
- [ ] MVP works end-to-end.
- [ ] System eval suite exists.
- [ ] Latency and cost numbers captured.
- [ ] README skeleton written.
- [ ] Rough demo recorded.
- [ ] External review list prepared for Module 16.
Rubric¶
| Category | Weight | What strong looks like |
|---|---|---|
| Scope judgment | 20% | Narrow problem, clear success definition |
| System design | 20% | Good component boundaries and contracts |
| Integration quality | 20% | Happy path works reliably, failures considered |
| Evaluation discipline | 20% | System-level metrics, not just vibes |
| Packaging and communication | 20% | Demo, README, and diagrams are reviewer-friendly |
Non-negotiables¶
- Do not hide behind slides without a working path.
- Do not claim production readiness without instrumentation.
- Do not use multi-agent complexity unless it earns its keep.
- Do not skip the eval suite.
- Do not let scope creep steal demo quality.
Demo script template¶
- State the user and problem in one sentence.
- Show the input.
- Show system steps quickly.
- Show the output and evidence.
- Show one metric for quality.
- Show one metric for latency or cost.
- End with the next production improvement.
External review preparation¶
Prepare this message now, even if you send it next week:
Hi [Name] — I have built the first version of my AI engineering capstone. It tackles [problem] for [user]. I would value 15 minutes of feedback on the architecture and eval approach. Repo and short demo are here: [link].
Completion gate¶
Before moving to 06_revision.md, confirm: - [ ] I can explain why I chose this architecture. - [ ] I know the slowest step in the system. - [ ] I know the most expensive step in the system. - [ ] I know the top three failure modes. - [ ] I know what Module 16 must polish next.