T

yuanzonghao 9cedfa66e4 feat: prefetch, vision split, provider adapter, UI polish

Engine
- Split /api/vision out from /api/interact so client can drive
  prefetch + cache lookup independently of click interpretation
- Image client switched to chat-completions+modalities API (OpenRouter/
  provider style), supporting markdown image URL responses
- annotateClick now resizes to 768w before composite to keep vision
  payloads small and avoid CDN timeouts
- Prompts updated to mention "JSON" in user messages (required by
  Gemini's strict JSON mode)
- Shared fetchWithRetry helper: 2 retries for chat/image, 0 for vision
  (with 60s hard timeout)

Client
- Parallel prefetch of all three choice branches on each new frame
- Effect deliberately excludes phase from deps so user-click doesn't
  abort in-flight prefetches
- Cache hit/miss/free-form fallback handled in handleClick
- PlayCanvas reads img naturalWidth/Height and adapts container to
  whatever aspect AI returns (no more cropped third choice)
- max-width raised to 560px, max-height calc(100dvh - 200px)

Misc
- README env-path corrected to apps/web/.env.local
- users.md: BGM/TTS idea note
- .env.example moved into apps/web alongside next config

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-12 19:38:03 +08:00

apps/web

feat: prefetch, vision split, provider adapter, UI polish

2026-05-12 19:38:03 +08:00

packages

feat: prefetch, vision split, provider adapter, UI polish

2026-05-12 19:38:03 +08:00

.gitignore

fix(web): tame Next.js 16 dev server CPU runaway

2026-05-10 10:12:54 +08:00

LICENSE

Initial commit: AI-driven visual novel scaffold

2026-05-09 13:29:58 +08:00

package.json

Initial commit: AI-driven visual novel scaffold

2026-05-09 13:29:58 +08:00

pnpm-lock.yaml

fix(web): tame Next.js 16 dev server CPU runaway

2026-05-10 10:12:54 +08:00

pnpm-workspace.yaml

Initial commit: AI-driven visual novel scaffold

2026-05-09 13:29:58 +08:00

README.md

feat: prefetch, vision split, provider adapter, UI polish

2026-05-12 19:38:03 +08:00

tsconfig.base.json

Initial commit: AI-driven visual novel scaffold

2026-05-09 13:29:58 +08:00

vercel.json

feat: prefetch, vision split, provider adapter, UI polish

2026-05-12 19:38:03 +08:00

README.md

Dada

An AI-driven visual novel where every frame — scenes, dialogue, choices — is rendered by an AI, one frame at a time. You click. It paints. The story unfolds.

Open source, MIT.

How it works

Each turn is three model calls:

[user clicks somewhere on the image]
        │
        ▼
1. Vision model    interprets the click against the visible UI
        │
        ▼
2. Text LLM        writes the next frame (narration, dialogue, choices)
        │
        ▼
3. Image model     renders the entire next UI screen — scene, dialogue,
                   buttons, all of it — as one painted frame
        │
        ▼
[new image is shown; repeat]

There is no traditional UI. There is only the image. The AI chooses the layout, the colors, the typography, the buttons. Pick "stick figure on grid paper" as your style and you'll get hand-drawn UI. Pick "cyberpunk noir" and you'll get neon HUDs. Whatever fits the world.

One-click deploy

After deploy, set the nine environment variables (see below) in your Vercel project. That's it.

Environment variables

Three providers, all independently configurable. Any OpenAI-compatible chat / image endpoint works (OpenAI, Anthropic via OpenAI-compat proxy, Gemini, OpenRouter, DeepSeek, local Ollama, …).

Provider	Variables	Recommended
Text · story director	`TEXT_BASE_URL` `TEXT_API_KEY` `TEXT_MODEL`	`claude-opus-4-7` via Anthropic
Image · UI renderer	`IMAGE_BASE_URL` `IMAGE_API_KEY` `IMAGE_MODEL`	`gpt-image-2` via OpenAI
Vision · click reader	`VISION_BASE_URL` `VISION_API_KEY` `VISION_MODEL`	`gemini-3-flash` via Google

See apps/web/.env.example for the exact shape.

Local development

Requires Node 20+ and pnpm 9+.

pnpm install
cp apps/web/.env.example apps/web/.env.local
# fill in the nine env vars
pnpm dev
# open http://localhost:3000

Project layout

dada/
├── apps/web/              Next.js 16 app — pages + API routes
└── packages/
    ├── types/             shared TypeScript types
    ├── ai-client/         unified OpenAI-compatible clients
    └── engine/            three-stage AI orchestration (open core)

packages/engine is the open core — pure TS, no Next.js or browser dependency. Import it directly to build your own visual-novel front-end (Tauri, Electron, CLI, anywhere).

Cost & limits

Each turn costs roughly $0.15–0.25 in API fees with the recommended model trio. A 30-turn session is ~$5–8. There is no rate limiting or auth out of the box — if you make your deployment public, your bill will reflect that. Add limits before sharing widely.

License

MIT.

README.md Unescape Escape

Dada

How it works

One-click deploy

Environment variables

Local development

Project layout

Cost & limits

License

README.md