infiplot-web

T

Zonghao Yuan df3b1d3307 Merge pull request #106 from zonghaoyuan/feat/freeform-always-new-scene

feat(play): freeform input always generates new scene + enhanced insert-beat

2026-06-24 19:40:04 +08:00

.github

chore(ci): add noreply@anthropic.com to CLA allowlist

2026-06-24 19:08:24 +08:00

app

Merge pull request #106 from zonghaoyuan/feat/freeform-always-new-scene

2026-06-24 19:40:04 +08:00

components

style(play): increase dialogue and choice font sizes by 3px

2026-06-24 18:59:21 +08:00

docs

docs(readme): update architecture, add OpenDeploy sponsor, fix LINUX DO link

2026-06-24 18:53:26 +08:00

drizzle

feat(engine): merge cloudflare-migration — paradigm D engine, BYOK proxy, story persistence (#95 )

2026-06-18 18:05:38 +08:00

lib

Merge pull request #106 from zonghaoyuan/feat/freeform-always-new-scene

2026-06-24 19:40:04 +08:00

public

refactor(presets): split mixed narration+dialogue beats into separate beats

2026-06-24 19:06:43 +08:00

scripts

refactor(presets): split mixed narration+dialogue beats into separate beats

2026-06-24 19:06:43 +08:00

.dev.vars.example

chore: remove stale GALLERY_SECRET from .dev.vars.example

2026-06-18 23:50:35 +08:00

.dockerignore

feat(deploy): add Docker Compose self-hosted deployment option

2026-06-06 21:57:57 +08:00

.env.example

refactor(share): remove GALLERY_SECRET, use plaintext + SHA-256 integrity for .infiplot files

2026-06-18 21:41:56 +08:00

.gitignore

chore: add piolium/ to .gitignore

2026-06-15 17:14:58 +08:00

.pr_agent.toml

chore(ci): disable PR Agent auto-describe and AI title

2026-06-07 22:27:54 +08:00

AGENTS.md

refactor(share): remove GALLERY_SECRET, use plaintext + SHA-256 integrity for .infiplot files

2026-06-18 21:41:56 +08:00

best_practices.md

chore(ci): tune PR Agent config

2026-06-07 15:29:34 +08:00

CLA.md

feat(repo): add CLA mechanism for external contributors

2026-06-17 12:23:46 +08:00

CLA.zh.md

feat(repo): add CLA mechanism for external contributors

2026-06-17 12:23:46 +08:00

CLAUDE.md

fix(play): fix CLAUDE.md typo and dialogue history memo anti-pattern

2026-06-06 21:39:24 +08:00

cloudflare-env.d.ts

feat(engine): merge cloudflare-migration — paradigm D engine, BYOK proxy, story persistence (#95 )

2026-06-18 18:05:38 +08:00

CONTRIBUTING.md

feat(repo): add CLA mechanism for external contributors

2026-06-17 12:23:46 +08:00

docker-compose.yml

feat(deploy): add Docker Compose self-hosted deployment option

2026-06-06 21:57:57 +08:00

Dockerfile

feat(deploy): add Docker Compose self-hosted deployment option

2026-06-06 21:57:57 +08:00

drizzle.config.ts

feat(engine): merge cloudflare-migration — paradigm D engine, BYOK proxy, story persistence (#95 )

2026-06-18 18:05:38 +08:00

LICENSE

docs: streamline 3 READMEs and fix EN language switcher (#6 )

2026-06-02 15:33:08 +08:00

middleware.ts

fix(i18n): overhaul i18n with [locale] routing, SSR translations, and hreflang SEO

2026-06-18 23:16:17 +08:00

next-env.d.ts

chore: revert auto-generated next-env.d.ts change

2026-06-19 01:36:15 +08:00

next.config.ts

feat(deploy): add Docker Compose self-hosted deployment option

2026-06-06 21:57:57 +08:00

open-next.config.ts

feat(engine): merge cloudflare-migration — paradigm D engine, BYOK proxy, story persistence (#95 )

2026-06-18 18:05:38 +08:00

package.json

perf(web): self-host Font Awesome instead of cdnjs CDN

2026-06-19 01:33:40 +08:00

pnpm-lock.yaml

perf(web): self-host Font Awesome instead of cdnjs CDN

2026-06-19 01:33:40 +08:00

postcss.config.mjs

refactor: flatten monorepo to single web package (#12 )

2026-06-03 00:55:45 +08:00

README.en.md

docs(readme): update architecture, add OpenDeploy sponsor, fix LINUX DO link

2026-06-24 18:53:26 +08:00

README.ja.md

docs(readme): update architecture, add OpenDeploy sponsor, fix LINUX DO link

2026-06-24 18:53:26 +08:00

README.md

docs(readme): update architecture, add OpenDeploy sponsor, fix LINUX DO link

2026-06-24 18:53:26 +08:00

tailwind.config.ts

refactor: flatten monorepo to single web package (#12 )

2026-06-03 00:55:45 +08:00

tsconfig.json

feat(engine): merge cloudflare-migration — paradigm D engine, BYOK proxy, story persistence (#95 )

2026-06-18 18:05:38 +08:00

vercel.json

feat(engine): merge cloudflare-migration — paradigm D engine, BYOK proxy, story persistence (#95 )

2026-06-18 18:05:38 +08:00

wrangler.jsonc

refactor(share): remove GALLERY_SECRET, use plaintext + SHA-256 integrity for .infiplot files

2026-06-18 21:41:56 +08:00

README.en.md

An interactive story game, generated in real time for you

简体中文 · English · 日本語

⚡ Overview

InfiPlot is an interactive story game with content generated by AI in real time. There are no pre-written plots and no pre-made characters — everything is generated on demand, tailored to you.

In one line: what we're building is an AI-generated, real-time take on Love Is All Around (《完蛋！我被美女包围了！》).

Whether you're a six-year-old, a twenty-something, thirty-five, or sixty, there's a fantasy here that belongs to you and you alone:

Learn magic in the world of Harry Potter; become the one everyone at school adores and confesses to; publish paper after paper in top journals and conferences with grant money to spare; step into Empresses in the Palace and live out the court intrigue; or return to your younger self and make a different choice about something you regret…

🌐 Live Demo

Free to play, no setup required: infiplot.com

Deploy

InfiPlot offers multiple deployment options. For personal use, we recommend the one-click Vercel deploy; to self-host on your own server or local machine, use Docker.

OpenDeploy / Vercel / Cloudflare (one-click)

Cloudflare deployment requires the Workers Paid Plan because the scene pipeline needs longer CPU time. OpenDeploy lets your AI agent handle the deployment for you.

After deploy, fill in the environment variables — see the Configuration guide below. The repo root is the app itself: Vercel needs no special root directory; on Cloudflare, just set the build command to pnpm build:cf.

Docker (self-hosted)

For VPS, home servers, or local machines. Supports x86 and ARM (including Apple Silicon Macs). No need to clone the repo — just download two files:

mkdir -p infiplot && cd infiplot
curl -fsSL https://raw.githubusercontent.com/zonghaoyuan/infiplot/main/docker-compose.yml -o docker-compose.yml
curl -fsSL https://raw.githubusercontent.com/zonghaoyuan/infiplot/main/.env.example -o .env.example
[ -f .env.local ] || cp .env.example .env.local

Edit .env.local with your API keys (see Configuration guide), then start:

docker compose up -d

Visit http://localhost:3000 to start playing.

You can also run the image directly without Compose:
docker run -d -p 3000:3000 --env-file .env.local ghcr.io/zonghaoyuan/infiplot:latest

📸 Screenshots

How it works

Built on text, image, and audio models, we've assembled a multi-agent framework to deliver on InfiPlot's goal. We split the agents into four roles — Writer, Character Designer, Cinematographer, and Painter — that work together to keep the plot coherent, the characters consistent, and the scenes continuous, all while making the story as compelling as we can. The Writer also handles overall story architecture.

We call each complete playthrough a story.

A story unfolds as a sequence of scenes. Each scene is one AI-painted background plus a short tree of beats — moments of narration, dialogue, and the occasional choice. You tap through a scene's beats and the image stays put; only when a choice leads somewhere genuinely new — another place, a new point of view, a jump in time — does the AI paint the next scene.

While you're reading one scene, the engine speculatively generates the scenes your choices could lead to — and, for unavoidable next steps, the scene after that. By the time you pick a direction, its image is usually already painted, so the cut feels instant. If you still notice some lag today, don't worry — we're working hard to bring it down.

Clicking the background itself (not a button) routes through a vision model: it reads where you tapped and decides whether you're exploring the current scene (it inserts a beat — no new image) or moving on (a new scene). This builds on a valuable lesson we learned from flipbook, and we believe it will become one of InfiPlot's defining features — taking the experience to the next level.

There is no traditional game UI baked into the art. The AI paints the world in whatever style you pick — "stick figure on grid paper" or "cyberpunk noir" — and the dialogue panel and choice buttons are a light HTML layer drawn on top, tuned to sit over the scene. In other words, the UI fits the story of each playthrough, rather than staying the same every time.

Team & Vision

We're a group of young people from Tsinghua University and other schools.

On one hand, we're longtime, devoted players of galgames, otome games, FMV, and AI role-play games. Even while enjoying them, we kept imagining how much more delightful and thrilling it would be if the story choices weren't fixed in advance — or if you could truly interact with an AI character in depth, instead of just texting it through a chat app.

On the other hand, we happen to know a little about large-model technology: enough to turn ideas into working software quickly with AI, and to have formed some modest views on the technical paths available and the limits of what today's tech can build.

The spark came on April 22, 2026, when @zan2434 and others released flipbook. We were stunned and delighted by this entirely new form of interaction.

So one day in May, we agreed on the spot to build something like this — both to help people live out the fantasies they'd once set aside, and to explore the new modes of interaction that multimodal models make possible.

The project is still very early and many features are far from polished. We'd love your feedback — open an issue, or join our dev team and explore the new possibilities with us, and satisfy your own curiosity.

Get in touch: hi@infiplot.com

Scan to join our beta community on QQ (group ID 575404333) to share feedback and help shape the project:

InfiPlot beta community QQ group QR code

Configuration guide

InfiPlot talks to four kinds of model providers. Text and Vision use any OpenAI-compatible endpoint, so you can mix and match freely — for Google Gemini, point *_BASE_URL at its OpenAI-compatible endpoint (https://generativelanguage.googleapis.com/v1beta/openai). For Anthropic Claude, a compatible gateway (e.g. LiteLLM) is recommended — Anthropic's official endpoint offers an OpenAI-compatible layer but no caching, which raises cost and latency. Image supports Runware (its own task-array protocol) and OpenAI (gpt-image). TTS supports Xiaomi MiMo (its own voice design / clone protocol — per-character voice design, clone, and per-line delivery direction; free) and StepFun (32 preset voices, auto-matched by AI; paid but better quality).

1. Choose your providers

Provider	Variables	Required?	Recommended
Text · story director	`TEXT_BASE_URL` `TEXT_API_KEY` `TEXT_MODEL`	✅	`deepseek-v4-flash` via DeepSeek
Image · scene renderer	`IMAGE_BASE_URL` `IMAGE_API_KEY` `IMAGE_MODEL`	✅	`runware:400@6` (FLUX.2 [klein] 9B KV) via Runware
Vision · click reader	`VISION_BASE_URL` `VISION_API_KEY` `VISION_MODEL`	✅	`gemini-3.5-flash` via Google
TTS · per-character voice	`TTS_BASE_URL` `TTS_API_KEY` `TTS_SPEECH_MODEL`	optional — leave blank to run silently	`mimo-v2.5-tts` via Xiaomi MiMo (free); paid alternative: `step-tts-2` via StepFun

2. Set the environment variables

Nine variables are required; TTS is optional (leave blank to run silently). There's also a flag for cheap testing:

Variable	Effect
`MOCK_IMAGE=true`	Skip image generation; the renderer returns a static placeholder. Story, voice, and choices still run normally. Great for iterating on TTS without burning Runware credits.

Where to set them (see .env.example for the exact shape):

Local dev — .env.local
Vercel — Project Settings → Environment Variables
Cloudflare Workers — from the repo root, run wrangler secret put <NAME> for each variable, or set them in the dashboard (Workers → infiplot → Settings → Variables and Secrets). For a private staging instance, gate the Worker behind Cloudflare Access — zero-code email-whitelist auth in front of the Worker.

3. Mind the cost

With the recommended trio, each scene's cost comes mainly from the image generation model. The FLUX.2 [klein] 9B KV image is roughly $0.00078 per scene (1792×1024, 4 steps, sub-second); the text model uses deepseek-v4-flash, so text costs are negligible by comparison. Tapping through a scene's beats is free. To keep transitions instant, the engine also pre-generates scenes you might pick but ultimately don't — so real spend runs somewhat higher than the scenes you actually see.

4. Image proxy (optional)

By default the browser fetches images directly from the provider — no setup needed; leave NEXT_PUBLIC_IMAGE_PROXY_URL blank and you're completely unaffected. You only want this if you hit progressive "top-to-bottom" image loading (Chrome's ERR_QUIC_PROTOCOL_ERROR on some networks paints partial PNGs row by row): deploy a tiny Cloudflare Worker that re-fetches images server-side and serves them atomically over HTTP/2. One-click deploy at infiplot-image-proxy, then paste the workers.dev URL it prints into NEXT_PUBLIC_IMAGE_PROXY_URL.

5. Let players bring their own voice Key (optional, recommended)

Xiaomi rate-limits the TTS model by RPM/TPM. When a public deployment has many people playing at once through a single shared TTS_API_KEY, those limits are easy to hit — the symptom is story and visuals work fine, but there's no audio. To fix this, players can optionally enter their own Xiaomi MiMo key on the homepage (free to obtain). Synthesis then runs browser-direct to Xiaomi, the key stays in the player's browser and never touches your server, and they get stable voice with lower latency. It's purely additive: leave it blank and playback falls back to your server key exactly as before.

See the Bring-your-own voice Key guide for how to obtain and enter one.

Roadmap

Completed

Latency optimized to ~10s
Vision-based image interaction
One-click deploy & custom model config
Frontend API Key & model setup
Mobile web support
Story sharing (.infiplot format)
OpenDeploy quick deployment

To Do

Mobile app & creator platform
ComfyUI custom image generation
Reduce latency to under 5s
Story save & resume
Custom character cards & world settings
Prompt cache hit-rate optimization

Star history

License & contributing

This project is open-sourced under AGPL-3.0.

Contributions are welcome! External contributors must sign our Contributor License Agreement (CLA) once before a PR can be merged — see CONTRIBUTING.md and CLA.md. Sign it directly in the PR via a comment after opening it; no separate step needed beforehand.

README.en.md Unescape Escape

⚡ Overview

🌐 Live Demo

Deploy

OpenDeploy / Vercel / Cloudflare (one-click)

Docker (self-hosted)

📸 Screenshots

How it works

Team & Vision

Configuration guide

Roadmap

Star history

License & contributing

README.en.md