infiplot-web

Author	SHA1	Message	Date
Zonghao Yuan	77f5296e18	Merge pull request #30 from zonghaoyuan/feat/multi-provider-compat feat(ai-client): multi-provider compat — native Anthropic/Google	2026-06-04 17:10:35 +08:00
yuanzonghao	865bf322e9	fix(ai-client): parse Runware host by hostname; doc nits - inferImageProtocol: match runware.ai by parsed hostname (exact match or subdomain) instead of a bare substring, so notrunware.ai / runware.ai.evil.com no longer misroute to the Runware protocol - README: document the image-2-vip → OpenAI-compatible exception; correct the Imagen wording (deprecated, EOL 2026-06-24 — not yet discontinued) Addresses Copilot review on #30. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 17:09:05 +08:00
yuanzonghao	83fd5717e7	feat(ai-client): multi-provider compat — native Anthropic/Google + URL tolerance - TEXT/VISION: add native Anthropic & Google Gemini paths via Vercel AI SDK, selectable through TEXT_PROVIDER / VISION_PROVIDER (default openai_compatible) - IMAGE: expand to openai (gpt-image) / google (Nano Banana) via AI SDK alongside the existing Runware task-array and OpenAI-compatible REST paths - normalizeBaseUrl: tolerate URLs with/without /v1 (or /chat/completions); append the per-protocol version segment only for bare hosts - config: readProvider() reads *_PROVIDER; types: ProviderProtocol + provider? - deps: @ai-sdk/anthropic, @ai-sdk/google; docs in .env.example + README Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 17:09:05 +08:00
Zonghao Yuan	a4dc57a1b6	Merge pull request #28 from zonghaoyuan/feat/byo-tts-key feat(web): optional bring-your-own Xiaomi MiMo TTS key	2026-06-04 17:00:42 +08:00
yuanzonghao	f6226facbd	fix(web): address PR #28 review — explicit clientTts boolean + BYO key prefix hint Harden the BYO-mode signal at the API boundary (start/scene/insert-beat): only clientTts === true drops server TTS, so a stray truthy non-boolean can't silently disable it. Add a non-blocking prefix hint in TtsKeyModal that warns when the pasted key prefix (tp-/sk-) mismatches the selected key type — a mismatch hits the wrong endpoint and plays silently, the symptom BYO fixes. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 16:58:55 +08:00
yuanzonghao	b0b2e922d3	feat(web): optional bring-your-own Xiaomi MiMo TTS key (browser-side synthesis) Public users share one server TTS key, so Xiaomi's per-key RPM/TPM limits cause silent playback under concurrency. This adds an OPTIONAL path: a user can store their own Xiaomi MiMo key in the browser and synthesize voice client-side against Xiaomi's CORS-open endpoints. The key lives only in localStorage and is never sent to or logged by our server; the shared server key still serves everyone who does not opt in. - components/TtsKeyModal.tsx: shared key modal (key-family + region picker), reused by both the home and play pages - app/play/page.tsx: silence nudge moved beside the mute toggle; modal opens in place instead of redirecting to the home page - app/page.tsx: home page consumes the shared modal + readStoredTtsConfig - lib/clientTtsConfig.ts, lib/ttsPresets.ts: browser config + region presets - app/api/{start,scene,insert-beat}: thread per-request voice; lib/types update - docs/xiaomi-tts-key.md + README note Verified with tsc --noEmit (exit 0). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 16:58:55 +08:00
Zonghao Yuan	24b674d792	Merge pull request #27 from zonghaoyuan/perf/writer-split perf(engine): split Writer into Phase A (plan) + Phase B (beats)	2026-06-04 16:53:21 +08:00
yuanzonghao	efe021d886	fix(engine): pin entry-beat roster to the plan in Phase B The Painter composites exactly plan.entryActiveCharacters into the entry frame (the same roster the Cinematographer framed). Phase B is told to reuse that roster, but only the entry beat's id was code-enforced — so an LLM slip could leave a character in the painted frame that the runtime entry beat says isn't there. Pin activeCharacters onto the plan's entry beat as a last line of defense, mirroring the existing id pin. Speaker is intentionally left to the prompt: it's coupled to line/TTS, so overwriting it could mis-attribute or orphan Phase B's dialogue. Addresses Copilot review feedback on PR #27. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 15:48:14 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	592c82816a	Revert "feat(loading): support typewriter story teaser during first scene generation" This reverts commit `4e4e06ec8a`.	2026-06-04 15:13:03 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	587e1e4e7d	Revert "fix(loading): use left-aligned text for typewriter teaser to prevent jitter" This reverts commit `e875ac8fd7`.	2026-06-04 15:13:03 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	3f45cd4e0f	Revert "fix(loading): set w-full on teaser container to prevent horizontal shifting on first line" This reverts commit `68999aca2a`.	2026-06-04 15:13:03 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	d19baa2127	Revert "feat(loading): hide footer text when teaser appears and apply pulse animation to teaser text when typing completes" This reverts commit `5e1a4656ed`.	2026-06-04 15:13:03 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	a311c24f70	Revert "feat(loading): delay teaser slow-pulse animation by 1s after typewriter ends" This reverts commit `1ac665ad88`.	2026-06-04 15:13:03 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	589bb31416	Revert "feat(loading): slow down teaser typing speed to 65ms and change fallback text to " 请等待\" This reverts commit `05d9060dc2`.	2026-06-04 15:13:03 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	a1f3750b6f	Revert "feat(loading): make teaser title pulse together with body" This reverts commit `7164c05b4e`.	2026-06-04 15:13:03 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	a00095df66	Revert "fix(image): try fetching image as a blob directly first to avoid progressive rendering" This reverts commit `676c0f1af8`.	2026-06-04 15:13:03 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	676c0f1af8	fix(image): try fetching image as a blob directly first to avoid progressive rendering	2026-06-04 15:08:39 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	7164c05b4e	feat(loading): make teaser title pulse together with body	2026-06-04 15:03:50 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	05d9060dc2	feat(loading): slow down teaser typing speed to 65ms and change fallback text to " 请等待\	2026-06-04 15:00:50 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	1ac665ad88	feat(loading): delay teaser slow-pulse animation by 1s after typewriter ends	2026-06-04 14:58:57 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	5e1a4656ed	feat(loading): hide footer text when teaser appears and apply pulse animation to teaser text when typing completes	2026-06-04 14:56:06 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	68999aca2a	fix(loading): set w-full on teaser container to prevent horizontal shifting on first line	2026-06-04 14:51:12 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	e875ac8fd7	fix(loading): use left-aligned text for typewriter teaser to prevent jitter	2026-06-04 14:49:42 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	4e4e06ec8a	feat(loading): support typewriter story teaser during first scene generation	2026-06-04 14:40:35 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	e04c51e875	feat(api): support custom BYO API header override on client fetches and backend config	2026-06-04 13:49:46 +08:00
Zonghao Yuan	af155ac107	Merge pull request #24 from zonghaoyuan/fix/optional-image-proxy fix(play): make scene-image proxy opt-in (default direct-connect)	2026-06-04 11:25:11 +08:00
yuanzonghao	3bf5c92841	perf(engine): split Writer into Phase A (plan) + Phase B (beats) The Writer was the serial long pole: a single LLM call wrote the scene skeleton AND the full beats[] graph before anything downstream could start, so variable-length beat generation blew up tail latency. Split it into two calls: - Phase A (runWriterPlan): minimal skeleton the image pipeline needs (sceneSummary, sceneKey, entryBeatId, cast, entry roster, entry speaker). Serial, on the critical path, kept lightweight. - Phase B (runWriterBeats): full beats[] + storyStatePatch, written to honor the plan. Launched immediately, overlaps the ENTIRE image pipeline (cards / cinematographer / portraits / painter), awaited last. Critical path becomes PhaseA + max(imagePipeline, PhaseB), so the long beat-writing is hidden behind image gen. A Phase B failure degrades to a single playable beat synthesized from the plan. Paired distinct-payload A/B (6 content-matched stories, baseline vs split): - median end-to-end 42.6s -> 32.2s (-24%) - mean 46.4s -> 33.1s (-29%) - worst case 74.7s -> 37.6s (halved) - no content regression: total Writer output tokens 12858 -> 13699 Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 11:17:34 +08:00
Zonghao Yuan	8ebacbeb83	Merge pull request #26 from zonghaoyuan/feat/umami-events feat(web): 隐私友好的 Umami 自定义埋点	2026-06-04 11:05:15 +08:00
yuanzonghao	4bc47d8210	fix(play): bound preloadImage decode by the timeout; clarify proxy env docs Addresses two GitHub Copilot review comments on PR #24: - preloadImage cleared the 20s timeout in onload, before awaiting img.decode(), leaving the decode phase unguarded — a hung decode could keep the promise pending forever and stall the play loop. Move clearTimeout into a single idempotent done() so the timeout stays armed through decode() too, matching the stated "timeouts resolve quietly" intent. - .env.example said to leave BOTH proxy vars blank, but shipped NEXT_PUBLIC_IMAGE_PROXY_ALLOWED_HOSTS=im.runware.ai. Only NEXT_PUBLIC_IMAGE_PROXY_URL gates the feature; the allowlist is inert until the URL is set. Corrected the wording, kept the self-documenting default value. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 11:04:16 +08:00
yuanzonghao	e095650944	refactor(web): enforce content-free Umami fields at compile time Address the Copilot review on #26. #1 The game_start / art_style_select payload fields were typed as bare `string`, so free text could still slip through despite the "content-free by construction" claim. Add lib/options.ts as the single source of truth for the selector option sets (`as const` → literal-union types), have the home OPTS render from those arrays, and type the analytics fields from the derived unions (gender/art_style/plot_style/pacing/style) plus a template type for `card`. Free text now fails to compile; no casts at call sites. #2 The /play heartbeat scheduled its 30s interval unconditionally. Gate the effect on the same NEXT_PUBLIC_UMAMI_* env used for script injection, so nothing is scheduled when the tracker is off (visibility check kept — a hidden tab still never emits). #3 choice_select no longer emits a -1 choice_index: skip the event when the index can't be resolved instead of polluting the index distribution. Verified with tsc (exit 0) and a throwaway negative test: free text in any of the six fields raises TS2322, valid enum/template values compile. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 10:59:31 +08:00
yuanzonghao	4bf05f6784	feat(web): add privacy-friendly Umami custom events Instrument the play flow with 9 content-free custom events (game_start, art_style_select, style_image_upload, scene_reached, choice_select, vision_click, tts_toggle, fullscreen_toggle, play_heartbeat) to measure retention, engagement depth and session duration. Privacy is enforced by construction, not convention: - lib/analytics.ts types each event with a discriminated union, so a payload has no slot for free text — prompts, world guides, uploaded images and vision output can never reach analytics (compile-time guarantee, not a comment). - track() no-ops without window.umami and never throws into the app. - coarse 30s heartbeat fires only while the tab is visible. - script stays gated on NEXT_PUBLIC_UMAMI_* env (blank → no script), honours Do-Not-Track, and locks to an exact data-domains allowlist. - one-line on-site disclosure with a link, shown only when tracking is on. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 10:14:08 +08:00
Zonghao Yuan	9f4dcc097b	Merge pull request #25 from zonghaoyuan/perf/home-cache-headers perf(web): pin /home/* assets to 1y immutable cache	2026-06-04 10:04:27 +08:00
yuanzonghao	1fbeea14e6	perf(web): pin /home/* assets to 1y immutable cache Next.js serves /public files with `Cache-Control: public, max-age=0, must-revalidate`, so the home covers + first-act JSON were re-fetched on every visit. Verified against 30 days of Vercel metrics: /home/* alone was ~62% of Fast Data Transfer egress (5.42 GB) while the files total only ~31 MB — the same bytes re-downloaded hundreds of times. Add a headers() rule scoping `public, max-age=31536000, immutable` to /home/:path* only; other paths keep their defaults (verified /icon.svg still returns no-cache). Filenames under /home are stable (covers fN/mN.webp, first-act JSON by card name), so immutable is safe; if a first-act JSON is ever re-baked under the same name, bump a query string or purge the cache. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 10:01:06 +08:00
yuanzonghao	4347e5bfdf	fix(play): make scene-image proxy opt-in — default deployers connect direct `b805b1d` routed every scene <img> through fetch → Blob → createObjectURL to kill QUIC progressive-paint, but in doing so added an unconditional dependency on a CORS-adding proxy. That breaks the default deployment: im.runware.ai sends no Access-Control-Allow-Origin, so a direct fetch().blob() throws and the scene image silently fails to load for anyone who hasn't stood up the Cloudflare Worker. Restore the pre-b805b1d behavior as the default and make the proxy strictly opt-in: - Direct path (no env set): preloadImage() warms the HTTP cache + decodes, then <img> uses the original https://im.runware.ai URL — as before `b805b1d`. No fetch().blob(), no CORS dependency: a fresh clone just works. - Proxy path (NEXT_PUBLIC_IMAGE_PROXY_URL set): fetch the proxied URL → Blob → createObjectURL, exactly as `b805b1d`, gaining the QUIC-immune HTTP/2 edge + atomic paint. shouldProxy(url) gates the two paths: proxy only when a base is configured AND the host is in NEXT_PUBLIC_IMAGE_PROXY_ALLOWED_HOSTS (default im.runware.ai). data: / non-http / unknown-host URLs always take the direct path. blobUrlCache + revoke logic is unchanged and safe for both paths (revoke is a no-op on non-blob: URLs). The Cloudflare Worker moves out of this repo into a standalone, one-click- deployable project (infiplot-image-proxy) so the optional infra isn't carried by every clone; .env.example and the READMEs link to it. restore: preloadImage() helper deleted by `b805b1d` add: NEXT_PUBLIC_IMAGE_PROXY_ALLOWED_HOSTS (default im.runware.ai) remove: worker/ (moved to standalone repo) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 09:57:29 +08:00
Zonghao Yuan	b86a9507e3	Merge pull request #23 from zonghaoyuan/fix/play-card-click-no-vision fix(play): story-card clicks no longer trigger vision	2026-06-04 09:34:31 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	010239de44	fix(home): localize first-scene images — drop Runware URL TTL dependency Card click flow now serves /home/firstscene/{name}.webp from Vercel static hosting instead of fetching im.runware.ai/... — those URLs have a finite TTL and would silently rot. Side benefit: backfilled the 18 stories that never had a local webp (f14-f29, m14, m29), and refreshed the 44 stale webps left over from a pre-prebake story batch so they actually match their cover art again. Scope is scene.imageUrl only; characters[].basePortraitUrl still points at Runware (painter consumes it server-side as referenceImages, where a local public path won't resolve). localize-firstact-images.mjs: - skip the network when the local webp is already on disk (don't re-encode what's already correct) - read imageUrlRemote as a fallback URL when imageUrl is already localized, so --force can refresh from the original Runware source - also localize scene.imageUrl alongside the top-level imageUrl Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-04 09:34:12 +08:00
yuanzonghao	a18b91c48c	fix(play): story-card clicks no longer trigger vision Symptom: on a choice beat, clicking the dialogue/narration card fired the vision ("识图") flow instead of doing nothing. Picking an option with fast clicks that landed on the card repeatedly kicked off the expensive /api/vision → insert-beat/scene chain — janky and confusing. Root cause: the story-card <div> had `pointer-events-none`, so clicks passed through to the background <img> onClick (handleImageClick), which on choice beats calls onBackgroundClick → vision. Fix: the card now owns its clicks (`pointer-events-auto` + handleCardClick): - mid-typing → completes the text (VN skip affordance, unchanged) - continue beat → advances, as before - choice beat → no-op (no vision) Clicking the actual scene art still triggers vision; choice buttons already had pointer-events-auto and are unaffected. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 09:17:30 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	b805b1d9c2	fix(play): scene image renders progressively from top → CF Worker proxy Symptom: in Chrome on certain networks the scene <img> renders row-by-row from top to bottom — "层层加载" — instead of appearing atomically. Root cause (confirmed via DevTools): - Chrome opportunistically opens HTTP/3 (QUIC) to im.runware.ai. - QUIC streams to Runware sometimes error mid-transfer: net::ERR_QUIC_PROTOCOL_ERROR HTTP-level status stays 200 (response headers received), but bytes are truncated. The browser paints whatever PNG bytes it has so far → visible row-by-row decode. - The earlier preloadImage()+decode() trick can't fix this — neither HTTP-cache reuse nor sync decode helps when the bytes themselves were never fully delivered. Two-tier fix: 1. Client: fetch → Blob → URL.createObjectURL() (app/play/page.tsx) - <img src> only ever points to a blob: URL whose bytes are 100% resident in the JS heap. No network-backed src = no possibility of progressive paint. - Module-level blobUrlCache keys by original URL so speculative prefetch + the eventual commit share one fetch. - Old blobs are URL.revokeObjectURL()'d on scene swap + unmount to release memory. 2. Network: optional Cloudflare Worker proxy (worker/) - Browser ↔ Worker is HTTP/2 over CF edge (extremely stable). - Worker ↔ Runware is a server-to-server fetch (no QUIC fragility, Cloudflare's backbone handles transit). - Worker buffers the full upstream response → client never sees a half-stream. - Bonus: CF edge cache (cacheEverything, 1y TTL) on Runware UUIDs; Access-Control-Allow-Origin: * so client fetch() can't hit CORS. - Hardened: only proxies im.runware.ai, only GET/HEAD/OPTIONS, all other hosts/methods → 403/405. Wired via NEXT_PUBLIC_IMAGE_PROXY_URL (inlined at build). Empty → no proxy → direct fetch (which still uses the blob path, just exposed to QUIC). ────────────────────────────────────────────────────────────────────── Deploy steps (one-time, do this AFTER pulling this commit): 1. Install wrangler globally: npm i -g wrangler 2. Log in to Cloudflare (opens browser for OAuth): wrangler login 3. From the worker/ directory, deploy: cd worker wrangler deploy wrangler will print the deployed URL, e.g. https://infiplot-image-proxy.<your-cf-username>.workers.dev 4. Paste that URL into .env.local for local dev: NEXT_PUBLIC_IMAGE_PROXY_URL=https://infiplot-image-proxy.<...>.workers.dev …and into Vercel project settings (Environment Variables) for prod. NEXT_PUBLIC_ vars are inlined at build time, so the URL bakes into the bundle on the next deploy/dev-server restart. 5. Restart dev server (pnpm dev) so the new env baked in. Generate a scene; Network tab should show requests going to *.workers.dev instead of im.runware.ai, no ERR_QUIC_PROTOCOL_ERROR, image renders atomically. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 22:50:48 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	347ab297d5	feat(web,engine): custom style — image upload, AI-extract prompt, painter ref 自定义画风入口里加上传按钮：客户端把图缩到 512px webp(base64)，传到新路由 /api/parse-style-image，vision LLM 解析成英文 style prompt 回填 textarea；图本身随 sessionStorage → /api/start → Session.styleReferenceImage 透传， painter.collectReferenceImages 把它置于 slot 0，整局每一幕都作为 reference 图锚定画风（brush / color / mood），比 priorScene 优先级更高。 Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 19:15:19 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	298ecd4ec0	perf(engine): reorder Writer/Cinematographer prompts for prefix caching Goal: lift prompt-cache hit rate from the ~75% baseline toward 95%+ on DeepSeek/MiMo-style 64-token chunked prefix caches. Both providers match a stable byte-identical prefix from message[0]; once a single byte changes everything after it misses, so the trick is to push every session-stable bit to the front and concentrate per-call churn in a short suffix. Three coordinated changes: 1. Split storyState rendering into spine + dynamic. renderStoryStateSpine: logline / genreTags / protagonist / castNotes — Architect-set fields that StoryStatePatch literally cannot touch (the type only declares the 4 volatile ones; coerce and apply both cherry-pick), so spine bytes are guaranteed stable for the entire session. Goes in the STABLE PREFIX. renderStoryStateDynamic: synopsis / openThreads / relationships / nextHook — the Writer rewrites these every scene via storyStatePatch. Goes in the DYNAMIC SUFFIX. renderStoryState kept as a convenience wrapper that joins both, for anything that still wants the merged bible. 2. Rewrite buildWriterUserMessage with a stable/dynamic split. STABLE PREFIX (byte-identical or pure append across consecutive calls): - 世界观 / 画风 (session-immutable scalars) - story bible spine - 已登记角色 [sentinel: "（以下每行一个已登记角色，开场前为空。）"] + entries - 已使用的 sceneKey [sentinel] + entries - 场景历史，已完结 [sentinel] + archivedHistory entries ↑ archivedHistory = history.slice(0, -1), NOT the full history — the live entry (history[-1]) keeps mutating mid-scene as the player walks new beats and speculative prefetches snapshot it at different moments, so it MUST stay out of the stable prefix or the byte-monotonic invariant breaks. DYNAMIC SUFFIX: - storyState dynamic patch - last-beat snippet (the exact emotional cliffhanger to continue from) - lastExit hint - format reminder tail The previous structure put the full storyState (including patched fields) at the very top of the user message, so the very first byte of the user message changed every scene — user-side cache hit was effectively 0% across the board. 3. Sentinel pattern for variable-length sections. Every list (characters / sceneKeys / archivedHistory) now emits a constant placeholder line after its header REGARDLESS of whether it has entries. With the old "if empty print '（暂无）' else print entries" pattern, adding the first item silently rewrites those placeholder bytes — the byte at offset N moves from a Chinese parenthesis to a dash, prefix cache torched. The sentinel line is the same bytes whether the list has 0 or N items; new items are pure appends after it. 4. Rewrite buildCinematographerUserMessage. New CINE_STABLE_HINT constant (~80 tokens of fixed guidance) glued right after the session-stable styleGuide line, so the stable prefix is long enough to cross at least one full 64-token chunk boundary beyond the system prompt. The per-scene inputs (sceneSummary, entryBeatActive, entryBeatSpeaker policy, prior-sceneKey continuity hint) all moved into the dynamic suffix below. Verified (see [cache] / [debug-writer] logs from staging): hash of 500-byte slices of the user message is byte-identical across two same-historyLen Writer calls through the entire stable prefix; only the dynamic suffix slice differs. The remaining cache-hit gap under MiMo is a server-side quirk (hit plateaus near 3072 tokens, occasionally jumps to 4096); on DeepSeek the same prefix should hit fully. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-03 10:42:33 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	37c911f510	chore(engine): log prompt-cache hit/miss per chat call Add a `tag` option to chat() and have it print one `[cache] <tag> hit=X miss=Y rate=Z%` line per call. Three Usage-shape variants are probed in order so the same logger works across providers: - DeepSeek (v3+): usage.prompt_cache_hit_tokens / _miss_tokens - OpenAI / o-series: usage.prompt_tokens_details.cached_tokens - Anthropic: usage.cache_read_input_tokens / _creation_* When none of them are present (MiMo / local Ollama / others) we still print prompt + completion totals so the cost baseline is visible. Tag every callsite so the log is greppable: architect / writer / character-designer / cinematographer / insert-beat This is the prerequisite for the prefix-cache reordering work that follows — without per-agent visibility there's no way to tell if a prompt rearrangement actually moved the needle. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-03 10:42:33 +08:00
Zonghao Yuan	334c9808c6	Merge pull request #22 from zonghaoyuan/chore/clean-play-ui chore(play): remove session-id readout and decorative footer mark	2026-06-03 16:35:23 +08:00
yuanzonghao	3fa3da5378	chore(play): remove session-id readout and decorative footer mark The session-id slice shown in the play header was an opaque timestamp that reads as noise to players. The footer's "Ⅰ · Ⅰ" was a leftover decorative mark after its sibling controls were moved above the canvas. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 16:00:16 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	b5f73d8082	fix(play): scene image renders as 1px sliver while CDN bytes still arrive When the Runware CDN download was slow (~10-20s over VPN / strict networks, vs. the optimistic <2s the existing comment assumed), the preload's 8s timeout fired and setImageUrl committed before the bytes were actually decoded. The rendered <img> has w-auto h-auto and no intrinsic aspect-ratio source — until the image loads the layout collapses to roughly 1px tall, giving the "等了很久 → 一根线 → 突然出图" jank. Two compounding fixes: app/play/page.tsx IMAGE_PRELOAD_TIMEOUT_MS 8000 → 20000. Real CDN+decode usually finishes well before this; pushing the ceiling out just stops the window where we commit a half-loaded URL. components/PlayCanvas.tsx Add width={1792} height={1024} HTML attrs to the scene <img>. Doesn't affect rendered size (still driven by w-auto h-auto and the maxWidth/maxHeight in sizeStyle); the browser uses them purely as an intrinsic aspect-ratio source, so the placeholder box reserves a 16:9-ish frame even mid-download. Together: slow networks now mostly wait through preload; on the rare genuine timeout the layout still holds shape instead of collapsing. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-03 07:24:42 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	ea861b7c99	feat(web): style modal — custom prompt + per-preset override, inline edit Two coordinated additions to the 绘画风格 modal so the user can shape the styleGuide that ultimately feeds every painter/director agent, without ever mutating the source-of-truth STYLE_MAP: 1. New "自定义" entry sits right under "自动" — opens an inline textarea where the user can write a free-form styleGuide (mix of Chinese / English, sent verbatim to the image model). Stored as in-memory state on HomePage (customStyleGuide), so refresh clears it — fits the "one-shot session" semantics of this UI. 2. Every preset card now exposes a small pencil on the right of its prompt area. Clicking it inlines a textarea pre-filled with the current effective prompt (override if any, else STYLE_MAP value). Saving writes to styleOverrides[name] — a separate in-memory record keyed by preset name. STYLE_MAP is never written to. start() selects the styleGuide with this priority: customStyleGuide (when 自动→自定义) > styleOverrides[artStyle] > STYLE_MAP[artStyle] > STYLE_MAP[DEFAULT_STYLE] UX polish in the same change: - 标题永远只读 (only the prompt is editable) - 只读 prompt 行去掉边框/底色，回归纯文字 + 右上铅笔 - 「自动」项无 prompt 可编辑，标题下直接放一行说明 - 编辑态 textarea 用 ember 边框作为"正在编辑"视觉反馈 - 「保存并选用」一并 onPick + close；「还原默认」清除该预设的 override - 搜索框同时匹配标题/原名/prompt 内容 - 移除「自由输入」标签 (now visually redundant with the pencil affordance) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-03 07:24:42 +08:00
Zonghao Yuan	a6825f0292	Merge pull request #20 from zonghaoyuan/docs/readme-2col-screenshots docs: replace README screenshots with 14-image 2-column gallery	2026-06-03 13:37:24 +08:00
Zonghao Yuan	a333b21759	feat(web): add favicon (IP monogram SVG) (#18 ) 站点此前没有 favicon，浏览器显示默认地球图标。新增 app/icon.svg：站点同款 Cormorant Garamond 衬线的「IP」字母组合 —— I 直立 (clay #2D1810)、 P 斜体 (ember #D97A2E)，米白圆角底，呼应 README 的 wordmark。字形以矢量轮廓内嵌（从 Cormorant wght 600 / italic 400 实例化抽取），不依赖 webfont，各处渲染一致。 Next.js 16 见到 app/icon.svg 会自动注入 <link rel="icon">，无需改动 layout.tsx。 Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-03 11:23:34 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	4d46bd0664	docs: replace README screenshots with 14-image 2-column gallery Drop the old 9-image 3x3 grid (4/5/a4/c3/c5/c7/d2/f2/f5.webp) and bring in 14 new stills as 1.webp..14.webp, laid out as 7 rows of 2 columns at width=420. Source PNGs (1920x1080 for 1-8, 1200x680 for 9-14) are resized to fit inside 1200x680 and saved as q=85 WebP — 70-150KB each. All three README locales (zh/en/ja) share the same paths so a single asset swap refreshes every edition. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-03 06:53:46 +08:00
Zonghao Yuan	95cfe24065	Merge pull request #19 from zonghaoyuan/staging chore: promote staging to main (favicon + home revamp)	2026-06-03 11:27:16 +08:00
$DESKTOP-I1T6TF3\Q$ DESKTOP-I1T6TF3\Q	1bcea098d5	chore(web): swap 数据幽灵 and 极简杀机 positions in male DISPLAY_ORDER m14 (极简杀机) is currently a 14.7KB placeholder while m18 (数据幽灵) got a real curated cover this round — promote 数据幽灵 into the front row and demote 极简杀机 back to its original neighborhood so the visible 首屏 only shows finished art. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-03 04:13:51 +08:00

... 2 3 4 5 6

260 Commits