Research + script
AI learns your benchmark content (video link or profile), analyzes viral patterns on each platform, and writes high-converting voice-over script.
Script · voice clone · digital human · titles & tags · subtitles & music · cover · one-click publish — AI handles all 7 steps. You just do Step 1.
Not template stitching — the agent plans and executes 7 connected steps from script through voice, digital human, titles, subtitles, cover, to publish. In Step 1 you pick a creation mode and fill a profile; the remaining 6 steps run untouched.
Upload a 10-30 second clip of yourself once. The AI learns your face and lip pattern, then drives unlimited new videos — no per-video license, no need to re-record.
Xinghan Cloud holds three telecom value-added licenses (IDC / CDN / ISP). AI-generated content follows model-platform and regulatory norms; the review trail is preserved and auditable.
Type your IP info — the agent handles the rest.
AI learns your benchmark content (video link or profile), analyzes viral patterns on each platform, and writes high-converting voice-over script.
NodeKey TTS drive — 50+ voices to pick, or clone your own. Natural delivery and intonation.
Upload yourself once (10-30s). The AI drives lip-sync and likeness — no re-recording required.
Auto-generates video titles, platform tags, and SEO keywords — improving algorithmic reach.
AI transcribes audio into smart subtitles and matches background music to the video rhythm — no post-production tools needed.
AI generates click-optimized cover images that fit each platform's thumbnail spec.
One click to Douyin, Kuaishou, Xiaohongshu, and WeChat Channels — no duplicate work.
| Traditional · Humans + pro tools | AI Agent · 7 steps automated | |
|---|---|---|
| Topic & script | 2-4 hours / video | ~30 seconds |
| Filming | 1-2 hours / video | Digital human 3-5 min |
| Editing | 2-4 hours / video | Auto |
| Subtitles & title | 30-60 minutes | Auto |
| Multi-platform publish | ~30 minutes | ~5 minutes · 4 platforms |
| Total | Total per video: 6-12 hours | Total per video: ~30 minutes |
Reference example · Actual time depends on content complexity and machine configuration. We make no specific efficiency guarantees.
💡 macOS "unverified" prompt → System Settings → Privacy & Security → Allow.
💡 One key powers everything — text, voice, ASR, digital human.
💡 Run a full cycle once to learn the flow before scaling.
You type the IP info — the remaining 7 steps run on AI.
Every step is backed by a dedicated engine — together they support the full automation.
Whole-pipeline video creation
Pick "Benchmark video learning" or "Profile-based generation" — the AI writes the script, drives the digital human, and ships the video.
50+ voices · multilingual · natural emotion
NodeKey TTS drive. 50+ professional voices, dialects and multiple languages supported. Voice cloning available.
Audio in · text / subtitle out
Upload video or audio for auto-transcription. Subtitle export (SRT / VTT). NodeKey ASR for precise alignment.
If your customers need short-video traffic, the agent fits.
Any vertical that needs short-video lead generation.
Native macOS (Apple Silicon + Intel) and Windows builds. Client is free; runtime billed via NodeKey token usage.
The client is free to download and install. Runtime is billed by actual token usage — AI copy / voice / digital human all run through the Xinghan Cloud Computing Power Router, settled in one NodeKey account. See the pricing center for details; most light workloads stay well under a typical monthly budget.
Quality depends on the source clip you upload. We recommend a face-forward 10-30 second video with clear lip movement and even lighting. Under those conditions, lip-sync is well-aligned and the output reads naturally for everyday short-video use. For formal commercial campaigns, pair with a human review pass.
Content the agent generates follows model-platform and regulatory norms with an auditable review trail. But platform throttling policies (especially around AI-generated content labels) change constantly. We recommend: 1) keep the content truthful, 2) add human review on the final cut, 3) avoid overstatement. We track platform rule changes and adjust the generation strategy accordingly.
macOS 12+ on Apple Silicon or Intel; Windows 10+ 64-bit. 16GB RAM recommended. Install footprint ~2 GB; each generated video is around 10-20 MB — minimal local storage required.
Currently Douyin, Kuaishou, Xiaohongshu, and WeChat Channels. Link your accounts under "Media" in the client. More platforms (Bilibili, Zhihu Video, Weibo, etc.) are on the roadmap.
Three things: 1) Truly end-to-end — 7 connected steps, not isolated tools stitched together; 2) Unlimited digital-human drive — upload yourself once, drive unlimited videos with no per-video license; 3) Compliance base — Xinghan Cloud holds three telecom value-added licenses (IDC / CDN / ISP), with auditable data handling.