Back to Skills
inference-sh•376•Content Management
AI 影片生成
透過 inference.sh CLI 使用 40 多種模型(Veo、Seedance、Wan、Grok 等)生成 AI 影片,支援文字轉影片、圖片轉影片、虛擬形象/對嘴型、影片超解析度和音效設計。
Installation Commandexplicit
$ npx skills add https://github.com/infsh-skills/skills --skill ai-video-generation

關於
透過 inference.sh CLI 使用 40 多種模型生成 AI 影片。支援文字轉影片、圖片轉影片、對嘴型、虛擬形象動畫、影片超解析度和音效設計。適用於社群媒體、行銷、解說影片、產品展示和 AI 虛擬形象。
主要特性
- 40 多種影片模型,包括 Google Veo 3.1/3、Seedance、Wan 2.5、Grok Imagine Video、OmniHuman 和 P-Video
- 可配置提示詞和參數的文字轉影片及圖片轉影片生成
- 虛擬形象動畫與逼真的對嘴型功能
- 影片超解析度、音效設計以及帶轉場的媒體合併
- 透過
beltCLI 整合,支援即時進度與串流處理 - 模型路由及範例,支援快速/經濟或高品質工作流程
使用情境
- 創作社群媒體短片、行銷影片和產品展示
- 從圖片 + 音訊生成 AI 虛擬形象與人物訪談內容
- 為解說影片和原型製作文字轉影片或圖片轉影片動畫
- 為現有影片添加超解析度與音效強化
安裝
SKILL.md
---
name: ai-video-generation
description: "Generate AI videos with Google Veo, Seedance, Wan, Grok and 40+ models via inference.sh CLI. Models: Veo 3.1, Veo 3, Seedance 1.5 Pro, Wan 2.5, Grok Imagine Video, OmniHuman, Fabric, HunyuanVideo. Capabilities: text-to-video, image-to-video, lipsync, avatar animation, video upscaling, foley sound. Use for: social media videos, marketing content, explainer videos, product demos, AI avatars. Triggers: video generation, ai video, text to video, image to video, veo, animate image, video from image, ai animation, video generator, generate video, t2v, i2v, ai video maker, create video with ai, runway alternative, pika alternative, sora alternative, kling alternative"
allowed-tools: Bash(belt *)
---
# AI Video Generation
Generate videos with 40+ AI models via [inference.sh](https://inference.sh) CLI.

## Quick Start
> Requires inference.sh CLI (`belt`). [Install instructions](https://raw.githubusercontent.com/inference-sh/skills/refs/heads/main/cli-install.md)
```bash
belt login
# Generate a video with Veo
belt app run google/veo-3-1-fast --input '{"prompt": "drone shot flying over a forest"}'
```
## Available Models
### Text-to-Video
| Model | App ID | Best For |
|-------|--------|----------|
| Veo 3.1 Fast | `google/veo-3-1-fast` | Fast, with optional audio |
| Veo 3.1 | `google/veo-3-1` | Best quality, frame interpolation |
| Veo 3 | `google/veo-3` | High quality with audio |
| Veo 3 Fast | `google/veo-3-fast` | Fast with audio |
| Veo 2 | `google/veo-2` | Realistic videos |
| **P-Video** | `pruna/p-video` | Fast, economical, with audio support |
| **WAN-T2V** | `pruna/wan-t2v` | Economical 480p/720p |
| Grok Video | `xai/grok-imagine-video` | xAI, configurable duration |
| Seedance 1.5 Pro | `bytedance/seedance-1-5-pro` | With first-frame control |
| Seedance 1.0 Pro | `bytedance/seedance-1-0-pro` | Up to 1080p |
### Image-to-Video
| Model | App ID | Best For |
|-------|--------|----------|
| Wan 2.5 | `falai/wan-2-5` | Animate any image |
| Wan 2.5 I2V | `falai/wan-2-5-i2v` | High quality i2v |
| **WAN-I2V** | `pruna/wan-i2v` | Economical 480p/720p |
| **P-Video** | `pruna/p-video` | Fast i2v with audio |
| Seedance Lite | `bytedance/seedance-1-0-lite` | Lightweight 720p |
### Avatar / Lipsync
| Model | App ID | Best For |
|-------|--------|----------|
| OmniHuman 1.5 | `bytedance/omnihuman-1-5` | Multi-character |
| OmniHuman 1.0 | `bytedance/omnihuman-1-0` | Single character |
| Fabric 1.0 | `falai/fabric-1-0` | Image talks with lipsync |
| PixVerse Lipsync | `falai/pixverse-lipsync` | Realistic lipsync |
### Utilities
| Tool | App ID | Description |
|------|--------|-------------|
| HunyuanVideo Foley | `infsh/hunyuanvideo-foley` | Add sound effects to video |
| Topaz Upscaler | `falai/topaz-video-upscaler` | Upscale video quality |
| Media Merger | `infsh/media-merger` | Merge videos with transitions |
## Browse All Video Apps
```bash
belt app list --category video
```
## Examples
### Text-to-Video with Veo
```bash
belt app run google/veo-3-1-fast --input '{
"prompt": "A timelapse of a flower blooming in a garden"
}'
```
### Grok Video
```bash
belt app run xai/grok-imagine-video --input '{
"prompt": "Waves crashing on a beach at sunset",
"duration": 5
}'
```
### Image-to-Video with Wan 2.5
```bash
belt app run falai/wan-2-5 --input '{
"image_url": "https://your-image.jpg"
}'
```
### AI Avatar / Talking Head
```bash
belt app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
```
### Fabric Lipsync
```bash
belt app run falai/fabric-1-0 --input '{
"image_url": "https://face.jpg",
"audio_url": "https://audio.mp3"
}'
```
### PixVerse Lipsync
```bash
belt app run falai/pixverse-lipsync --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
```
### Video Upscaling
```bash
belt app run falai/topaz-video-upscaler --input '{"video_url": "https://..."}'
```
### Add Sound Effects (Foley)
```bash
belt app run infsh/hunyuanvideo-foley --input '{
"video_url": "https://silent-video.mp4",
"prompt": "footsteps on gravel, birds chirping"
}'
```
### Merge Videos
```bash
belt app run infsh/media-merger --input '{
"videos": ["https://clip1.mp4", "https://clip2.mp4"],
"transition": "fade"
}'
```
## Related Skills
```bash
# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli
# Pruna P-Video (fast & economical)
npx skills add inference-sh/skills@p-video
# Google Veo specific
npx skills add inference-sh/skills@google-veo
# AI avatars & lipsync
npx skills add inference-sh/skills@ai-avatar-video
# Text-to-speech (for video narration)
npx skills add inference-sh/skills@text-to-speech
# Image generation (for image-to-video)
npx skills add inference-sh/skills@ai-image-generation
# Twitter (post videos)
npx skills add inference-sh/skills@twitter-automation
```
Browse all apps: `belt app list`
## Documentation
- [Running Apps](https://inference.sh/docs/apps/running) - How to run apps via CLI
- [Streaming Results](https://inference.sh/docs/api/sdk/streaming) - Real-time progress updates
- [Content Pipeline Example](https://inference.sh/docs/examples/content-pipeline) - Building media workflowsRelated Skills
More workflow tools in the Content Management category.
Compatible MCP Servers
Power these skills with protocol-compliant data sources.
