inference-sh•376•Content Management

AI動画生成

inference.sh CLI を使用して、40種類以上のモデル（Veo、Seedance、Wan、Grokなど）でAI動画を生成します。テキストから動画、画像から動画、アバター/リップシンク、動画超解像、フォーリー効果に対応しています。

Installation Commandexplicit

$ npx skills add https://github.com/infsh-skills/skills --skill ai-video-generation

概要

inference.sh CLI を使用して、40種類以上のモデルでAI動画を生成します。テキストから動画、画像から動画、リップシンク、アバターアニメーション、動画超解像、フォーリー効果に対応しています。ソーシャルメディア、マーケティング、解説動画、製品デモ、AIアバターに適しています。

主な機能

Google Veo 3.1/3、Seedance、Wan 2.5、Grok Imagine Video、OmniHuman、P-Videoなど40種類以上の動画モデル
プロンプトとパラメータを設定可能なテキストから動画および画像から動画生成
アバターアニメーションとリアルなリップシンク機能
動画超解像、フォーリー効果、トランジション付きメディア結合
belt CLI 統合によるリアルタイム進行状況とストリーミング処理対応
モデルルーティングとサンプル、高速/経済または高品質ワークフロー対応

ユースケース

ソーシャルメディア用ショート動画、マーケティング動画、製品デモの作成
画像＋音声からのAIアバターと人物インタビューコンテンツ生成
解説動画やプロトタイプ制作のためのテキストから動画または画像から動画アニメーション
既存動画への超解像と音響効果の強化

インストール

SKILL.md

---
name: ai-video-generation
description: "Generate AI videos with Google Veo, Seedance, Wan, Grok and 40+ models via inference.sh CLI. Models: Veo 3.1, Veo 3, Seedance 1.5 Pro, Wan 2.5, Grok Imagine Video, OmniHuman, Fabric, HunyuanVideo. Capabilities: text-to-video, image-to-video, lipsync, avatar animation, video upscaling, foley sound. Use for: social media videos, marketing content, explainer videos, product demos, AI avatars. Triggers: video generation, ai video, text to video, image to video, veo, animate image, video from image, ai animation, video generator, generate video, t2v, i2v, ai video maker, create video with ai, runway alternative, pika alternative, sora alternative, kling alternative"
allowed-tools: Bash(belt *)
---

# AI Video Generation

Generate videos with 40+ AI models via [inference.sh](https://inference.sh) CLI.

![AI Video Generation](https://cloud.inference.sh/app/files/u/4mg21r6ta37mpaz6ktzwtt8krr/01kg2c0egyg243mnyth4y6g51q.jpeg)

## Quick Start

> Requires inference.sh CLI (`belt`). [Install instructions](https://raw.githubusercontent.com/inference-sh/skills/refs/heads/main/cli-install.md)

```bash
belt login

# Generate a video with Veo
belt app run google/veo-3-1-fast --input '{"prompt": "drone shot flying over a forest"}'
```

## Available Models

### Text-to-Video

| Model | App ID | Best For |
|-------|--------|----------|
| Veo 3.1 Fast | `google/veo-3-1-fast` | Fast, with optional audio |
| Veo 3.1 | `google/veo-3-1` | Best quality, frame interpolation |
| Veo 3 | `google/veo-3` | High quality with audio |
| Veo 3 Fast | `google/veo-3-fast` | Fast with audio |
| Veo 2 | `google/veo-2` | Realistic videos |
| **P-Video** | `pruna/p-video` | Fast, economical, with audio support |
| **WAN-T2V** | `pruna/wan-t2v` | Economical 480p/720p |
| Grok Video | `xai/grok-imagine-video` | xAI, configurable duration |
| Seedance 1.5 Pro | `bytedance/seedance-1-5-pro` | With first-frame control |
| Seedance 1.0 Pro | `bytedance/seedance-1-0-pro` | Up to 1080p |

### Image-to-Video

| Model | App ID | Best For |
|-------|--------|----------|
| Wan 2.5 | `falai/wan-2-5` | Animate any image |
| Wan 2.5 I2V | `falai/wan-2-5-i2v` | High quality i2v |
| **WAN-I2V** | `pruna/wan-i2v` | Economical 480p/720p |
| **P-Video** | `pruna/p-video` | Fast i2v with audio |
| Seedance Lite | `bytedance/seedance-1-0-lite` | Lightweight 720p |

### Avatar / Lipsync

| Model | App ID | Best For |
|-------|--------|----------|
| OmniHuman 1.5 | `bytedance/omnihuman-1-5` | Multi-character |
| OmniHuman 1.0 | `bytedance/omnihuman-1-0` | Single character |
| Fabric 1.0 | `falai/fabric-1-0` | Image talks with lipsync |
| PixVerse Lipsync | `falai/pixverse-lipsync` | Realistic lipsync |

### Utilities

| Tool | App ID | Description |
|------|--------|-------------|
| HunyuanVideo Foley | `infsh/hunyuanvideo-foley` | Add sound effects to video |
| Topaz Upscaler | `falai/topaz-video-upscaler` | Upscale video quality |
| Media Merger | `infsh/media-merger` | Merge videos with transitions |

## Browse All Video Apps

```bash
belt app list --category video
```

## Examples

### Text-to-Video with Veo

```bash
belt app run google/veo-3-1-fast --input '{
  "prompt": "A timelapse of a flower blooming in a garden"
}'
```

### Grok Video

```bash
belt app run xai/grok-imagine-video --input '{
  "prompt": "Waves crashing on a beach at sunset",
  "duration": 5
}'
```

### Image-to-Video with Wan 2.5

```bash
belt app run falai/wan-2-5 --input '{
  "image_url": "https://your-image.jpg"
}'
```

### AI Avatar / Talking Head

```bash
belt app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'
```

### Fabric Lipsync

```bash
belt app run falai/fabric-1-0 --input '{
  "image_url": "https://face.jpg",
  "audio_url": "https://audio.mp3"
}'
```

### PixVerse Lipsync

```bash
belt app run falai/pixverse-lipsync --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'
```

### Video Upscaling

```bash
belt app run falai/topaz-video-upscaler --input '{"video_url": "https://..."}'
```

### Add Sound Effects (Foley)

```bash
belt app run infsh/hunyuanvideo-foley --input '{
  "video_url": "https://silent-video.mp4",
  "prompt": "footsteps on gravel, birds chirping"
}'
```

### Merge Videos

```bash
belt app run infsh/media-merger --input '{
  "videos": ["https://clip1.mp4", "https://clip2.mp4"],
  "transition": "fade"
}'
```

## Related Skills

```bash
# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli

# Pruna P-Video (fast & economical)
npx skills add inference-sh/skills@p-video

# Google Veo specific
npx skills add inference-sh/skills@google-veo

# AI avatars & lipsync
npx skills add inference-sh/skills@ai-avatar-video

# Text-to-speech (for video narration)
npx skills add inference-sh/skills@text-to-speech

# Image generation (for image-to-video)
npx skills add inference-sh/skills@ai-image-generation

# Twitter (post videos)
npx skills add inference-sh/skills@twitter-automation
```

Browse all apps: `belt app list`

## Documentation

- [Running Apps](https://inference.sh/docs/apps/running) - How to run apps via CLI
- [Streaming Results](https://inference.sh/docs/api/sdk/streaming) - Real-time progress updates
- [Content Pipeline Example](https://inference.sh/docs/examples/content-pipeline) - Building media workflows

Related Skills

More workflow tools in the Content Management category.

View all skills →

inference-sh

AI Video Generation

376

Generate AI videos with 40+ models (Veo, Seedance, Wan, Grok and more) via inference.sh CLI for text-to-video, image-to-video, avatar/lipsync, upscaling and foley.

Content Management

Compatible MCP Servers

Power these skills with protocol-compliant data sources.

Explore servers →

openaiDeveloperDocs

MCP Servers

OpenAI Developer Docs MCP is an OpenAI-hosted, read-only server that lets MCP clients search and fetch documentation from developers.openai.com, platform.openai.com, and learn.chatgpt.com. It uses a public streamable HTTP endpoint and does not call the OpenAI API on the user's behalf.

MCPOpenAIdeveloper documentation

Clawk

MCP Servers

Clawk is an API-first, Twitter/X-style social network where AI agents can publish 400-character posts, reply, follow, like, quote, and reclawk. It provides a hosted REST API and Agent Skill guide rather than a documented MCP server.

AI agentssocial networkREST API

FetchSandbox

MCP Servers

FetchSandbox is a stdio MCP server and hosted API-sandbox service that lets AI coding agents import OpenAPI 3.x specifications, discover runnable workflows, and execute stateful integration tests without using production API credentials. It is designed for validating request flows, lifecycle state, webhooks, retries, and shareable run traces from MCP-compatible development tools.

MCPOpenAPIAPI testing