Mountsea AI API Introduction
Discover all you can do with our Mountsea AI API! Our comprehensive AI platform empowers developers to integrate advanced AI capabilities into their applications โ including video generation, image creation, music production, and multi-model LLM chat.Overview
Mountsea AI provides a unified API gateway to access the worldโs leading AI models across video, image, music, and language domains. We offer the following core services:Google (Gemini)
Video & image generation powered by Veo 2 / Veo 3 / Veo 3.1 and Nano Banana models
Sora2
OpenAIโs Sora-2 / Sora-2-Pro text-to-video generation with character roles
OpenAI (GPT Image)
GPT Image 2 image generation & editing โ async task API + official
openai SDK compatibilityXAI (Grok)
Image & video generation powered by xAIโs Grok models
Suno
Full-suite AI music generation, voice persona, custom model training, and audio processing via Suno AI
ElevenLabs
AI music generation with composition plans, video scoring, and stem separation via ElevenLabs
Producer
AI music generation powered by Google DeepMindโs Lyria 3 Pro model
Chat
Multi-protocol AI chat gateway supporting OpenAI, Claude, and Gemini APIs
Why Mountsea AI Stands Out
All-in-One API Platform
A single API key gives you access to multiple AI services across different domains โ no need to manage separate accounts and credentials for each AI provider.Multi-Protocol LLM Gateway
Our Chat service supports OpenAI Compatible API, Anthropic (Claude) API, OpenAI Responses API, and Google Gemini Native API. Use official SDKs directly โ just change thebase_url.
Comprehensive AI Model Coverage
Access top-tier AI models from leading providers:| Domain | Models |
|---|---|
| Video | Veo 2, Veo 3, Veo 3.1, Sora-2, Sora-2-Pro, Grok Imagine Video |
| Image | Nano Banana Fast/Pro/2, GPT Image 2, Grok Imagine Image |
| Music | Suno (chirp-v35 ~ chirp-v55, custom models), ElevenLabs music_v1, Lyria 3 Pro |
| LLM | GPT-5.1, GPT-5.2, Claude 4.5, Claude Opus 4.6, Claude Sonnet 4.6, Gemini 2.5/3/3.1 |
Budget-Friendly Excellence
Enjoy premium AI tools at competitive prices. Our platform delivers high-quality, scalable solutions while keeping your projects cost-effective.Available Services
๐ฌ Google (Gemini) โ Video, Image & SDK Compat
Powered by Googleโs cutting-edge AI, this service is organized into three dedicated sections:- ๐ฅ Video Generation โ Create videos with Veo 2 / Veo 3 / Veo 3.1. Supports
text2video,img2video,ingredients2video, upsample to 1080p/4K, extend, reshoot, and object insert/remove. - ๐ผ๏ธ Image Generation (Nano Banana) โ Create and edit images with Nano Banana Fast / Pro / 2 models, multiple aspect ratios, up to 4K resolution.
- ๐ Gemini Compat (Official SDK) โ Drop-in replacement for Googleโs official @google/genai SDK. Use
generateContent/streamGenerateContentwith base URLhttps://api.mountsea.ai/gemini. Image models (gemini-2.5-flash-image,gemini-3.1-flash-image-preview,gemini-3-pro-image-preview) auto-route to Nano Banana.
๐ฅ Sora2 โ OpenAI Video Generation
Powered by OpenAIโs Sora, this service provides:- Text-to-Video Generation โ Create high-quality videos from text prompts using Sora-2 or Sora-2-Pro models
- Image-to-Video โ Use reference images to guide video generation
- Character Roles โ Create custom characters from short video clips, then reference them in prompts with
@character_name - Style Presets โ Choose from styles like anime, retro, comic, vintage, and more
- Flexible Video Formats โ Landscape/portrait orientation, variable durations (10s/15s/25s), optional watermark removal
๐ผ๏ธ XAI (Grok) โ Image & Video Generation
Powered by xAIโs Grok models, this service provides:- Text-to-Image โ Generate high-quality images from text prompts using grok-imagine-image, with customizable aspect ratios (1:1, 2:3, 3:2, 9:16, 16:9)
- Image-to-Image โ Provide a reference image to guide generation โ output size follows the reference
- Text-to-Video โ Generate videos from text prompts using grok-imagine-video, with control over duration (6s/10s/15s), aspect ratio, and resolution (480P/720P)
- Image-to-Video โ Use a reference image to guide video generation โ aspect ratio and resolution follow the reference automatically
- Async Task System โ All generation requests return a
taskIdfor polling
๐จ OpenAI โ GPT Image Generation & Editing
Powered by OpenAIโs latest GPT Image 2 model, this service provides:- Text-to-Image โ Generate images from text prompts with flexible sizes (1024x1024, 1024x1536, 1536x1024) and quality levels
- Image Editing โ Edit existing images by providing a URL or base64 data URL
- Inpainting โ Use a mask image to repaint specific areas of an input image
- Transparent Background โ Generate images with transparent background (
background: transparent) - Two API Styles:
- Async Task API (
/openai/images) โ unified endpoint withtaskIdpolling - OpenAI Compat API (
/openai/v1/images/*) โ drop-in replacement for OpenAIโs official SDK, just changebase_url
- Async Task API (
๐ต Suno โ Music Generation
Powered by Suno AI, this service provides a full suite of music creation and processing tools:- Music Generation โ Create, extend, cover, mashup, sample, and generate inspiration-based tracks using 15 task types via a unified
/generateendpoint, with models fromchirp-v35tochirp-v55 - Sound Effects โ Generate one-shot or looped sound effects from text descriptions
- Lyrics Generation โ Generate original lyrics or mashup lyrics from two songs
- Voice Persona โ Create verified voice personas from your own recordings through a single-task two-phase voice verification flow (init โ await phrase โ complete)
- Custom Models โ Train personalized music models on your own audio (6+ training clips), then use
chirp-custom:<uuid>for generation - Audio Processing โ Concat clips, remaster tracks (V4.5+/V5 models), adjust playback speed with pitch preservation
- Stem Separation โ Separate tracks into vocals + instrumental (two-track) or all individual stems
- Audio Export โ Export to MP4 (with visualizer), lossless WAV, or MDI (MIDI) format
- Audio Analysis โ Get synchronized lyrics timeline, downbeat detection, and enhanced style tags
- Vocal Persona โ Extract vocal characteristics from clips and create reusable personas for consistent vocal style
๐ถ ElevenLabs โ AI Music Generation
Powered by ElevenLabsโ music_v1 model, this service provides:- Text-to-Music โ Generate music from simple text prompts or structured composition plans with section-level style and lyrics control
- Composition Plan โ AI-generated structured plans with sections, global/local styles, and lyrics โ free, no credits consumed
- Video to Music โ Automatically generate background music that matches your video content (up to 10 videos, 600s total)
- Stem Separation โ Split audio into 2 tracks (vocals + instrumental) or 6 individual stems
- Inpainting โ Edit specific sections of existing songs (enterprise only)
- Multiple Output Formats โ MP3, PCM, Opus with configurable sample rates and bitrates
๐ง Producer โ AI Music Generation
Powered by Google DeepMindโs Lyria 3 Pro model, this service provides high-quality music generation:- Create Music โ Generate original tracks from sound prompts, lyrics, and images
- Image-Guided Generation โ Use images to influence the mood and style of generated music
- Instrumental Mode โ Generate without vocals
- Stem Separation โ Separate audio into individual stems (vocals, drums, bass, etc.)
- Multi-Format Export โ Download as MP3/M4A/WAV audio or generate video with preset visualizers
๐ฌ Chat โ Multi-Protocol AI Gateway
A unified gateway supporting multiple API protocols:| Protocol | Base URL | Description |
|---|---|---|
| OpenAI Compatible | https://api.mountsea.ai/chat | Drop-in replacement for OpenAI Chat Completions API |
| OpenAI Responses | https://api.mountsea.ai/chat | OpenAI Responses API format |
| Anthropic (Claude) | https://api.mountsea.ai/chat/claude | Claude Code & Anthropic SDK compatible |
| Gemini Native | https://api.mountsea.ai/chat/gemini | Google Gemini Native API format |
- โ Use official SDKs (OpenAI, Anthropic, Google GenAI) directly
- โ Full streaming, function calling, and tool support
- โ Compatible with Claude Code, Cursor, Cherry Studio, and other AI tools
- โ Access GPT-5.1/5.2, Claude 4.5/Opus/Sonnet, Gemini 2.5/3/3.1 models
How to Get Started
Get Your API Key
Sign up at shanhaiapi.com, go to API ๅฏ้ฅ็ฎก็, and create a new API key.
Make Your First API Call
Use
Authorization: Bearer your-api-key in your request header and call the appropriate endpoint.API Base URL
All API requests are made to:Exception: The Claude (Anthropic) compatible API uses
https://api.mountsea.ai/chat/claude as the base URL.Get Started Today
Ready to integrate AI capabilities into your applications?- ๐ Check out our Quick Start Guide
- ๐ฌ Explore Gemini Video & Image API
- ๐ Try Gemini Compat API with the official
@google/genaiSDK - ๐ฅ Explore Sora2 Video API
- ๐ผ๏ธ Explore XAI (Grok) Image & Video API
- ๐จ Try OpenAI GPT Image API with both async tasks and the official
openaiSDK - ๐ต Explore Suno Music API
- ๐ถ Explore ElevenLabs Music API
- ๐ง Explore Producer Music API
- ๐ฌ Explore Chat LLM Gateway
- ๐ Need help? Contact us
Transform your creative projects with the power of AI. Start building amazing applications today!