Skip to main content

Documentation Index

Fetch the complete documentation index at: https://rimp.mintlify.app/llms.txt

Use this file to discover all available pages before exploring further.

The headline feature: send a single prompt to N models and get all results in one structured response. No more juggling Replicate, OpenAI, Runway, ElevenLabs, and a dozen other dashboards to find which model handles your use case best. Rimp spans 8 modalities behind one API key and one credit balance:
ModalityEndpointModeWhat it does
ImagePOST /v1/imagessync 200Text-to-image and image-to-image
VideoPOST /v1/videosasync 202Text-to-video and image-to-video
VoicePOST /v1/voicesync 200Text-to-speech, voice cloning, transcription
MusicPOST /v1/musicasync 202Full songs, with or without vocals
AvatarPOST /v1/avatarsasync 202Talking avatars and lipsync
ChatPOST /v1/chatsync 200LLM chat completions
3DPOST /v1/three-dasync 202Text-to-3D and image-to-3D (GLB meshes)
UpscalePOST /v1/upscaleasync 202Image upscaling and face restoration
Costs below are in credits — Rimp’s universal currency (≈ 1,000 credits per $1 of underlying provider cost). The figures are the base cost per unit, exactly as shown in Studio; your wallet debit then applies your plan’s margin multiplier (Free ×1, Pro & Studio ×1.6, Team ×1.3). Call GET /v1/models for the live catalog. cr = credits.
Models tagged Studio+ require a Studio or Team plan. Calling them on Free or Pro returns 402 plan_upgrade_required.

Image models

SlugProviderTierCreditsCapabilities
flux-2-klein-9bBlack Forest Labsfast3 cr / imagetext-to-image
seedream-5-liteByteDancefast20 cr / imagetext-to-image, image-to-image
openai-imageOpenAIstandard40 cr / imagetext-to-image, image-to-image
nano-banana-proGoogle (Gemini)standard40 cr / imagetext-to-image, image-to-image
qwen-image-2-proQwenpro40 cr / imagetext-to-image
flux-proBlack Forest Labspro40 cr / imagetext-to-image
flux-pro-i2iBlack Forest Labspro50 cr / imageimage-to-image
imagen-4Googlepro50 cr / imagetext-to-image
recraft-v4-proRecraftpro50 cr / imagetext-to-image (design + SVG)
flux-2-proBlack Forest Labspro60 cr / imagetext-to-image, image-to-image
ideogram-v2-turboIdeogramfast50 cr / imagetext-to-image (typography)
imagen-4-ultraGooglepro80 cr / imagetext-to-image — Studio+
ideogram-v2Ideogramstandard80 cr / imagetext-to-image (typography)
flux-2-maxBlack Forest Labspro100 cr / imagetext-to-image — Studio+
openai-gpt-image-2OpenAIpro210 cr / imagetext-to-image, image-to-image — Studio+

Video models

Billed per second unless noted. Image-to-video models accept an image_url.
SlugProviderTierCreditsCapabilities
runway-gen4-turboRunwayfast50 cr / sectext-to-video, image-to-video
kling-v3Klingstandard100 cr / sectext-to-video, image-to-video
seedance-2-0ByteDancestandard120 cr / sectext-to-video (audio sync)
kling-v3-omniKlingpro280 cr / sectext-to-video, image-to-video (audio + 4K) — Studio+
runway-gen4-5Runwaypro300 cr / sectext-to-video, image-to-video — Studio+
openai-sora-2OpenAIstandard300 cr / sectext-to-video (synced audio) — Studio+
luma-ray2Lumastandard320 cr / Mpxtext-to-video, image-to-video — Studio+
veo-3-1-fastGoogle Veofast400 cr / sectext-to-video, image-to-video
openai-sora-2-proOpenAIpro500 cr / sectext-to-video, image-to-video (synced audio) — Studio+
veo-3-1-standardGoogle Veostandard750 cr / sectext-to-video, image-to-video — Studio+

Voice models

SlugProviderTierCreditsCapabilities
inworld-realtime-tts-2Inworldfast20 cr / 1k charstext-to-voice
minimax-speech-2-8-hdMiniMaxpro30 cr / 1k charstext-to-voice, voice cloning
elevenlabs-scribe-v2ElevenLabspro12 cr / mintranscription

Music models

SlugProviderTierCreditsCapabilities
minimax-music-2-8MiniMaxstandard100 cr / songtext-to-music (vocals)
lyria-3-proGooglepro120 cr / songtext-to-music (up to 3 min)

Avatar models

Billed per second of output. Two request shapes: text-to-avatar (text + optional avatar_id/voice_id) or lipsync (video_url + audio_url).
SlugProviderTierCreditsCapabilities
kling-lip-syncKlingstandard40 cr / seclipsync
sync-lipsync-2-proSync Labspro50 cr / seclipsync
heygen-avatar-vHeyGenstandard67 cr / seclipsync
heygen-avatar-ivHeyGenstandard67 cr / seclipsync — Studio+
kling-avatar-v2Klingpro120 cr / seclipsync (realistic + cartoon) — Studio+

3D models

Output is a GLB mesh URL. Billed per mesh.
SlugProviderTierCreditsCapabilities
trellis-3dTRELLISfast33 cr / modelimage-to-3D
hunyuan-3d-3-1Tencentstandard400 cr / modeltext-to-3D, image-to-3D
hyper3d-rodinRodinpro400 cr / modeltext-to-3D, image-to-3D, rigged characters — Studio+
trellis-2-proTRELLISpro820 cr / modelimage-to-3D (PBR materials) — Studio+

Upscale models

SlugProviderTierCreditsCapabilities
real-esrganReal-ESRGANfast2 cr / imageimage upscale, face restore
gfpgan-face-upscaleGFPGANstandard4 cr / imageface restore
recraft-crisp-upscaleRecraftstandard4 cr / imageimage upscale (commercial-safe)
topaz-image-upscaleTopazpro3 cr / Mpximage upscale, face restore — Studio+
clarity-upscalerClaritypro17 cr / imageimage upscale (Magnific-like)

Chat models

Billed per output token. The figures below are credits per 1M output tokens — a single turn typically uses far fewer (the default cap is 1,024 tokens). Charged on max_tokens, then reconciled with actual usage on completion.
SlugProviderTierCredits / 1M tokensCapabilities
deepseek-v3DeepSeekfast300text
meta-llama-4-scoutMetafast350text, multimodal (10M context)
meta-llama-4-maverickMetastandard500text, multimodal
openai-gpt-4o-miniOpenAIfast600text, tool use
deepseek-v4-proDeepSeekpro800text, tool use
qwen-3-maxQwenpro1,200text, tool use
anthropic-claude-haiku-4-5Anthropicfast1,250text, tool use
openai-gpt-5-miniOpenAIfast2,000text, tool use, multimodal
mistral-large-3Mistralstandard3,000text, tool use — Studio+
gemini-3-proGooglepro7,500text, tool use, multimodal — Studio+
openai-gpt-4oOpenAIpro10,000text, tool use, multimodal — Studio+
anthropic-claude-sonnet-4-5Anthropicstandard15,000text, tool use, multimodal — Studio+
xai-grok-4xAIpro15,000text, tool use — Studio+
openai-gpt-5OpenAIpro30,000text, tool use, multimodal — Studio+
anthropic-claude-opus-4-7Anthropicpro75,000text, tool use, multimodal — Studio+

Comparison API

Send one prompt to several models at once. Works across any models that share a modality.
const cmp = await client.comparisons.create({
  prompt: 'cinematic portrait of a fisherman at golden hour, 35mm film',
  models: ['flux-pro', 'imagen-4', 'ideogram-v2'],
  params_shared: { aspect_ratio: '1:1' },
});

Atomic credit reservation

Rimp sums the estimated cost of all N models and reserves it in one transaction. If your wallet can’t cover the sum, you get 402 Insufficient credits and nothing is charged — not even the models that would have fit.

Plan limits

The number of models per comparison is capped by your plan:
PlanMax models per comparison
Free2
Pro3
Studio6
Team12

Polling vs webhooks

A comparison returns 202 with a parent comparison object. Two ways to wait for results:
const result = await client.comparisons.waitFor(cmp.id);
// resolves when all child generations reach a terminal state
result.generations.forEach((g) => console.log(g.model, g.outputs[0]?.url));
Fetch the full result any time with GET /v1/comparisons/{id} — it returns the parent plus every child generation (status, charged credits, and signed output URLs).

Capability matrix

Not every model supports every operation. Check the capabilities array on each model (visible in the tables above and in GET /v1/models):
{
  "slug": "flux-pro-i2i",
  "modality": "image",
  "capabilities": ["image_to_image"]
}
Common capabilities: text_to_image, image_to_image, text_to_video, image_to_video, text_to_voice, voice_clone, transcription, text_to_music, lipsync, text_to_text, tool_use, multimodal_input, text_to_3d, image_to_3d, rigged_3d, image_upscale, face_restore.