Get started in minutes
Follow the guide for your framework and add Agent Human to your pipeline.
Built for voice AI developers
Agent Human adds a video layer to your existing voice agent — nothing more, nothing less.
Drop-in for Pipecat & LiveKit
Plug directly into your existing voice AI pipeline. Agent Human sits at the end of your STT → LLM → TTS stack and renders the video — no rearchitecting required.
Simple API
Clean REST API and SDK. Send audio, get back a talking avatar video stream. Integrate in minutes.
Multi-Language & Lip Sync
Works across languages out of the box. Avatar lip movements stay in sync regardless of the language your TTS outputs.
Bring Your Own Avatar
Use any portrait photo — upload an image or provide a URL. No need to pick from a preset library.
Custom Avatar Generation
Generate a photorealistic avatar from scratch for your product or brand.
Real-Time Video
Low-latency video generation built to keep up with live voice conversations.
Simple, Transparent Pricing
Choose the plan that's right for you
Explorer
For individuals and small projects
- 500 minutes/month
- 20 minutes session limit
- 5 concurrent sessions
- Unlimited custom image avatars
- 20 avatar generations/month
Growth
For growing teams and products
- 3,000 minutes/month
- 60 minutes session limit
- 10 concurrent sessions
- Unlimited custom image avatars
- 60 avatar generations/month
Pro
For high-volume production use
- 8,000 minutes/month
- 180 minutes session limit
- 20 concurrent sessions
- Unlimited custom image avatars
- 180 avatar generations/month
Need more minutes? No problem.
Additional usage is available at $0.18 per extra minute
Get in Touch
We'd love to hear from you. Choose how you'd like to reach us.
Contact Form
Send us a message
Schedule a Meeting
Book a time that works for you