Your AI stage hand for live streaming.

We watch the camera, the screen, and the mic so you can focus on the show.

View on GitHub
The Problem

Live streaming has exploded, and the streamer is drowning in manual work.

Behind every successful stream sits a single person juggling four jobs at once. They play the game, they read chat, they react to alerts, and they pilot OBS and VTube Studio with hotkeys.

Hotkey conflicts

A scene-switch hotkey accidentally fires a game ability mid-fight.

Forgotten triggers

A streamer misses a Hype Train or a raid because the small notification slipped past their attention.

Awkward scene switches

They mis-click a transition and viewers see a black screen or the wrong overlay.

VTuber emote selection

A VTuber wants their character to react with a surprised expression, but they have to remember which of 30 hotkeys triggers it while playing.

Cognitive overload

Memorizing 10+ hotkey combinations during demanding gameplay drives the documented rise in streamer burnout for 2025 and 2026.

Small and mid-size streamers cannot afford a producer to handle this. They need software that does it for them.

The Solution

AuTuber is an AI agent that runs in the background as your stage hand.

Instead of you pressing shortcut keys, the agent observes your context from multiple sources at once. It watches your webcam, listens to your microphone, captures your game or work screen, and reads OBS and VTube Studio state. Then it controls your local streaming tools to match the moment.

Scene and overlay control via OBS WebSocket

switch from gameplay to facecam when you start talking to chat, drop in a BRB scene when you step away, fade overlays in and out based on the activity on screen.

VTube Studio expression triggers

fire a surprised emote when chat reacts to a clutch play, swap to a thinking pose when you open a code editor, animate a celebration on a Hype Train.

Audience moment recognition

catch raids, donations, and Hype Trains and produce visible reactions so the moment lands with the audience.

Safety by default

every action passes through validation, cooldowns, and configurable autonomy levels so you stay in control.

AuTuber general overview

You keep your existing OBS scenes, VTS hotkeys, and platform setup. AuTuber plugs into them.

Why AuTuber

The streaming automation market exists, and every product in it stops short of contextual intelligence.

We surveyed the competitive landscape across event-based automation platforms, hardware controllers, AI co-pilots, and OBS/VTS plugins.

ProductOBSVTSLLMMultimodal ContextCost
Streamer.botYesYesNoChat onlyFree
AitumYesYesNoChat only$5/mo
Advanced Scene SwitcherYesNoNoMotion onlyFree
OBS AgentYesNoYesMetrics onlyFree
Streamlabs Intelligent AgentYesUnclearYes4 supported games onlyFree/Paid
Elgato Stream DeckYesManualNoNone$99-$299
VSeeFaceNoManualNoFacial onlyFree
AuTuberYesYesYesCamera + Screen + Audio + StateOpen
1. Multimodal context

We combine the webcam, screen capture, microphone audio, and live OBS/VTS state into one observation that the model reasons over.

2. LLM-driven action planning

The agent decides what to do next from context.

3. Plug-and-play with existing workflows

We control OBS and VTube Studio through their official WebSocket APIs and auto-discover default ports.

4. Open and platform-agnostic

Twitch, YouTube, Kick, and TikTok all work because we control the local tools, not the platform.

5. Affordable for solo streamers

Small and mid-size creators get the producer-class behavior that has only been available to professional studios.

Demo
Development Team
Development team

Jacob Berger

Project ManagerAI EngineerFull Stack Engineer

Anthony Kung

AI Engineer

Brian Phan

Full Stack EngineerCameramanVideo Editor

Marcus Tin

Full Stack EngineerPitch Deck