eessppeelllloo/PVS

Fork 0

mirror of https://github.com/Khendi1/PVS.git synced 2026-03-22 15:47:34 +01:00

Files

khenderson96 cf45c2591f param info field, markdown docs, web app,

2026-03-21 00:54:10 -04:00

15 KiB

Raw Permalink Blame History

Video Synthesizer API & FFmpeg Usage

Overview

The video synthesizer can now be controlled remotely via REST API and output to FFmpeg for recording or streaming.

Features

REST API: Control all parameters remotely via HTTP
FFmpeg Output: Record to file or stream via UDP/SRT/RTMP
Headless Mode: Run without GUI for server deployments
Agent Control: Perfect for AI agents or automation scripts

Installation

Install the additional dependencies:

pip install fastapi uvicorn

Make sure FFmpeg is installed and in your PATH:

Windows: Download from https://ffmpeg.org/download.html
macOS: brew install ffmpeg
Linux: apt-get install ffmpeg or yum install ffmpeg

Usage

Enable API Server

Start the synthesizer with API enabled:

python -m video_synth --api

The API server starts at http://0.0.0.0:8000 by default (accessible from any device on the network). To restrict to localhost only, pass --api-host 127.0.0.1.

Custom host and port:

python -m video_synth --api --api-host 127.0.0.1 --api-port 8080

Web UI

A browser-based control panel is bundled with the API. Build it once from the web/ directory:

cd web
npm install
npm run build

After building, start the synth with --api and open http://<host>:8000/ui in a browser. The web UI provides sliders for all parameters, live video preview, LFO controls, and MIDI learn — no separate server needed.

Development mode (auto-reloads on source changes):

cd web
npm run dev

The dev server runs on port 5173 and proxies API and WebSocket traffic to the Python backend automatically.

Enable FFmpeg Output

Record to file:

python -m video_synth --ffmpeg --ffmpeg-output output.mp4

Stream over UDP (lowest latency, no server needed):

python -m video_synth --ffmpeg --ffmpeg-output udp://127.0.0.1:1234

Encoding options:

python -m video_synth --ffmpeg --ffmpeg-output output.mp4 \
  --ffmpeg-preset fast --ffmpeg-crf 20

Presets: ultrafast, superfast, veryfast, faster, fast, medium, slow, slower, veryslow CRF: 0-51 (lower = better quality, 23 is default)

Headless Mode

Run without GUI (requires --api or --ffmpeg):

python -m video_synth --headless --api --ffmpeg --ffmpeg-output output.mp4

This is useful for server deployments or when running on systems without a display.

Combined Usage

API + FFmpeg + Headless for remote-controlled recording:

python -m video_synth --headless --api --ffmpeg --ffmpeg-output recording.mp4

API Endpoints

The REST API provides the following endpoints:

Get All Parameters

GET http://127.0.0.1:8000/params

Returns a list of all parameters with their current values, min/max bounds, and metadata.

Example response:

[
  {
    "name": "glitch_intensity_max",
    "value": 50,
    "min": 0,
    "max": 100,
    "default": 50,
    "group": "Groups.SRC_1_EFFECTS",
    "subgroup": "Glitch_General",
    "type": "Widget.SLIDER"
  },
  ...
]

Get Specific Parameter

GET http://127.0.0.1:8000/params/{param_name}

Example:

GET http://127.0.0.1:8000/params/glitch_intensity_max

Set Parameter Value

PUT http://127.0.0.1:8000/params/{param_name}
Content-Type: application/json

{
  "value": 75
}

Example with curl:

curl -X PUT http://127.0.0.1:8000/params/glitch_intensity_max \
  -H "Content-Type: application/json" \
  -d '{"value": 75}'

Example with Python requests:

import requests

response = requests.put(
    'http://127.0.0.1:8000/params/glitch_intensity_max',
    json={'value': 75}
)
print(response.json())

Reset Parameter

POST http://127.0.0.1:8000/params/reset/{param_name}

Resets the parameter to its default value.

Get Snapshot

GET http://127.0.0.1:8000/snapshot

Returns the current frame as a JPEG image. Useful for monitoring or analysis.

Example with Python:

import requests
from PIL import Image
import io

response = requests.get('http://127.0.0.1:8000/snapshot')
image = Image.open(io.BytesIO(response.content))
image.show()

MJPEG Stream

GET http://127.0.0.1:8000/stream

Continuous MJPEG stream at ~30 fps. Open directly in a browser or media player.

WebSocket Stream

ws://127.0.0.1:8000/ws/stream

Low-latency binary JPEG frames pushed at the render rate (~30 fps). Each message is a raw JPEG blob. Used by the web UI for the live preview panel.

Web UI

http://127.0.0.1:8000/ui

Browser-based control panel (requires building with npm run build in web/).

API Documentation

Interactive API documentation (Swagger UI) is available at:

http://127.0.0.1:8000/docs

Agent Control Examples

Python Agent Example

import requests
import time

API_BASE = 'http://127.0.0.1:8000'

class VideoSynthAgent:
    def __init__(self, base_url=API_BASE):
        self.base_url = base_url

    def get_params(self):
        """Get all parameters."""
        response = requests.get(f'{self.base_url}/params')
        return response.json()

    def set_param(self, name, value):
        """Set a parameter value."""
        response = requests.put(
            f'{self.base_url}/params/{name}',
            json={'value': value}
        )
        return response.json()

    def get_snapshot(self):
        """Get current frame as PIL Image."""
        from PIL import Image
        import io
        response = requests.get(f'{self.base_url}/snapshot')
        return Image.open(io.BytesIO(response.content))

    def animate_parameter(self, param_name, start, end, duration=5.0, steps=100):
        """Smoothly animate a parameter from start to end value."""
        for i in range(steps):
            progress = i / (steps - 1)
            value = start + (end - start) * progress
            self.set_param(param_name, value)
            time.sleep(duration / steps)

# Usage
agent = VideoSynthAgent()

# Get all parameters
params = agent.get_params()
print(f"Found {len(params)} parameters")

# Animate glitch intensity
agent.animate_parameter('glitch_intensity_max', 0, 100, duration=10.0)

# Set multiple parameters
agent.set_param('pattern_alpha', 0.8)
agent.set_param('pattern_speed', 2.5)

# Capture snapshot
image = agent.get_snapshot()
image.save('snapshot.jpg')

LLM Agent Integration

The API is designed to be easily used by LLM agents. Example prompt:

You are controlling a video synthesizer via REST API.

Available endpoints:
- GET /params - list all parameters
- PUT /params/{name} - set parameter value (JSON: {"value": number})
- GET /snapshot - get current frame as image

Task: Create a psychedelic visual effect by:
1. Enabling pattern feedback
2. Setting high pattern warp
3. Animating the pattern speed

API base URL: http://127.0.0.1:8000

Automation Example

import requests
import time
import random

API_BASE = 'http://127.0.0.1:8000'

def random_glitch_sequence():
    """Create a random glitch art sequence."""
    glitch_params = [
        'enable_pixel_shift',
        'enable_color_split',
        'enable_block_corruption',
        'enable_slitscan'
    ]

    # Randomly enable/disable glitch effects
    for param in glitch_params:
        value = random.choice([0, 1])
        requests.put(f'{API_BASE}/params/{param}', json={'value': value})

    # Randomize intensity
    intensity = random.randint(30, 100)
    requests.put(f'{API_BASE}/params/glitch_intensity_max', json={'value': intensity})

# Run random glitch sequence every 5 seconds
while True:
    random_glitch_sequence()
    time.sleep(5)

OBS Integration

The video synthesizer can be integrated with OBS Studio in several ways.

Method 1: UDP Stream to OBS (Recommended)

MPEG-TS over UDP provides the lowest latency with zero setup - no external server required.

Step 1: Stream from Video Synth

python -m video_synth --ffmpeg \
  --ffmpeg-output udp://127.0.0.1:1234 \
  --ffmpeg-preset veryfast

Step 2: Add to OBS

In OBS, add a Media Source
Uncheck "Local File"
Input: udp://127.0.0.1:1234
Check "Restart playback when source becomes active"
Set to "Close file when inactive" = Off

Tips:

Use veryfast or ultrafast preset for lowest latency
UDP on localhost has near-zero packet loss
No external RTMP server needed - just start the synth and point OBS at the UDP address

Method 2: OBS WebSocket Control

Control OBS programmatically while using virtual camera or UDP/SRT stream.

Install Dependencies

pip install obs-websocket-py

Enable OBS WebSocket

In OBS, go to Tools > WebSocket Server Settings
Enable WebSocket server
Set a password (optional but recommended)
Note the port (default: 4455 for OBS 28+, 4444 for older versions)

Use OBS Controller

from obs_controller import OBSController

# Connect to OBS
obs = OBSController(password="your_password")
obs.connect()

# Start recording
obs.start_recording()

# Switch scene
obs.set_scene("Scene 2")

# Stop recording after some time
import time
time.sleep(60)
obs.stop_recording()

# Disconnect
obs.disconnect()

Combined API + OBS Control Example

import requests
import time
from obs_controller import OBSController

API_BASE = 'http://127.0.0.1:8000'

# Connect to OBS
obs = OBSController(password="your_password")
obs.connect()

# Start OBS recording
obs.start_recording()

# Animate video synth parameters via API
for i in range(100):
    intensity = int(i)
    requests.put(f'{API_BASE}/params/glitch_intensity_max',
                 json={'value': intensity})
    time.sleep(0.1)

# Stop recording
obs.stop_recording()
obs.disconnect()

Method 3: Virtual Camera

Use OBS Virtual Camera as an intermediate device.

Step 1: Enable OBS Virtual Camera

In OBS, click Start Virtual Camera
This creates a virtual webcam device

Step 2: Use Virtual Camera in Video Synth

The video synth can capture from the virtual camera as a video device:

# List available devices first
python -m video_synth

# In the GUI, select "DEVICE_X" that corresponds to OBS Virtual Camera

Step 3: Create Feedback Loop

This creates interesting feedback effects:

Video Synth → OBS (via UDP/SRT)
OBS → Virtual Camera
Virtual Camera → Video Synth (as input)

Warning: This can create intense visual feedback! Start with low effect intensities.

Method 4: NDI (Network Device Interface)

NDI allows low-latency video over network.

Install NDI Tools

Download from: https://ndi.tv/tools/

Install OBS NDI Plugin

Download from: https://github.com/obs-ndi/obs-ndi/releases

Install NDI Python Library

pip install ndi-python

Stream via NDI

# This would require implementing NDI output in the video synth
# Currently not implemented, but could be added similar to FFmpeg output

Method 5: SRT Protocol (Low Latency with Error Recovery)

SRT provides low latency with built-in error recovery, useful for streaming over networks.

python -m video_synth --ffmpeg \
  --ffmpeg-output "srt://127.0.0.1:9999?pkt_size=1316" \
  --ffmpeg-preset veryfast

In OBS, add a Media Source with input: srt://127.0.0.1:9999

Automated Recording Workflow

Complete example: Automated video generation with OBS recording.

import requests
import time
from obs_controller import OBSController

API_BASE = 'http://127.0.0.1:8000'

def automated_recording_session():
    """Automated 5-minute recording with parameter automation."""

    # Connect to OBS
    obs = OBSController(password="your_password")
    obs.connect()

    # Setup initial parameters
    requests.put(f'{API_BASE}/params/pattern_alpha', json={'value': 0.5})
    requests.put(f'{API_BASE}/params/pattern_fb_enable', json={'value': 1})

    # Start recording
    obs.start_recording()
    print("Recording started...")

    # Animate parameters over 5 minutes
    duration = 300  # 5 minutes
    steps = 300

    for i in range(steps):
        progress = i / steps

        # Animate multiple parameters
        params = {
            'glitch_intensity_max': int(progress * 100),
            'pattern_speed': progress * 3.0,
            'pattern_fb_warp': progress * 15.0,
            'warp_angle_amt': int(progress * 180)
        }

        for param, value in params.items():
            requests.put(f'{API_BASE}/params/{param}', json={'value': value})

        time.sleep(duration / steps)

        # Print progress
        if i % 30 == 0:
            print(f"Progress: {int(progress * 100)}%")

    # Stop recording
    obs.stop_recording()
    print("Recording stopped")

    # Wait for OBS to finalize the file
    time.sleep(2)

    # Get recording info
    status = obs.get_recording_status()
    print(f"Recording status: {status}")

    obs.disconnect()

if __name__ == "__main__":
    # Start the video synth with UDP output
    print("Start video_synth with:")
    print("python -m video_synth --api --ffmpeg --ffmpeg-output udp://127.0.0.1:1234 --ffmpeg-preset veryfast")
    print("\nThen run this script to automate the recording")

    input("Press Enter when video_synth and OBS are ready...")
    automated_recording_session()

Recommended Setup for Best Results

For Live Streaming (lowest latency):

# Terminal 1: Start video synth with UDP output
python -m video_synth --api --ffmpeg \
  --ffmpeg-output udp://127.0.0.1:1234 \
  --ffmpeg-preset ultrafast

# Terminal 2: Run automation script
python automation_script.py

For High-Quality Recording:

# Terminal 1: Video synth with medium quality
python -m video_synth --api --ffmpeg \
  --ffmpeg-output udp://127.0.0.1:1234 \
  --ffmpeg-preset medium

# OBS: Record at high quality settings
# OBS Settings > Output > Recording Quality: "High Quality, Medium File Size"

For Headless Server:

# No GUI, just API and UDP output
python -m video_synth --headless --api --ffmpeg \
  --ffmpeg-output udp://127.0.0.1:1234 \
  --ffmpeg-preset fast

Performance Notes

API calls are thread-safe and non-blocking
Parameter changes take effect immediately (next frame)
Snapshot endpoint may have slight delay depending on frame rate
FFmpeg encoding adds minimal overhead with appropriate presets
For real-time streaming, use ultrafast or veryfast preset

Troubleshooting

FFmpeg not found

Make sure FFmpeg is installed and in your system PATH:

ffmpeg -version

API server won't start

Check if port is already in use:

# Windows
netstat -ano | findstr :8000

# macOS/Linux
lsof -i :8000

Use a different port with --api-port.

Headless mode validation error

Headless mode requires either --api or --ffmpeg to be enabled.

Permission errors

On Linux/macOS, you may need to allow the port in your firewall:

# Allow port 8000
sudo ufw allow 8000

15 KiB Raw Permalink Blame History

Video Synthesizer API & FFmpeg Usage

Overview

Features

Installation

Usage

Enable API Server

Web UI

Enable FFmpeg Output

Headless Mode

Combined Usage

API Endpoints

Get All Parameters

Get Specific Parameter

Set Parameter Value

Reset Parameter

Get Snapshot

MJPEG Stream

WebSocket Stream

Web UI

API Documentation

Agent Control Examples

Python Agent Example

LLM Agent Integration

Automation Example

OBS Integration

Method 1: UDP Stream to OBS (Recommended)

Step 1: Stream from Video Synth

Step 2: Add to OBS

Method 2: OBS WebSocket Control

Install Dependencies

Enable OBS WebSocket

Use OBS Controller

Combined API + OBS Control Example

Method 3: Virtual Camera

Step 1: Enable OBS Virtual Camera

Step 2: Use Virtual Camera in Video Synth

Step 3: Create Feedback Loop

Method 4: NDI (Network Device Interface)

Install NDI Tools

Install OBS NDI Plugin

Install NDI Python Library

Stream via NDI

Method 5: SRT Protocol (Low Latency with Error Recovery)

Automated Recording Workflow

Recommended Setup for Best Results

Performance Notes

Troubleshooting

FFmpeg not found

API server won't start

Headless mode validation error

Permission errors

15 KiB

Raw Permalink Blame History