docs: add HuggingFace cache troubleshooting to README

- Document HF_HOME environment variable for writable cache - Add systemd service permission guidance for /tmp paths - Troubleshooting steps for read-only file system errors
docs: update README with comprehensive effects documentation and bump version to 1.2.0
2026-02-26 15:56:09 -06:00 · 2026-01-31 17:33:28 -06:00 · 2026-01-31 17:28:47 -06:00 · 2026-01-31 17:25:52 -06:00 · 2026-01-31 17:10:19 -06:00 · 2026-01-31 16:56:15 -06:00
38 changed files with 1315 additions and 256 deletions
--- a/.env.example
+++ b/.env.example
--- a/.env.testing
+++ b/.env.testing
@@ -0,0 +1,21 @@
 # Discord Bot Configuration
 # Testing environment configuration
 # This file is used when running: python bot.py testing
 # Your Discord bot token (from Discord Developer Portal) - use a DIFFERENT bot for testing!
 DISCORD_TOKEN=MTQyNDU3MjA4MjI1MTEwODQyNQ.GJ8iyw.B2O1nlAsw6AlRz3YR5eSN-OcHm4j1l7lEHzxY0
 # The text channel ID to monitor for messages
 # (Right-click channel with Developer Mode enabled -> Copy ID)
 # Use a DIFFERENT channel for testing!
 TEXT_CHANNEL_ID=1424585470616146061
 # Directory containing voice .wav files
 VOICES_DIR=./voices
 # Default voice name (optional - uses first found voice if not set)
 # This should match the filename without .wav extension (case-insensitive)
 # DEFAULT_VOICE=masterchief
 # HuggingFace cache directory (must be writable)
 HF_HOME=/tmp/huggingface
--- a/.gitignore
+++ b/.gitignore
@@ -117,10 +117,15 @@ dmypy.json
 .venv
 env/
 venv/
 linux_venv/
 ENV/
 env.bak/
 venv.bak/
 /venv
 .numba_cache/
 # Gemini files
 GEMINI.md
 PROGRESS.md
 .vscode/launch.json
 voices/preferences.json
--- a/README.md
+++ b/README.md
@@ -11,6 +11,12 @@ A Discord bot that reads messages aloud using [Pocket TTS](https://github.com/ky
 - 🔄 **Per-User Voice Selection**: Each user can choose their own TTS voice via `/voice` commands
 - 💾 **Voice Persistence**: User voice preferences are saved and restored on restart
 - 🔄 **Hot-reload Voices**: Add new voices without restarting the bot using `/voice refresh`
 - 🧪 **Test Mode**: Separate testing configuration for safe development
 - 📦 **Auto-updates**: Automatically checks for and installs dependency updates on startup
 - 👂 **Voice Preview**: Preview voices with `/voice preview` before committing to them
 - 🎵 **Audio Effects**: 7 different effects to customize your voice (pitch, speed, echo, robot, chorus, tremolo)
 - ⚡ **Unlimited Effects**: Use as many effects as you want (warning shown when >2 active)
 - ⏱️ **Processing Indicator**: Shows when audio processing is taking longer than expected
 ## Prerequisites
@@ -107,6 +113,108 @@ A Discord bot that reads messages aloud using [Pocket TTS](https://github.com/ky
   - `/voice set <name>` - Change your personal TTS voice
   - `/voice current` - Shows your current voice
   - `/voice refresh` - Re-scan for new voice files (no restart needed)
   - `/voice preview <name>` - Preview a voice before selecting it
 ### Test Mode
 Run the bot in testing mode to use a separate configuration:
 ```bash
 python bot.py testing
 ```
 This loads `.env.testing` instead of `.env`, allowing you to:
 - Use a different Discord bot token for testing
 - Monitor a different text channel
 - Test new features without affecting the production bot
 Create `.env.testing` by copying `.env.example` and configuring it with your testing values.
 ### Audio Effects
 Transform your TTS voice with 7 different audio effects:
 #### Available Effects:
 **🎵 Pitch** (`/effects set pitch <semitones>`)
 - Range: -12 to +12 semitones
 - Default: 0 (no change)
 - Positive = higher/chipmunk voice
 - Negative = lower/deeper voice
 **⚡ Speed** (`/effects set speed <multiplier>`)
 - Range: 0.5 to 2.0
 - Default: 1.0x (normal speed)
 - Higher = faster speech
 - Lower = slower speech
 **🔊 Echo** (`/effects set echo <percentage>`)
 - Range: 0-100%
 - Default: 0% (off)
 - Adds spatial delay and reverb effect
 - Higher values = more pronounced echo
 **🤖 Robot** (`/effects set robot <percentage>`)
 - Range: 0-100%
 - Default: 0% (off)
 - Applies ring modulation for sci-fi robotic voice
 - Higher values = more robotic distortion
 **🎶 Chorus** (`/effects set chorus <percentage>`)
 - Range: 0-100%
 - Default: 0% (off)
 - Creates "multiple voices" effect with slight pitch variations
 - Higher values = more voices and depth
 **〰️ Tremolo Depth** (`/effects set tremolo_depth <value>`)
 - Range: 0.0 to 1.0
 - Default: 0.0 (off)
 - Controls amplitude modulation amount
 - Higher = more warble/vintage radio effect
 **📳 Tremolo Rate** (`/effects set tremolo_rate <hertz>`)
 - Range: 0.0 to 10.0 Hz
 - Default: 0.0 Hz (off)
 - Controls how fast the tremolo warbles
 - Requires tremolo_depth > 0 to have effect
 #### Effect Commands:
 - `/effects list` - Show all your current effect settings
 - `/effects set <effect> <value>` - Change an effect value
 - `/effects reset` - Reset all effects to defaults (with confirmation)
 #### Effect Application Order:
 Effects are applied in this sequence:
 1. Pitch shift
 2. Speed change
 3. Echo/Reverb
 4. Chorus
 5. Tremolo
 6. Robot voice
 #### Performance Notes:
 - **No limit** on number of active effects
 - ⚠️ Warning shown when you have more than 2 active effects
 - More effects = longer processing time
 - Some effects (like pitch shift and chorus) are more CPU-intensive
 - Processing time is logged to console for monitoring
 ### Preview with Effects
 Test any combination of voice and effects before committing:
 **Preview a voice:**
 - `/voice preview <voice_name>` - Preview with your current effects
 **Preview with specific effects:**
 - `/voice preview <voice_name> pitch:5 speed:1.5` - Preview with pitch +5 and 1.5x speed
 - All effect parameters are optional and default to your current settings
 **Example combinations to try:**
 - Robot voice: `/effects set robot 75`
 - Deep scary voice: `/effects set pitch -8`
 - Fast chipmunk: `/effects set pitch 8 speed:1.5`
 - Radio announcer: `/effects set echo 40 tremolo_depth:0.3 tremolo_rate:4`
 ## How It Works
@@ -145,6 +253,27 @@ A Discord bot that reads messages aloud using [Pocket TTS](https://github.com/ky
 - Ensure the reference audio is clear with minimal background noise
 - Try a longer reference clip (5-10 seconds)
 ### HuggingFace cache read-only error
 If you see errors like `OSError: [Errno 30] Read-only file system` when the bot tries to download the TTS model:
 1. **Set a writable cache directory**: Add to your `.env` file:
   ```env
   HF_HOME=/tmp/huggingface
   ```
 2. **Create and set permissions** on the directory:
   ```bash
   sudo mkdir /tmp/huggingface
   sudo chown -R $USER:$USER /tmp/huggingface
   ```
 3. **If using systemd service**: Ensure the service has write access to `/tmp` or the chosen cache directory. You may need to add `ReadWritePaths=/tmp/huggingface` to the service file or remove `ProtectHome=read-only`.
 4. **Restart the bot**:
   ```bash
   sudo systemctl restart vox.service
   ```
 ## Linux Server Deployment
 To run the bot as a service on a Linux server:
--- a/audio_effects.py
+++ b/audio_effects.py
@@ -0,0 +1,345 @@
 """Audio effects processing for TTS output."""
 import time
 from typing import Any
 import librosa
 import numpy as np
 class AudioEffects:
    """Apply post-processing effects to TTS audio."""
    # No limit on effects, but warnings shown when > 2 active
    MAX_ACTIVE_EFFECTS = None
    # Effect ranges and defaults
    PITCH_MIN = -12
    PITCH_MAX = 12
    PITCH_DEFAULT = 0
    SPEED_MIN = 0.5
    SPEED_MAX = 2.0
    SPEED_DEFAULT = 1.0
    ECHO_MIN = 0
    ECHO_MAX = 100
    ECHO_DEFAULT = 0
    ROBOT_MIN = 0
    ROBOT_MAX = 100
    ROBOT_DEFAULT = 0
    CHORUS_MIN = 0
    CHORUS_MAX = 100
    CHORUS_DEFAULT = 0
    TREMOLO_DEPTH_MIN = 0.0
    TREMOLO_DEPTH_MAX = 1.0
    TREMOLO_DEPTH_DEFAULT = 0.0
    TREMOLO_RATE_MIN = 0.0
    TREMOLO_RATE_MAX = 10.0
    TREMOLO_RATE_DEFAULT = 0.0
    @classmethod
    def apply_effects(
        cls,
        audio: np.ndarray,
        sr: int,
        pitch: int = PITCH_DEFAULT,
        speed: float = SPEED_DEFAULT,
        echo: int = ECHO_DEFAULT,
        robot: int = ROBOT_DEFAULT,
        chorus: int = CHORUS_DEFAULT,
        tremolo_depth: float = TREMOLO_DEPTH_DEFAULT,
        tremolo_rate: float = TREMOLO_RATE_DEFAULT,
    ) -> tuple[np.ndarray, bool]:
        """
        Apply effects to audio in order: pitch → speed → echo → chorus → tremolo → robot
        Args:
            audio: Input audio array (1D)
            sr: Sample rate
            pitch: Pitch shift in semitones (-12 to +12, 0 = no shift)
            speed: Speed multiplier (0.5 to 2.0, 1.0 = normal)
            echo: Echo intensity (0-100, 0 = no echo)
            robot: Robot voice intensity (0-100, 0 = no robot)
            chorus: Chorus intensity (0-100, 0 = no chorus)
            tremolo_depth: Tremolo depth (0.0-1.0, 0.0 = no tremolo)
            tremolo_rate: Tremolo rate in Hz (0.0-10.0)
        Returns:
            Tuple of (processed_audio, show_processing_message)
            show_processing_message is True if processing took > 1 second
        """
        start_time = time.time()
        original_length = len(audio)
        # Validate inputs
        pitch = max(cls.PITCH_MIN, min(cls.PITCH_MAX, pitch))
        speed = max(cls.SPEED_MIN, min(cls.SPEED_MAX, speed))
        echo = max(cls.ECHO_MIN, min(cls.ECHO_MAX, echo))
        robot = max(cls.ROBOT_MIN, min(cls.ROBOT_MAX, robot))
        chorus = max(cls.CHORUS_MIN, min(cls.CHORUS_MAX, chorus))
        tremolo_depth = max(cls.TREMOLO_DEPTH_MIN, min(cls.TREMOLO_DEPTH_MAX, tremolo_depth))
        tremolo_rate = max(cls.TREMOLO_RATE_MIN, min(cls.TREMOLO_RATE_MAX, tremolo_rate))
        # Apply pitch shift first
        if pitch != cls.PITCH_DEFAULT:
            print(f"  Applying pitch shift: {pitch:+d} semitones...")
            audio = librosa.effects.pitch_shift(
                audio, sr=sr, n_steps=pitch, bins_per_octave=12
            )
        # Apply speed change second
        if speed != cls.SPEED_DEFAULT:
            print(f"  Applying speed change: {speed:.1f}x...")
            audio = librosa.effects.time_stretch(audio, rate=speed)
        # Apply echo third
        if echo > 0:
            print(f"  Applying echo: {echo}%...")
            audio = cls._apply_echo(audio, sr, echo)
        # Apply chorus fourth
        if chorus > 0:
            print(f"  Applying chorus: {chorus}%...")
            audio = cls._apply_chorus(audio, sr, chorus)
        # Apply tremolo fifth
        if tremolo_depth > 0 and tremolo_rate > 0:
            print(f"  Applying tremolo: depth={tremolo_depth:.1f}, rate={tremolo_rate:.1f}Hz...")
            audio = cls._apply_tremolo(audio, sr, tremolo_depth, tremolo_rate)
        # Apply robot voice last
        if robot > 0:
            print(f"  Applying robot effect: {robot}%...")
            audio = cls._apply_robot(audio, sr, robot)
        processing_time = time.time() - start_time
        print(f"  Effects applied in {processing_time:.2f}s")
        # Show processing message if it took more than 1 second
        show_message = processing_time > 1.0
        return audio, show_message
    @classmethod
    def _apply_echo(cls, audio: np.ndarray, sr: int, intensity: int) -> np.ndarray:
        """Apply simple echo/reverb effect."""
        if intensity == 0:
            return audio
        # Calculate delay in samples (50-300ms based on intensity)
        delay_ms = 50 + (intensity / 100) * 250
        delay_samples = int((delay_ms / 1000) * sr)
        # Create output array
        output = np.copy(audio)
        # Add delayed copy with decay
        decay = 0.3 + (intensity / 100) * 0.4  # 0.3-0.7 decay factor
        if delay_samples < len(audio):
            output[delay_samples:] += audio[:-delay_samples] * decay
        # Normalize
        max_val = np.max(np.abs(output))
        if max_val > 0:
            output = output / max_val * np.max(np.abs(audio))
        return output
    @classmethod
    def _apply_chorus(cls, audio: np.ndarray, sr: int, intensity: int) -> np.ndarray:
        """Apply chorus effect using multiple delayed voices."""
        if intensity == 0:
            return audio
        # Number of voices based on intensity (1-3)
        num_voices = 1 + int((intensity / 100) * 2)
        # Base delay (15-30ms)
        base_delay_ms = 15 + (intensity / 100) * 15
        base_delay_samples = int((base_delay_ms / 1000) * sr)
        output = np.copy(audio) * 0.6  # Reduce original to make room for voices
        for i in range(num_voices):
            # Slight pitch variation for each voice (±3%)
            pitch_var = 1.0 + (0.03 * (i - 1))
            try:
                voice = librosa.effects.time_stretch(audio, rate=pitch_var)
                # Slight delay variation
                delay_samples = base_delay_samples + int((i * 5 / 1000) * sr)
                # Mix voice into output
                voice_len = min(len(voice), len(output) - delay_samples)
                if voice_len > 0:
                    output[delay_samples:delay_samples + voice_len] += voice[:voice_len] * 0.2
            except Exception as e:
                print(f"    Warning: Chorus voice {i+1} failed: {e}")
        # Normalize
        max_val = np.max(np.abs(output))
        if max_val > 0:
            output = output / max_val * 0.95
        return output
    @classmethod
    def _apply_tremolo(cls, audio: np.ndarray, sr: int, depth: float, rate: float) -> np.ndarray:
        """Apply tremolo effect (amplitude modulation)."""
        if depth == 0 or rate == 0:
            return audio
        # Create modulation signal
        duration = len(audio) / sr
        t = np.linspace(0, duration, len(audio))
        # Sine wave modulation at specified rate
        modulation = 1.0 - depth * 0.5 * (1 - np.sin(2 * np.pi * rate * t))
        return audio * modulation
    @classmethod
    def _apply_robot(cls, audio: np.ndarray, sr: int, intensity: int) -> np.ndarray:
        """Apply robot voice effect using ring modulation."""
        if intensity == 0:
            return audio
        # Carrier frequency based on intensity (80-300 Hz)
        carrier_freq = 80 + (intensity / 100) * 220
        # Create carrier signal
        duration = len(audio) / sr
        t = np.linspace(0, duration, len(audio))
        carrier = np.sin(2 * np.pi * carrier_freq * t)
        # Mix original with ring-modulated version based on intensity
        mix = intensity / 100
        robot_signal = audio * carrier
        output = audio * (1 - mix * 0.7) + robot_signal * mix * 0.7
        # Normalize
        max_val = np.max(np.abs(output))
        if max_val > 0:
            output = output / max_val * 0.95
        return output
    @classmethod
    def validate_effect(cls, effect_name: str, value: Any) -> tuple[bool, str]:
        """
        Validate an effect value.
        Returns:
            Tuple of (is_valid, error_message)
        """
        validators = {
            "pitch": (int, cls.PITCH_MIN, cls.PITCH_MAX, "Pitch must be a whole number", "semitones"),
            "speed": (float, cls.SPEED_MIN, cls.SPEED_MAX, "Speed must be a number", "x"),
            "echo": (int, cls.ECHO_MIN, cls.ECHO_MAX, "Echo must be a whole number", "%"),
            "robot": (int, cls.ROBOT_MIN, cls.ROBOT_MAX, "Robot must be a whole number", "%"),
            "chorus": (int, cls.CHORUS_MIN, cls.CHORUS_MAX, "Chorus must be a whole number", "%"),
            "tremolo_depth": (float, cls.TREMOLO_DEPTH_MIN, cls.TREMOLO_DEPTH_MAX, "Tremolo depth must be a number", ""),
            "tremolo_rate": (float, cls.TREMOLO_RATE_MIN, cls.TREMOLO_RATE_MAX, "Tremolo rate must be a number", "Hz"),
        }
        if effect_name not in validators:
            return False, f"Unknown effect: {effect_name}"
        type_func, min_val, max_val, error_msg, unit = validators[effect_name]
        try:
            val = type_func(value)
            if min_val <= val <= max_val:
                return True, ""
            unit_str = f" {unit}" if unit else ""
            return False, f"{effect_name.replace('_', ' ').title()} must be between {min_val} and {max_val}{unit_str}"
        except (ValueError, TypeError):
            return False, error_msg
    @classmethod
    def count_active_effects(cls, **effects) -> int:
        """Count how many effects are active (non-default)."""
        count = 0
        # Convert values to proper types (JSON stores them as strings)
        pitch = int(effects.get("pitch", cls.PITCH_DEFAULT))
        speed = float(effects.get("speed", cls.SPEED_DEFAULT))
        echo = int(effects.get("echo", cls.ECHO_DEFAULT))
        robot = int(effects.get("robot", cls.ROBOT_DEFAULT))
        chorus = int(effects.get("chorus", cls.CHORUS_DEFAULT))
        tremolo_depth = float(effects.get("tremolo_depth", cls.TREMOLO_DEPTH_DEFAULT))
        if pitch != cls.PITCH_DEFAULT:
            count += 1
        if speed != cls.SPEED_DEFAULT:
            count += 1
        if echo > cls.ECHO_DEFAULT:
            count += 1
        if robot > cls.ROBOT_DEFAULT:
            count += 1
        if chorus > cls.CHORUS_DEFAULT:
            count += 1
        if tremolo_depth > cls.TREMOLO_DEPTH_DEFAULT:
            count += 1
        # tremolo_rate only counts if depth is also active
        return count
    @classmethod
    def get_effect_description(cls, effect_name: str) -> str:
        """Get a human-readable description of what an effect does."""
        descriptions = {
            "pitch": f"Changes voice pitch ({cls.PITCH_MIN} to {cls.PITCH_MAX} semitones). Positive = higher/chipmunk, Negative = lower/deeper.",
            "speed": f"Changes speech speed ({cls.SPEED_MIN} to {cls.SPEED_MAX}x). Higher = faster, Lower = slower.",
            "echo": f"Adds echo/reverb ({cls.ECHO_MIN} to {cls.ECHO_MAX}%). Higher = more pronounced echo.",
            "robot": f"Applies robot voice effect ({cls.ROBOT_MIN} to {cls.ROBOT_MAX}%). Higher = more robotic.",
            "chorus": f"Adds chorus effect ({cls.CHORUS_MIN} to {cls.CHORUS_MAX}%). Higher = more voices/depth.",
            "tremolo_depth": f"Tremolo amplitude modulation ({cls.TREMOLO_DEPTH_MIN} to {cls.TREMOLO_DEPTH_MAX}). Higher = more warble.",
            "tremolo_rate": f"Tremolo speed ({cls.TREMOLO_RATE_MIN} to {cls.TREMOLO_RATE_MAX} Hz). Higher = faster warble.",
        }
        return descriptions.get(effect_name, "Unknown effect")
    @classmethod
    def format_effect_value(cls, effect_name: str, value: Any) -> str:
        """Format an effect value for display."""
        if effect_name == "pitch":
            pitch = int(value)
            if pitch == 0:
                return "0 (normal)"
            direction = "higher" if pitch > 0 else "lower"
            return f"{pitch:+d} ({direction})"
        elif effect_name == "speed":
            speed = float(value)
            if speed == 1.0:
                return "1.0x (normal)"
            direction = "faster" if speed > 1.0 else "slower"
            return f"{speed:.1f}x ({direction})"
        elif effect_name == "echo":
            echo = int(value)
            if echo == 0:
                return "0% (off)"
            return f"{echo}%"
        elif effect_name == "robot":
            robot = int(value)
            if robot == 0:
                return "0% (off)"
            return f"{robot}%"
        elif effect_name == "chorus":
            chorus = int(value)
            if chorus == 0:
                return "0% (off)"
            return f"{chorus}%"
        elif effect_name == "tremolo_depth":
            depth = float(value)
            if depth == 0.0:
                return "0.0 (off)"
            return f"{depth:.1f}"
        elif effect_name == "tremolo_rate":
            rate = float(value)
            if rate == 0.0:
                return "0.0 Hz (off)"
            return f"{rate:.1f} Hz"
        return str(value)
--- a/audio_preprocessor.py
+++ b/audio_preprocessor.py
@@ -190,16 +190,16 @@ def print_audio_analysis(file_path: str) -> None:
    print(f"\n{'=' * 50}")
    print(f"Audio Analysis: {info['path']}")
    print(f"{'=' * 50}")
-    print(f"  Sample Rate:    {info['sample_rate']} Hz {'⚠️  (should be 22050)' if info['needs_resampling'] else '✓'}")
+    print(f"  Sample Rate:    {info['sample_rate']} Hz {'[WARN] (should be 22050)' if info['needs_resampling'] else '[OK]'}")
    print(f"  Duration:       {info['duration_seconds']:.2f}s", end="")
    if info['is_too_short']:
-        print(" ⚠️  (too short, aim for 5-15s)")
+        print(" [WARN] (too short, aim for 5-15s)")
    elif info['is_too_long']:
-        print(" ⚠️  (quite long, 5-15s is ideal)")
+        print(" [WARN] (quite long, 5-15s is ideal)")
    else:
-        print(" ✓")
+        print(" [OK]")
-    print(f"  Channels:       {'Stereo' if info['is_stereo'] else 'Mono'} {'⚠️  (will convert to mono)' if info['is_stereo'] else '✓'}")
+    print(f"  Channels:       {'Stereo' if info['is_stereo'] else 'Mono'} {'[WARN] (will convert to mono)' if info['is_stereo'] else '[OK]'}")
-    print(f"  Max Amplitude:  {info['max_amplitude']:.3f} {'✓' if info['is_normalized'] else '⚠️  (low volume)'}")
+    print(f"  Max Amplitude:  {info['max_amplitude']:.3f} {'[OK]' if info['is_normalized'] else '[WARN] (low volume)'}")
    print(f"  RMS Level:      {info['rms_level']:.4f}")
    print(f"  Noise Floor:    {info['estimated_noise_floor']:.4f}")
    print(f"{'=' * 50}\n")
--- a/bot.py
+++ b/bot.py
@@ -1,5 +1,21 @@
 __version__ = "1.2.0"
 import random
 import sys
 import os
 # Parse command line arguments before loading any config
 if len(sys.argv) > 1 and sys.argv[1] == "testing":
    os.environ["ENV_MODE"] = "testing"
    # Remove the argument so it doesn't interfere with other parsing
    sys.argv.pop(1)
 import numba_config
 import asyncio
 import io
 import subprocess
 import sys
 import time
 from typing import Any
 import discord
@@ -8,10 +24,27 @@ import scipy.io.wavfile as wavfile
 from discord import app_commands
 from discord.ext import commands
 from audio_effects import AudioEffects
 from config import Config
 from voice_manager import VoiceManager
 # Inactivity timeout in seconds (10 minutes)
 INACTIVITY_TIMEOUT = 10 * 60
 # Sample lines for voice preview
 PREVIEW_LINES = [
    "Hello! This is how I sound. Choose me as your voice with /voice set.",
    "Testing, one, two, three! Can you hear me clearly?",
    "Here's a preview of my voice. Pretty cool, right?",
    "Greetings! I am ready to speak for you.",
    "Voice check! This is what I sound like.",
    "Audio test complete. This voice is ready to go!",
    "Sample message incoming. How do I sound to you?",
    "Preview mode activated. Testing speech synthesis.",
 ]
 class TTSBot(commands.Bot):
    """Discord bot that reads messages aloud using Pocket TTS."""
@@ -22,28 +55,50 @@ class TTSBot(commands.Bot):
        super().__init__(command_prefix="!", intents=intents)
        self.voice_manager = VoiceManager(Config.VOICES_DIR, Config.DEFAULT_VOICE)
-        self.message_queue: asyncio.Queue[tuple[discord.Message, str]] = asyncio.Queue()
+        self.message_queue: asyncio.Queue[tuple[discord.Message, str] | tuple[discord.Message, str, str]] = asyncio.Queue()
        self.last_activity: float = 0.0
        print("\n=== Command Registration ===")
        self._setup_slash_commands()
        self._setup_effects_commands()
        self._log_registered_commands()
        print("=== End Command Registration ===\n")
    def _log_registered_commands(self) -> None:
        """Log all registered commands to console."""
        print("\nRegistered commands:")
        commands = list(self.tree.get_commands())
        if not commands:
            print("  ⚠️  No commands registered!")
        else:
            for cmd in commands:
                print(f"  ✓ /{cmd.name} - {cmd.description}")
        print(f"\nTotal commands registered: {len(commands)}")
    def _setup_slash_commands(self) -> None:
        """Set up slash commands for voice management."""
        print("Setting up voice commands...")
        @self.tree.command(name="voice", description="Manage your TTS voice")
        @app_commands.describe(
            action="What to do",
-            voice_name="Name of the voice (for 'set' action)"
+            voice_name="Name of the voice (for 'set' or 'preview' action)",
            preview_pitch="Optional pitch for preview (-12 to 12, default: use your settings)",
            preview_speed="Optional speed for preview (0.5 to 2.0, default: use your settings)",
        )
        @app_commands.choices(action=[
            app_commands.Choice(name="list", value="list"),
            app_commands.Choice(name="set", value="set"),
            app_commands.Choice(name="current", value="current"),
            app_commands.Choice(name="refresh", value="refresh"),
            app_commands.Choice(name="preview", value="preview"),
        ])
        async def voice_command(
            interaction: discord.Interaction,
            action: app_commands.Choice[str],
-            voice_name: str | None = None
+            voice_name: str | None = None,
            preview_pitch: int | None = None,
            preview_speed: float | None = None,
        ):
            if action.value == "list":
                await self._handle_voice_list(interaction)
@@ -53,6 +108,8 @@ class TTSBot(commands.Bot):
                await self._handle_voice_current(interaction)
            elif action.value == "refresh":
                await self._handle_voice_refresh(interaction)
            elif action.value == "preview":
                await self._handle_voice_preview(interaction, voice_name, preview_pitch, preview_speed)
        @voice_command.autocomplete("voice_name")
        async def voice_name_autocomplete(
@@ -66,6 +123,197 @@ class TTSBot(commands.Bot):
                if current.lower() in v.lower()
            ][:25]
    def _setup_effects_commands(self) -> None:
        """Set up slash commands for audio effects management."""
        print("Setting up effects commands...")
        @self.tree.command(name="effects", description="Manage your TTS audio effects")
        @app_commands.describe(
            action="What to do",
            effect_name="Name of the effect (for 'set' action)",
            value="Value for the effect (for 'set' action)"
        )
        @app_commands.choices(action=[
            app_commands.Choice(name="list", value="list"),
            app_commands.Choice(name="set", value="set"),
            app_commands.Choice(name="reset", value="reset"),
        ])
        @app_commands.choices(effect_name=[
            app_commands.Choice(name="pitch", value="pitch"),
            app_commands.Choice(name="speed", value="speed"),
            app_commands.Choice(name="echo", value="echo"),
            app_commands.Choice(name="robot", value="robot"),
            app_commands.Choice(name="chorus", value="chorus"),
            app_commands.Choice(name="tremolo_depth", value="tremolo_depth"),
            app_commands.Choice(name="tremolo_rate", value="tremolo_rate"),
        ])
        async def effects_command(
            interaction: discord.Interaction,
            action: app_commands.Choice[str],
            effect_name: app_commands.Choice[str] | None = None,
            value: str | None = None
        ):
            if action.value == "list":
                await self._handle_effects_list(interaction)
            elif action.value == "set":
                await self._handle_effects_set(interaction, effect_name, value)
            elif action.value == "reset":
                await self._handle_effects_reset(interaction)
    async def _handle_effects_list(self, interaction: discord.Interaction) -> None:
        """Handle /effects list command."""
        effects = self.voice_manager.get_user_effects(interaction.user.id)
        active_count = self.voice_manager.count_active_effects(interaction.user.id)
        lines = ["**Your Audio Effects:**\n"]
        # Pitch
        pitch_desc = AudioEffects.get_effect_description("pitch")
        pitch_val = AudioEffects.format_effect_value("pitch", effects["pitch"])
        lines.append(f"🎵 **Pitch**: {pitch_val}")
        lines.append(f"   {pitch_desc}\n")
        # Speed
        speed_desc = AudioEffects.get_effect_description("speed")
        speed_val = AudioEffects.format_effect_value("speed", effects["speed"])
        lines.append(f"⚡ **Speed**: {speed_val}")
        lines.append(f"   {speed_desc}\n")
        # Echo
        echo_desc = AudioEffects.get_effect_description("echo")
        echo_val = AudioEffects.format_effect_value("echo", effects["echo"])
        lines.append(f"🔊 **Echo**: {echo_val}")
        lines.append(f"   {echo_desc}\n")
        # Robot
        robot_desc = AudioEffects.get_effect_description("robot")
        robot_val = AudioEffects.format_effect_value("robot", effects["robot"])
        lines.append(f"🤖 **Robot**: {robot_val}")
        lines.append(f"   {robot_desc}\n")
        # Chorus
        chorus_desc = AudioEffects.get_effect_description("chorus")
        chorus_val = AudioEffects.format_effect_value("chorus", effects["chorus"])
        lines.append(f"🎶 **Chorus**: {chorus_val}")
        lines.append(f"   {chorus_desc}\n")
        # Tremolo Depth
        tremolo_depth_desc = AudioEffects.get_effect_description("tremolo_depth")
        tremolo_depth_val = AudioEffects.format_effect_value("tremolo_depth", effects["tremolo_depth"])
        lines.append(f"〰️ **Tremolo Depth**: {tremolo_depth_val}")
        lines.append(f"   {tremolo_depth_desc}\n")
        # Tremolo Rate
        tremolo_rate_desc = AudioEffects.get_effect_description("tremolo_rate")
        tremolo_rate_val = AudioEffects.format_effect_value("tremolo_rate", effects["tremolo_rate"])
        lines.append(f"📳 **Tremolo Rate**: {tremolo_rate_val}")
        lines.append(f"   {tremolo_rate_desc}\n")
        # Active count warning
        lines.append(f"**Active Effects**: {active_count}")
        if active_count > 2:
            lines.append("⚠️ You have more than 2 active effects. Processing may be slower!")
        elif active_count > 0:
            lines.append("ℹ️ Add more effects for fun variations (may slow processing)")
        lines.append(f"\n*Use `/effects set <effect> <value>` to change settings*")
        lines.append(f"*Use `/effects reset` to clear all effects*")
        await interaction.response.send_message(
            "\n".join(lines),
            ephemeral=True
        )
    async def _handle_effects_set(
        self,
        interaction: discord.Interaction,
        effect_name: app_commands.Choice[str] | None,
        value: str | None
    ) -> None:
        """Handle /effects set command."""
        if not effect_name or value is None:
            await interaction.response.send_message(
                "❌ Please provide both effect name and value. Example: `/effects set pitch 3`",
                ephemeral=True
            )
            return
        success, message = self.voice_manager.set_user_effect(
            interaction.user.id,
            effect_name.value,
            value
        )
        if success:
            await interaction.response.send_message(
                f"✅ {message}",
                ephemeral=True
            )
        else:
            await interaction.response.send_message(
                f"❌ {message}",
                ephemeral=True
            )
    async def _handle_effects_reset(self, interaction: discord.Interaction) -> None:
        """Handle /effects reset command with confirmation UI."""
        # Check if user has any effects to reset
        active_count = self.voice_manager.count_active_effects(interaction.user.id)
        if active_count == 0:
            await interaction.response.send_message(
                "ℹ️ You don't have any active effects to reset.",
                ephemeral=True
            )
            return
        # Create confirmation buttons
        class ConfirmResetView(discord.ui.View):
            def __init__(self, voice_manager, user_id):
                super().__init__(timeout=30)
                self.voice_manager = voice_manager
                self.user_id = user_id
                self.confirmed = False
            @discord.ui.button(label="✅ Yes, Reset All", style=discord.ButtonStyle.danger)
            async def confirm_button(self, interaction: discord.Interaction, button: discord.ui.Button):
                if interaction.user.id != self.user_id:
                    await interaction.response.send_message("This button is not for you!", ephemeral=True)
                    return
                self.voice_manager.reset_user_effects(self.user_id)
                self.confirmed = True
                await interaction.response.edit_message(
                    content="✅ All audio effects have been reset to defaults!",
                    view=None
                )
                self.stop()
            @discord.ui.button(label="❌ Cancel", style=discord.ButtonStyle.secondary)
            async def cancel_button(self, interaction: discord.Interaction, button: discord.ui.Button):
                if interaction.user.id != self.user_id:
                    await interaction.response.send_message("This button is not for you!", ephemeral=True)
                    return
                await interaction.response.edit_message(
                    content="❌ Reset cancelled. Your effects remain unchanged.",
                    view=None
                )
                self.stop()
        view = ConfirmResetView(self.voice_manager, interaction.user.id)
        await interaction.response.send_message(
            f"⚠️ **Reset Confirmation**\n\n"
            f"You have {active_count} active effect(s).\n"
            f"This will reset **all** your audio effects to defaults:\n"
            f"• Pitch: 0 (normal)\n"
            f"• Speed: 1.0x (normal)\n\n"
            f"Are you sure you want to continue?",
            view=view,
            ephemeral=True
        )
    async def _handle_voice_list(self, interaction: discord.Interaction) -> None:
        """Handle /voice list command."""
        voices = self.voice_manager.get_available_voices()
@@ -186,6 +434,113 @@ class TTSBot(commands.Bot):
            ephemeral=True
        )
    async def _handle_voice_preview(
        self,
        interaction: discord.Interaction,
        voice_name: str | None,
        preview_pitch: int | None = None,
        preview_speed: float | None = None,
    ) -> None:
        """Handle /voice preview command."""
        if not voice_name:
            await interaction.response.send_message(
                "❌ Please provide a voice name. Use `/voice list` to see available voices.",
                ephemeral=True
            )
            return
        # Check if user is in a voice channel
        if interaction.user.voice is None:
            await interaction.response.send_message(
                "❌ You need to be in a voice channel to hear a preview!",
                ephemeral=True
            )
            return
        voice_name = voice_name.lower()
        # Validate voice exists
        if not self.voice_manager.is_voice_available(voice_name):
            voices = self.voice_manager.get_available_voices()
            await interaction.response.send_message(
                f"❌ Voice `{voice_name}` not found.\n"
                f"Available voices: {', '.join(f'`{v}`' for v in voices)}",
                ephemeral=True
            )
            return
        # Validate pitch if provided
        if preview_pitch is not None:
            is_valid, error_msg = AudioEffects.validate_effect("pitch", preview_pitch)
            if not is_valid:
                await interaction.response.send_message(
                    f"❌ Invalid pitch value: {error_msg}",
                    ephemeral=True
                )
                return
        # Validate speed if provided
        if preview_speed is not None:
            is_valid, error_msg = AudioEffects.validate_effect("speed", preview_speed)
            if not is_valid:
                await interaction.response.send_message(
                    f"❌ Invalid speed value: {error_msg}",
                    ephemeral=True
                )
                return
        # Select a random preview line
        preview_text = random.choice(PREVIEW_LINES)
        # Create a preview message object with all necessary attributes
        class PreviewMessage:
            def __init__(self, user, channel, voice_channel):
                self.author = user
                self.channel = channel
                self._voice_channel = voice_channel
            @property
            def voice(self):
                class VoiceState:
                    def __init__(self, channel):
                        self.channel = channel
                return VoiceState(self._voice_channel)
        preview_message = PreviewMessage(
            interaction.user,
            interaction.channel,
            interaction.user.voice.channel
        )
        # Use user's current effects if not overridden
        user_effects = self.voice_manager.get_user_effects(interaction.user.id)
        effect_overrides = {}
        if preview_pitch is not None:
            effect_overrides["pitch"] = preview_pitch
        if preview_speed is not None:
            effect_overrides["speed"] = preview_speed
        # Use default effects from user settings for preview
        preview_effects = user_effects.copy()
        preview_effects.update(effect_overrides)
        # Queue the preview with voice override and effects
        await self.message_queue.put((preview_message, preview_text, voice_name, preview_effects))
        # Build effect description
        effect_desc = []
        if preview_effects.get("pitch", 0) != 0:
            effect_desc.append(f"pitch: {preview_effects['pitch']:+d}")
        if preview_effects.get("speed", 1.0) != 1.0:
            effect_desc.append(f"speed: {preview_effects['speed']:.1f}x")
        effect_str = f" (with {', '.join(effect_desc)})" if effect_desc else ""
        await interaction.response.send_message(
            f"⏳ Queued preview for `{voice_name}`{effect_str}. Sample: \"{preview_text[:50]}{'...' if len(preview_text) > 50 else ''}\"",
            ephemeral=True
        )
    async def setup_hook(self) -> None:
        """Called when the bot is starting up."""
        print("Initializing TTS...")
@@ -200,17 +555,52 @@ class TTSBot(commands.Bot):
            await asyncio.to_thread(self.voice_manager.get_voice_state, default)
        self.loop.create_task(self.process_queue())
-        
+        self.loop.create_task(self.check_inactivity())
        # Sync slash commands
        print("Syncing slash commands...")
        await self.tree.sync()
        print("Slash commands synced!")
    async def on_ready(self) -> None:
        print(f"Logged in as {self.user}")
        print(f"Bot ID: {self.user.id}")
        print(f"Monitoring channel ID: {Config.TEXT_CHANNEL_ID}")
        print(f"Available voices: {', '.join(self.voice_manager.get_available_voices())}")
-        print("Bot is ready!")
+        
        # Log registered commands before sync
        registered_cmds = list(self.tree.get_commands())
        print(f"\nCommands in tree before sync: {len(registered_cmds)}")
        for cmd in registered_cmds:
            print(f"  - /{cmd.name}")
        # Sync slash commands to each guild for immediate availability
        print(f"\nConnected to {len(self.guilds)} guild(s):")
        for guild in self.guilds:
            print(f"  - {guild.name} (ID: {guild.id})")
        print("\nSyncing slash commands to guilds...")
        sync_count = 0
        for guild in self.guilds:
            try:
                # Copy global commands to this guild before syncing
                # This is necessary for guild-specific command registration
                self.tree.copy_global_to(guild=discord.Object(guild.id))
                print(f"  📋 Copied global commands to guild: {guild.name}")
                synced = await self.tree.sync(guild=discord.Object(guild.id))
                print(f"  ✓ Synced {len(synced)} commands to guild: {guild.name}")
                for cmd in synced:
                    print(f"      - /{cmd.name}")
                sync_count += 1
            except discord.errors.Forbidden as e:
                print(f"  ✗ Forbidden: Cannot sync to guild {guild.name}. Missing 'applications.commands' scope!")
                print(f"    Error: {e}")
            except Exception as e:
                print(f"  ✗ Failed to sync to guild {guild.name}: {type(e).__name__}: {e}")
        if sync_count == 0:
            print("\n⚠️  WARNING: No guilds were synced! Commands won't appear in Discord.")
            print("   Make sure the bot was invited with 'applications.commands' scope.")
        else:
            print(f"\n✓ Successfully synced to {sync_count}/{len(self.guilds)} guild(s)")
        print("\nBot is ready!")
    async def on_message(self, message: discord.Message) -> None:
        if message.author.bot:
@@ -237,16 +627,36 @@ class TTSBot(commands.Bot):
    async def process_queue(self) -> None:
        """Process messages from the queue one at a time."""
        while True:
-            message, text = await self.message_queue.get()
+            queue_item = await self.message_queue.get()
            # Handle queue items:
            # - (message, text) - regular message
            # - (message, text, voice_override) - preview with voice override
            # - (message, text, voice_override, effects_dict) - preview with effect overrides
            if len(queue_item) == 4 and isinstance(queue_item[3], dict):
                message, text, voice_override, effect_overrides = queue_item
            elif len(queue_item) == 3:
                message, text, voice_override = queue_item
                effect_overrides = {}
            else:
                message, text = queue_item
                voice_override = None
                effect_overrides = {}
            try:
-                await self.speak_message(message, text)
+                await self.speak_message(message, text, voice_override, effect_overrides)
            except Exception as e:
                print(f"Error processing message: {e}")
            finally:
                self.message_queue.task_done()
-    async def speak_message(self, message: discord.Message, text: str) -> None:
+    async def speak_message(
        self,
        message: discord.Message,
        text: str,
        voice_override: str | None = None,
        effect_overrides: dict | None = None,
    ) -> None:
        """Generate TTS and play it in the user's voice channel."""
        if message.author.voice is None:
            return
@@ -259,22 +669,34 @@ class TTSBot(commands.Bot):
        print(f"Generating TTS for: {text[:50]}...")
-        # Get user's voice (loads on-demand if needed)
+        # Get voice state (use override for previews, otherwise user's voice)
        user_id = message.author.id
        try:
-            voice_state = await asyncio.to_thread(
+            if voice_override:
-                self.voice_manager.get_user_voice_state, user_id
+                voice_state = await asyncio.to_thread(
-            )
+                    self.voice_manager.get_voice_state, voice_override
                )
            else:
                user_id = message.author.id
                voice_state = await asyncio.to_thread(
                    self.voice_manager.get_user_voice_state, user_id
                )
        except Exception as e:
-            print(f"Error loading voice for user {user_id}: {e}")
+            print(f"Error loading voice: {e}")
-            await message.channel.send(
+            if not voice_override:
-                f"{message.author.mention}, failed to load your voice. Use `/voice set` to choose a voice.",
+                await message.channel.send(
-                delete_after=5
+                    f"{message.author.mention}, failed to load your voice. Use `/voice set` to choose a voice.",
-            )
+                    delete_after=5
                )
            return
        # Get user's effects and apply any overrides
        user_effects = self.voice_manager.get_user_effects(message.author.id)
        effects = user_effects.copy()
        if effect_overrides:
            effects.update(effect_overrides)
        wav_bytes = await asyncio.to_thread(
-            self._generate_wav_bytes, voice_state, text
+            self._generate_wav_bytes, voice_state, text, effects
        )
        audio_source = discord.FFmpegPCMAudio(
@@ -294,11 +716,17 @@ class TTSBot(commands.Bot):
            self.loop.call_soon_threadsafe(play_complete.set)
        voice_client.play(audio_source, after=after_playing)
        self.last_activity = time.time()
        print(f"Playing audio in {voice_channel.name}")
        await play_complete.wait()
-    def _generate_wav_bytes(self, voice_state: Any, text: str) -> bytes:
+    def _generate_wav_bytes(
        self,
        voice_state: Any,
        text: str,
        effects: dict,
    ) -> bytes:
        """Generate audio and return as WAV file bytes."""
        model = self.voice_manager.model
        if model is None:
@@ -307,9 +735,32 @@ class TTSBot(commands.Bot):
        audio = model.generate_audio(voice_state, text)
        audio_np = audio.numpy()
        # Ensure audio is 2D [samples, channels] for storage
        if audio_np.ndim == 1:
            audio_np = audio_np.reshape(-1, 1)
        # Apply audio effects if any are active
        pitch = effects.get("pitch", AudioEffects.PITCH_DEFAULT)
        speed = effects.get("speed", AudioEffects.SPEED_DEFAULT)
        echo = effects.get("echo", AudioEffects.ECHO_DEFAULT)
        robot = effects.get("robot", AudioEffects.ROBOT_DEFAULT)
        chorus = effects.get("chorus", AudioEffects.CHORUS_DEFAULT)
        tremolo_depth = effects.get("tremolo_depth", AudioEffects.TREMOLO_DEPTH_DEFAULT)
        tremolo_rate = effects.get("tremolo_rate", AudioEffects.TREMOLO_RATE_DEFAULT)
        if any([pitch != 0, speed != 1.0, echo > 0, robot > 0, chorus > 0, tremolo_depth > 0]):
            print(f"Applying {AudioEffects.count_active_effects(**effects)} effect(s)...")
            # Squeeze to 1D for librosa effects, then reshape back
            audio_1d = audio_np.squeeze()
            audio_1d, show_processing = AudioEffects.apply_effects(
                audio_1d, model.sample_rate,
                pitch, speed, echo, robot, chorus, tremolo_depth, tremolo_rate
            )
            # Reshape back to 2D
            audio_np = audio_1d.reshape(-1, 1)
            if show_processing:
                print("⚠️ Audio processing took longer than expected due to effects")
        max_val = np.max(np.abs(audio_np))
        if max_val > 0:
            audio_np = audio_np / max_val
@@ -320,6 +771,23 @@ class TTSBot(commands.Bot):
        wav_buffer.seek(0)
        return wav_buffer.read()
    async def check_inactivity(self) -> None:
        """Periodically check for inactivity and disconnect from voice channels."""
        while True:
            await asyncio.sleep(60)  # Check every minute
            if self.last_activity == 0.0:
                continue
            elapsed = time.time() - self.last_activity
            if elapsed >= INACTIVITY_TIMEOUT:
                # Disconnect from all voice channels
                for guild in self.guilds:
                    if guild.voice_client is not None:
                        print(f"Disconnecting from {guild.name} due to inactivity")
                        await guild.voice_client.disconnect()
                self.last_activity = 0.0
    async def ensure_voice_connection(self, channel: discord.VoiceChannel) -> discord.VoiceClient | None:
        """Ensure we're connected to the specified voice channel."""
        guild = channel.guild
@@ -332,13 +800,34 @@ class TTSBot(commands.Bot):
        try:
            voice_client = await channel.connect(timeout=10.0)
            self.last_activity = time.time()
            return voice_client
        except Exception as e:
            print(f"Failed to connect to voice channel: {e}")
            return None
 def auto_update_dependencies() -> None:
    """Auto-update pip packages on startup."""
    try:
        print("Checking for package updates...")
        result = subprocess.run(
            [sys.executable, "-m", "pip", "install", "-r", "requirements.txt", "-U", "-q"],
            capture_output=True,
            text=True,
            check=False
        )
        if result.returncode == 0:
            print("Packages updated successfully (or already up to date)")
        else:
            print(f"Warning: Package update had issues: {result.stderr}")
    except Exception as e:
        print(f"Warning: Could not auto-update packages: {e}")
 def main():
    auto_update_dependencies()
    errors = Config.validate()
    if errors:
        print("Configuration errors:")
--- a/config.py
+++ b/config.py
@@ -1,7 +1,10 @@
 import os
 from dotenv import load_dotenv
-load_dotenv()
+# Load appropriate .env file based on ENV_MODE
 env_mode = os.getenv("ENV_MODE", "production")
 env_file = ".env.testing" if env_mode == "testing" else ".env"
 load_dotenv(env_file)
 class Config:
--- a/launch.sh
+++ b/launch.sh
@@ -0,0 +1,4 @@
 #!/bin/bash
 cd /home/artanis/Documents/Vox/
 source venv/bin/activate
 python bot.py
--- a/media/Subnautica/CyclopsEngineOff.oga
+++ b/media/Subnautica/CyclopsEngineOff.oga
--- a/media/Subnautica/CyclopsEngineOn.oga
+++ b/media/Subnautica/CyclopsEngineOn.oga
--- a/media/Subnautica/CyclopsOverheat.oga
+++ b/media/Subnautica/CyclopsOverheat.oga
--- a/media/Subnautica/Cyclops_Welcome.oga
+++ b/media/Subnautica/Cyclops_Welcome.oga
--- a/media/Subnautica/Cyclops_Welcome2.oga
+++ b/media/Subnautica/Cyclops_Welcome2.oga
--- a/media/TF2/Ronin/diag_gs_titanRonin_embark_03.wav
+++ b/media/TF2/Ronin/diag_gs_titanRonin_embark_03.wav
--- a/media/TF2/Ronin/diag_gs_titanRonin_embark_05.wav
+++ b/media/TF2/Ronin/diag_gs_titanRonin_embark_05.wav
--- a/media/TF2/Ronin/diag_gs_titanRonin_embark_06.wav
+++ b/media/TF2/Ronin/diag_gs_titanRonin_embark_06.wav
--- a/media/TF2/Ronin/diag_gs_titanRonin_embark_08.wav
+++ b/media/TF2/Ronin/diag_gs_titanRonin_embark_08.wav
--- a/media/TF2/Ronin/diag_gs_titanRonin_embark_09.wav
+++ b/media/TF2/Ronin/diag_gs_titanRonin_embark_09.wav
--- a/media/TF2/Ronin/diag_gs_titanRonin_embark_10.wav
+++ b/media/TF2/Ronin/diag_gs_titanRonin_embark_10.wav
--- a/media/TF2/Ronin/diag_gs_titanRonin_embark_11.wav
+++ b/media/TF2/Ronin/diag_gs_titanRonin_embark_11.wav
--- a/numba_config.py
+++ b/numba_config.py
@@ -0,0 +1,19 @@
 import os
 import sys
 # Set a writable cache directory for Numba
 # This is crucial when running as a systemd service with restricted home directory access.
 # The cache will be created in the bot's root directory.
 CACHE_DIR = os.path.join(os.path.dirname(__file__), '.numba_cache')
 if not os.path.exists(CACHE_DIR):
    try:
        os.makedirs(CACHE_DIR)
        print(f"Numba cache directory created at: {CACHE_DIR}")
    except OSError as e:
        print(f"Error creating Numba cache directory: {e}", file=sys.stderr)
 # Set the environment variable for Numba
 os.environ['NUMBA_CACHE_DIR'] = CACHE_DIR
 print(f"Numba cache directory set to: {os.environ.get('NUMBA_CACHE_DIR')}")
--- a/pockettts.service
+++ b/pockettts.service
--- a/requirements.txt
+++ b/requirements.txt
--- a/research/overview.md
+++ b/research/overview.md
@@ -0,0 +1,140 @@
 # Vox - Discord Text-to-Speech Bot
 A Python-based Discord bot that generates neural text-to-speech using voice cloning from reference WAV files.
 ## Project Structure
 ```
 Vox/
 ├── bot.py                 # Main entry point, Discord bot implementation
 ├── config.py              # Configuration management using environment variables
 ├── voice_manager.py       # Voice discovery, loading, and user preferences
 ├── audio_effects.py       # Audio post-processing effects (7 effects)
 ├── audio_preprocessor.py  # Audio preprocessing for voice cloning
 ├── numba_config.py        # Numba JIT compiler cache configuration
 ├── requirements.txt       # Python dependencies
 ├── launch.sh              # Shell script to start the bot
 ├── pockettts.service      # Systemd service file for Linux deployment
 ├── README.md             # Comprehensive documentation
 ├── .env                   # Production environment configuration
 ├── .env.testing           # Testing environment configuration
 ├── .env.example           # Environment configuration template
 └── voices/               # Directory for voice WAV files
    ├── preferences.json  # User voice/effect preferences (auto-generated)
    └── *.wav             # Voice reference files
 ```
 ## Core Functionality
 ### TTS Implementation
 - **Engine**: Pocket TTS (`pocket-tts` library) for neural text-to-speech synthesis
 - **Voice Cloning**: Uses reference WAV files to clone voices via `model.get_state_for_audio_prompt()`
 - **On-demand Loading**: Voices are loaded only when first needed, then cached
 ### Discord Integration
 - Monitors a configured text channel for messages
 - Joins the user's voice channel when they speak
 - Uses `discord.FFmpegPCMAudio` with piped WAV data for streaming
 ### Audio Processing Pipeline
 ```
 Text Message → Pocket TTS → Audio Effects → Normalize → FFmpeg → Discord VC
 ```
 ## Dependencies
 | Library | Purpose |
 |---------|---------|
 | `discord.py[voice]>=2.3.0` | Discord bot API with voice support |
 | `pocket-tts>=0.1.0` | Neural TTS engine with voice cloning |
 | `scipy>=1.10.0` | Scientific computing (audio I/O) |
 | `numpy>=1.24.0` | Numerical computing |
 | `librosa>=0.10.0` | Audio analysis and effects |
 | `noisereduce>=3.0.0` | Noise reduction preprocessing |
 | `soundfile>=0.12.0` | Audio file I/O |
 | `python-dotenv>=1.0.0` | Environment variable loading |
 **System Requirements**: Python 3.10+, FFmpeg
 ## Key Modules
 ### `TTSBot` (bot.py)
 Main Discord bot class that extends `commands.Bot`. Handles:
 - Message processing and TTS queue
 - Voice channel connections
 - Slash command registration
 - Startup initialization (loads TTS model, discovers voices)
 ### `VoiceManager` (voice_manager.py)
 Manages voice files and user preferences:
 - Discovers voices from WAV files in `voices/` directory
 - On-demand voice loading with caching
 - Per-user voice selection and effect preferences
 - Preferences persistence to JSON
 ### `AudioEffects` (audio_effects.py)
 Provides 7 post-processing effects:
 1. **Pitch** (-12 to +12 semitones)
 2. **Speed** (0.5x to 2.0x)
 3. **Echo** (0-100%)
 4. **Robot** (0-100%) - Ring modulation
 5. **Chorus** (0-100%) - Multiple voice layering
 6. **Tremolo Depth** (0.0-1.0)
 7. **Tremolo Rate** (0.0-10.0 Hz)
 ### `AudioPreprocessor` (audio_preprocessor.py)
 Prepares voice reference files for cloning:
 1. Load and resample to 22050 Hz
 2. Normalize volume
 3. Trim silence
 4. Noise reduction
 5. Limit length (default 15 seconds)
 ### `Config` (config.py)
 Centralized configuration management with environment-aware loading and validation.
 ## Slash Commands
 | Command | Description |
 |---------|-------------|
 | `/voice list` | Show available voices |
 | `/voice set <name>` | Select your voice |
 | `/voice current` | Show current voice |
 | `/voice refresh` | Rescan for new voices |
 | `/voice preview <name>` | Preview before committing |
 | `/effects list` | Show your effect settings |
 | `/effects set <effect> <value>` | Adjust effects |
 | `/effects reset` | Reset to defaults |
 ## Features
 - **Voice Cloning**: Add new voices by placing `.wav` files in `voices/` directory
 - **Per-User Customization**: Each user can have their own voice and effect preferences
 - **Hot-Reload**: Rescan for new voices without restart (`/voice refresh`)
 - **Message Queue**: Queues messages for sequential playback
 - **Inactivity Management**: Disconnects after 10 minutes of inactivity
 - **Testing Support**: Separate `.env.testing` configuration for safe development
 ## Configuration (.env)
 ```env
 DISCORD_TOKEN=your_bot_token
 TEXT_CHANNEL_ID=channel_id_to_monitor
 VOICES_DIR=./voices
 DEFAULT_VOICE=optional_default_voice_name
 ```
 ## Running the Bot
 ```bash
 # Production
 python bot.py
 # Testing (uses .env.testing)
 python bot.py testing
 # Or use the launch script
 ./launch.sh
 ```
 For production deployment on Linux, a systemd service file (`pockettts.service`) is included.
--- a/setup_linux.sh
+++ b/setup_linux.sh
@@ -1,213 +0,0 @@
 #!/bin/bash
 # Pocket TTS Discord Bot - Linux Setup Script
 # This script helps set up the bot and install it as a systemd service
 set -e
 # Colors for output
 RED='\033[0;31m'
 GREEN='\033[0;32m'
 YELLOW='\033[1;33m'
 NC='\033[0m' # No Color
 echo -e "${GREEN}========================================${NC}"
 echo -e "${GREEN}  Pocket TTS Discord Bot - Linux Setup${NC}"
 echo -e "${GREEN}========================================${NC}"
 echo
 # Get the directory where this script is located
 SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
 USERNAME="$(whoami)"
 # Check if running as root
 if [ "$EUID" -eq 0 ]; then
    echo -e "${RED}Please do not run this script as root.${NC}"
    echo "Run it as the user who will own the bot."
    exit 1
 fi
 # Function to check if a command exists
 command_exists() {
    command -v "$1" >/dev/null 2>&1
 }
 echo -e "${YELLOW}Step 1: Checking system dependencies...${NC}"
 # Check for Python 3.10+
 if command_exists python3; then
    PYTHON_VERSION=$(python3 -c 'import sys; print(f"{sys.version_info.major}.{sys.version_info.minor}")')
    PYTHON_MAJOR=$(echo $PYTHON_VERSION | cut -d. -f1)
    PYTHON_MINOR=$(echo $PYTHON_VERSION | cut -d. -f2)
    if [ "$PYTHON_MAJOR" -ge 3 ] && [ "$PYTHON_MINOR" -ge 10 ]; then
        echo -e "  ${GREEN}✓${NC} Python $PYTHON_VERSION found"
    else
        echo -e "  ${RED}✗${NC} Python 3.10+ required, found $PYTHON_VERSION"
        echo "  Please install Python 3.10 or later"
        exit 1
    fi
 else
    echo -e "  ${RED}✗${NC} Python 3 not found"
    echo "  Please install Python 3.10 or later"
    exit 1
 fi
 # Check for FFmpeg
 if command_exists ffmpeg; then
    FFMPEG_VERSION=$(ffmpeg -version 2>&1 | head -n1 | cut -d' ' -f3)
    echo -e "  ${GREEN}✓${NC} FFmpeg found ($FFMPEG_VERSION)"
 else
    echo -e "  ${RED}✗${NC} FFmpeg not found"
    echo ""
    echo "  Please install FFmpeg:"
    echo "    Ubuntu/Debian: sudo apt install ffmpeg"
    echo "    Fedora: sudo dnf install ffmpeg"
    echo "    Arch: sudo pacman -S ffmpeg"
    exit 1
 fi
 # Check for pip
 if command_exists pip3; then
    echo -e "  ${GREEN}✓${NC} pip3 found"
 else
    echo -e "  ${RED}✗${NC} pip3 not found"
    echo "  Please install python3-pip"
    exit 1
 fi
 echo
 echo -e "${YELLOW}Step 2: Setting up virtual environment...${NC}"
 cd "$SCRIPT_DIR"
 if [ -d "venv" ]; then
    echo "  Virtual environment already exists"
 else
    echo "  Creating virtual environment..."
    python3 -m venv venv
    echo -e "  ${GREEN}✓${NC} Virtual environment created"
 fi
 echo "  Activating virtual environment..."
 source venv/bin/activate
 echo "  Installing dependencies..."
 pip install --upgrade pip -q
 pip install -r requirements.txt -q
 echo -e "  ${GREEN}✓${NC} Dependencies installed"
 echo
 echo -e "${YELLOW}Step 3: Checking configuration...${NC}"
 # Check for .env file
 if [ -f ".env" ]; then
    echo -e "  ${GREEN}✓${NC} .env file found"
 else
    echo -e "  ${YELLOW}!${NC} .env file not found"
    echo "  Creating .env template..."
    cat > .env << 'EOF'
 # Discord Bot Configuration
 DISCORD_TOKEN=your_bot_token_here
 TEXT_CHANNEL_ID=your_channel_id_here
 # Voice Configuration
 VOICES_DIR=./voices
 # DEFAULT_VOICE=estinien
 EOF
    echo -e "  ${YELLOW}!${NC} Please edit .env with your Discord token and channel ID"
 fi
 # Check for voices directory
 if [ -d "voices" ]; then
    VOICE_COUNT=$(find voices -name "*.wav" 2>/dev/null | wc -l)
    echo -e "  ${GREEN}✓${NC} voices directory found ($VOICE_COUNT voice files)"
    if [ "$VOICE_COUNT" -eq 0 ]; then
        echo -e "  ${YELLOW}!${NC} No voice files found. Add .wav files to the voices directory."
    fi
 else
    echo "  Creating voices directory..."
    mkdir -p voices
    echo -e "  ${YELLOW}!${NC} Add voice .wav files to the voices directory"
 fi
 echo
 echo -e "${YELLOW}Step 4: Setting up systemd service...${NC}"
 read -p "Do you want to install the bot as a systemd service? (y/n) " -n 1 -r
 echo
 if [[ $REPLY =~ ^[Yy]$ ]]; then
    # Create the service file with correct paths
    SERVICE_FILE="/tmp/pockettts.service"
    cat > "$SERVICE_FILE" << EOF
 [Unit]
 Description=Pocket TTS Discord Bot
 After=network-online.target
 Wants=network-online.target
 [Service]
 User=$USERNAME
 Group=$USERNAME
 WorkingDirectory=$SCRIPT_DIR
 ExecStart=$SCRIPT_DIR/venv/bin/python bot.py
 Restart=on-failure
 RestartSec=10
 TimeoutStopSec=30
 StandardOutput=journal
 StandardError=journal
 SyslogIdentifier=pockettts
 # Security hardening
 NoNewPrivileges=true
 ProtectSystem=strict
 ProtectHome=read-only
 ReadWritePaths=$SCRIPT_DIR/voices
 PrivateTmp=true
 [Install]
 WantedBy=multi-user.target
 EOF
    echo "  Installing systemd service (requires sudo)..."
    sudo cp "$SERVICE_FILE" /etc/systemd/system/pockettts.service
    sudo systemctl daemon-reload
    echo -e "  ${GREEN}✓${NC} Service installed"
    read -p "Do you want to enable the service to start on boot? (y/n) " -n 1 -r
    echo
    if [[ $REPLY =~ ^[Yy]$ ]]; then
        sudo systemctl enable pockettts
        echo -e "  ${GREEN}✓${NC} Service enabled for boot"
    fi
    read -p "Do you want to start the service now? (y/n) " -n 1 -r
    echo
    if [[ $REPLY =~ ^[Yy]$ ]]; then
        sudo systemctl start pockettts
        echo -e "  ${GREEN}✓${NC} Service started"
        sleep 2
        echo
        echo "  Service status:"
        sudo systemctl status pockettts --no-pager || true
    fi
 fi
 echo
 echo -e "${GREEN}========================================${NC}"
 echo -e "${GREEN}  Setup Complete!${NC}"
 echo -e "${GREEN}========================================${NC}"
 echo
 echo "Useful commands:"
 echo "  Start bot:      sudo systemctl start pockettts"
 echo "  Stop bot:       sudo systemctl stop pockettts"
 echo "  Restart bot:    sudo systemctl restart pockettts"
 echo "  View status:    sudo systemctl status pockettts"
 echo "  View logs:      journalctl -u pockettts -f"
 echo "  Disable boot:   sudo systemctl disable pockettts"
 echo
 echo "To run the bot manually (without systemd):"
 echo "  cd $SCRIPT_DIR"
 echo "  source venv/bin/activate"
 echo "  python bot.py"
 echo
--- a/voice_manager.py
+++ b/voice_manager.py
@@ -6,6 +6,7 @@ from typing import Any
 from pocket_tts import TTSModel
 from audio_effects import AudioEffects
 from audio_preprocessor import (
    AudioPreprocessor,
    PreprocessingConfig,
@@ -26,6 +27,8 @@ class VoiceManager:
        self._voice_states: dict[str, Any] = {}
        # Per-user voice preferences: user_id -> voice_name
        self._user_voices: dict[int, str] = {}
        # Per-user audio effects: user_id -> {"pitch": int, "speed": float}
        self._user_effects: dict[int, dict[str, Any]] = {}
        # Available voices: voice_name -> file_path
        self._available_voices: dict[str, Path] = {}
@@ -181,10 +184,129 @@ class VoiceManager:
            self.preferences_file.parent.mkdir(parents=True, exist_ok=True)
            data = {
-                "user_voices": {str(k): v for k, v in self._user_voices.items()}
+                "user_voices": {str(k): v for k, v in self._user_voices.items()},
                "user_effects": {str(k): v for k, v in self._user_effects.items()},
            }
            with open(self.preferences_file, "w") as f:
                json.dump(data, f, indent=2)
        except Exception as e:
            print(f"Warning: Failed to save preferences: {e}")
    # Effects management methods
    def get_user_effects(self, user_id: int) -> dict[str, int | float]:
        """Get the audio effects for a user. Returns defaults if not set."""
        effects = self._user_effects.get(user_id, {})
        # Convert to proper types (JSON stores them as strings)
        pitch = effects.get("pitch", AudioEffects.PITCH_DEFAULT)
        speed = effects.get("speed", AudioEffects.SPEED_DEFAULT)
        echo = effects.get("echo", AudioEffects.ECHO_DEFAULT)
        robot = effects.get("robot", AudioEffects.ROBOT_DEFAULT)
        chorus = effects.get("chorus", AudioEffects.CHORUS_DEFAULT)
        tremolo_depth = effects.get("tremolo_depth", AudioEffects.TREMOLO_DEPTH_DEFAULT)
        tremolo_rate = effects.get("tremolo_rate", AudioEffects.TREMOLO_RATE_DEFAULT)
        return {
            "pitch": int(pitch) if pitch is not None else AudioEffects.PITCH_DEFAULT,
            "speed": float(speed) if speed is not None else AudioEffects.SPEED_DEFAULT,
            "echo": int(echo) if echo is not None else AudioEffects.ECHO_DEFAULT,
            "robot": int(robot) if robot is not None else AudioEffects.ROBOT_DEFAULT,
            "chorus": int(chorus) if chorus is not None else AudioEffects.CHORUS_DEFAULT,
            "tremolo_depth": float(tremolo_depth) if tremolo_depth is not None else AudioEffects.TREMOLO_DEPTH_DEFAULT,
            "tremolo_rate": float(tremolo_rate) if tremolo_rate is not None else AudioEffects.TREMOLO_RATE_DEFAULT,
        }
    def set_user_effect(self, user_id: int, effect_name: str, value: Any) -> tuple[bool, str]:
        """
        Set an audio effect for a user.
        Returns:
            Tuple of (success, message)
        """
        # Validate the effect
        is_valid, error_msg = AudioEffects.validate_effect(effect_name, value)
        if not is_valid:
            return False, error_msg
        # Get current effects
        if user_id not in self._user_effects:
            self._user_effects[user_id] = {}
        # Save the effect
        current_effects = self._user_effects[user_id].copy()
        if effect_name == "pitch":
            current_effects["pitch"] = int(value)
        elif effect_name == "speed":
            current_effects["speed"] = float(value)
        elif effect_name == "echo":
            current_effects["echo"] = int(value)
        elif effect_name == "robot":
            current_effects["robot"] = int(value)
        elif effect_name == "chorus":
            current_effects["chorus"] = int(value)
        elif effect_name == "tremolo_depth":
            current_effects["tremolo_depth"] = float(value)
        elif effect_name == "tremolo_rate":
            current_effects["tremolo_rate"] = float(value)
        # Count active effects and show warning if > 2
        active_count = AudioEffects.count_active_effects(
            pitch=current_effects.get("pitch", AudioEffects.PITCH_DEFAULT),
            speed=current_effects.get("speed", AudioEffects.SPEED_DEFAULT),
            echo=current_effects.get("echo", AudioEffects.ECHO_DEFAULT),
            robot=current_effects.get("robot", AudioEffects.ROBOT_DEFAULT),
            chorus=current_effects.get("chorus", AudioEffects.CHORUS_DEFAULT),
            tremolo_depth=current_effects.get("tremolo_depth", AudioEffects.TREMOLO_DEPTH_DEFAULT),
        )
        self._user_effects[user_id][effect_name] = value
        self._save_preferences()
        if active_count > 2:
            return True, f"Effect applied! ⚠️ You have {active_count} active effects. Performance may be slower with more effects."
        else:
            return True, "Effect applied successfully!"
    def reset_user_effects(self, user_id: int) -> None:
        """Reset all audio effects to defaults for a user."""
        if user_id in self._user_effects:
            del self._user_effects[user_id]
            self._save_preferences()
    def count_active_effects(self, user_id: int) -> int:
        """Count how many effects are active for a user."""
        effects = self.get_user_effects(user_id)
        return AudioEffects.count_active_effects(
            pitch=effects["pitch"],
            speed=effects["speed"],
            echo=effects["echo"],
            robot=effects["robot"],
            chorus=effects["chorus"],
            tremolo_depth=effects["tremolo_depth"],
        )
    def _load_preferences(self) -> None:
        """Load user voice preferences from JSON file."""
        if not self.preferences_file.exists():
            return
        try:
            with open(self.preferences_file, "r") as f:
                data = json.load(f)
            # Load user preferences (convert string keys back to int)
            for user_id_str, voice_name in data.get("user_voices", {}).items():
                user_id = int(user_id_str)
                # Only load if voice still exists
                if voice_name.lower() in self._available_voices:
                    self._user_voices[user_id] = voice_name.lower()
            # Load user effects (convert string keys back to int)
            for user_id_str, effects in data.get("user_effects", {}).items():
                user_id = int(user_id_str)
                self._user_effects[user_id] = effects
            print(f"  Loaded {len(self._user_voices)} user voice preferences")
            print(f"  Loaded {len(self._user_effects)} user effect preferences")
        except Exception as e:
            print(f"  Warning: Failed to load preferences: {e}")
--- a/voices/ChoGath.wav
+++ b/voices/ChoGath.wav
--- a/voices/Estinien.wav
+++ b/voices/Estinien.wav
--- a/voices/Gaius.wav
+++ b/voices/Gaius.wav
--- a/voices/Gibralter_funny.wav
+++ b/voices/Gibralter_funny.wav
--- a/voices/Gibralter_good.wav
+++ b/voices/Gibralter_good.wav
--- a/voices/HankHill.wav
+++ b/voices/HankHill.wav
--- a/voices/Johnny.wav
+++ b/voices/Johnny.wav
--- a/voices/MasterChief.wav
+++ b/voices/MasterChief.wav
--- a/voices/SelfHelpSingh.wav
+++ b/voices/SelfHelpSingh.wav
--- a/voices/Trump.wav
+++ b/voices/Trump.wav
--- a/voices/preferences.json
+++ b/voices/preferences.json
@@ -1,5 +0,0 @@
 {
  "user_voices": {
    "122139828182712322": "hankhill"
  }
 }
Author	SHA1	Message	Date
Spencer	9917d44f5d	docs: add HuggingFace cache troubleshooting to README - Document HF_HOME environment variable for writable cache - Add systemd service permission guidance for /tmp paths - Troubleshooting steps for read-only file system errors	2026-02-26 15:56:09 -06:00
Spencer Grimes	85a334a57b	docs: update README with comprehensive effects documentation and bump version to 1.2.0 README Updates: - Updated features list with all new capabilities - Comprehensive Audio Effects section covering all 7 effects: - Pitch, Speed, Echo, Robot, Chorus, Tremolo Depth, Tremolo Rate - Detailed effect ranges, defaults, and descriptions - Effect application order documentation - Performance notes and warnings - Enhanced Preview with Effects section with examples - Example effect combinations for users to try Version Bump: - Bumped __version__ from 1.1.0 to 1.2.0 Major features in 1.2.0: - 4 new voice effects (echo, robot, chorus, tremolo) - Unlimited effects with performance warnings - Complete effects pipeline implementation - Enhanced preview system	2026-01-31 17:33:28 -06:00
Spencer Grimes	40843e4ac9	fix: convert string values to proper types in count_active_effects JSON stores effect values as strings, but count_active_effects was tryting to compare them directly with integers/floats. Now properly converts: - pitch, echo, robot, chorus -> int - speed, tremolo_depth -> float Before comparison to avoid TypeError: '>' not supported between instances of 'str' and 'int'	2026-01-31 17:28:47 -06:00
Spencer Grimes	7e76deed3d	feat: wire up all effects to audio processing pipeline - Updated queue system to pass effects as dict instead of individual params - Updated process_queue to handle effects_dict for previews - Updated speak_message to extract all 7 effects from user settings - Updated _generate_wav_bytes to accept effects dict and pass all params - Updated _handle_voice_preview to use new effects dict system - Effects now actually process the audio: - pitch, speed, echo, robot, chorus, tremolo_depth, tremolo_rate - Fixed preview effect description to use preview_effects dict	2026-01-31 17:25:52 -06:00
Spencer Grimes	795d5087e9	feat: add 4 new voice effects (echo, robot, chorus, tremolo) - Removed MAX_ACTIVE_EFFECTS limit (effects unlimited) - Added echo effect (0-100%): spatial delay/reverb - Added robot effect (0-100%): ring modulation voice - Added chorus effect (0-100%): multiple voices effect - Added tremolo depth (0.0-1.0) and rate (0.0-10.0 Hz): amplitude modulation - Effects apply in order: pitch → speed → echo → chorus → tremolo → robot - Updated /effects command with all 7 effect choices - Updated /effects list to display all 7 effects with emojis - Updated warning system: warns when > 2 active effects - Added validation and formatting for all new effects - Updated voice_manager.py to handle all 7 effect storage/loading Note: Cancel button for processing >10s not yet implemented Note: Queue system needs updating to handle all effect parameters	2026-01-31 17:10:19 -06:00
Spencer Grimes	8d4ac59f73	chore: untrack voices/preferences.json from git Remove the preferences.json file from git tracking while keeping it locally. This file contains user-specific effect settings that should not be committed or shared between installations.	2026-01-31 16:56:15 -06:00
Spencer Grimes	68bc3b2c7d	chore: add voices/preferences.json to .gitignore User effect preferences should not be committed to git as they are personal user data that varies per installation.	2026-01-31 16:53:38 -06:00
Spencer Grimes	4cb0a78486	fix: squeeze audio to 1D before applying effects The TTS model returns a 2D array [samples, 1], but librosa.effects functions expect 1D arrays. This was causing the warning: 'n_fft=2048 is too large for input signal of length=1' Fix: Squeeze to 1D before effects, reshape back after. Also moved the effects application logic to handle the shape conversion properly.	2026-01-31 16:50:43 -06:00
Spencer Grimes	b12639a618	fix: convert effect values to proper types when loading from preferences JSON stores numbers as strings, so pitch and speed were being returned as strings from get_user_effects(), causing format string errors like: 'Unknown format code d for object of type str' Now get_user_effects() explicitly converts: - pitch to int - speed to float This fixes the format string errors when logging or displaying effects.	2026-01-31 16:46:24 -06:00
Spencer Grimes	f082c62a16	fix: use copy_global_to before guild sync for immediate command availability The issue: Commands registered as global commands weren't being synced when calling tree.sync(guild=...) because they weren't associated with the specific guild context. The fix: Call tree.copy_global_to(guild=...) before sync() to copy global commands to each guild's context. This makes commands appear immediately instead of requiring global sync (which can take up to 1 hour). Reference: discord.py FAQ recommends copy_global_to for development when you want immediate command availability in specific guilds.	2026-01-31 16:43:10 -06:00
Spencer Grimes	85f3e79d2a	debug: add comprehensive logging for command registration and sync - Added _log_registered_commands() to list all commands in tree - Added logging in __init__ to track command registration - Enhanced on_ready() sync logging with detailed information - Shows registered commands before and during sync - Shows specific guild sync status with command counts - Added error handling for Forbidden errors (missing permissions) - Clear warnings when no guilds are synced	2026-01-31 16:40:23 -06:00
Spencer Grimes	9f14e8c745	feat: add audio effects (pitch and speed control) - Added new audio_effects.py module with pitch shift and speed change - Pitch range: -12 to +12 semitones (higher = chipmunk, lower = deeper) - Speed range: 0.5 to 2.0x (higher = faster, lower = slower) - Maximum 2 active effects per user (performance optimization) - Added /effects command group: - /effects list - Shows current effects with descriptions - /effects set pitch\|speed <value> - Apply effects - /effects reset - Confirmation UI to clear all effects - Effects persist across restarts in preferences.json - Updated /voice preview to support optional pitch/speed parameters - Effects applied in _generate_wav_bytes using librosa - Added performance warnings when processing takes >1 second - Updated README with effects documentation	2026-01-31 15:43:29 -06:00
Spencer Grimes	4a2d72517f	feat: add /voice preview command - Added 8 random preview sample lines for voice testing - New /voice preview <name> command to hear voices before selecting - Previews play in queue like regular messages (no queue jumping) - Preview does NOT change user's active voice preference - Updated queue system to support voice override for previews - Added documentation for new command in README	2026-01-31 15:06:45 -06:00
Spencer Grimes	2403b431e9	chore: bump version to 1.1.0 Major features added since 1.0.0: - Test Mode support for safe development - Auto-updates dependencies on startup - Multi-voice support with per-user preferences - Voice persistence across restarts - Hot-reload voices without restart	2026-01-31 14:47:52 -06:00
Spencer Grimes	c0e5d4bcb6	docs: update README with Test Mode and Auto-update features - Added Test Mode documentation for safe development - Added Auto-updates feature description - Added usage instructions for testing mode	2026-01-31 14:46:37 -06:00
Spencer Grimes	c5e3fd33c4	Added Test Mode	2026-01-31 14:42:08 -06:00
Spencer Grimes	d0de47bdd7	fix: replace emoji characters with ASCII-safe markers for Windows compatibility - Replace Unicode emoji (✓, ⚠️) with [OK] and [WARN] in audio_preprocessor.py to prevent UnicodeEncodeError on Windows console (cp1252 codec) - Add auto-update dependencies function to bot.py for easier maintenance - Remove setup_linux.sh (no longer needed) - Update .gitignore to exclude VS Code launch.json	2026-01-31 13:54:27 -06:00
Spencer Grimes	9e537b7d20	Added SelfHelpSingh	2026-01-18 23:03:16 -06:00
Spencer Grimes	d40f895e2a	Added Chogath	2026-01-18 19:36:40 -06:00
Spencer Grimes	a46ddc9b21	Added Disconnect	2026-01-18 18:27:01 -06:00
Spencer	736a819493	feat: Rename pockettts service to vox and improve numba caching Renamed the systemd service from "pockettts" to "vox" for better branding and clarity. Updated the script to reflect the new service name. Addressed numba caching issues when running as a systemd service: - Created to explicitly set to a project-local directory (). - Modified to import early in the execution flow. - Updated the systemd service file to grant write permissions to the directory. - Added to to prevent caching files from being committed.	2026-01-18 18:09:10 -06:00