Skip to content

PaciStardust/HOSCY

Repository files navigation

REWORK IN PROGRESS

For the following reasons this project is currently being rewritten from the ground up:

  • The original is reliant on WPF and other Windows-Only components
  • The project started development with 1 year of experience, leading to an unmaintainable mess
  • As a personal challenge to try out things

If you wish to view the code of the current release version, refer to any tag lower than v1

Current Progress

Initial Release V1

  • Initial Startup
    • ✅ Dependency injection
    • ✅ Logging
    • ✅ Config Loading
    • ❌ Displaying startup errors (Currently windows only)
    • 🆗 Loading splash screen
    • ❌ Version checking & updating
  • User Interface
    • ✅ Recreation of original Hoscy components
    • ❌ Recreation of UI
  • Features
    • ✅ OSC
      • ✅ Sending
      • ✅ Receiving
      • ✅ Routing
      • ✅ OscQuery
    • ❌ Hotkeys
    • ❌ Media
      • ❌ Control
      • ❌ Display
    • ✅ Translation
    • ✅ Textbox Control
    • ✅ STT
      • ✅ Voice Commands
      • ✅ API Recognition
      • ✅ Azure Recognition
      • ✅ Vosk Recognition
      • ✅ Whisper Recognition
      • ✅ Windows Recognition (V1 & V2)
    • ❌ TTS
      • ❌ Azure TTS
      • ❌ Windows TTS

Future Updates

  • ❌ UI Themes
  • ❌ Updating Whisper
  • ❌ Improved TTS with Piper

HOSCY (Old version)

HOSCY is a free and Open-Source tool with many utilities for communication and OSC aimed at making communication and use of OSC easier

Need help setting this up? Check the Quickstart Guide

If you wish to contribute or need support, join the Discord

Features

  • Speech Recognition
    • Windows Speech Recognition
    • Locally running AI (Thanks to WHISPER & VOSK)
    • Azure Cognitive Services
    • Most external APIs (Provided they use raw audio data)
  • Utility for communicating
    • A manual textbox for input with preset support
    • A customizable system for displaying Text on VRChats chatbox
    • Integrated Text-to-Speech support
  • Translation of whatever you say using an external API of your choice
  • OSC (Open Sound Control) Support using CoreOsc
    • Configurable routing of incomming OSC data
    • Sending out OSC data
    • Creation of your own OSC command sequences
    • Support for OSCQuery
    • Counters for any parameter and AFK detection
  • Media control using Voice:
    • Simple and non-intrusive "Now Playing" display

Credits

  • CoreOSC by ValdemarOrn and Dalesjo for sending and receiving OSC Data
  • OSCQuery by VRChat for OSC-Service communication
  • VOSK by AlphaCephei for local AI speech recognition
  • Azure Cognitive Services for API speech recognition
  • Const-me for creating a usable C# whisper wrapper
  • AuroraNemoia for branding (Logo, Name)
  • Hyblocker for providing assistance when I got stuck
  • Realmlist for providing assistance and API keys for testing
  • M.O.O.N for creating a youtube tutorial and sfx for muting and unmuting
  • DrBlackRat for creating the Avatar 3.0 menu for VRChat use

About

Companion for OSC and Communication

Topics

Resources

License

Stars

Watchers

Forks

Contributors

Languages