Skip to content

Thairu-dev/StealthIt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

StealthIt

StealthIt is a powerful Vibe coding Challenge, AI-integrated desktop utility designed for stealth and efficiency(Cluely Alternative). It provides instant access to AI capabilities, screen analysis, and voice interaction while remaining completely hidden from the taskbar and screen capture software.

Developed by Antigravity and Gemini-3-pro


🤖 Supported AI Providers

StealthIt supports 5 AI providers for maximum flexibility:

Provider Vision Support Notes
Google Gemini Cloud API, excellent vision capabilities
Ollama Local LLMs, requires vision models like llava
OpenAI GPT-4o, GPT-4 Turbo, o1 models
Anthropic Claude 4, Claude 3.5 Sonnet/Haiku/Opus
Cerebras Blazing-fast inference, Llama models

🚀 Ollama Support

StealthIt now fully supports local LLMs via Ollama, including vision capabilities!

Requirements for Vision (Screen Capture)

To use the Capture & Analyze feature (Ctrl+Enter) with Ollama, you MUST use a multimodal (vision-capable) model. Standard text models like llama3 will not work with images.

Recommended Models:

  • llava (Lightweight, fast)
  • llama3.2-vision (Higher quality)

Setup:

  1. Install Ollama from ollama.com.
  2. Pull a vision model:
    ollama pull llava
  3. In StealthIt Settings, select Ollama provider and choose llava as the model.

✨ Features

  • 👻 True Stealth Mode: The application is hidden from the Windows Taskbar and is invisible to screen capture tools (OBS, Discord, Teams, etc.) thanks to advanced window affinity settings.
  • 🧠 Multi-Provider AI: Powered by Google Gemini, OpenAI, Anthropic Claude, Cerebras, or local Ollama models.
  • 📸 Instant Vision: Press Ctrl+Enter to instantly capture a screenshot and analyze it with AI.
  • 🎤 Voice Interaction: Press Ctrl+R to record audio and get instant transcriptions and AI responses.
  • ⌨️ Global Hotkeys: Control the application from anywhere without losing focus.
  • 🎨 Modern UI: A sleek, dark, semi-transparent interface that floats unobtrusively on your desktop.
  • 📝 Markdown Support: Rich text formatting for AI responses (bold, italics, lists, etc.).
  • ⚡ Quick Model Switching: Click the model chip to instantly switch between providers and models.

🛠️ Installation

  1. Clone the Repository:

    git clone https://github.com/Thairu-dev/StealthIt.git
    cd StealthIt
  2. Install Dependencies: Ensure you have Python 3.10+ installed.

    pip install -r requirements.txt
  3. Run the Application:

    python main.py

🔑 API Keys

Get API keys from your preferred providers:

Provider Get API Key
Google Gemini Google AI Studio
OpenAI OpenAI Platform
Anthropic Anthropic Console
Cerebras Cerebras Cloud
Ollama No API key needed (runs locally)

⚙️ Configuration

  1. Open the Settings menu by clicking the ⚙️ icon or pressing Ctrl+,.
  2. Select your Active Provider from the dropdown.
  3. Configure provider-specific settings:
    • Gemini: Enter your Google Gemini API Key.
    • OpenAI: Enter your OpenAI API Key, select model (GPT-4o, GPT-4 Turbo, etc.).
    • Anthropic: Enter your Anthropic API Key, select model (Claude 4, Claude 3.5, etc.).
    • Cerebras: Enter your Cerebras API Key, select model (Llama 4 Scout, Llama 3.1, etc.).
    • Ollama: Configure your Ollama host URL (default: http://localhost:11434).

Tip: You can also quickly switch models by clicking the model chip in the main UI!

🎮 Usage & Hotkeys

Hotkey Action
Ctrl + Enter Capture & Analyze: Takes a screenshot and sends it to the AI with your prompt.
Ctrl + R Record Audio: Toggles microphone recording for voice queries.
Ctrl + T Toggle Chat: Expands or collapses the chat window.
Ctrl + W Close App: Completely terminates the application.
Ctrl + \ Hide/Show: Instantly hides or shows the entire application window.
Ctrl + , Settings: Opens the configuration dialog.

🤝 Contributing

Feel free to submit issues and enhancement requests.


Built with ❤️ by Antigravity & Gemini-3-pro

About

A Stealthy, AI-Agnostic Desktop Utility built with Python (PySide/Qt). Features screen-share invisibility, real-time transcription, and multi-LLM support (Gemini, Ollama).

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors