Ghost Capture for live interviews

Invisible on shared screens. Grounded in your own resume. Ready when the interview turns technical.

Wraith v4.0.0 is a frameless desktop app for Windows, macOS, and Linux built for high-pressure technical interviews. Ghost Capture keeps it out of shared-screen workflows, Dual Audio Intelligence captures both system and mic audio, Connector System (Ollama + Kilo Code) provides flexible AI backends, Resume Intelligence keeps answers rooted in your real work, and access starts at ₹100 for 24 hours inside the app.

>

macOS users: read before installing

After downloading the .dmg, you may need to remove the quarantine attribute before macOS lets you install. Open Terminal and run:

$ xattr -d com.apple.quarantine ~/Downloads/Wraith-*.dmg
If Gatekeeper blocks the app after install, go to System Preferences → Privacy & Security and click "Open Anyway" in the Security section.

Read this before you install

  • Download Wraith here. Every payment and recharge happens inside Wraith, not on this website.
  • The current public build supports Windows x86, x64, and ARM, macOS for both Intel and Apple Silicon, and Linux x64 processors.
  • Keep an active internet connection because Wraith validates the device license while you use it.
  • Ghost Capture plus hotkey-driven visibility keep the workflow discreet when the interview is live.
  • Connector System (Ollama + Kilo Code) provides flexible local/cloud AI backends with auto-model detection.
  • Stealth Browser hardware toggles control mic, camera, audio, and Stealth Mode from the invisible dashboard.
  • Resume Intelligence makes answers sound like your real projects, stack, and experience instead of generic AI filler.
Ghost Capture Frameless window with content protection (Browser is visible)
Connector System Local Ollama + Kilo Code cloud via unified API
Resume Intelligence Your background gets injected into prompt context
Stealth Browser Hardware toggles + Stealth Mode for browser
Auto Model Sync Ollama models auto-detected on launch
GPU Offload Real-time inference performance control
Precision Snapping Global hotkeys move Wraith around a 3x3 grid

See it in action

The real Wraith interface, not mockups.

Every screenshot below comes from the live desktop app. Swipe through to see how Ollama model sync, Resume Intelligence, Ghost Capture controls, and the chat interface actually look on a real machine.

Featured on leading platform discovery sites

What Wraith changes

Stay calmer in the room. Sound more like yourself.

Wraith is built for developers who want discreet support during live interviews without dragging a visible browser tab, overlay extension, or second-screen mess into the call.

01

Keep your composure when the question gets hard

Use Wraith for system design, debugging, architecture, and follow-up questions without needing a visible browser tab or overlay.

02

Answer from your real experience, not generic AI tone

Resume Intelligence injects your projects, tools, and work history into prompt context so answers stay specific and believable.

03

Use speech and screen context instead of frantic typing

Local Whisper transcription and vision capture help Wraith follow the conversation and the screen, but both are manual triggers so you stay in control of when listening or screenshot analysis starts.

04

Keep the workflow local and discreet

Wraith keeps interview prompts, resume context, and local inference on-device while only reaching the internet for live license checks and in-app payments.

05

Stealth Extraction & Control Dashboard

Navigate interview platforms via an invisible dashboard. While the controls and extraction stay hidden, any external browser window remains visible to others for normal interaction.

Inside the app

These are real Wraith systems, not fluffy feature names.

The language below comes directly from how the desktop app is built today, so power users can understand what Wraith is actually doing under the hood.

Ghost Capture

Built to stay out of shared-screen workflows

Wraith uses a frameless always-on-top window with content protection so it can remain discreet during screen sharing. Note: The assistant dashboard is hidden, but the external browser window can be controlled to appear either visible or invisible based on use case.

Connector System

Local + Cloud AI in one app

Wraith v4 unified connector system supports both Ollama (local) and Kilo Code (cloud). Auto-detect Ollama models on launch, or enter your Kilo Code API key to access 100+ cloud models.

Resume Intelligence

Your background becomes prompt context

Load a resume PDF once and Wraith locally parses it so answers can reference the projects and stack you actually want to talk about.

Auto Model Sync

Ollama models auto-detected on launch

Wraith dynamically fetches installed Ollama models on startup. Pull any model via ollama pull and it appears automatically in the model selector.

Stealth Browser

Invisible control + hardware toggles

Navigate interview platforms via an invisible dashboard. Toggle mic, camera, audio permissions, and Stealth Mode (content protection) on the browser directly from the hidden control panel.

Seamless Audio

Local Whisper keeps up with the conversation

Microphone transcription is manual, not always-on. Start it from the microphone icon or hotkey, and Wraith uses local Whisper to transcribe whatever reaches the active microphone before sending that transcript to the selected model.

Vision Intelligence

Capture the screen when the prompt is visual

Screen snapshot support is also manual. Click the screenshot control or use the hotkey when you want Wraith to analyze what is currently on your display for coding prompts and visual tasks.

Atomic Sizing + Precision Snapping

Fast placement without fidgeting

Wraith supports 25%, 50%, and 75% sizing plus directional hotkeys that move the window through a 3x3 grid for fast placement.

GPU Offload Controller

Real-time performance tuning

Toggle between maximum VRAM utilization (GPU: 100%) and automatic resource management (GPU: Auto) directly from the dashboard to optimize inference speed.

GPU Offload Controller

Real-time performance tuning

Toggle between maximum VRAM utilization and automatic resource management directly from the dashboard to optimize inference speed.

Download Wraith

Download the app here, then unlock only the time window you actually need inside Wraith.

Wraith is a desktop product, not a browser tab. This website handles the installer, compatibility guidance, and docs. Every payment happens inside Wraith after you launch it, and the current public release is available for Windows x86, x64, and ARM, macOS for both Intel and Apple Silicon, and Linux x64 processors.

Important compatibility warning before paying inside the app

All Wraith payments happen inside Wraith after download. Only continue if your machine uses a supported processor: Windows x86, x64, or ARM, macOS Intel or Apple Silicon, or Linux x64.

Plans Inside Wraith

The 24-hour pass is the honest default, because not every candidate should be pushed into a month.

Start with the day pass if you just need Wraith for interview day or one short burst of usage. The longer plans are there for active interview loops, coaching, and heavier ongoing use, but the website should not pressure people into buying more time than they need.

Best for active loops

1 Month

₹3000

A better fit if you are moving through multiple rounds in the same month.

  • 30-day access window
  • Useful for repeated screens and follow-up rounds
  • Purchased and extended inside Wraith
Best for sustained use

1 Year

₹36500

Best for sustained prep, heavier usage, coaching, or broader operational use.

  • 365-day access window
  • Useful when Wraith is part of a long search cycle
  • Purchased and managed inside Wraith

Every Wraith purchase and recharge is completed inside the desktop app after install.

What you need

4 GB RAM minimum, 8 GB recommended, around 500 MB free space, a supported Windows x86, x64, or ARM machine, macOS Intel/Apple Silicon Mac, or Linux x64 machine, and an active internet connection for live license validation. Heavier Ollama models and larger context sizes can need more system RAM, more CPU headroom, and in many cases enough GPU memory to stay usable.

Validate before the real call

Run multiple Google Meet checks with a laptop and a phone, confirm Wraith stays invisible on the phone view, verify the active microphone receives the audio you want Whisper to transcribe, and confirm the manual mic and screenshot actions behave correctly with the Ollama model and context size that feel right on your hardware.

Licensing Model

No accounts. No signup wall. Just a device-bound license.

Wraith does not ask users to register or log in. Instead, the license is tied to a machine fingerprint. That is intentional. Not collecting email addresses or account profiles keeps the product simpler, reduces privacy risk, lowers breach exposure, and avoids storing personal user data the app does not actually need. It also means moving to a new device, reinstalling to a different OS, or materially changing the machine can require a new purchase.

How it works

Download, sync your stack, and configure Wraith around your machine.

Wraith is straightforward when the setup is clear: install it on supported Windows, macOS, or Linux hardware, connect your local model stack, choose the Ollama model and context size that fit your machine, then verify Ghost Capture and the manual controls.

1

Install Wraith on a supported machine

Download the current installer for your platform, then open Wraith. Every plan purchase and recharge happens inside the app after install, not on the website.

Windows x86/x64/ARM, macOS Intel/Apple Silicon, and Linux x64
2

Connect Ollama, choose a model, and load your resume

Use Dynamic Model Sync to connect Ollama, pick a local model such as moondream or qwen3.5:0.8b, try model and context-size combinations that feel stable on your machine, load your resume PDF for Resume Intelligence, and verify that the audio source you want Whisper to transcribe reaches the active microphone.

Install preferred models

Run these commands in your terminal to pull the recommended local LLMs:

$ ollama pull moondream
$ ollama pull qwen3.5:0.8b
Local AI plus real context
3

Keep the license live and validate the control surface

Keep internet active so Wraith can validate the license while actions are performed. Run one Google Meet visibility test with a laptop and phone, confirm Ghost Capture stays out of the shared view, then verify the microphone, screenshot, and snapping controls behave as expected.

Ghost Capture plus manual controls

Before a real interview day, run this four-point check.

Compatibility and hardware check

Make sure the machine is Windows x86, x64, or ARM, macOS Intel or Apple Silicon, or Linux x64, and has enough CPU, RAM, and if applicable GPU memory for the Ollama model and context size you plan to use.

Connection and audio check

Keep internet active for live license validation, and verify that the audio source you want Whisper to transcribe reaches the active microphone cleanly.

Visibility check

Join Google Meet from your laptop and phone, share the laptop screen, and confirm Wraith does not appear on the phone view.

Control check

Verify Ctrl+Shift+H, M, S, and the arrow hotkeys, and confirm the mic and screenshot actions trigger correctly with the Ollama setup you plan to use.

100+ Cloud AI Models

Meet your new secret weapon: Kilo Code Connector

Wraith v4.0.0 introduces the Kilo Code connector - a unified gateway that gives you access to 100+ cloud AI models including OpenAI, Anthropic, Google, DeepSeek, xAI, Mistral, and many more. No local GPU required. No Ollama setup needed. Just enter your free API key and you're ready for interview day.

100+ Models

Access GPT-4o, Claude 3.5 Sonnet, Gemini Pro, Llama 3.1, Mistral, DeepSeek, and 100+ more through a single unified API.

Free API Key

Get your Kilo Code API key from app.kilo.ai. Encrypts locally with AES-256-GCM and stays on your device.

Instant Setup

No local model downloads. No GPU requirements. Just paste your API key and start using cloud models immediately.

Why use Kilo Code with Wraith?

No Local Setup

Skip Ollama installation entirely. Enter your API key and go.

No GPU Needed

Cloud models run on remote hardware. Your laptop stays cool.

100+ Options

Switch between GPT-4o, Claude, Gemini, DeepSeek, and more.

Encrypted Storage

Your API key is encrypted with AES-256-GCM locally.

$ Visit https://app.kilo.ai to get your free API key Get Kilo Code API Key

Why Wraith

Made for discreet live interviews, not just another visible AI helper.

Wraith still needs internet for live license checks, but the model workflow, resume context, and interview data can stay local. That is a very different operating model from a visible browser tab or overlay extension.

Tool Visible on screen-share? Uses your resume? Keeps data local?
Browser AI chat tab Yes Sometimes No
Browser overlay extension Yes No No
Wraith desktop app No Yes Yes

FAQ

Questions worth answering before a real interview day.

Short version: Wraith keeps the interview workflow local, needs an active internet connection for live license validation, and relies on manual mic and screenshot controls.

Wraith is built around Ghost Capture, a frameless desktop window, and content protection so it can stay out of shared-screen and recording workflows. That said, you should always test your exact OS, monitor setup, and interview platform before relying on it in a high-stakes call.

Docs for power users

Read the operator notes if you want the full picture before you install.

The landing page above is optimized for first-time visitors. The docs below preserve the deeper details around setup, controls, local AI flow, processor support, and device-bound licensing.

Supported release

Current Windows x86/x64/ARM, macOS Intel/Apple Silicon, and Linux x64 availability, plus the explicit ARM Windows warning before anyone pays.

Local AI workflow

Ollama setup, Dynamic Model Sync, Whisper transcription, and resume loading.

Stealth controls

Ghost Capture, Google Meet visibility checks, hotkeys, and the manual trigger model.

Licensing details

In-app plans, live validation, device fingerprints, and what changes can force a repurchase.