AIRGAP StudioAIRGAP Studio

Supported Models

Catalog of the 6 AI models supported by AIRGAP Studio

Overview

AIRGAP Studio ships with Qwen3-8B-Q4_K_M.gguf (baseline) bundled in the installer. Additional models can be installed via a separate Modelpack installer or by placing .gguf files directly into the build/vscodium/models/ directory — they will be automatically detected.

The single source of truth for the model catalog is phase3/models-metadata.json, and baseline priority is managed in phase3/model-version.json.

Model Catalog

Display NameFilenameClassGPU ContextCPU ContextCompatibility Tier
Qwen3 8B (Q4_K_M)Qwen3-8B-Q4_K_M.gguf8B40,96016,384recommended (default)
Qwen3 4B (Q4_K_M)Qwen3-4B-Q4_K_M.gguf4B40,9608,192compatible
Granite 4.0 Micro (Q4_K_M)granite-4.0-micro-Q4_K_M.ggufmicro32,7688,192experimental
Qwen3 1.7B (Q8_0)Qwen3-1.7B-Q8_0.ggufmicro32,7688,192experimental
Mistral 7B Instruct v0.3 (Q4_K_M)Mistral-7B-Instruct-v0.3.Q4_K_M.gguf7B32,7688,192compatible
Meta Llama 3.1 8B Instruct (Q4_K_M)Meta-Llama-3.1-8B-Instruct-Q4_K_M.gguf8B32,76816,384compatible

Compatibility Tier Definitions

  • recommended: The officially recommended baseline model for AIRGAP Studio. Optimally verified across all assistants / Designer / Monitor.
  • compatible: Verified to work correctly. Response quality on some features may differ from the baseline.
  • experimental: Functional but with limited verification of response quality and stability. Intended as a fallback for low-spec environments.

Model Selection Guide

EnvironmentRecommended ModelNotes
VRAM 4GBQwen3 1.7B (Q8_0) or Granite 4.0 MicroLightweight, fast responses
VRAM 6GBQwen3 4B (Q4_K_M)Balanced choice
VRAM 8GB or moreQwen3 8B (Q4_K_M) — default / recommendedMistral 7B / Llama 3.1 8B also viable
CPU-onlyQwen3 1.7B or Granite 4.0 Micro16GB+ RAM recommended

See the System Requirements document for detailed GPU/CPU requirements.

Automatic Model Detection

llama-server automatically recognizes .gguf files in the build/vscodium/models/ directory.

build/vscodium/models/
├── Qwen3-8B-Q4_K_M.gguf           # Bundled by default
├── Qwen3-4B-Q4_K_M.gguf           # If additionally installed
└── Mistral-7B-Instruct-v0.3.Q4_K_M.gguf

After placing a new model and restarting AIRGAP Studio, it will be added to the model selection UI automatically.

How to Change Models

Model switching is the sole responsibility of the Monitor extension. Other assistants only display the currently active model (read-only).

  1. Open the Command Palette (Ctrl + Shift + P)
  2. Run the AIRGAP: Select LLM Model command
  3. Choose your model from the QuickPick list
  4. llama-server restarts automatically and loads the new model

For details, see the Monitor documentation.

Model Downloads

Beyond the bundled default (Qwen3 8B), additional models are distributed via a separate Modelpack installer.

  • Download: Official downloads page
  • For air-gapped environment deployment, simply distribute the Modelpack .exe alongside the main installer.