Skip to content

Installation

Step 1: Install via pip

pip install tinfer-ai

This installs the Tinfer Python wrapper and the following commands:

  • tinfer — CLI for chat and text completion
  • tinfer-server — HTTP server with WebUI
  • tinfer-bench — Model benchmarking tool
  • tinfer-quantize — Model quantization tool
  • tinfer-setup — Inference engine installer

Step 2: Setup the Inference Engine

After installing the pip package, you need to download the inference engine binary for your system. There are two options:

Run the setup command — it will automatically detect your OS, CPU, and GPU, then download the correct engine:

tinfer-setup

You will see output like:

=======================================================
  Tinfer Setup -- Inference Engine Installer
=======================================================

[Tinfer] OS:   Windows (win)
[Tinfer] Arch: AMD64 (x64)
[Tinfer] GPU:  NVIDIA GeForce RTX 3060

[Tinfer] Selected Engine: win-x64-cuda
[Tinfer] Downloading: tinfer-v0.2.0-win-x64-cuda.zip
[########################################] 100% (505.5/505.5 MB)

[Tinfer] Extracting to C:\Users\you\.tinfer\bin...

[Tinfer] Done! Setup complete!

The engine is installed to ~/.tinfer/bin/ and all tinfer commands will automatically use it.

Option B: Manual Download (For Developers)

If you prefer to download the engine manually:

1. Download the correct archive for your platform from the GitHub Releases page:

Platform GPU Download
Windows x64 NVIDIA CUDA tinfer-v0.2.0-win-x64-cuda.zip
Windows x64 CPU only tinfer-v0.2.0-win-x64-cpu.zip
Linux x64 NVIDIA CUDA tinfer-v0.2.0-linux-x64-cuda.tar.gz
Linux x64 CPU only tinfer-v0.2.0-linux-x64-cpu.tar.gz
Linux ARM64 CPU only tinfer-v0.2.0-linux-arm64-cpu.tar.gz
macOS ARM64 (Apple Silicon) Metal tinfer-v0.2.0-macos-arm64-metal.tar.gz
macOS x64 (Intel) CPU only tinfer-v0.2.0-macos-x64-cpu.tar.gz

2. Extract the downloaded archive to a folder of your choice.

3. Set the TINFER_ENGINE_PATH environment variable to the extracted folder path:

$env:TINFER_ENGINE_PATH = "D:\path\to\extracted\folder"
export TINFER_ENGINE_PATH="/path/to/extracted/folder"

After setting the variable, all tinfer commands will use the engine from that custom path.

Prerequisites

Requirement Details
Python 3.8 or higher
OS Windows (x64), Linux (x64, ARM64), macOS (x64, ARM64)
GPU (optional) NVIDIA GPU with CUDA drivers for GPU acceleration

GPU is optional

Tinfer works on CPU-only systems. GPU acceleration speeds up inference but is not required. When a CUDA-capable GPU is detected, tinfer-setup will automatically download the GPU-accelerated engine.

Verify Installation

After installing and running setup, verify everything works:

tinfer --help

Upgrade

pip install --upgrade tinfer-ai
tinfer-setup  # Re-download the engine for the new version

Uninstall

pip uninstall tinfer-ai

Next Steps

Once installed, download a model and start using Tinfer!