Installation¶

Requirements¶

The simplest way to install:

pip install reverse-attention

For development or the latest features:

git clone https://github.com/ovshake/rat
cd rat
pip install -e .

For running tests and contributing:

pip install -e ".[dev]"

This installs pytest and pytest-cov.

For building the documentation locally:

pip install -e ".[docs]"

This installs MkDocs, the Material theme, and mkdocstrings.

from reverse_attention import ReverseAttentionTracer
print("Installation successful!")

The package automatically uses CUDA if available. To verify GPU support:

import torch
print(f"CUDA available: {torch.cuda.is_available()}")
print(f"MPS available: {torch.backends.mps.is_available()}")

The tracer will automatically select the best available device:

You can also specify a device explicitly:

tracer = ReverseAttentionTracer(model, tokenizer, device="cuda")

If you see 'sdpa' attention does not support 'output_attentions=True':

# Fix: Use eager attention implementation
model = AutoModelForCausalLM.from_pretrained(
    "model-name",
    attn_implementation="eager",
)

Modern transformers (4.36+) default to SDPA which doesn't support attention output. Always use attn_implementation="eager".

If you encounter OOM errors:

Ensure your model supports output_attentions=True. Most HuggingFace models do, but some custom models may not. See Model Compatibility for details.