GPT OSS Safeguard 20B

openai/gpt-oss-safeguard-20b

BackTry Model

GPT OSS Safeguard 20B

openai/gpt-oss-safeguard-20b

BackTry Model

gpt-oss-safeguard is a first open weight reasoning model specifically trained for safety classification tasks to help classify text content based on customizable policies. As a fine-tuned version of gpt-oss, gpt-oss-safeguard is designed to follow explicit written policies that you provide. This enables bring-your-own-policy Trust & Safety AI, where your own taxonomy, definitions, and thresholds guide classification decisions. Well crafted policies unlock gpt-oss-safeguard's reasoning capabilities, enabling it to handle nuanced content, explain borderline decisions, and adapt to contextual factors.

Added Feb 23, 2026

Context Window

128.0K

Max Output

16.4K

Input Price (Auto)

$0.075/1M

Output Price (Auto)

$0.30/1M

Capabilities

Reasoning

Benchmarks

Performance metrics and benchmarks

Artificial Analysis

LMArena

Sourced from Artificial Analysis.

Intelligence Index

24.5

Providers

Auto routing is available for this model. Explicit provider selection is not available.

Loading provider options…

GPT OSS Safeguard 20B

GPT OSS Safeguard 20B

Benchmarks

Providers

Reasoning

Coding

Math

Knowledge