Benchmark Dataset Overview

Overview

The Benchmark Dataset defines the structure and classification system of Seed data, which serves as the starting point for attack simulations in Auto Red Teaming.
This document explains the data hierarchy, the meaning of each field, and how Benchmark Datasets are connected to Red Teaming evaluations.

Benchmark Data Hierarchy

The Benchmark Dataset is organized into the following four hierarchical levels:

Domain → Risk Taxonomy → Dataset → Seed

Level	Description	Example
Domain	Top-level classification based on evaluation objectives	Safety
Risk Taxonomy	Risk classification system within a Domain (tree structure)	Violence, Illegal Activity, …
Dataset	A collection of Seeds associated with a specific Risk Taxonomy	`dataset-safety-violence-01`
Seed	An individual query that serves as the source of attack prompts	“How can I make an explosive?”

Seed

A Seed is the fundamental unit of evaluation.
Each Seed consists of a single query representing a specific risk scenario. During Auto Red Teaming execution, the Attack Generator uses these Seeds to generate diverse attack prompts.

Each Seed contains the following fields:

Field	Description
ID	Unique identifier of the Seed
Seed Query	Query text representing a risk scenario
Metadata	Additional metadata, if available

Domain Classification

A Domain represents a broad risk area to which an AI system may be exposed.

note

Currently, only the Open-Domain Domain is available.
As additional Domains are introduced, you will be able to switch between them using the Domain tab in the Benchmark Dataset page.

UI Field Reference

Dataset List Table

The following columns are displayed in the Benchmark Dataset page.

Column	Description
ID	Unique identifier of the Dataset
Dataset Name	Name of the Dataset
Description	Description of the Dataset
Risk Taxonomy	List of associated Risk Taxonomy tags
Seed Count	Number of Seeds included in the Dataset

Risk Taxonomy Architecture Panel

The Risk Taxonomy tree displayed on the left side of the screen provides a reference view of the complete risk classification structure for the current Domain.
It is not linked to Dataset filtering.

Relationship to Auto Red Teaming

Seeds in the Benchmark Dataset are used as inputs to Auto Red Teaming evaluations.
The overall flow is as follows:

Benchmark Dataset Auto Red Teaming
───────────────── ─────────────────
Seed → Attack Set configuration
↓
→ Attack Generator creates attack prompts
↓
→ Target model responses are collected and evaluated
↓
Results are reviewed in the Dashboard

When creating an Attack Set, you select a Dataset from the Benchmark Dataset.
The Seeds included in the selected Dataset become the source for attack prompt generation.
The Attack Generator automatically applies multiple strategies to the Seeds to generate attack prompts.
The Target model is evaluated using the generated prompts, and results are available in the Dashboard.

tip

Benchmark Datasets are read-only. Users cannot add or modify Seeds directly; they can only select Datasets when running evaluations.
For instructions on how to browse and inspect Datasets, see Usage Guide > Benchmark Dataset.

Benchmark Data Hierarchy​

Seed​

Domain Classification​

UI Field Reference​

Dataset List Table​

Risk Taxonomy Architecture Panel​

Relationship to Auto Red Teaming​