menu
Eval Plugins Demo
home
Sample Data Evals
tune
Custom Data Evals
Reset All
Clear Evals
Non-Rag Evals
Rag Evals
Explain and Score Evals
Explain and Score Rag Evals
Prompts and Content
Topic
On what topic should the generated questions be?
Number
Number of questions to generate
Generate
Generate or add questions and then evaluate a system prompt
HelpfulAI sample
ChildishAI sample
NeutralAI sample
System Prompt
Enter the system prompt to evaluate
Alternative System Prompt
An alternative system prompt to test the evals that require it.
Add
gpt-4.1-mini
gpt-4.1-mini
gpt-4.1-nano
gpt-4o-mini
Answer Model
gpt-4.1-nano
gpt-4.1-mini
gpt-4.1-nano
gpt-4o-mini
Eval Model
Evaluate
Question
filter_alt
Filter
Contains
Equals
Not equals
Contains
Starts with
Ends with
Does not contain
Is null
Is empty
Is not null
Is not empty
And
And
Or
Equals
Equals
Not equals
Contains
Starts with
Ends with
Does not contain
Is null
Is empty
Is not null
Is not empty
Clear
Apply
Eval
filter_alt
Filter
Contains
Equals
Not equals
Contains
Starts with
Ends with
Does not contain
Is null
Is empty
Is not null
Is not empty
And
And
Or
Equals
Equals
Not equals
Contains
Starts with
Ends with
Does not contain
Is null
Is empty
Is not null
Is not empty
Clear
Apply
Score
filter_alt
Filter
Equals
Equals
Not equals
Less than
Less than or equals
Greater than
Greater than or equals
Is null
Is not null
Custom
And
And
Or
Equals
Equals
Not equals
Less than
Less than or equals
Greater than
Greater than or equals
Is null
Is not null
Custom
Clear
Apply
Weighted
filter_alt
Filter
Equals
Equals
Not equals
Less than
Less than or equals
Greater than
Greater than or equals
Is null
Is not null
Custom
And
And
Or
Equals
Equals
Not equals
Less than
Less than or equals
Greater than
Greater than or equals
Is null
Is not null
Custom
Clear
Apply
No records to display.
An unhandled error has occurred.
Reload
🗙