Eval Plugins Demo
Reset All
Clear Evals
Non-Rag Evals
Rag Evals
Explain and Score Evals
Explain and Score Rag Evals
Prompts and Content
Topic
On what topic should the generated questions be?
Number
Number of questions to generate
Generate
Generate or add questions and then evaluate a system prompt
HelpfulAI sample
ChildishAI sample
NeutralAI sample
System Prompt
Enter the system prompt to evaluate
Add
gpt-4o-mini
gpt-3.5-turbo
gpt-4o-mini
Answer Model
gpt-4o-mini
gpt-3.5-turbo
gpt-4o-mini
Eval Model
Evaluate
Question
filter_alt
Filter
Contains
Equals
Not equals
Contains
Starts with
Ends with
Does not contain
Is null
Is empty
Is not null
Is not empty
And
And
Or
Equals
Equals
Not equals
Contains
Starts with
Ends with
Does not contain
Is null
Is empty
Is not null
Is not empty
Clear
Apply
Eval
filter_alt
Filter
Contains
Equals
Not equals
Contains
Starts with
Ends with
Does not contain
Is null
Is empty
Is not null
Is not empty
And
And
Or
Equals
Equals
Not equals
Contains
Starts with
Ends with
Does not contain
Is null
Is empty
Is not null
Is not empty
Clear
Apply
Score
filter_alt
Filter
Equals
Equals
Not equals
Less than
Less than or equals
Greater than
Greater than or equals
Is null
Is not null
Custom
And
And
Or
Equals
Equals
Not equals
Less than
Less than or equals
Greater than
Greater than or equals
Is null
Is not null
Custom
Clear
Apply
Weighted
filter_alt
Filter
Equals
Equals
Not equals
Less than
Less than or equals
Greater than
Greater than or equals
Is null
Is not null
Custom
And
And
Or
Equals
Equals
Not equals
Less than
Less than or equals
Greater than
Greater than or equals
Is null
Is not null
Custom
Clear
Apply
No records to display.
An unhandled error has occurred.
Reload
🗙