Loading model details...
Back to Leaderboard
Mistral
Mistral Small 3.2 24B
Rank
#59
Overview
Analytics
Score Overview
19.9%
Overall Score
20.0%
Accuracy
33.3%
Syntax Valid
Score Dimensions
68
Correctness
77
Best Practices
80
Performance
77
Clarity
Performance by Complexity
Basic
50.0%
Intermediate
19.8%
Advanced
9.7%
Tasks Correct
6 / 30
Avg Response Time
180829ms
Task Breakdown
Summary
Dimensions
Task
Category
Score
Time
Total Sales Amount
Invalid Response
task-001
aggregation
0%
600314ms
Count of Customers
truncation_error
task-002
aggregation
0%
268351ms
Average Unit Price
task-003
aggregation
100%
951ms
Distinct Product Count
task-004
aggregation
100%
709ms
Total Order Quantity
task-005
aggregation
100%
3454ms
Year-to-Date Sales
task-006
time intelligence
100%
988ms
Previous Year Sales
truncation_error
task-007
time intelligence
0%
240797ms
Sales by Category Filter
truncation_error
task-008
filtering
0%
241138ms
Year-over-Year Growth Percentage
Query Failed
task-009
calculation
10%
2475ms
Running Total with CALCULATE and FILTER
truncation_error
task-010
iterator
0%
240934ms
Sales Summary by Category
truncation_error
task-011
table manipulation
0%
241224ms
Product List with Renamed Columns
Invalid Response
task-012
table manipulation
0%
57063ms
Union of High-Value Transactions
truncation_error
task-013
table manipulation
0%
241149ms
Year-Category Analysis Matrix
truncation_error
task-014
table manipulation
0%
273497ms
Product Percentage of Category Total
truncation_error
task-015
context transition
0%
240844ms
Virtual Relationship with TREATAS
truncation_error
task-016
context transition
0%
323800ms
Granularity-Aware Measure with VALUES
Query Failed
task-017
context transition
10%
1299ms
Running Count with EARLIER
truncation_error
task-018
context transition
0%
275020ms
Multiple Filter Conditions
truncation_error
task-019
filtering
0%
241613ms
Percentage of Total with ALLEXCEPT
truncation_error
task-020
filtering
0%
263026ms
Filter Intersection with KEEPFILTERS
truncation_error
task-021
filtering
0%
285390ms
Product Ranking with RANKX
truncation_error
task-022
iterator
0%
262773ms
Top 5 Products with TOPN
task-023
table manipulation
30%
2591ms
90th Percentile Order Value
truncation_error
task-024
iterator
0%
310456ms
Handle Missing Data with BLANK
truncation_error
task-025
calculation
0%
241183ms
Safe Ratio with Cascading Fallbacks
truncation_error
task-026
calculation
0%
253977ms
Safe Year-over-Year with Missing Data
task-027
time intelligence
30%
3966ms
3-Month Rolling Average
task-028
time intelligence
100%
2161ms
Same Month Previous Year Comparison
task-029
time intelligence
100%
753ms
Fiscal Year-to-Date (July Start)
truncation_error
task-030
time intelligence
0%
302960ms