Loading model details...
Back to Leaderboard
Meta
Llama 3.2 1B Instruct
Rank
#60
Overview
Analytics
Score Overview
11.9%
Overall Score
6.7%
Accuracy
90.0%
Syntax Valid
Score Dimensions
17
Correctness
7
Best Practices
7
Performance
7
Clarity
Performance by Complexity
Basic
38.9%
Intermediate
8.3%
Advanced
6.1%
Tasks Correct
2 / 30
Avg Response Time
1807ms
Task Breakdown
Summary
Dimensions
Task
Category
Score
Time
Total Sales Amount
Query Failed
task-001
aggregation
10%
527ms
Count of Customers
Query Failed
task-002
aggregation
10%
423ms
Average Unit Price
task-003
aggregation
100%
489ms
Distinct Product Count
Query Failed
task-004
aggregation
10%
514ms
Total Order Quantity
task-005
aggregation
100%
462ms
Year-to-Date Sales
Query Failed
task-006
time intelligence
10%
538ms
Previous Year Sales
Query Failed
task-007
time intelligence
10%
604ms
Sales by Category Filter
Query Failed
task-008
filtering
10%
644ms
Year-over-Year Growth Percentage
Query Failed
task-009
calculation
10%
944ms
Running Total with CALCULATE and FILTER
Query Failed
task-010
iterator
10%
668ms
Sales Summary by Category
Query Failed
task-011
table manipulation
10%
1951ms
Product List with Renamed Columns
Query Failed
task-012
table manipulation
10%
564ms
Union of High-Value Transactions
Invalid Response
task-013
table manipulation
0%
637ms
Year-Category Analysis Matrix
Query Failed
task-014
table manipulation
10%
1324ms
Product Percentage of Category Total
Query Failed
task-015
context transition
10%
538ms
Virtual Relationship with TREATAS
Query Failed
task-016
context transition
10%
724ms
Granularity-Aware Measure with VALUES
Query Failed
task-017
context transition
10%
876ms
Running Count with EARLIER
Query Failed
task-018
context transition
10%
509ms
Multiple Filter Conditions
Query Failed
task-019
filtering
10%
571ms
Percentage of Total with ALLEXCEPT
Query Failed
task-020
filtering
10%
970ms
Filter Intersection with KEEPFILTERS
Query Failed
task-021
filtering
10%
601ms
Product Ranking with RANKX
Query Failed
task-022
iterator
10%
444ms
Top 5 Products with TOPN
Query Failed
task-023
table manipulation
10%
586ms
90th Percentile Order Value
truncation_error
task-024
iterator
0%
480ms
Handle Missing Data with BLANK
Query Failed
task-025
calculation
10%
476ms
Safe Ratio with Cascading Fallbacks
Query Failed
task-026
calculation
10%
611ms
Safe Year-over-Year with Missing Data
Query Failed
task-027
time intelligence
10%
919ms
3-Month Rolling Average
truncation_error
task-028
time intelligence
0%
34089ms
Same Month Previous Year Comparison
Query Failed
task-029
time intelligence
10%
826ms
Fiscal Year-to-Date (July Start)
Query Failed
task-030
time intelligence
10%
701ms