Loading model details...
Back to Leaderboard
Google
Gemini 2.5 Pro
Rank
#37
Overview
Analytics
Score Overview
64.4%
Overall Score
63.3%
Accuracy
80.0%
Syntax Valid
Score Dimensions
68
Correctness
93
Best Practices
70
Performance
69
Clarity
Performance by Complexity
Basic
99.7%
Intermediate
64.1%
Advanced
52.6%
Tasks Correct
19 / 30
Avg Response Time
11225ms
Task Breakdown
Summary
Dimensions
Task
Category
Score
Time
Total Sales Amount
task-001
aggregation
100%
5342ms
Count of Customers
task-002
aggregation
100%
16312ms
Average Unit Price
task-003
aggregation
100%
2785ms
Distinct Product Count
task-004
aggregation
100%
2993ms
Total Order Quantity
task-005
aggregation
100%
2610ms
Year-to-Date Sales
task-006
time intelligence
100%
8830ms
Previous Year Sales
task-007
time intelligence
100%
11530ms
Sales by Category Filter
task-008
filtering
100%
5091ms
Year-over-Year Growth Percentage
task-009
calculation
30%
8568ms
Running Total with CALCULATE and FILTER
task-010
iterator
100%
14395ms
Sales Summary by Category
task-011
table manipulation
100%
13748ms
Product List with Renamed Columns
task-012
table manipulation
100%
10441ms
Union of High-Value Transactions
Query Failed
task-013
table manipulation
10%
13604ms
Year-Category Analysis Matrix
task-014
table manipulation
100%
12911ms
Product Percentage of Category Total
Query Failed
task-015
context transition
10%
14725ms
Virtual Relationship with TREATAS
Query Failed
task-016
context transition
10%
9282ms
Granularity-Aware Measure with VALUES
Query Failed
task-017
context transition
10%
14122ms
Running Count with EARLIER
task-018
context transition
30%
12904ms
Multiple Filter Conditions
task-019
filtering
100%
6130ms
Percentage of Total with ALLEXCEPT
task-020
filtering
100%
14973ms
Filter Intersection with KEEPFILTERS
task-021
filtering
100%
13711ms
Product Ranking with RANKX
task-022
iterator
100%
15285ms
Top 5 Products with TOPN
Query Failed
task-023
table manipulation
10%
15090ms
90th Percentile Order Value
Query Failed
task-024
iterator
10%
12881ms
Handle Missing Data with BLANK
task-025
calculation
100%
9002ms
Safe Ratio with Cascading Fallbacks
Query Failed
task-026
calculation
10%
14353ms
Safe Year-over-Year with Missing Data
Query Failed
task-027
time intelligence
10%
13115ms
3-Month Rolling Average
Query Failed
task-028
time intelligence
10%
13077ms
Same Month Previous Year Comparison
task-029
time intelligence
100%
16940ms
Fiscal Year-to-Date (July Start)
task-030
time intelligence
100%
12004ms