Loading model details...
Back to Leaderboard
Moonshot AI
Kimi K2.5
Rank
#65
Overview
Analytics
Score Overview
61.6%
Overall Score
60.0%
Accuracy
100.0%
Syntax Valid
Score Dimensions
67
Correctness
73
Best Practices
73
Performance
73
Clarity
Performance by Complexity
Basic
100.0%
Intermediate
74.3%
Advanced
36.4%
Tasks Correct
18 / 30
Avg Response Time
40712ms
Task Breakdown
Summary
Dimensions
Task
Category
Score
Time
Total Sales Amount
task-001
aggregation
100%
6366ms
Count of Customers
task-002
aggregation
100%
7330ms
Average Unit Price
task-003
aggregation
100%
6622ms
Distinct Product Count
task-004
aggregation
100%
6966ms
Total Order Quantity
task-005
aggregation
100%
7564ms
Year-to-Date Sales
task-006
time intelligence
100%
34542ms
Previous Year Sales
task-007
time intelligence
100%
25252ms
Sales by Category Filter
task-008
filtering
100%
13024ms
Year-over-Year Growth Percentage
Query Failed
task-009
calculation
10%
33993ms
Running Total with CALCULATE and FILTER
Query Failed
task-010
iterator
10%
53045ms
Sales Summary by Category
task-011
table manipulation
30%
55509ms
Product List with Renamed Columns
task-012
table manipulation
100%
10346ms
Union of High-Value Transactions
task-013
table manipulation
30%
33240ms
Year-Category Analysis Matrix
task-014
table manipulation
100%
19095ms
Product Percentage of Category Total
Query Failed
task-015
context transition
10%
102652ms
Virtual Relationship with TREATAS
Query Failed
task-016
context transition
10%
25598ms
Granularity-Aware Measure with VALUES
Query Failed
task-017
context transition
10%
82874ms
Running Count with EARLIER
task-018
context transition
30%
33280ms
Multiple Filter Conditions
task-019
filtering
100%
18323ms
Percentage of Total with ALLEXCEPT
Query Failed
task-020
filtering
10%
106096ms
Filter Intersection with KEEPFILTERS
Query Failed
task-021
filtering
10%
54835ms
Product Ranking with RANKX
task-022
iterator
100%
43051ms
Top 5 Products with TOPN
task-023
table manipulation
30%
88577ms
90th Percentile Order Value
task-024
iterator
100%
28828ms
Handle Missing Data with BLANK
task-025
calculation
100%
12976ms
Safe Ratio with Cascading Fallbacks
task-026
calculation
100%
61902ms
Safe Year-over-Year with Missing Data
task-027
time intelligence
100%
47632ms
3-Month Rolling Average
Query Failed
task-028
time intelligence
10%
128006ms
Same Month Previous Year Comparison
task-029
time intelligence
100%
32053ms
Fiscal Year-to-Date (July Start)
task-030
time intelligence
100%
41794ms
Kimi K2.5 | DAXBench