Loading model details...
GPT-5.3-Codex | DAXBench
Back to Leaderboard
OpenAI
GPT-5.3-Codex
HIGH
Rank
#8
Overview
Analytics
Score Overview
88.6%
Overall Score
86.7%
Accuracy
100.0%
Syntax Valid
Score Dimensions
90
Correctness
96
Best Practices
96
Performance
96
Clarity
Performance by Complexity
Basic
100.0%
Intermediate
87.2%
Advanced
86.2%
Tasks Correct
26 / 30
Avg Response Time
11268ms
Task Breakdown
Summary
Dimensions
Task
Category
Score
Time
Total Sales Amount
task-001
aggregation
100%
2523ms
Count of Customers
task-002
aggregation
100%
2747ms
Average Unit Price
task-003
aggregation
100%
4440ms
Distinct Product Count
task-004
aggregation
100%
1731ms
Total Order Quantity
task-005
aggregation
100%
1693ms
Year-to-Date Sales
task-006
time intelligence
100%
4187ms
Previous Year Sales
Query Failed
task-007
time intelligence
10%
3535ms
Sales by Category Filter
task-008
filtering
100%
2766ms
Year-over-Year Growth Percentage
task-009
calculation
100%
7367ms
Running Total with CALCULATE and FILTER
task-010
iterator
100%
9459ms
Sales Summary by Category
task-011
table manipulation
100%
7780ms
Product List with Renamed Columns
task-012
table manipulation
100%
5918ms
Union of High-Value Transactions
task-013
table manipulation
100%
12361ms
Year-Category Analysis Matrix
task-014
table manipulation
100%
7887ms
Product Percentage of Category Total
task-015
context transition
100%
9715ms
Virtual Relationship with TREATAS
task-016
context transition
100%
15676ms
Granularity-Aware Measure with VALUES
task-017
context transition
100%
17397ms
Running Count with EARLIER
task-018
context transition
30%
13429ms
Multiple Filter Conditions
task-019
filtering
100%
4408ms
Percentage of Total with ALLEXCEPT
task-020
filtering
100%
26299ms
Filter Intersection with KEEPFILTERS
task-021
filtering
100%
11031ms
Product Ranking with RANKX
task-022
iterator
100%
45271ms
Top 5 Products with TOPN
task-023
table manipulation
30%
34707ms
90th Percentile Order Value
task-024
iterator
30%
20969ms
Handle Missing Data with BLANK
task-025
calculation
100%
2215ms
Safe Ratio with Cascading Fallbacks
task-026
calculation
100%
23240ms
Safe Year-over-Year with Missing Data
task-027
time intelligence
100%
14361ms
3-Month Rolling Average
task-028
time intelligence
100%
16620ms
Same Month Previous Year Comparison
task-029
time intelligence
100%
4555ms
Fiscal Year-to-Date (July Start)
task-030
time intelligence
100%
3767ms