← Back to Benchmark Results

Interfaces & Events

Interface definitions, implementations, event publishers and subscribers

Report generated: February 25, 2026 at 7:49 AM

Benchmark data: Feb 18, 2026 – Feb 25, 2026

15
Models
8
Tasks
78.3%
Pass Rate

Model Rankings

pass@1 pass@3 (additional)
Kimi K2.5 (t0.1)
88%
13%
100.0%
Claude Opus 4.6 (t0.1)
79%
8%
87.5%
Glm 5 (t0.1)
75%
12%
87.5%
Deepseek V3.2 (t0.1)
63%
13%
75.0%
Grok Code Fast 1 (t0.1)
63%
13%
75.0%
Minimax M2.5 (t0.1)
58%
17%
75.0%

Model Performance

openrouter/moonshotai/kimi-k2.5

Runs:3
pass@1:87.5%
pass@3:100.0%
Consistency:75.0%
1st: 142nd: 78/8 passed
Temperature:0.1
Thinking:-
Tokens/run:32,327
Cost/run:$0.39

Known Shortcomings (11)

  • query-object-syntax 2x
  • event-subscriber-parameter-syntax 1x
  • page-extension-cardpageid-override 1x
  • parse-failure 1x
  • multiline-string-literals 1x
+6 more View all 11
View details ›

gemini/gemini-3-pro-preview

Runs:3
pass@1:87.5%
pass@3:87.5%
Consistency:100.0%
1st: 202nd: 1Failed: 17/8 passed
Temperature:0.1
Thinking:-
Tokens/run:38,623
Cost/run:$0.07

Known Shortcomings (23)

  • codeunit-syntax-structure 3x
  • multiline-string-literals 1x
  • inherent-permissions-syntax 1x
  • query-crossjoin-column-datasource 1x
  • complete-codeunit-generation 1x
+18 more View all 23
View details ›

anthropic/claude-sonnet-4-5-20250929

Runs:3
pass@1:87.5%
pass@3:87.5%
Consistency:100.0%
1st: 21Failed: 17/8 passed
Temperature:0.1
Thinking:-
Tokens/run:11,531
Cost/run:$0.11

Known Shortcomings (14)

  • multiline-string-literals 1x
  • query-filter-element-syntax 1x
  • jsonobject-get-method-signature 1x
  • cross-join-dataitem-link-constraints 1x
  • reserved-keyword-as-variable-name 1x
+9 more View all 14
View details ›

gemini/gemini-3.1-pro-preview

Runs:3
pass@1:83.3%
pass@3:87.5%
Consistency:87.5%
1st: 192nd: 1Failed: 17/8 passed
Temperature:0.1
Thinking:-
Tokens/run:39,773
Cost/run:$0.04

Known Shortcomings (16)

  • empty-or-missing-code-generation 2x
  • empty-or-missing-code-generation 2x
  • empty-or-missing-code-generation 1x
  • table-extension-with-page-extension 1x
  • al-syntax-structure 1x
+11 more View all 16
View details ›

anthropic/claude-opus-4-5-20251101@thinking=50000

Runs:3
pass@1:79.2%
pass@3:87.5%
Consistency:87.5%
1st: 182nd: 1Failed: 17/8 passed
Temperature:0.1
Thinking:50,000
Tokens/run:24,029
Cost/run:$0.41

Known Shortcomings (10)

  • page-extension-with-table-extension 1x
  • reserved-keyword-as-parameter-name 1x
  • dictionary-iteration-syntax 1x
  • empty-or-malformed-code-generation 1x
  • temporary-table-parameter-handling 1x
+5 more View all 10
View details ›

anthropic/claude-opus-4-6

Runs:3
pass@1:79.2%
pass@3:87.5%
Consistency:87.5%
1st: 182nd: 1Failed: 17/8 passed
Temperature:0.1
Thinking:-
Tokens/run:30,003
Cost/run:$0.53

Known Shortcomings (8)

  • reserved-keyword-as-parameter-name 1x
  • cross-join-dataitem-link 1x
  • incomplete-procedure-body 1x
  • flowfield-calcfields-requirement 1x
  • parse-failure 1x
+3 more View all 8
View details ›

openrouter/z-ai/glm-5

Runs:3
pass@1:75.0%
pass@3:87.5%
Consistency:62.5%
1st: 92nd: 9Failed: 17/8 passed
Temperature:0.1
Thinking:-
Tokens/run:79,768
Cost/run:$1.09

Known Shortcomings (25)

  • query-object-syntax 2x
  • list-dictionary-of-interface-clear-method 1x
  • event-subscriber-event-name 1x
  • al-string-literal-escaping 1x
  • fluent-api-return-self-codeunit 1x
+20 more View all 25
View details ›

openrouter/qwen/qwen3-max-thinking

Runs:3
pass@1:75.0%
pass@3:87.5%
Consistency:75.0%
1st: 102nd: 8Failed: 17/8 passed
Temperature:0.1
Thinking:-
Tokens/run:13,722
Cost/run:$0.12

Known Shortcomings (27)

  • option-field-optionmembers-required 2x
  • variant-type-argument-and-interface-definition 2x
  • query-object-syntax 2x
  • enum-frominteger-syntax 1x
  • list-iteration-pattern 1x
+22 more View all 27
View details ›

openai/gpt-5.3-codex

Runs:3
pass@1:75.0%
pass@3:75.0%
Consistency:100.0%
1st: 112nd: 7Failed: 26/8 passed
Temperature:0.1
Thinking:-
Tokens/run:18,431
Cost/run:$0.11

Known Shortcomings (14)

  • query-object-syntax 2x
  • empty-or-missing-code-generation 2x
  • parse-failure 2x
  • dictionary-keys-method-signature 1x
  • al-syntax-basics 1x
+9 more View all 14
View details ›

openai/gpt-5.2-2025-12-11@thinking=high

Runs:3
pass@1:75.0%
pass@3:75.0%
Consistency:100.0%
1st: 142nd: 4Failed: 26/8 passed
Temperature:0.1
Thinking:high
Tokens/run:17,985
Cost/run:$0.16

Known Shortcomings (13)

  • interface-definition-syntax 2x
  • table-field-caption-property 2x
  • query-object-syntax 2x
  • query-crossjoin-syntax 2x
  • jsonvalue-type-checking-methods 2x
+8 more View all 13
View details ›

openrouter/deepseek/deepseek-v3.2

Runs:3
pass@1:62.5%
pass@3:75.0%
Consistency:75.0%
1st: 142nd: 1Failed: 26/8 passed
Temperature:0.1
Thinking:-
Tokens/run:20,372
Cost/run:$0.22

Known Shortcomings (29)

  • interface-definition-syntax 3x
  • application-area-in-page-extension-field 2x
  • reserved-keyword-as-variable-name 2x
  • query-cross-join-syntax 2x
  • jsonobject-get-vs-selecttoken 2x
+24 more View all 29
View details ›

openrouter/x-ai/grok-code-fast-1

Runs:3
pass@1:62.5%
pass@3:75.0%
Consistency:75.0%
1st: 92nd: 6Failed: 26/8 passed
Temperature:0.1
Thinking:-
Tokens/run:28,803
Cost/run:$0.27

Known Shortcomings (22)

  • query-object-syntax 2x
  • jsonobject-selecttoken-vs-get 2x
  • httpclient-getheaders-usage 2x
  • multiline-string-literals 1x
  • page-extension-cardpageid-override 1x
+17 more View all 22
View details ›

openrouter/minimax/minimax-m2.5

Runs:3
pass@1:58.3%
pass@3:75.0%
Consistency:75.0%
1st: 72nd: 7Failed: 26/8 passed
Temperature:0.1
Thinking:-
Tokens/run:26,164
Cost/run:$0.28

Known Shortcomings (32)

  • json-object-key-iteration 3x
  • interface-definition-syntax 2x
  • multiline-string-literals 2x
  • text-char-conversion-copystr 1x
  • page-object-definition 1x
+27 more View all 32
View details ›

anthropic/claude-sonnet-4-6

Runs:3
pass@1:62.5%
pass@3:62.5%
Consistency:100.0%
1st: 15Failed: 35/8 passed
Temperature:0.1
Thinking:-
Tokens/run:44,033
Cost/run:$0.47

Known Shortcomings (5)

  • event-subscriber-event-name 1x
  • count-method-column-syntax 1x
  • code-truncation-incomplete-output 1x
  • jsonvalue-type-checking-api 1x
  • flowfield-calcsums-restriction 1x
View details ›

openrouter/qwen/qwen3-coder-next

Runs:3
pass@1:16.7%
pass@3:25.0%
Consistency:87.5%
1st: 12nd: 3Failed: 62/8 passed
Temperature:0.1
Thinking:-
Tokens/run:19,874
Cost/run:$0.16

Known Shortcomings (35)

  • codeunit-generation-empty-output 5x
  • interface-definition-syntax 4x
  • query-object-syntax 2x
  • reserved-keyword-as-parameter-name 2x
  • initvalue-vs-defaultvalue 1x
+30 more View all 35
View details ›

Task Results Matrix

N/M = passed N of M runs (hover for details)

TaskDescriptionKimi K2.5Gemini 3 ProClaude Sonnet 4.5Gemini 3.1 Pro PreviewClaude Opus 4.5 (50K)Claude Opus 4.6Glm 5Qwen3 Max ThinkingGPT-5.3 CodexGPT-5.2Deepseek V3.2Grok Code Fast 1Minimax M2.5Claude Sonnet 4 6Qwen3 Coder Next
CG-AL-E008Create a simple AL interface called "Payment Processor" with ID 70000. The interface should define the following procedures: - ProcessPayment(Amount: Decimal; PaymentMethod: Text): Boolean - ValidatePayment(PaymentData: Text): Boolean - GetTransactionFee(Amount: Decimal): Decimal3/33/33/33/33/33/32/33/33/33/31/32/31/33/30/3
CG-AL-E010Create a simple AL codeunit called "Item Event Subscriber" with ID 70001 that subscribes to Item table events. Create an event subscriber procedure that: - Subscribes to the OnAfterInsert event of the Item table - Displays a message when a new item is created - Includes proper EventSubscriber attributes3/33/33/33/33/33/32/33/33/33/33/31/30/30/33/3
CG-AL-E032Create an interface called "CG Token Provider".3/33/33/33/33/33/33/33/33/33/32/33/33/33/30/3
CG-AL-H010Create a codeunit with IntegrationEvent publishers and demonstrate proper event patterns.3/33/33/33/33/33/33/33/33/33/33/33/33/33/31/3
CG-AL-H0151. Define an Interface named "Payment Gateway".3/33/33/33/33/33/33/31/33/33/33/33/33/33/30/3
CG-AL-H021Create AL objects demonstrating Lists and Dictionaries of interfaces (available in BC 2025 Wave 1).1/30/30/30/30/30/30/30/30/30/30/30/31/30/30/3
CG-AL-H205Create a codeunit called "CG Line Amount Engine" with ID 70205. The codeunit must have Access = Public.3/33/33/33/33/33/33/33/33/33/33/33/33/33/30/3
CG-AL-M009Create a comprehensive interface implementation for a shipping service.2/33/33/32/31/31/32/32/30/30/30/30/30/30/30/3