Arithmetic Addition - 2025

1. Probe description

In Term 1 (BOY), students answered by dragging and dropping the correct number (0–13) into the answer box. Because response times were slow at BOY, the task was changed in Term 3 (MOY) and Term 4 (EOY) to a multiple-choice format: students selected the correct answer from four options, with distractors generated from the most common BOY incorrect responses. As a result, the separate “speed test” used to familiarise students with the response format was no longer required at MOY/EOY.

Term 1 example

Term 3 / Term 4 example

Table 1

Download Year 1 student data (wide format)

2. Overview of test results

Table 2
Term Test name Test ID Students Median items attempted Median accuracy (%)* Median correct / min
1 Addition drag and drop AADD_2025 2343 6/40 100 7.14
3 Addition Multiple Choice - New AADD_2025-NEW 1489 9/40 100 10.80
4 Addition Multiple Choice - (MOY) AAMC_2025-MOY 1065 11/40 100 12.24
* Accuracy is calculated after removing non-attempted items.

3. Accuracy distribution shows strong ceiling effect across iterations

4. Faster response with atlernate form in Term 3

Students in term 1 repeating the test in term 3 may have also contributed to faster RT distribution.

Warning: Removed 241 rows containing non-finite outside the scale range
(`stat_ydensity()`).
Warning: Removed 241 rows containing non-finite outside the scale range
(`stat_boxplot()`).

5. Fluency shows a wider spread of student ability

Term 1 Drag n’ drop vs Term 3 multiple choice

Bottom quartile perform at roughly half the pace of the rest

6. Item statistics

NoteItem discrimination

Item discrimination (point-biserial) measures how well an item separates higher-ability from lower-ability students. Values above ~0.3 are typically “good”, below ~0.2 “weak”. In timed tests, late items can look artificially strong because only fast/able students reach them.

Correct response time (sec)
Question ID Question Flag No. response % Correct Median RT 95th RT Item discrim.
Term 1 – Addition drag and drop (AADD_2025)
AADD_001 2+1 A1 2264 98.1 6.0 17.00 0.14
AADD_002 1+4 1A 2265 94.0 7.0 20.00 0.26
AADD_003 4+0 P0 2199 92.1 6.0 16.00 0.29
AADD_004 3+3 D 2073 93.4 6.0 15.00 0.27
AADD_005 4+6 M10 1676 79.6 10.0 22.00 0.33
AADD_006 8+1 A1 1414 94.6 6.0 12.00 0.33
AADD_007 2+3 C 1100 92.5 6.0 13.00 0.38
AADD_008 9+1 M10 893 97.1 4.0 8.00 0.35
AADD_009 2+7 C 573 85.7 6.0 13.00 0.37
AADD_010 7+5 C 279 69.2 7.0 14.40 0.34
AADD_011 5+1 A1 207 90.3 4.0 6.00 0.72
AADD_012 1+3 1A 135 91.9 3.0 5.00 0.73
AADD_013 0+2 P0 87 85.1 3.0 6.00 0.75
AADD_014 4+4 D 55 83.6 3.0 6.00 0.76
AADD_015 1+9 M10 35 91.4 2.0 7.25 0.59
AADD_016 7+1 A1 26 76.9 3.0 7.20 0.50
AADD_017 4+3 C 15 53.3 5.5 9.00 NA
AADD_018 6+4 M10 7 28.6 2.5 2.95 NA
AADD_019 6+2 C 4 0.0 NA NA NA
AADD_020 3+8 C 4 75.0 2.0 2.00 NA
AADD_021 3+1 A1 3 0.0 NA NA NA
AADD_022 1+5 1A 2 0.0 NA NA NA
AADD_023 3+0 P0 1 0.0 NA NA NA
AADD_024 2+2 D 2 0.0 NA NA NA
Term 3 – Addition Multiple Choice - New (AADD_2025-NEW)
AAMC-001 2+1 A1 1465 98.4 4.0 14.00 0.17
AAMC-002 1+4 1A 1447 95.6 3.0 12.00 0.18
AAMC-003 4+0 P0 1431 94.2 3.0 8.00 0.23
AAMC-004 3+3 D 1431 94.1 3.0 9.00 0.15
AAMC-005 4+6 M10 1342 80.7 7.0 21.00 0.27
AAMC-006 8+1 A1 1294 93.1 4.0 10.00 0.20
AAMC-007 2+3 C 1195 93.1 4.0 10.00 0.21
AAMC-008 9+1 M10 1117 95.1 3.0 8.00 0.20
AAMC-009 2+7 C 956 84.3 5.0 12.00 0.22
AAMC-010 7+5 C 710 70.7 7.0 17.00 0.24
AAMC-011 5+1 A1 595 95.3 3.0 5.00 0.35
AAMC-012 1+3 1A 508 95.5 3.0 5.00 0.32
AAMC-013 0+2 P0 419 92.1 3.0 6.00 0.36
AAMC-014 4+4 D 313 91.4 3.0 5.00 0.45
AAMC-015 1+9 M10 250 92.8 2.0 5.00 0.54
AAMC-016 7+1 A1 184 93.5 2.0 4.00 0.60
AAMC-017 4+3 C 132 87.9 3.0 6.00 0.50
AAMC-018 6+4 M10 103 84.5 2.0 4.70 0.61
AAMC-019 6+2 C 75 78.7 2.0 4.10 0.66
AAMC-020 3+8 C 49 71.4 3.0 7.00 0.73
AAMC-021 3+1 A1 37 81.1 2.0 3.55 0.50
AAMC-022 1+5 1A 25 72.0 1.0 3.15 0.69
AAMC-023 3+0 P0 21 47.6 2.0 4.10 0.77
AAMC-024 2+2 D 17 41.2 2.0 3.00 NA
AAMC-025 2+8 M10 8 37.5 2.0 2.00 NA
AAMC-026 9+1 A1 6 100.0 1.5 5.25 NA
AAMC-027 2+4 C 3 0.0 NA NA NA
AAMC-028 8+2 M10 6 33.3 7.5 8.85 NA
AAMC-029 3+5 C 3 0.0 NA NA NA
AAMC-030 8+4 C 2 0.0 NA NA NA
AAMC-031 4+1 A1 2 100.0 1.0 1.00 NA
AAMC-032 1+2 1A 2 50.0 2.0 2.00 NA
AAMC-033 0+5 P0 1 100.0 2.0 2.00 NA
AAMC-034 5+5 D 1 0.0 NA NA NA
AAMC-035 3+7 M10 1 0.0 NA NA NA
Term 4 – Addition Multiple Choice - (MOY) (AAMC_2025-MOY)
AAMC-001-copy 2+1 A1 1 0.0 NA NA NA
AAMC-006-copy 8+1 A1 1045 93.2 5.0 15.00 0.24
AAMC-007-copy 2+3 C 1028 91.8 4.0 13.00 0.19
AAMC-008-copy 9+1 M10 1025 95.8 3.0 9.00 0.19
AAMC-009-copy 2+7 C 987 84.6 6.0 17.00 0.29
AAMC-010-copy 7+5 C 920 73.7 7.0 19.00 0.23
AAMC-011-copy 5+1 A1 878 96.5 3.0 7.00 0.25
AAMC-012-copy 1+3 1A 850 96.9 3.0 7.00 0.17
AAMC-013-copy 0+2 P0 802 94.3 3.0 6.00 0.20
AAMC-014-copy 4+4 D 738 95.3 3.0 6.00 0.25
AAMC-015-copy 1+9 M10 672 95.2 3.0 6.00 0.27
AAMC-016-copy 7+1 A1 610 94.6 3.0 5.20 0.21
AAMC-017-copy 4+3 C 491 86.8 4.0 8.00 0.29
AAMC-018-copy 6+4 M10 397 86.6 3.0 7.00 0.46
AAMC-019-copy 6+2 C 312 88.1 3.0 7.00 0.44
AAMC-020-copy 3+8 C 202 78.7 4.0 7.00 0.53
AAMC-021-copy 3+1 A1 158 89.2 2.0 4.00 0.53
AAMC-022-copy 1+5 1A 134 90.3 2.0 4.00 0.57
AAMC-023-copy 3+0 P0 110 88.2 2.0 4.00 0.65
AAMC-024-copy 2+2 D 87 89.7 2.0 4.00 0.62
AAMC-025-copy 2+8 M10 64 81.2 2.0 3.00 0.74
AAMC-026-copy 9+1 A1 46 87.0 2.0 2.05 0.27
AAMC-027-copy 2+4 C 32 65.6 2.0 3.00 0.82
AAMC-028-copy 8+2 M10 25 64.0 1.0 2.25 0.90
AAMC-029-copy 3+5 C 22 77.3 2.0 3.00 0.75
AAMC-030-copy 8+4 C 15 60.0 2.0 3.00 NA
AAMC-031-copy 4+1 A1 13 84.6 1.0 2.00 NA
AAMC-032-copy 1+2 1A 12 58.3 1.0 3.40 NA
AAMC-033-copy 0+5 P0 8 62.5 2.0 2.00 NA
AAMC-034-copy 5+5 D 8 62.5 1.0 1.80 NA
AAMC-035-copy 3+7 M10 7 28.6 1.5 1.95 NA
AAMC-036-copy 6+1 A1 6 33.3 2.0 2.90 NA
AAMC-037-copy 3+2 C 6 33.3 1.5 1.95 NA
AAMC-038-copy 7+3 M10 4 25.0 2.0 2.00 NA
AAMC-039-copy 5+4 C 6 50.0 2.0 2.90 NA
AAMC-040-copy 2+9 C 4 75.0 1.0 1.00 NA
Flag Question group
1A First Addend Is 1
A1 Second Addend Is 1
C Common
D Double
M10 Make Ten
P0 One Addend Is 0

7. Item correct response time

AADD_2025 (T1)

AADD_2025-NEW (T3)

AAMC_2025-MOY (T4)

8. Question group performance

Make-ten type items are more difficult

Correct response time (sec)
Question group Avg items attempted No. response % Correct Median RT 95th RT
Term 1 – Addition drag and drop (AADD_2025)
Common 1.8/12 1975 86.7 6 13
Double 1/4 2130 93.1 6 15
First Addend Is 1 1.1/4 2402 93.8 6 20
Make Ten 1.5/8 2611 85.6 6 20
One Addend Is 0 1/4 2287 91.8 6 16
Second Addend Is 1 1.7/8 3914 96.2 5 15
Term 3 – Addition Multiple Choice - New (AADD_2025-NEW)
Common 2.5/12 3125 84.2 5 13
Double 1.2/4 1762 93.1 3 8
First Addend Is 1 1.4/4 1982 95.2 3 10
Make Ten 2/8 2827 87.3 4 16
One Addend Is 0 1.3/4 1872 93.2 3 8
Second Addend Is 1 2.4/8 3583 95.5 4 11
Term 4 – Addition Multiple Choice - (MOY) (AAMC_2025-MOY)
Common 3.8/12 4025 83.8 5 15
Double 1.1/4 833 94.4 3 6
First Addend Is 1 1.2/4 996 95.6 3 7
Make Ten 2.1/8 2194 92.8 3 7
One Addend Is 0 1.1/4 920 93.3 3 6
Second Addend Is 1 2.6/8 2757 94.0 3 11