Host
Intel Core Ultra 9 285K · 24 cores
Platform
linux/amd64
Go
go1.26.0
CPython
python:3.13-slim
PyPy
pypy:3.10-slim
Runs / combo
10 + 2 warmup

Smoke: integer addition

Sum 1..N in a tight loop, the smallest workload in the suite. It exists to surface interpreter dispatch overhead at the smallest possible payload.

Runtime · median per inner-loop window

median of 10 runs

Native Gocompiled
24.9 µs0.06×
Piko interpbytecode VM
444 µsbaseline
CPython 3.13bytecode VM
1.53 ms3.44×
PyPy 7.3tracing JIT
481 µs1.08×
Ttengobytecode VM
5.06 ms11.4×
Sscriggobytecode VM
4.48 ms10.1×
Mmvmbytecode VM
2.81 ms6.33×
YyaegiAST walker
2.26 ms5.09×

Full statistics

RunnerNCompileRuntimeP95StddevRSSvs pikoStatus
Native Gocompiled10178 ms24.9 µs28.3 µs1.01 µs68 MiB0.06×OK
Piko interpbytecode VM10452 µs444 µs449 µs2.77 µs78 MiB1.00×OK
CPython 3.13bytecode VM10185 µs1.53 ms1.84 ms116 µsn/a3.44×OK
PyPy 7.3tracing JIT10166 µs481 µs510 µs13.9 µsn/a1.08×OK
tengobytecode VM10140 µs5.06 ms5.66 ms341 µs84 MiB11.4×OK
scriggobytecode VM10131 µs4.48 ms4.56 ms537 µs70 MiB10.1×OK
mvmbytecode VM10138 µs2.81 ms3.43 ms511 µs57 MiB6.33×OK
yaegiAST walker10193 µs2.26 ms2.28 ms6.01 µs58 MiB5.09×OK
Workload & symmetry rules

Workload

Sum 1..1,000,000 in a single tight loop, print the result mod 2^32.

Why this benchmark exists

It's the simplest possible interpreter workload. If dispatch dominates anything, it dominates this. End-to-end mode is therefore mostly startup time; inner-loop mode is mostly add+jump throughput.

Symmetry rules

  • One loop, two locals, no function calls in the hot path.
  • No sum() / numpy / strconv shortcuts.
Source code