Host
Intel Core Ultra 9 285K · 24 cores
Platform
linux/amd64
Go
go1.26.0
CPython
python:3.13-slim
PyPy
pypy:3.10-slim
Runs / combo
10 + 2 warmup

Smoke: integer addition

Sum 1..N in a tight loop, the smallest workload in the suite. It exists to surface interpreter dispatch overhead at the smallest possible payload.

Compile time · median (cold)

median of 10 runs

Native Gocompiled
178 ms394×
Piko interpbytecode VM
452 µsbaseline
CPython 3.13bytecode VM
185 µs0.41×
PyPy 7.3tracing JIT
166 µs0.37×
Ttengobytecode VM
140 µs0.31×
Sscriggobytecode VM
131 µs0.29×
Mmvmbytecode VM
138 µs0.30×
YyaegiAST walker
193 µs0.43×

Full statistics

RunnerNCompileRuntimeP95StddevRSSvs pikoStatus
Native Gocompiled10178 ms24.9 µs28.3 µs1.01 µs68 MiB394×OK
Piko interpbytecode VM10452 µs444 µs449 µs2.77 µs78 MiB1.00×OK
CPython 3.13bytecode VM10185 µs1.53 ms1.84 ms116 µsn/a0.41×OK
PyPy 7.3tracing JIT10166 µs481 µs510 µs13.9 µsn/a0.37×OK
tengobytecode VM10140 µs5.06 ms5.66 ms341 µs84 MiB0.31×OK
scriggobytecode VM10131 µs4.48 ms4.56 ms537 µs70 MiB0.29×OK
mvmbytecode VM10138 µs2.81 ms3.43 ms511 µs57 MiB0.30×OK
yaegiAST walker10193 µs2.26 ms2.28 ms6.01 µs58 MiB0.43×OK
Workload & symmetry rules

Workload

Sum 1..1,000,000 in a single tight loop, print the result mod 2^32.

Why this benchmark exists

It's the simplest possible interpreter workload. If dispatch dominates anything, it dominates this. End-to-end mode is therefore mostly startup time; inner-loop mode is mostly add+jump throughput.

Symmetry rules

  • One loop, two locals, no function calls in the hot path.
  • No sum() / numpy / strconv shortcuts.
Source code