Research
We investigate whether AI systems know what they know — measuring confidence, self-monitoring and change with the methods of clinical psychology and signal detection theory, so that deployment decisions rest on evidence rather than benchmark scores.
Research areas: MetacognitionEvaluation methodologyPsychophysicsFine-tuning
Metacognition & self-monitoring
Whether models know what they know: confidence validity, keep-or-withdraw behaviour, and self-monitoring mapped across domains and model families.
Evaluation methodology
How testing itself can mislead: positional artefacts, prompted underperformance, and change detection that survives the version treadmill.
Psychophysics of representations
What a century of perception science reveals inside model internals: magnitude, noise and category boundaries.
Fine-tuning for reliability
Making open-weight models more reliable at specific tasks: tuning that restores usable confidence, re-graded with the same public tests.
Does the system know what it knows? We mapped 33 models to find out.
Concurrent Criterion Validation of a Validity Screen
Models that pass the screen are safer when allowed to act on confidence. Models that fail are worse than a coin flip.
METHODOLOGY APR 2026Beyond the Mean: Reliable Change Detection
Averages hide item-level change between model versions. Clinical change statistics find it.
METACOGNITION MAY 2026Thinking Mode Induces Confidence Compression
Reasoning modes improve answers while compressing the confidence signal that oversight relies on.
The reliability of AI sold into Australian government
An annual report applying the atlas methodology to the systems agencies actually buy, in the configurations they run.
Publications
Dates are arXiv submission months. Published with open code and data; pre-registered where noted. These are single-author preprints, and we describe them that way. Plain-language lines are ours; the papers say it precisely. Author’s page: synthiumjp.github.io.
Put the instruments to work.
Commission the methods, re-run them yourself, or watch the screen run on a live model.
Request a technical briefing