Johannes Bertram

Johannes Bertram

Machine Learning & Neuroscience Researcher

Understanding mechanistic models of the brain.

About Me

Understanding biological and artificial visual systems

I am a machine learning researcher broadly interested in understanding biological and artificial visual systems. My work focuses on mechanistic models of the brain — studying how neural populations represent and transform sensory information, and what this tells us about the computational principles underlying perception.

I am a PhD student at the University of Tübingen, IMPRS-IS, and ELLIS Tübingen, in the Machine Learning in Science Group. My research combines neural population geometry, encoding and decoding manifold analysis, and foundation models of neural activity to probe the inner workings of biological and mechanistic visual systems.

When I'm not immersed in research, I enjoy long-distance running, hiking, and the game of Go.

Research Areas

Mechanistic Models of the Brain NeuroAI Neural Population Geometry Mouse Visual System Foundation Models Interpretability

Professional Experience

Building expertise across academia and industry

2026 - Present

PhD Student

IMPRS-IS Tübingen · University of Tübingen · ELLIS Tübingen

Machine Learning in Science Group

NeuroAI Mechanistic Models Neural Population Geometry
05/2026

Research Visit

IE Madrid

2025 - 03/2026

Graduate Research Assistant

ELLIS Institute and Max Planck Institute for Intelligent Systems,

Safety- & Efficiency-Aligned Learning Group

Principled AI Safety Benchmarking

  • Identification of simple, abstract test scenarios for AI safety evaluation
  • Benchmarking of SOTA models
  • Publication of benchmark for the community
PyTorch LMEval vLLM
2025 - Present

Engineering Lead

FulcrumCare - Yale Startup

Full-stack development of front desk automation software tools

  • Frontend and backend development
  • Database integration
  • API development
SQL Software Engineering Chatbots
2024

Graduate Research Assistant

Scalable Trustworthy AI Group - University of Tübingen

Training Data Attribution (TDA) research, teaching assistant for graduate-level trustworthy machine learning course

  • Evaluation of Training Data Attribution (TDA) experiments
  • Published research on TDA practitioner needs
  • Developed educational materials and taught trustworthy machine learning concepts
Explainable AI Academic Research Teaching
2021 - 2024

Research Assistant

Neurocognitive Modeling Group - University of Tübingen

Rain forecasting benchmark creation, modeling social inference, teaching assistant for cognitive modeling

  • Large-scale data management for rain forecasting benchmarks
  • Experiment design, implementation, and evaluation within the rational speech act framework
  • Teaching and grading
Benchmarking Experimental Design Teaching
2023

Deep Learning and Robotics Intern

FANUC Deutschland GmbH

Automatic in-time detection of external influences to a robot arm

  • Deep Learning model development for robotic perception
  • Robot arm control and manipulation
  • On-device inference optimization
PyTorch Robotics Deep Learning
2020 - 2021

Teaching Assistant for Mathematics

University of Tübingen

Assisted in teaching linear algebra and calculus to undergraduate students

  • Conducted tutorial sessions and graded assignments
Linear Algebra Calculus Teaching

Education

2023 - 2026

Master of Science in Machine Learning

University of Tübingen

Thesis: "How Neural is a Neural Foundation Model"

Advisors: Prof. Steven Zucker, Prof. Luciano Dyballa, Prof. Martin Butz, Prof. Matthias Bethge | GPA: 4.0/4.0

Machine Learning Deep Learning Interpretability NeuroAI
2025

Student Exchange

Yale University

Neural Networks Reinforcement Learning NLP Statistics
2019 - 2023

Bachelor of Science in Cognitive Science

University of Tübingen

Thesis: Using Epistemic Uncertainty as Intrinsic Reward for Exploration to Efficiently Learn Affordances for Action Planning

Advisor: Prof. Martin Butz | GPA: 3.93/4.0

Algorithms Data Structures Linear Algebra Probability

Research & Publications

Advancing the frontiers of machine learning

How "Neural" is a Neural Foundation Model?

arXiv preprint · NeurIPS 2025 Workshop: Data on the Brain and Mind

J. Bertram, L. Dyballa, T. A. Keller, S. Kinger, S. Zucker

We examine a state-of-the-art neural foundation model by analyzing how its individual components respond to visual stimuli, similar to how physiologists study biological neurons. Investigating three processing stages — encoder, recurrent module, and readout — we find that each exhibits distinct representational patterns. The recurrent module outperforms the encoder by better separating representations of different temporal stimulus patterns; a tubularity metric indicates biologically plausible temporal response development; while the readout module achieves accuracy through specialized feature maps rather than biologically realistic mechanisms.

Decoding Alignment without Encoding Alignment: A Critique of Similarity Analysis in Neuroscience

arXiv preprint

J. Bertram, L. Dyballa, T. A. Keller, S. Kinger, S. Zucker

We challenge the common assumption that representational geometry reflects entire neural populations. Similar decoding behavior and high alignment metrics can emerge from small, non-representative subpopulations of neurons. We propose encoding manifolds as a complementary analytical approach, and demonstrate using a controlled MNIST experiment that decoding metrics remain stable even when encoding topology is intentionally altered during training — suggesting that similarity in decoding does not imply similarity in function or computation.

Inferring Active Neural Circuits Using Diffusion Scores

arXiv preprint

S. Kinger, J. Bertram, L. Dyballa, E. Yemini, S. Zucker

We present a method for identifying directed neural interactions from population activity recordings without assuming specific dynamic models. Using denoising score models to estimate transitions between brain states, our Score-Block Time Graphs (SBTG) approach converts these into directed edge tests, recovers the Jacobian of state transition maps under nonlinear dynamics, and separates lag-specific circuit effects. Applied to whole-brain C. elegans calcium imaging, SBTG reveals circuit structures better aligned with connectome data and demonstrates cell-type temporal organization consistent with known neuromodulator kinetics.

Towards user-focused research in training data attribution for human-centered explainable ai

E. Nguyen, J. Bertram, E. Kortukov, J. Y. Song, S. J. Oh

While Explainable AI (XAI) aims to make AI understandable and useful to humans, it has been criticised for relying too much on formalism and solutionism, focusing more on mathematical soundness than user needs. We propose an alternative to this bottom-up approach inspired by design thinking: the XAI research community should adopt a top-down, user-focused perspective to ensure user relevance. We illustrate this with a relatively young subfield of XAI, Training Data Attribution (TDA). With the surge in TDA research and growing competition, the field risks repeating the same patterns of solutionism. We conducted a needfinding study with a diverse group of AI practitioners to identify potential user needs related to TDA. Through interviews (N=10) and a systematic survey (N=31), we uncovered new TDA tasks that are currently largely overlooked. We invite the TDA and XAI communities to consider these novel tasks and improve the user relevance of their research outcomes.

Quick and Accurate Affordance Learning

arXiv preprint

F. Scholz, E. Ayari, J. Bertram, M. V. Butz

Infants learn actively in their environments, shaping their own learning curricula. They learn about their environments' affordances, that is, how local circumstances determine how their behavior can affect the environment. Here we model this type of behavior by means of a deep learning architecture. The architecture mediates between global cognitive map exploration and local affordance learning. Inference processes actively move the simulated agent towards regions where they expect affordance-related knowledge gain. We contrast three measures of uncertainty to guide this exploration: predicted uncertainty of a model, standard deviation between the means of several models (SD), and the Jensen-Shannon Divergence (JSD) between several models. We show that the first measure gets fooled by aleatoric uncertainty inherent in the environment, while the two other measures focus learning on epistemic uncertainty. JSD exhibits the most balanced exploration strategy. From a computational perspective, our model suggests three key ingredients for coordinating the active generation of learning curricula: (1) Navigation behavior needs to be coordinated with local motor behavior for enabling active affordance learning. (2) Affordances need to be encoded locally for acquiring generalized knowledge. (3) Effective active affordance learning mechanisms should use density comparison techniques for estimating expected knowledge gain. Future work may seek collaborations with developmental psychology to model active play in children in more realistic scenarios.

Featured Projects

Bridging research and practical applications

Dental Appointment Chatbot

Dental Appointment Chatbot

A conversational AI chatbot designed to assist patients in scheduling dental appointments, providing information about services, and answering frequently asked questions. In progress work, MVP is publicly available. Closed-source productization and deployment in progress.

Python Azure SQL
NLP Analysis Tool

Principled Analysis of DeepRARE for Unsupervised Saliency Prediction

DeepRARE (Mancas, Kong, and Gosselin 2020) is a compelling approach to visual saliency prediction. It combines the traditional idea of low-level pop-out with modern high-level deep features. DeepRARE assumes high saliency for rare VGG16 (Simonyan and Zisserman 2015) features within an image without needing saliency supervision. This paper evaluates DeepRARE on the MIT/Tuebingen Saliency Benchmark (Kümmerer, Bylinskii, et al. n.d.) using the principled information gain metric and conducts a detailed error case inspection. Error case analysis reveals that human faces and text are insufficiently attended to. Adding face and text detectors improves model performance suggesting potential improvements of DeepRARE by using stronger pre-trained features. DeepRARE performs at least 15% better than low-level saliency models, but still 60% short of state-of-the-art models.

Visual Saliency Pytorch Benchmarking
DB Delays

Switching Perspectives: Analysing train delays considering train transfer and cancellation

Course project for "Data Literacy" at University of Tübingen.

Python Pandas Scraping

Get In Touch

Let's discuss research opportunities and collaborations

Let's Connect

I'm always interested in discussing new research opportunities, potential collaborations, or simply connecting with fellow researchers and practitioners in the ML community.

Email

johannes.bertram@yale.edu

Location

Tübingen, BW, Germany