I am a 3rd year PhD student at The University of Edinburgh, advised by Amos Storkey as a member of the Bayesian and Neural Systems Group in the Institute for Adaptive and Neural Computation.
I am interested in getting reinforcement learning into the real world by focusing on:
- Offline settings (and variations therein)
- Representation learning for high-dimensional observation spaces
- Model-based methods
- Off-policy methods

SELECTED PUBLICATIONS
()♔: co-first authors
Click title for arXiv link
Title | Authors | Venue |
---|---|---|
Enhancing Tactile-based Reinforcement Learning for Robotic Control | Elle Miller, Trevor McInroe, David Abel, Oisin Mac Aodha, Sethu Vijayakumar | NeurIPS (2025) |
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning | (Samuel Garcin & Trevor McInroe)♔, Pablo Samuel Castro, Christopher G. Lucas, David Abel, Prakash Panangaden, Stefano V Albrecht | ICLR (2025) |
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning | Trevor McInroe, Adam Jelley, Stefano V. Albrecht, Amos Storkey | RLC (2024) |
Conditional Mutual Information for Disentangled Representations in Reinforcement Learning | Mhairi Dunion, Trevor McInroe, Kevin Luck, Josiah Hanna, Stefano V. Albrecht | NeurIPS (Spotlight) (2023) |
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning | Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah P. Hanna, Stefano V. Albrecht | ICLR (2023) |
LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots | Dongge Han, Trevor McInroe, Adam Jelley, Stefano V. Albrecht, Peter Bell, Amos Storkey | COLING (2025) |
Efficient Offline Reinforcement Learning: The Critic is Critical | Adam Jelley, Trevor McInroe, Sam Devlin, Amos Storkey | ICML (Workshop) (2024) |