EPFL @ ICLR 2026

EPFL @ ICLR 2026

The following EPFL papers have been accepted to ICLR 2026 (the Fourteenth International Conference on Learning Representations).
The conference will be held in Rio de Janeiro, Brazil from April 23-27, 2026.

Below is a list of ICLR 2026 papers with at least one EPFL author:

Selective Rotary Position Embedding by Sajad Movahedi, Arshia Afzal, Timur Carstensen, Frank Hutter, Antonio Orvieto, Volkan Cevher
Matching multiple experts: on the exploitability of multi-agent imitation learning by Antoine Bergerault, Volkan Cevher, Negar Mehr
ABBA-Adapters: Efficient and Expressive Fine-Tuning of Foundation Models by Raghav Singhal*, Kaustubh Ponkshe*, Rohit Vartak*, Praneeth Vepakomma
Safety Subspaces are Not Linearly Distinct: A Fine-Tuning Case Study by Kaustubh Ponkshe*, Shaan Shah*, Raghav Singhal*, Praneeth Vepakomma
Scaling Laws and Spectra of Shallow Neural Networks in the Feature Learning Regime by Leonardo Defilippis, Yizhou Xu, Julius Girardin, Vittorio Erba, Emanuele Troiani, Lenka Zdeborová, Bruno Loureiro, Florent Krzakala
Efficient Best-of-Both-Worlds Algorithms for Contextual Combinatorial Semi-Bandits by Mengmeng Li, Philipp J. Schneider, Jelisaveta Aleksic, Daniel Kuhn
PERK: Long-Context Reasoning as Parameter-Efficient Test-Time Learning by Zeming Chen, Angelika Romanou, Gail Weiss, Antoine Bosselut
GeoFAR: Geography-informed frequency-aware super-resolution for climate data by Chang Xu, Gencer Sumbuel, Li Mi, Robin Zbinden, and Devis Tuia
Learning to Weight Parameters for Data Attribution by Shuangqi Li, Hieu Le, Jingyi Xu, Mathieu Salzmann
Weight Decay may matter more than µP for Learning Rate Transfer in Practice by Atli Kosson, Jeremy Welborn, Yang Liu, Martin Jaggi, Xi Chen
MIAM: Modality Imbalance-Aware Masking for Multimodal Ecological Applications by Robin Zbinden*, Wesley Monteith-Finas*, Gencer Sumbuel, Nina van Tiel, Chiara Vanalli, and Devis Tuia
LLaVAction: evaluating and training multi-modal large language models for action understanding by Haozhe Qi*, Shaokai Ye*, Alexander Mathis, Mackenzie Weygandt Mathis
RL for Reasoning by Adaptively Revealing Rationales by Mohammad Hossein Amani, Aryo Lotfi, Nicolas Baldwin, Samy Bengio, Mehrdad Farajtabar, Emmanuel Abbe, Robert West
Narrow Finetuning Leaves Clearly Readable Traces in the Activation Differences by Julian Minder, Clément Dumas, Stewart Slocum, Helena Casademunt, Cameron Holmes, Robert West, Neel Nanda
Navigating the Accuracy-Size Trade-Off with Flexible Model Merging by Akash Dhasade, Divyansh Jhunjhunwala, Milos Vujasinovic, Gauri Joshi, Anne-Marie Kermarrec
Robust Federated Inference by Akash Dhasade, Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta, Maxime Jacovella, Anne-Marie Kermarrec, Rafael Pinot
HeurekaBench: A Benchmarking Framework for AI Co-scientist by Siba Smarak Panigrahi*, Jovana Videnovic*, Maria Brbic
When LLMs Speak with Confidence, Preference Alignment Gets Stronger by Myeongho Jeon*, Amirabbas Afzali*, Maria Brbic
Meta-RL Induces Exploration in Language Agents by Yulun Jiang*, Liangze Jiang*, Damien Teney, Michael Moor, Maria Brbic
Optimizing Agent Planning for Security and Autonomy by Aashish Kolluri, Rishi Sharma, Manuel Costa, Boris Köpf, Tobias Nießen, Mark Russinovich, Shruti Tople, Santiago Zanella-Beguelin
Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization by Badr AlKhamissi, C. Nicolò De Sabbata, Greta Tuckute, Zeming Chen, Martin Schrimpf, Antoine Bosselut
Inducing Dyslexia in Vision Language Models by Melika Honarmand, Ayati Sharma, Badr AlKhamissi, Johannes Mehrer, Martin Schrimpf
Model-Guided Microstimulation Steers Primate Visual Behavior by Johannes Mehrer, Ben Lonnqvist, Abdülkadir Gökce, Anna Mitola, Paolo Papale, Martin Schrimpf
Generating Directed Graphs with Dual Attention and Asymmetric Encoding by Alba Carballo-Castro, Manuel Madeira, Yiming Qin, Dorina Thanou, Pascal Frossard
Symmetry-Aware Bayesian Optimization via Max Kernels by Anthony Bardou, Antoine Gonon, Aryan Ahadinia, Patrick Thiran
SynthWorlds: Controlled Parallel Worlds for Disentangling Reasoning and Knowledge in Language Models by Ken Gu, Advait Bhat, Mike A Merrill, Robert West, Xin Liu, Daniel McDuff, Tim Althoff
Partition Generative Modeling: Masked Modeling Without Masks by Justin Deschenaux, Lan Tran, Caglar Gulcehre
The Diffusion Duality, Chapter II: Ψ-Samplers and Efficient Curriculum by Justin Deschenaux, Caglar Gulcehre, Subham Sekhar Sahoo
Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall by Mingyu Jo, Jaesik Yoon, Justin Deschenaux, Caglar Gulcehre, Sungjin Ahn
Non-Asymptotic Analysis of Efficiency in Conformalized Regression by Yunzhen Yao, Lie He, Michael Gastpar
Lookup multivariate Kolmogorov-Arnold Networks by Sergey Pozdnyakov, Philippe Schwaller
Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining by Dongyang Fan*, Diba Hashemi*, Sai Praneeth Karimireddy, Martin Jaggi
Multimodality As Supervision: Self-Supervised Specialization To The Test Environment Via Multimodality by Kunal Pratap Singh*, Ali Garjani*, Rishubh Singh, Muhammad Uzair Khattak, Jason Toskov, Efe Tarhan, Andrei Atanov, Oghuzan Fatih Kar, Amir Zamir
Gradient-Normalized Smoothness for Optimization with Inexact Hessians by Andrei Semenov, Martin Jaggi, Nikita Doikov
JointDiff: Bridging Continuous and Discrete in Multi-Agent Trajectory Generation by Guillem Capellera, Luis Ferraz, Antonio Rubio, Alexandre Alahi, Antonio Agudo
Loc2: Interpretable Cross-View Localization via Depth-Lifted Local Feature Matching by Zimin Xia*, Chenghao Xu*, Alexandre Alahi
LayerSync: Self-aligning Intermediate Layers by Yasaman Haghighi*, Bastien Van Delft*, Mariam Hassan, Alexandre Alahi
Stable video infinity: Infinite-length video generation with error recycling by Wuyang Li, Wentao Pan, Po-Chien Luan, Yang Gao, Alexandre Alahi
Rap: 3d rasterization augmented end-to-end planning by Lan Feng, Yang Gao, Eloi Zablocki, Quanyi Li, Wuyang Li, Sichao Liu, Matthieu Cord, Alexandre Alahi
From Markov to Laplace: How Mamba In-Context Learns Markov Chains by Marco Bondaschi, Ashok Vardhan Makkuva, Nived Rajaraman, Xiuying Wei, Razvan Pascanu, Caglar Gulcehre, Michael Gastpar
Faster Gradient Methods for Highly-smooth Stochastic Bilevel Optimization by Lesi Chen, Junru Li, El Mahdi Chayti, Jingzhao Zhang
How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks by Rahul Ramachandran, Ali Garjani, Roman Bachmann, Andrei Atanov, Oğuzhan Fatih Kar, Amir Zamir
Can Transformers Really Do It All? On the Compatibility of Inductive Biases Across Tasks by Damien Teney, Liangze Jiang, Hemanth Saratchandran, Simon Lucey
AbstRaL: Augmenting LLMs' Reasoning by Reinforcing Abstract Thinking by Silin Gao, Antoine Bosselut, Samy Bengio, Emmanuel Abbe
Dataset Distillation for Memorized Data: Soft Labels can Leak Held-Out Teacher Knowledge by Freya Behrens, Lenka Zdeborová
Statistical Advantage of Softmax Attention: Insights from Single-Location Regression by Odilon Duranthon, Pierre Marion, Claire Boyer, Bruno Loureiro, Lenka Zdeborová
Control Tax: The Price of Keeping AI in Check by Mikhail Terekhov, Zhen Ning David Liu, Caglar Gulcehre, Samuel Albanie
Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols by Mikhail Terekhov, Alexander Panfilov, Daniil Dzenhaliou, Caglar Gulcehre, Maksym Andriushchenko, Ameya Prabhu, Jonas Geiping
Geometry-aware Policy Imitation by Yiming Li, Nael Darwiche, Amirreza Razmjoo, Sichao Liu, Yilun Du, Auke Ijspeert, Sylvain Calinon
The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity? by Alexander Hägele, Aryo Pradipta Gema, Henry Sleight, Ethan Perez, Jascha Sohl-Dickstein

∗Shared first authorship and equal contributions.

27.01.26

Links

ICLR 2026

News

Subscription

Receive an email for each new article

Share on