Hey there! I'm Giovanni, a second-year Computer Science PhD student at Cornell, advised by Yoav Artzi.
My research in Natural Language Processing investigates how to make Large Language Models more adaptive and efficient. Specifically, I focus on enabling LLMs to solve problems more efficiently (e.g., through KV cache compression or speculative decoding) and learn dynamically through interaction (e.g., through in-context reinforcement learning).
Previously, I earned my Master's in Computer Science at EPFL, where I worked with Bob West. I have interned as a researcher at Microsoft Research with Michel Galley, at Apple MLR with Edouard Grave, and as a data engineer at Amazon. I hold a Bachelor's from Politecnico di Milano.
*equal contribution
Breadcrumbs Reasoning: Memory-Efficient Reasoning with Compression Beacons ER @ NeurIPS 2025 Giovanni Monea, Yair Feldman, Shankar Padmanabhan, Kianté Brantley, Yoav Artzi
LLMs Are In-Context Bandit Reinforcement Learners COLM 2025 Giovanni Monea, Antoine Bosselut, Kianté Brantley, Yoav Artzi
PaSS: Parallel Speculative Sampling ENLSP @ NeurIPS 2023 Giovanni Monea, Armand Joulin, Edouard Grave
Breadcrumbs Reasoning: Memory-Efficient Reasoning with Compression Beacons ER @ NeurIPS 2025 Giovanni Monea, Yair Feldman, Shankar Padmanabhan, Kianté Brantley, Yoav Artzi
LLMs Are In-Context Bandit Reinforcement Learners COLM 2025 Giovanni Monea, Antoine Bosselut, Kianté Brantley, Yoav Artzi
Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers ACL 2025 Clément Dumas, Chris Wendler, Veniamin Veselovsky, Giovanni Monea, Robert West
Controllable Context Sensitivity and the Knob Behind It ICLR 2025 Julian Minder, Kevin Du, Niklas Stoehr, Giovanni Monea, Chris Wendler, Robert West, Ryan Cotterell
How Do Llamas Process Multilingual Text? A Latent Exploration through Activation Patching MI @ ICML 2024 Clément Dumas, Veniamin Veselovsky, Giovanni Monea, Robert West, Chris Wendler
A Glitch in the Matrix? Locating and Detecting Language Model Grounding with Fakepedia ACL 2024 Giovanni Monea, Maxime Peyrard, Martin Josifoski, Vishrav Chaudhary, Jason Eisner, Emre Kıcıman, Hamid Palangi, Barun Patra, Robert West
Do Llamas Work in English? On the Latent Language of Multilingual Transformers ACL 2024 🏆 Outstanding Paper Award Chris Wendler*, Veniamin Veselovsky*, Giovanni Monea*, Robert West*
PaSS: Parallel Speculative Sampling ENLSP @ NeurIPS 2023 Giovanni Monea, Armand Joulin, Edouard Grave