I am a postdoctoral researcher at the Bethge Lab (Tübingen AI Center). My research is focused on advancing:
- Metacognitive Continual Learning: Developing agents with stateful inference that can reflect on their own reasoning. My focus is on: (i) building systems that monitor their internal reasoning process to identify stagnation, backtrack, and self-correct; and (ii) designing memory mechanisms that learn from past mistakes without catastrophic forgetting.
- Post-Training for Self-Improvement: While pretraining crystallizes existing knowledge, I hypothesize that post-training can be a general method for discovering new knowledge. I want to design better long-horizon RL trajectories as a principled, scalable approach to online data curation even in superhuman settings. My focus is on: (i) dense reward structures, (ii) intrinsic motivation by forecasting consequences of exploration.
- Long-Horizon Reasoning: I conceptually structure discovery of new things broadly needing (i) hierarchy and (ii) planning. Inspired by the DSPy philosophy, I want to automatically engineer scaffolds that: (i) hierarchical delegation to subagents, (ii) select best next steps by dense rewards, enabling on-the-fly planning guided by metacognition to enable effective exploration of massive search spaces.
- The Science of Benchmarking: We need to rethink benchmarks from being mere leaderboards to providing actionable diagnostics for model developers. By utilizing sample-wise evaluation and clustering failures, we can pinpoint actionable areas for improvement where: (i) your model fails, (ii) competitor models succeed. This provides diagnostics for identifying gaps in real-world capabilities and provides fine-grained measurements of capability progression and regression.
- LM-Agent-driven Forecasting: Can AI translate vague research intuitions into concrete, verifiable predictions about things we haven't discovered yet? I aim to post-train LLMs to weigh conflicting evidence in literature under high uncertainty to develop principled ways of forecasting the most viable paths for future scientific breakthroughs (or generally larger goals).
I'm currently on the job market; please contact me at ameya.prabhu@bethgelab.org.
Values and research philosophy
- Robust, empirical research: Results which just work in other contexts; avoiding toy setups and unscalable algorithms.
- Simple, principled algorithms: Papers which provide clarity over trial-and-error, validated in real settings — not under the streetlight of convenience.
- Design better incentives via mechanism design, so technology trends toward a fairer world without defaulting to gatekeepers.
Outside of research, I am a techno DJ and am currently learning to shuffle. I advocate for animal welfare, open research culture (JMLR statement, Cost of Knowledge). I believe the underlying open source philosophy, vital for maintaining non-extractive digital institutions (inspiration).