Socratic Tutoring LLM via Multi-Stage Policy Optimization
Fine-tuning open-source LLMs (SFT, Offline DPO, Online GRPO) to align them as Socratic tutors that guide students without prematurely leaking answers.
Fine-tuning open-source LLMs (SFT, Offline DPO, Online GRPO) to align them as Socratic tutors that guide students without prematurely leaking answers.
An internal RAG chatbot powered by a local LLM for confidential data handling, designed to assist employees with personalized services.
Won the User Experience Award at the Vakıfbank Hack to the Future Hackathon. A hub for organizational memory with local document and voice memo querying.
Won the 2025 Cherry New Grad Hackathon. An interactive map-based interface to help consumers find merchants offering Cherry’s services.
Applied RLHF techniques (GRPO, PPO) to optimize LLMs for low-resource languages, outperforming SFT and standard baselines. Ranked 4th/35.
Published in ArXiv Preprint, 2025
A comprehensive ethical evaluation framework for open-source generative LLMs, assessing performance in Turkish and English.
Recommended citation: Yıldırım Özen, Erinç Çetin, Kaan Engür, Elif Naz Demiryılmaz, Çağrı Toraman. (2025). "OpenEthics: A Comprehensive Ethical Evaluation of Open-Source Generative Large Language Models." ArXiv Preprint.
Download Paper
Graduate program, ETH Zurich, MAS AID Program, 2025
Teaching Assistant for the ETH Zurich MAS in Applied Information and Data Science (AID) program, which is designed to enhance managers’ technical understanding of machine learning.