ICML 2024 Blog

Bilgehan Sel

bsel AT vt DOT edu

CV Google Scholar Github

I am a second-year PhD student in the Electrical and Computer Engineering Department at Virginia Tech advised by Ming Jin.

My research focuses on enhancing decision-making (e.g., question-answering, reinforcement learning agents, recommendation systems) with safety and ethical considerations in foundational models (e.g., multi-modal LLMs, robotic foundational models, retrieval-augmented generation models). I routinely use clusters with multi-GPU setups to speed up experiments and train/finetune large models, as seen in the publications section.



Selected Projects

Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models
Bilgehan Sel, Ahmad Al-Tawaha, Vanshaj Khattar, Ruoxi Jia, Ming Jin
Internation Conference on Machine Learning (ICML), 2024.
Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs
Bilgehan Sel, Priya Shanmugasundaram, Mohammad Kachuee, Kun Zhou, Ruoxi Jia, Ming Jin
Association for Computational Linguistics (ACL), 2024.
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Shangding Gu, Bilgehan Sel, Yuhao Ding, Lei Wang, Qingwei Lin, Ming Jin, Alois Knoll
Association for the Advancement of Artificial Intelligence (AAAI), 2024


Preprints

A Human-on-the-Loop Optimization Autoformalism Approach for Sustainability
Ming Jin, Bilgehan Sel, Hardeep, Wotao Yin
Preprint

All Publications

2023 Learning-to-Learn to Guide Random Search: Derivative-Free Meta Blackbox Optimization on Manifold
Bilgehan Sel, Ahmad Al-Tawaha, Yuhao Ding, Ruoxi Jia, Bo Ji, Javad Lavaei, Ming Jin
Learning for Dynamics and Control (L4DC), 2023
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
Vanshaj Khattar, Yuhao Ding, Bilgehan Sel, Javad Lavaei, Ming Jin
International Conference on Learning Representations (ICLR), 2023
On Solution Functions of Optimization: Universal Approximation and Covering Number Bounds
Ming Jin, Vanshaj Khattar, Hardeep Kaushik, Bilgehan Sel, Ruoxi Jia
Association for the Advancement of Artificial Intelligence (AAAI), 2023
Decision-Focused Learning for Inverse Noncooperative Games: Generalization Bounds and Convergence Analysis
Ahmad Al-Tawaha, Hardeep Kaushik, Bilgehan Sel, Ruoxi Jia, Ming Jin
International Federation of Automatic Control (IFAC), 2023
Dynamic Modeling and Trajectory Tracking of a Quadcopter via Linear and Backstepping Controller
Uygar Gunes, Artun Sel, Bilgehan Sel, Cosku Kasnakoglu
AIAA SCITECH, 2023
2022 Magnetic field mapping of inaccessible regions using physics-informed neural networks
Umit H Coskun, Bilgehan Sel, Bradley Plaster
Scientific Reports, 2022
SOS-Based Nonlinear Observer Design for Simultaneous State and Disturbance Estimation Designed for a PMSM Model
Artun Sel, Bilgehan Sel, Umit Coskun, Cosku Kasnakoglu
Sustainability, 2022
2021 Comparative study of an EKF-based parameter estimation and a nonlinear optimization-based estimation on PMSM system identification
Artun Sel, Bilgehan Sel, Umit Coskun, Cosku Kasnakoglu
Energies, 2021
GLSDC based parameter estimation algorithm for a PMSM model
Artun Sel, Bilgehan Sel, Cosku Kasnakoglu
Energies, 2021