Follow
Alec Koppel
Alec Koppel
Research Lead, JP Morgan AI Research
Verified email at jpmchase.com - Homepage
Title
Cited by
Cited by
Year
Global convergence of policy gradient methods to (almost) locally optimal policies
K Zhang, A Koppel, H Zhu, T Basar
SIAM Journal on Control and Optimization 58 (6), 3586-3612, 2020
2262020
A saddle point algorithm for networked online convex optimization
A Koppel, FY Jakubiec, A Ribeiro
IEEE Transactions on Signal Processing 63 (19), 5149-5164, 2015
1952015
A Class of Prediction-Correction Methods for Time-Varying Convex Optimization
A Simonetto, A Mokhtari, A Koppel, G Leus, A Ribeiro
IEEE Transactions on Signal Processing (submitted), 0
163*
Variational policy gradient method for reinforcement learning with general utilities
J Zhang, A Koppel, AS Bedi, C Szepesvari, M Wang
Advances in Neural Information Processing Systems 33, 4572-4583, 2020
1582020
On the sample complexity of actor-critic method for reinforcement learning with function approximation
H Kumar, A Koppel, A Ribeiro
Machine Learning 112 (7), 2433-2467, 2023
1192023
Proximity without consensus in online multi-agent optimization
A Koppel, BM Sadler, A Ribeiro
Proc. Int. Conf. Accoustics Speech Signal Proces (submitted),, 2016
922016
A Decentralized Prediction-Correction Method for Networked Time-Varying Convex Optimization
A Simonetto, A Mokhtari, A Koppel, G Leus, A Ribeiro
Computational Advances in Multi-Sensor Adaptive Processing, IEEE …, 2015
882015
Achieving zero constraint violation for constrained reinforcement learning via primal-dual approach
Q Bai, AS Bedi, M Agarwal, A Koppel, V Aggarwal
Proceedings of the AAAI Conference on Artificial Intelligence 36 (4), 3682-3689, 2022
752022
Decentralized online learning with kernels
A Koppel, S Paternain, C Richard, A Ribeiro
IEEE Transactions on Signal Processing 66 (12), 3240-3255, 2018
652018
Parsimonious online learning with kernels via sparse projections in function space
A Koppel, G Warnell, E Stump, A Ribeiro
The Journal of Machine Learning Research 20 (1), 83-126, 2019
59*2019
Parsimonious online learning with kernels via sparse projections in function space
A Koppel, G Warnell, E Stump, A Ribeiro
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
472017
Consistent online gaussian process regression without the sample complexity bottleneck
A Koppel, H Pradhan, K Rajawat
Statistics and Computing 31, 1-18, 2021
402021
Asynchronous and parallel distributed pose graph optimization
Y Tian, A Koppel, AS Bedi, JP How
IEEE Robotics and Automation Letters 5 (4), 5819-5826, 2020
402020
D4L: Decentralized Dynamic Discrminative Dictionary Learning
A Koppel, G Warnell, E Stump, A Ribeiro
IEEE Transactions on Signal and Info. Processing over Networks, 2015
402015
Cautious reinforcement learning via distributional risk in the dual domain
J Zhang, AS Bedi, M Wang, A Koppel
IEEE Journal on Selected Areas in Information Theory 2 (2), 611-626, 2021
372021
MaxMin-RLHF: Towards equitable alignment of large language models with diverse human preferences
S Chakraborty, J Qiu, H Yuan, A Koppel, F Huang, D Manocha, AS Bedi, ...
arXiv preprint arXiv:2402.08925, 2024
342024
Policy Evaluation in Continuous MDPs with Efficient Kernelized Gradient Temporal Difference
A Koppel, G Warnell, E Stump, P Stone, A Ribeiro.
IEEE Transactions on Automatic Control 66 (4), 2020
33*2020
Asynchronous online learning in multi-agent systems with proximity constraints
AS Bedi, A Koppel, K Rajawat
IEEE Transactions on Signal and Information Processing over Networks 5 (3 …, 2019
302019
Asynchronous Decentralized Stochastic Optimization in Heterogeneous Networks
AS Bedi, A Koppel, K Rajawat
IEEE Trans. Signal Process (submitted)., 2017
29*2017
A variational approach to dual methods for constrained convex optimization
M Fazlyab, A Koppel, VM Preciado, A Ribeiro
2017 American Control Conference (ACC), 5269-5275, 2017
272017
The system can't perform the operation now. Try again later.
Articles 1–20