Akifumi Wachi
Akifumi Wachi
Chief Research Scientist, LY Corporation
Verified email at - Homepage
Cited by
Cited by
Safe Reinforcement Learning in Constrained Markov Decision Processes
A Wachi, Y Sui
International Conference on Machine Learning (ICML), 2020
Safe Exploration and Optimization of Constrained MDPs Using Gaussian Processes.
A Wachi, Y Sui, Y Yue, M Ono
AAAI Conference on Artificial Intelligence (AAAI), 6548-6556, 2018
Failure-scenario maker for rule-based agent using multi-agent adversarial reinforcement learning and its application to autonomous driving
A Wachi
International Joint Conference on Artificial Intelligence (IJCAI), 6006-6012, 2019
Neuro-symbolic reinforcement learning with first-order logic
D Kimura, M Ono, S Chaudhury, R Kohita, A Wachi, DJ Agravante, ...
arXiv preprint arXiv:2110.10963, 2021
Verbosity bias in preference labeling by large language models
K Saito, A Wachi, K Wataoka, Y Akimoto
arXiv preprint arXiv:2310.10076, 2023
Reinforcement learning with external knowledge by using logical neural networks
D Kimura, S Chaudhury, A Wachi, R Kohita, A Munawar, M Tatsubori, ...
arXiv preprint arXiv:2103.02363, 2021
Integral design method for simple and small Mars lander system using membrane aeroshell
R Sakagami, R Takahashi, A Wachi, Y Koshiro, H Maezawa, Y Kasai, ...
Acta Astronautica 144, 103-118, 2018
Safe policy optimization with local generalized linear function approximations
A Wachi, Y Wei, Y Sui
Advances in Neural Information Processing Systems 34, 20759-20771, 2021
LOA: Logical optimal actions for text-based interaction games
D Kimura, S Chaudhury, M Ono, M Tatsubori, DJ Agravante, A Munawar, ...
arXiv preprint arXiv:2110.10973, 2021
Mars entry, descent, and landing by small THz spacecraft via membrane aeroshell
A Wachi, R Takahashi, R Sakagami, Y Koshiro, Y Kasai, S Nakasuka
AIAA SPACE and Astronautics Forum and Exposition, 5313, 2017
Language-based general action template for reinforcement learning agents
R Kohita, A Wachi, D Kimura, S Chaudhury, M Tatsubori, A Munawar
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021
The conceptual design of a novel, small and simple Mars lander
R Takahashi, R Sakagami, A Wachi, Y Kasai, S Nakasuka
IEEE Aerospace Conference, 1-10, 2018
Safe exploration in reinforcement learning: A generalized formulation and algorithms
A Wachi, W Hashimoto, X Shen, K Hashimoto
Advances in Neural Information Processing Systems 36, 2024
Adversarial input generation using variational autoencoder
A Wachi
US Patent 11,715,016, 2023
Polar Embedding
R Iwamoto, R Kohita, A Wachi
Proceedings of the 25th Conference on Computational Natural Language …, 2021
Q-learning with language model for edit-based unsupervised summarization
R Kohita, A Wachi, Y Zhao, R Tachibana
arXiv preprint arXiv:2010.04379, 2020
Safe exploration in Markov decision processes with time-variant safety using spatio-temporal gaussian process
A Wachi, H Kajino, A Munawar
arXiv preprint arXiv:1809.04232, 2018
Mars Micro-Satellite for Terahertz Remote Sensing
R Larsson, Y Kasai, T Kuroda, H Maezawa, T Manabe, T Nishibori, ...
EGU General Assembly Conference Abstracts, 18645, 2017
Low-Thrust Trajectory Design to Improve Overall Mission Success Probability Incorporating Target Changes in Case of Engine Failures
A Wachi
International Symposium on Space Flight Dynamics, 2017
Stepwise Alignment for Constrained Language Model Policy Optimization
A Wachi, TQ Tran, R Sato, T Tanabe, Y Akimoto
arXiv preprint arXiv:2404.11049, 2024
The system can't perform the operation now. Try again later.
Articles 1–20