Find link

language:

jump to random article

Find link is a tool written by Edward Betts.

searching for Deep reinforcement learning 71 found (89 total)

alternate case: deep reinforcement learning

Q-learning (3,785 words) [view diff] case mismatch in snippet view article find links to article

2015). "Deep Reinforcement Learning with Double Q-learning". arXiv:1509.06461 [cs.LG]. van Hasselt, Hado; Guez, Arthur; Silver, David (2015). "Deep reinforcement
Dharshan Kumaran (187 words) [view diff] exact match in snippet view article find links to article
more than 20,000 articles, is '"Human-level control through deep reinforcement learning" which he co-authored in 2015 with others including V Mnih, K
Adversarial machine learning (7,402 words) [view diff] exact match in snippet view article find links to article
information to the structure and type of model being used. Adversarial deep reinforcement learning is an active area of research in reinforcement learning focusing
Ansatz (656 words) [view diff] exact match in snippet view article find links to article
; Prati, E. (2019). "Coherent transport of quantum states by deep reinforcement learning". Communications Physics. 2 (1): 61. arXiv:1901.06603. Bibcode:2019CmPhy
Intelligent control (458 words) [view diff] exact match in snippet view article find links to article
supposed to capture the dynamics of a system. For the control part, deep reinforcement learning has shown its ability to control complex systems. Bayesian probability
Timothy Lillicrap (912 words) [view diff] case mismatch in snippet view article find links to article
David Silver, Daan Wierstra (2015). Continuous Control with Deep Reinforcement Learning. arXiv:1509.02971 Nicolas Heess, Jonathan J. Hunt, Timothy Lillicrap
Baher Abdulhai (1,928 words) [view diff] exact match in snippet view article find links to article
the impacts of AVs on the capacities of highway systems. Using deep reinforcement learning and high dimensional sensory inputs, he performed a case study
Cognitive architecture (1,252 words) [view diff] case mismatch in snippet view article find links to article
Wierstra, Daan; Riedmiller, Martin (2013). "Playing Atari with Deep Reinforcement Learning". arXiv:1312.5602 [cs.LG]. Mnih, Volodymyr; Kavukcuoglu, Koray;
ACM Prize in Computing (104 words) [view diff] exact match in snippet view article find links to article
to robot learning, including learning from demonstrations and deep reinforcement learning for robotic control. 2020 Scott Aaronson For groundbreaking contributions
Palletizer (539 words) [view diff] case mismatch in snippet view article find links to article
position on the pallet. In recent years, some research has utilized Deep Reinforcement Learning, where robotic agents aim to learn an optimal placement position
Dorothy Okello (1,176 words) [view diff] exact match in snippet view article find links to article
published in the 2020 IST-Africa Conference (IST-Africa) (3) A deep reinforcement learning-based algorithm for reliability-aware multi-domain service deployment
Machine learning in video games (3,879 words) [view diff] exact match in snippet view article find links to article
state of the art machine learning techniques such as relational deep reinforcement learning, long short-term memory, auto-regressive policy heads, pointer
Lit pool (261 words) [view diff] exact match in snippet view article find links to article
making and incentives design in the presence of a dark pool: a deep reinforcement learning approach". arXiv:1912.01129 [q-fin.MF]. Palmer, Max (2010-03-20)
David Silver (computer scientist) (713 words) [view diff] exact match in snippet view article
Silver; et al. (25 February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. doi:10.1038/NATURE14236. ISSN 1476-4687
Maluuba (1,266 words) [view diff] exact match in snippet view article find links to article
Maluuba published a research paper learning dialogue policies with deep reinforcement learning. In 2016, Maluuba also freely released the Frames dataset, which
Networked-loan (3,353 words) [view diff] exact match in snippet view article find links to article
with deep reinforcement learning integrated with high-order graph message-passing networks. It uses the framework of deep reinforcement learning to learn
RFM (market research) (866 words) [view diff] case mismatch in snippet view article
Tkachenko, Yegor. Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space. (April 8, 2015). arXiv
Apprenticeship learning (1,336 words) [view diff] exact match in snippet view article find links to article
Leike, J., Brown, T., Martic, M., Legg, S., & Amodei, D. (2017). Deep reinforcement learning from human preferences. In Advances in Neural Information Processing
Beam tilt (753 words) [view diff] case mismatch in snippet view article find links to article
"Online Antenna Tuning in Heterogeneous Cellular Networks With Deep Reinforcement Learning". IEEE Transactions on Cognitive Communications and Networking
Demis Hassabis (4,994 words) [view diff] exact match in snippet view article find links to article
learning and reinforcement learning, and pioneered the field of deep reinforcement learning which combines these two methods. Hassabis has predicted that
DeepStack (675 words) [view diff] exact match in snippet view article find links to article
Bakhtin, Anton; Lerer, Adam; Gong, Qucheng (2020). "Combining deep reinforcement learning and search for imperfect-information games". Advances in Neural
Daniel Kroening (589 words) [view diff] case mismatch in snippet view article find links to article
"Deepsynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning". AAAI 2020, Vol. 35, No. 9, pages 7647-7656. Vijay D’Silva,
Language creation in artificial intelligence (769 words) [view diff] case mismatch in snippet view article find links to article
Batra, D. (2017). Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning. arXiv preprint arXiv:1703.06585. Johnson, M., Schuster, M.,
Swarm robotics (2,264 words) [view diff] case mismatch in snippet view article find links to article
Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning" IEEE Transactions on Vehicular Technology, 2020. Hu, J.; Turgut
Microswimmer (14,639 words) [view diff] exact match in snippet view article find links to article
trapped in certain flow structures by learning smart gravitaxis. Deep reinforcement learning has been used to explore microswimmer navigation problems in
Seega (game) (905 words) [view diff] case mismatch in snippet view article
Player for Seejeh (A.K.A Seega, Siga, Kharbga) Board Game with Deep Reinforcement Learning". Procedia Computer Science. 160: 241–247. doi:10.1016/j.procs
Chainer (856 words) [view diff] exact match in snippet view article find links to article
previous record held by Facebook. ChainerRL adds state of art deep reinforcement learning algorithms, and ChainerUI is a management and visualization tool
Paul Christiano (researcher) (1,172 words) [view diff] case mismatch in snippet view article
single charity. At OpenAI, Christiano co-authored the paper "Deep Reinforcement Learning from Human Preferences" (2017) and other works developing reinforcement
InterQuest Group Ltd (1,073 words) [view diff] case mismatch in snippet view article find links to article
Conference". Retrieved 27 November 2019. "Step into the AI Era: Deep Reinforcement Learning Workshop". Retrieved 27 November 2019. "UX Sessions". Retrieved
Proximal policy optimization (2,082 words) [view diff] case mismatch in snippet view article find links to article
05477 “A Beginner’s Guide to deep Reinforcement learning,” Pathmind. https://wiki.pathmind.com/deep-reinforcement-learning#reward Q. T. Luu, “Q-learning
Active learning (machine learning) (2,358 words) [view diff] case mismatch in snippet view article
https://arxiv.org/abs/2303.01560v2 Learning how to Active Learn: A Deep Reinforcement Learning Approach, Meng Fang, Yuan Li, Trevor Cohn, https://arxiv.org/abs/1708
Montezuma's Revenge (video game) (1,392 words) [view diff] exact match in snippet view article
Petersen, Stig (February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. Bibcode:2015Natur.518..529M. doi:10
Princeton Plasma Physics Laboratory (2,137 words) [view diff] exact match in snippet view article find links to article
Egemen (2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. doi:10.1038/s41586-024-07024-9
Dorin Comaniciu (834 words) [view diff] case mismatch in snippet view article find links to article
Andreas; Hornegger, Joachim; Comaniciu, Dorin (2019). "Multi-Scale Deep Reinforcement Learning for Real-Time 3D-Landmark Detection in CT Scans". IEEE Transactions
Artificial intelligence (22,441 words) [view diff] exact match in snippet view article find links to article
against four of the world's best Gran Turismo drivers using deep reinforcement learning. Finance is one of the fastest growing sectors where applied
Nested sampling algorithm (2,160 words) [view diff] exact match in snippet view article find links to article
framework for uncertainty quantification, optimization, and deep reinforcement learning, which also implements nested sampling. Since nested sampling
Mahjong and artificial intelligence (555 words) [view diff] case mismatch in snippet view article find links to article
Yang; Li Zhao; Tao Qin; Tie-Yan Liu; Hsiao-Wuen Hon (2020-04-01). "Suphx: Mastering Mahjong with Deep Reinforcement Learning". arXiv:2003.13590 [cs.AI].
Customer lifetime value (2,890 words) [view diff] case mismatch in snippet view article find links to article
Tkachenko, Yegor. Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space. (April 8, 2015). arXiv
Distributional Soft Actor Critic (321 words) [view diff] case mismatch in snippet view article find links to article
et al. (2018). "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor". ICML. Wang, Wenxuan; et al. (2023)
Google Brain (3,833 words) [view diff] exact match in snippet view article find links to article
ISSN 1941-0468. Gu, S.; Holly, E.; Lillicrap, T.; Levine, S. (May 2017). "Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates"
Evaluation function (2,438 words) [view diff] case mismatch in snippet view article find links to article
ICCA Journal Lai, Matthew (4 September 2015), Giraffe: Using Deep Reinforcement Learning to Play Chess, arXiv:1509.01549v1 "Neural network topology".
Gregory Dudek (1,273 words) [view diff] exact match in snippet view article find links to article
decision-making under uncertainty, using techniques including deep reinforcement learning and probabilistic modelling. Dudek has participated in the organization
Convolutional neural network (15,064 words) [view diff] exact match in snippet view article find links to article
research described an application to Atari 2600 gaming. Other deep reinforcement learning models preceded it. Convolutional deep belief networks (CDBN)
Pushmeet Kohli (1,045 words) [view diff] exact match in snippet view article find links to article
(February 2022). "Magnetic control of tokamak plasmas through deep reinforcement learning". Nature. 602 (7897): 414–419. Bibcode:2022Natur.602..414D. doi:10
AI alignment (11,625 words) [view diff] case mismatch in snippet view article find links to article
Jacob; Krueger, David (June 28, 2022). "Goal Misgeneralization in Deep Reinforcement Learning". Proceedings of the 39th International Conference on Machine
IIT Madras (6,959 words) [view diff] exact match in snippet view article find links to article
one of the country's largest groups in network analytics and deep reinforcement learning. Google has granted IIT Madras $1 million for setting up India's
Edward Y. Chang (2,492 words) [view diff] exact match in snippet view article find links to article
, Chang, E. Y. (2018). Refuel: Exploring sparse features in deep reinforcement learning for fast disease diagnosis. In Advances in Neural Information
AlphaDev (1,132 words) [view diff] exact match in snippet view article find links to article
Silver, David (2023). "Faster sorting algorithms discovered using deep reinforcement learning". Nature. 618: 257–263. doi:10.1038/s41586-023-06004-9. PMC 10247365
MuJoCo (318 words) [view diff] case mismatch in snippet view article find links to article
Jorge Pena; Westerlund, Tomi (2020). "Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: A Survey". 2020 IEEE Symposium Series on Computational
Machine learning (14,693 words) [view diff] case mismatch in snippet view article find links to article
Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning". IEEE Transactions on Vehicular Technology. 69 (12): 14413–14423
OpenAI (15,431 words) [view diff] exact match in snippet view article find links to article
(MOBA) games and how OpenAI Five has demonstrated the use of deep reinforcement learning (DRL) agents to achieve superhuman competence in Dota 2 matches
Rubik's Cube (10,383 words) [view diff] case mismatch in snippet view article find links to article
Prati (2021). "Solving Rubik's Cube via Quantum Mechanics and Deep Reinforcement Learning". Journal of Physics A: Mathematical and Theoretical. 54 (5):
List of volunteer computing projects (4,255 words) [view diff] exact match in snippet view article find links to article
2018-03-04 Software testing, chess Trains chess neural networks with deep reinforcement learning. Experiments with training parameters and net architectures No
Mahjong (13,142 words) [view diff] case mismatch in snippet view article find links to article
Hsiao-Wuen (31 March 2020). "Suphx: Mastering Mahjong with Deep Reinforcement Learning". arXiv:2003.13590 [cs.AI]. "Top-grossing". Facebook. Retrieved
Internet of things (19,751 words) [view diff] exact match in snippet view article find links to article
driving force for autonomous IoT. An approach in this context is deep reinforcement learning where most of IoT systems provide a dynamic and interactive environment
Curriculum learning (1,366 words) [view diff] exact match in snippet view article find links to article
Curriculum learning for heterogeneous star network embedding via deep reinforcement learning. pp. 468–476. doi:10.1145/3159652.3159711. hdl:2142/101634.
Reward hacking (1,505 words) [view diff] exact match in snippet view article find links to article
Yuval Tassa, Tom Erez, and Martin Riedmiller. "Data-efficient deep reinforcement learning for dexterous manipulation." arXiv preprint arXiv:1704.03073
Quantum machine learning (10,293 words) [view diff] case mismatch in snippet view article find links to article
Xiaoli; Goan, Hsi-Sheng (2020). "Variational Quantum Circuits for Deep Reinforcement Learning". IEEE Access. 8: 141007–141024. arXiv:1907.00397. Bibcode:2020IEEEA
Hover (behaviour) (1,809 words) [view diff] exact match in snippet view article
2023). "Exploring storm petrel pattering and sea-anchoring using deep reinforcement learning". Bioinspiration & Biomimetics. 18 (6). University of Portland
Tokamak (14,070 words) [view diff] exact match in snippet view article find links to article
(February 2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. Bibcode:2024Natur.626..746S. doi:10
Generative adversarial network (14,084 words) [view diff] exact match in snippet view article find links to article
enforce the alignment of the latent feature space, such as in deep reinforcement learning. This works by feeding the embeddings of the source and target
Fusion power (20,836 words) [view diff] exact match in snippet view article find links to article
address fusion heating, measurement, and power production. A deep reinforcement learning system has been used to control a tokamak-based reactor. The
Federated learning (5,963 words) [view diff] case mismatch in snippet view article find links to article
Guo, Weisi; Nallanathan, Arumugam; Wu, Qihui (2021). "Green Deep Reinforcement Learning for Radio Resource Management: Architecture, Algorithm Compression
Occupant-centric building controls (1,910 words) [view diff] exact match in snippet view article find links to article
associated with thermal comfort and indoor air control via a deep reinforcement learning algorithm". Building and Environment. 155: 105–117. doi:10.1016/j
Timeline of computing 2020–present (23,329 words) [view diff] exact match in snippet view article find links to article
Scaramuzza, Davide (August 2023). "Champion-level drone racing using deep reinforcement learning". Nature. 620 (7976): 982–987. Bibcode:2023Natur.620..982K. doi:10
Drones in wildfire management (4,517 words) [view diff] case mismatch in snippet view article find links to article
Mousavi, Seyed Sajad; Schukat, Michael; Howley, Enda (2018). "Deep Reinforcement Learning: An Overview". Proceedings of SAI Intelligent Systems Conference
Glossary of engineering: M–Z (31,123 words) [view diff] case mismatch in snippet view article find links to article
Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning" IEEE Transactions on Vehicular Technology, 2020. Feynman, Richard
Applications of artificial intelligence (20,753 words) [view diff] exact match in snippet view article find links to article
Hassabis, Demis (26 February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. Bibcode:2015Natur.518..529M. doi:10
2023 in science (44,482 words) [view diff] exact match in snippet view article find links to article
Scaramuzza, Davide (August 2023). "Champion-level drone racing using deep reinforcement learning". Nature. 620 (7976): 982–987. Bibcode:2023Natur.620..982K. doi:10
AI safety (9,544 words) [view diff] case mismatch in snippet view article find links to article
Jacob; Krueger, David (2022-06-28). "Goal Misgeneralization in Deep Reinforcement Learning". Proceedings of the 39th International Conference on Machine
Reinforcement learning from human feedback (4,911 words) [view diff] case mismatch in snippet view article find links to article
Brown, Tom; Martic, Miljan; Legg, Shane; Amodei, Dario (2017). "Deep Reinforcement Learning from Human Preferences". Advances in Neural Information Processing