Find link

language:

Find link is a tool written by Edward Betts.

searching for Deep reinforcement learning 71 found (89 total)

alternate case: deep reinforcement learning

Q-learning (3,785 words) [view diff] case mismatch in snippet view article find links to article

2015). "Deep Reinforcement Learning with Double Q-learning". arXiv:1509.06461 [cs.LG]. van Hasselt, Hado; Guez, Arthur; Silver, David (2015). "Deep reinforcement

Dharshan Kumaran (187 words) [view diff] exact match in snippet view article find links to article

more than 20,000 articles, is '"Human-level control through deep reinforcement learning" which he co-authored in 2015 with others including V Mnih, K

Adversarial machine learning (7,402 words) [view diff] exact match in snippet view article find links to article

information to the structure and type of model being used. Adversarial deep reinforcement learning is an active area of research in reinforcement learning focusing

Ansatz (656 words) [view diff] exact match in snippet view article find links to article

; Prati, E. (2019). "Coherent transport of quantum states by deep reinforcement learning". Communications Physics. 2 (1): 61. arXiv:1901.06603. Bibcode:2019CmPhy

Intelligent control (458 words) [view diff] exact match in snippet view article find links to article

supposed to capture the dynamics of a system. For the control part, deep reinforcement learning has shown its ability to control complex systems. Bayesian probability

Timothy Lillicrap (912 words) [view diff] case mismatch in snippet view article find links to article

David Silver, Daan Wierstra (2015). Continuous Control with Deep Reinforcement Learning. arXiv:1509.02971 Nicolas Heess, Jonathan J. Hunt, Timothy Lillicrap

Baher Abdulhai (1,928 words) [view diff] exact match in snippet view article find links to article

the impacts of AVs on the capacities of highway systems. Using deep reinforcement learning and high dimensional sensory inputs, he performed a case study

Cognitive architecture (1,252 words) [view diff] case mismatch in snippet view article find links to article

Wierstra, Daan; Riedmiller, Martin (2013). "Playing Atari with Deep Reinforcement Learning". arXiv:1312.5602 [cs.LG]. Mnih, Volodymyr; Kavukcuoglu, Koray;

ACM Prize in Computing (104 words) [view diff] exact match in snippet view article find links to article

to robot learning, including learning from demonstrations and deep reinforcement learning for robotic control. 2020 Scott Aaronson For groundbreaking contributions

Palletizer (539 words) [view diff] case mismatch in snippet view article find links to article

position on the pallet. In recent years, some research has utilized Deep Reinforcement Learning, where robotic agents aim to learn an optimal placement position

Dorothy Okello (1,176 words) [view diff] exact match in snippet view article find links to article

published in the 2020 IST-Africa Conference (IST-Africa) (3) A deep reinforcement learning-based algorithm for reliability-aware multi-domain service deployment

Machine learning in video games (3,879 words) [view diff] exact match in snippet view article find links to article

state of the art machine learning techniques such as relational deep reinforcement learning, long short-term memory, auto-regressive policy heads, pointer

Lit pool (261 words) [view diff] exact match in snippet view article find links to article

making and incentives design in the presence of a dark pool: a deep reinforcement learning approach". arXiv:1912.01129 [q-fin.MF]. Palmer, Max (2010-03-20)

David Silver (computer scientist) (713 words) [view diff] exact match in snippet view article

Silver; et al. (25 February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. doi:10.1038/NATURE14236. ISSN 1476-4687

Maluuba (1,266 words) [view diff] exact match in snippet view article find links to article

Maluuba published a research paper learning dialogue policies with deep reinforcement learning. In 2016, Maluuba also freely released the Frames dataset, which

Networked-loan (3,353 words) [view diff] exact match in snippet view article find links to article

with deep reinforcement learning integrated with high-order graph message-passing networks. It uses the framework of deep reinforcement learning to learn

RFM (market research) (866 words) [view diff] case mismatch in snippet view article

Tkachenko, Yegor. Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space. (April 8, 2015). arXiv

Apprenticeship learning (1,336 words) [view diff] exact match in snippet view article find links to article

Leike, J., Brown, T., Martic, M., Legg, S., & Amodei, D. (2017). Deep reinforcement learning from human preferences. In Advances in Neural Information Processing

Beam tilt (753 words) [view diff] case mismatch in snippet view article find links to article

"Online Antenna Tuning in Heterogeneous Cellular Networks With Deep Reinforcement Learning". IEEE Transactions on Cognitive Communications and Networking

Demis Hassabis (4,994 words) [view diff] exact match in snippet view article find links to article

learning and reinforcement learning, and pioneered the field of deep reinforcement learning which combines these two methods. Hassabis has predicted that

DeepStack (675 words) [view diff] exact match in snippet view article find links to article

Bakhtin, Anton; Lerer, Adam; Gong, Qucheng (2020). "Combining deep reinforcement learning and search for imperfect-information games". Advances in Neural

Daniel Kroening (589 words) [view diff] case mismatch in snippet view article find links to article

"Deepsynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning". AAAI 2020, Vol. 35, No. 9, pages 7647-7656. Vijay D’Silva,

Language creation in artificial intelligence (769 words) [view diff] case mismatch in snippet view article find links to article

Batra, D. (2017). Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning. arXiv preprint arXiv:1703.06585. Johnson, M., Schuster, M.,

Swarm robotics (2,264 words) [view diff] case mismatch in snippet view article find links to article

Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning" IEEE Transactions on Vehicular Technology, 2020. Hu, J.; Turgut

Microswimmer (14,639 words) [view diff] exact match in snippet view article find links to article

trapped in certain flow structures by learning smart gravitaxis. Deep reinforcement learning has been used to explore microswimmer navigation problems in

Seega (game) (905 words) [view diff] case mismatch in snippet view article

Player for Seejeh (A.K.A Seega, Siga, Kharbga) Board Game with Deep Reinforcement Learning". Procedia Computer Science. 160: 241–247. doi:10.1016/j.procs

Chainer (856 words) [view diff] exact match in snippet view article find links to article

previous record held by Facebook. ChainerRL adds state of art deep reinforcement learning algorithms, and ChainerUI is a management and visualization tool

Paul Christiano (researcher) (1,172 words) [view diff] case mismatch in snippet view article

single charity. At OpenAI, Christiano co-authored the paper "Deep Reinforcement Learning from Human Preferences" (2017) and other works developing reinforcement

InterQuest Group Ltd (1,073 words) [view diff] case mismatch in snippet view article find links to article

Conference". Retrieved 27 November 2019. "Step into the AI Era: Deep Reinforcement Learning Workshop". Retrieved 27 November 2019. "UX Sessions". Retrieved

Proximal policy optimization (2,082 words) [view diff] case mismatch in snippet view article find links to article

05477 “A Beginner’s Guide to deep Reinforcement learning,” Pathmind. https://wiki.pathmind.com/deep-reinforcement-learning#reward Q. T. Luu, “Q-learning

Active learning (machine learning) (2,358 words) [view diff] case mismatch in snippet view article

https://arxiv.org/abs/2303.01560v2 Learning how to Active Learn: A Deep Reinforcement Learning Approach, Meng Fang, Yuan Li, Trevor Cohn, https://arxiv.org/abs/1708

Montezuma's Revenge (video game) (1,392 words) [view diff] exact match in snippet view article

Petersen, Stig (February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. Bibcode:2015Natur.518..529M. doi:10

Princeton Plasma Physics Laboratory (2,137 words) [view diff] exact match in snippet view article find links to article

Egemen (2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. doi:10.1038/s41586-024-07024-9

Dorin Comaniciu (834 words) [view diff] case mismatch in snippet view article find links to article

Andreas; Hornegger, Joachim; Comaniciu, Dorin (2019). "Multi-Scale Deep Reinforcement Learning for Real-Time 3D-Landmark Detection in CT Scans". IEEE Transactions

Artificial intelligence (22,441 words) [view diff] exact match in snippet view article find links to article

against four of the world's best Gran Turismo drivers using deep reinforcement learning. Finance is one of the fastest growing sectors where applied

Nested sampling algorithm (2,160 words) [view diff] exact match in snippet view article find links to article

framework for uncertainty quantification, optimization, and deep reinforcement learning, which also implements nested sampling. Since nested sampling

Mahjong and artificial intelligence (555 words) [view diff] case mismatch in snippet view article find links to article

Yang; Li Zhao; Tao Qin; Tie-Yan Liu; Hsiao-Wuen Hon (2020-04-01). "Suphx: Mastering Mahjong with Deep Reinforcement Learning". arXiv:2003.13590 [cs.AI].

Customer lifetime value (2,890 words) [view diff] case mismatch in snippet view article find links to article

Tkachenko, Yegor. Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space. (April 8, 2015). arXiv

Distributional Soft Actor Critic (321 words) [view diff] case mismatch in snippet view article find links to article

et al. (2018). "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor". ICML. Wang, Wenxuan; et al. (2023)

Google Brain (3,833 words) [view diff] exact match in snippet view article find links to article

ISSN 1941-0468. Gu, S.; Holly, E.; Lillicrap, T.; Levine, S. (May 2017). "Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates"

Evaluation function (2,438 words) [view diff] case mismatch in snippet view article find links to article

ICCA Journal Lai, Matthew (4 September 2015), Giraffe: Using Deep Reinforcement Learning to Play Chess, arXiv:1509.01549v1 "Neural network topology".

Gregory Dudek (1,273 words) [view diff] exact match in snippet view article find links to article

decision-making under uncertainty, using techniques including deep reinforcement learning and probabilistic modelling. Dudek has participated in the organization

Convolutional neural network (15,064 words) [view diff] exact match in snippet view article find links to article

research described an application to Atari 2600 gaming. Other deep reinforcement learning models preceded it. Convolutional deep belief networks (CDBN)

Pushmeet Kohli (1,045 words) [view diff] exact match in snippet view article find links to article

(February 2022). "Magnetic control of tokamak plasmas through deep reinforcement learning". Nature. 602 (7897): 414–419. Bibcode:2022Natur.602..414D. doi:10

AI alignment (11,625 words) [view diff] case mismatch in snippet view article find links to article

Jacob; Krueger, David (June 28, 2022). "Goal Misgeneralization in Deep Reinforcement Learning". Proceedings of the 39th International Conference on Machine

IIT Madras (6,959 words) [view diff] exact match in snippet view article find links to article

one of the country's largest groups in network analytics and deep reinforcement learning. Google has granted IIT Madras $1 million for setting up India's

Edward Y. Chang (2,492 words) [view diff] exact match in snippet view article find links to article

, Chang, E. Y. (2018). Refuel: Exploring sparse features in deep reinforcement learning for fast disease diagnosis. In Advances in Neural Information

AlphaDev (1,132 words) [view diff] exact match in snippet view article find links to article

Silver, David (2023). "Faster sorting algorithms discovered using deep reinforcement learning". Nature. 618: 257–263. doi:10.1038/s41586-023-06004-9. PMC 10247365

MuJoCo (318 words) [view diff] case mismatch in snippet view article find links to article

Jorge Pena; Westerlund, Tomi (2020). "Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: A Survey". 2020 IEEE Symposium Series on Computational

Machine learning (14,693 words) [view diff] case mismatch in snippet view article find links to article

Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning". IEEE Transactions on Vehicular Technology. 69 (12): 14413–14423

OpenAI (15,431 words) [view diff] exact match in snippet view article find links to article

(MOBA) games and how OpenAI Five has demonstrated the use of deep reinforcement learning (DRL) agents to achieve superhuman competence in Dota 2 matches

Rubik's Cube (10,383 words) [view diff] case mismatch in snippet view article find links to article

Prati (2021). "Solving Rubik's Cube via Quantum Mechanics and Deep Reinforcement Learning". Journal of Physics A: Mathematical and Theoretical. 54 (5):

List of volunteer computing projects (4,255 words) [view diff] exact match in snippet view article find links to article

2018-03-04 Software testing, chess Trains chess neural networks with deep reinforcement learning. Experiments with training parameters and net architectures No

Mahjong (13,142 words) [view diff] case mismatch in snippet view article find links to article

Hsiao-Wuen (31 March 2020). "Suphx: Mastering Mahjong with Deep Reinforcement Learning". arXiv:2003.13590 [cs.AI]. "Top-grossing". Facebook. Retrieved

Internet of things (19,751 words) [view diff] exact match in snippet view article find links to article

driving force for autonomous IoT. An approach in this context is deep reinforcement learning where most of IoT systems provide a dynamic and interactive environment

Curriculum learning (1,366 words) [view diff] exact match in snippet view article find links to article

Curriculum learning for heterogeneous star network embedding via deep reinforcement learning. pp. 468–476. doi:10.1145/3159652.3159711. hdl:2142/101634.

Reward hacking (1,505 words) [view diff] exact match in snippet view article find links to article

Yuval Tassa, Tom Erez, and Martin Riedmiller. "Data-efficient deep reinforcement learning for dexterous manipulation." arXiv preprint arXiv:1704.03073

Quantum machine learning (10,293 words) [view diff] case mismatch in snippet view article find links to article

Xiaoli; Goan, Hsi-Sheng (2020). "Variational Quantum Circuits for Deep Reinforcement Learning". IEEE Access. 8: 141007–141024. arXiv:1907.00397. Bibcode:2020IEEEA

Hover (behaviour) (1,809 words) [view diff] exact match in snippet view article

2023). "Exploring storm petrel pattering and sea-anchoring using deep reinforcement learning". Bioinspiration & Biomimetics. 18 (6). University of Portland

Tokamak (14,070 words) [view diff] exact match in snippet view article find links to article

(February 2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. Bibcode:2024Natur.626..746S. doi:10

Generative adversarial network (14,084 words) [view diff] exact match in snippet view article find links to article

enforce the alignment of the latent feature space, such as in deep reinforcement learning. This works by feeding the embeddings of the source and target

Fusion power (20,836 words) [view diff] exact match in snippet view article find links to article

address fusion heating, measurement, and power production. A deep reinforcement learning system has been used to control a tokamak-based reactor. The

Federated learning (5,963 words) [view diff] case mismatch in snippet view article find links to article

Guo, Weisi; Nallanathan, Arumugam; Wu, Qihui (2021). "Green Deep Reinforcement Learning for Radio Resource Management: Architecture, Algorithm Compression

Occupant-centric building controls (1,910 words) [view diff] exact match in snippet view article find links to article

associated with thermal comfort and indoor air control via a deep reinforcement learning algorithm". Building and Environment. 155: 105–117. doi:10.1016/j

Timeline of computing 2020–present (23,329 words) [view diff] exact match in snippet view article find links to article

Scaramuzza, Davide (August 2023). "Champion-level drone racing using deep reinforcement learning". Nature. 620 (7976): 982–987. Bibcode:2023Natur.620..982K. doi:10

Drones in wildfire management (4,517 words) [view diff] case mismatch in snippet view article find links to article

Mousavi, Seyed Sajad; Schukat, Michael; Howley, Enda (2018). "Deep Reinforcement Learning: An Overview". Proceedings of SAI Intelligent Systems Conference

Glossary of engineering: M–Z (31,123 words) [view diff] case mismatch in snippet view article find links to article

Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning" IEEE Transactions on Vehicular Technology, 2020. Feynman, Richard

Applications of artificial intelligence (20,753 words) [view diff] exact match in snippet view article find links to article

Hassabis, Demis (26 February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. Bibcode:2015Natur.518..529M. doi:10

2023 in science (44,482 words) [view diff] exact match in snippet view article find links to article

Scaramuzza, Davide (August 2023). "Champion-level drone racing using deep reinforcement learning". Nature. 620 (7976): 982–987. Bibcode:2023Natur.620..982K. doi:10

AI safety (9,544 words) [view diff] case mismatch in snippet view article find links to article

Jacob; Krueger, David (2022-06-28). "Goal Misgeneralization in Deep Reinforcement Learning". Proceedings of the 39th International Conference on Machine

Reinforcement learning from human feedback (4,911 words) [view diff] case mismatch in snippet view article find links to article

Brown, Tom; Martic, Miljan; Legg, Shane; Amodei, Dario (2017). "Deep Reinforcement Learning from Human Preferences". Advances in Neural Information Processing