language:
Find link is a tool written by Edward Betts.searching for Deep reinforcement learning 71 found (89 total)
alternate case: deep reinforcement learning
Q-learning
(3,785 words)
[view diff]
case mismatch in snippet
view article
find links to article
2015). "Deep Reinforcement Learning with Double Q-learning". arXiv:1509.06461 [cs.LG]. van Hasselt, Hado; Guez, Arthur; Silver, David (2015). "Deep reinforcementDharshan Kumaran (187 words) [view diff] exact match in snippet view article find links to article
more than 20,000 articles, is '"Human-level control through deep reinforcement learning" which he co-authored in 2015 with others including V Mnih, KAdversarial machine learning (7,402 words) [view diff] exact match in snippet view article find links to article
information to the structure and type of model being used. Adversarial deep reinforcement learning is an active area of research in reinforcement learning focusingAnsatz (656 words) [view diff] exact match in snippet view article find links to article
; Prati, E. (2019). "Coherent transport of quantum states by deep reinforcement learning". Communications Physics. 2 (1): 61. arXiv:1901.06603. Bibcode:2019CmPhyIntelligent control (458 words) [view diff] exact match in snippet view article find links to article
supposed to capture the dynamics of a system. For the control part, deep reinforcement learning has shown its ability to control complex systems. Bayesian probabilityTimothy Lillicrap (912 words) [view diff] case mismatch in snippet view article find links to article
David Silver, Daan Wierstra (2015). Continuous Control with Deep Reinforcement Learning. arXiv:1509.02971 Nicolas Heess, Jonathan J. Hunt, Timothy LillicrapBaher Abdulhai (1,928 words) [view diff] exact match in snippet view article find links to article
the impacts of AVs on the capacities of highway systems. Using deep reinforcement learning and high dimensional sensory inputs, he performed a case studyCognitive architecture (1,252 words) [view diff] case mismatch in snippet view article find links to article
Wierstra, Daan; Riedmiller, Martin (2013). "Playing Atari with Deep Reinforcement Learning". arXiv:1312.5602 [cs.LG]. Mnih, Volodymyr; Kavukcuoglu, Koray;ACM Prize in Computing (104 words) [view diff] exact match in snippet view article find links to article
to robot learning, including learning from demonstrations and deep reinforcement learning for robotic control. 2020 Scott Aaronson For groundbreaking contributionsPalletizer (539 words) [view diff] case mismatch in snippet view article find links to article
position on the pallet. In recent years, some research has utilized Deep Reinforcement Learning, where robotic agents aim to learn an optimal placement positionDorothy Okello (1,176 words) [view diff] exact match in snippet view article find links to article
published in the 2020 IST-Africa Conference (IST-Africa) (3) A deep reinforcement learning-based algorithm for reliability-aware multi-domain service deploymentMachine learning in video games (3,879 words) [view diff] exact match in snippet view article find links to article
state of the art machine learning techniques such as relational deep reinforcement learning, long short-term memory, auto-regressive policy heads, pointerLit pool (261 words) [view diff] exact match in snippet view article find links to article
making and incentives design in the presence of a dark pool: a deep reinforcement learning approach". arXiv:1912.01129 [q-fin.MF]. Palmer, Max (2010-03-20)David Silver (computer scientist) (713 words) [view diff] exact match in snippet view article
Silver; et al. (25 February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. doi:10.1038/NATURE14236. ISSN 1476-4687Maluuba (1,266 words) [view diff] exact match in snippet view article find links to article
Maluuba published a research paper learning dialogue policies with deep reinforcement learning. In 2016, Maluuba also freely released the Frames dataset, whichNetworked-loan (3,353 words) [view diff] exact match in snippet view article find links to article
with deep reinforcement learning integrated with high-order graph message-passing networks. It uses the framework of deep reinforcement learning to learnRFM (market research) (866 words) [view diff] case mismatch in snippet view article
Tkachenko, Yegor. Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space. (April 8, 2015). arXivApprenticeship learning (1,336 words) [view diff] exact match in snippet view article find links to article
Leike, J., Brown, T., Martic, M., Legg, S., & Amodei, D. (2017). Deep reinforcement learning from human preferences. In Advances in Neural Information ProcessingBeam tilt (753 words) [view diff] case mismatch in snippet view article find links to article
"Online Antenna Tuning in Heterogeneous Cellular Networks With Deep Reinforcement Learning". IEEE Transactions on Cognitive Communications and NetworkingDemis Hassabis (4,994 words) [view diff] exact match in snippet view article find links to article
learning and reinforcement learning, and pioneered the field of deep reinforcement learning which combines these two methods. Hassabis has predicted thatDeepStack (675 words) [view diff] exact match in snippet view article find links to article
Bakhtin, Anton; Lerer, Adam; Gong, Qucheng (2020). "Combining deep reinforcement learning and search for imperfect-information games". Advances in NeuralDaniel Kroening (589 words) [view diff] case mismatch in snippet view article find links to article
"Deepsynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning". AAAI 2020, Vol. 35, No. 9, pages 7647-7656. Vijay D’Silva,Language creation in artificial intelligence (769 words) [view diff] case mismatch in snippet view article find links to article
Batra, D. (2017). Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning. arXiv preprint arXiv:1703.06585. Johnson, M., Schuster, M.,Swarm robotics (2,264 words) [view diff] case mismatch in snippet view article find links to article
Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning" IEEE Transactions on Vehicular Technology, 2020. Hu, J.; TurgutMicroswimmer (14,639 words) [view diff] exact match in snippet view article find links to article
trapped in certain flow structures by learning smart gravitaxis. Deep reinforcement learning has been used to explore microswimmer navigation problems inSeega (game) (905 words) [view diff] case mismatch in snippet view article
Player for Seejeh (A.K.A Seega, Siga, Kharbga) Board Game with Deep Reinforcement Learning". Procedia Computer Science. 160: 241–247. doi:10.1016/j.procsChainer (856 words) [view diff] exact match in snippet view article find links to article
previous record held by Facebook. ChainerRL adds state of art deep reinforcement learning algorithms, and ChainerUI is a management and visualization toolPaul Christiano (researcher) (1,172 words) [view diff] case mismatch in snippet view article
single charity. At OpenAI, Christiano co-authored the paper "Deep Reinforcement Learning from Human Preferences" (2017) and other works developing reinforcementInterQuest Group Ltd (1,073 words) [view diff] case mismatch in snippet view article find links to article
Conference". Retrieved 27 November 2019. "Step into the AI Era: Deep Reinforcement Learning Workshop". Retrieved 27 November 2019. "UX Sessions". RetrievedProximal policy optimization (2,082 words) [view diff] case mismatch in snippet view article find links to article
05477 “A Beginner’s Guide to deep Reinforcement learning,” Pathmind. https://wiki.pathmind.com/deep-reinforcement-learning#reward Q. T. Luu, “Q-learningActive learning (machine learning) (2,358 words) [view diff] case mismatch in snippet view article
https://arxiv.org/abs/2303.01560v2 Learning how to Active Learn: A Deep Reinforcement Learning Approach, Meng Fang, Yuan Li, Trevor Cohn, https://arxiv.org/abs/1708Montezuma's Revenge (video game) (1,392 words) [view diff] exact match in snippet view article
Petersen, Stig (February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. Bibcode:2015Natur.518..529M. doi:10Princeton Plasma Physics Laboratory (2,137 words) [view diff] exact match in snippet view article find links to article
Egemen (2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. doi:10.1038/s41586-024-07024-9Dorin Comaniciu (834 words) [view diff] case mismatch in snippet view article find links to article
Andreas; Hornegger, Joachim; Comaniciu, Dorin (2019). "Multi-Scale Deep Reinforcement Learning for Real-Time 3D-Landmark Detection in CT Scans". IEEE TransactionsArtificial intelligence (22,441 words) [view diff] exact match in snippet view article find links to article
against four of the world's best Gran Turismo drivers using deep reinforcement learning. Finance is one of the fastest growing sectors where appliedNested sampling algorithm (2,160 words) [view diff] exact match in snippet view article find links to article
framework for uncertainty quantification, optimization, and deep reinforcement learning, which also implements nested sampling. Since nested samplingMahjong and artificial intelligence (555 words) [view diff] case mismatch in snippet view article find links to article
Yang; Li Zhao; Tao Qin; Tie-Yan Liu; Hsiao-Wuen Hon (2020-04-01). "Suphx: Mastering Mahjong with Deep Reinforcement Learning". arXiv:2003.13590 [cs.AI].Customer lifetime value (2,890 words) [view diff] case mismatch in snippet view article find links to article
Tkachenko, Yegor. Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space. (April 8, 2015). arXivDistributional Soft Actor Critic (321 words) [view diff] case mismatch in snippet view article find links to article
et al. (2018). "Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor". ICML. Wang, Wenxuan; et al. (2023)Google Brain (3,833 words) [view diff] exact match in snippet view article find links to article
ISSN 1941-0468. Gu, S.; Holly, E.; Lillicrap, T.; Levine, S. (May 2017). "Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates"Evaluation function (2,438 words) [view diff] case mismatch in snippet view article find links to article
ICCA Journal Lai, Matthew (4 September 2015), Giraffe: Using Deep Reinforcement Learning to Play Chess, arXiv:1509.01549v1 "Neural network topology".Gregory Dudek (1,273 words) [view diff] exact match in snippet view article find links to article
decision-making under uncertainty, using techniques including deep reinforcement learning and probabilistic modelling. Dudek has participated in the organizationConvolutional neural network (15,064 words) [view diff] exact match in snippet view article find links to article
research described an application to Atari 2600 gaming. Other deep reinforcement learning models preceded it. Convolutional deep belief networks (CDBN)Pushmeet Kohli (1,045 words) [view diff] exact match in snippet view article find links to article
(February 2022). "Magnetic control of tokamak plasmas through deep reinforcement learning". Nature. 602 (7897): 414–419. Bibcode:2022Natur.602..414D. doi:10AI alignment (11,625 words) [view diff] case mismatch in snippet view article find links to article
Jacob; Krueger, David (June 28, 2022). "Goal Misgeneralization in Deep Reinforcement Learning". Proceedings of the 39th International Conference on MachineIIT Madras (6,959 words) [view diff] exact match in snippet view article find links to article
one of the country's largest groups in network analytics and deep reinforcement learning. Google has granted IIT Madras $1 million for setting up India'sEdward Y. Chang (2,492 words) [view diff] exact match in snippet view article find links to article
, Chang, E. Y. (2018). Refuel: Exploring sparse features in deep reinforcement learning for fast disease diagnosis. In Advances in Neural InformationAlphaDev (1,132 words) [view diff] exact match in snippet view article find links to article
Silver, David (2023). "Faster sorting algorithms discovered using deep reinforcement learning". Nature. 618: 257–263. doi:10.1038/s41586-023-06004-9. PMC 10247365MuJoCo (318 words) [view diff] case mismatch in snippet view article find links to article
Jorge Pena; Westerlund, Tomi (2020). "Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: A Survey". 2020 IEEE Symposium Series on ComputationalMachine learning (14,693 words) [view diff] case mismatch in snippet view article find links to article
Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning". IEEE Transactions on Vehicular Technology. 69 (12): 14413–14423OpenAI (15,431 words) [view diff] exact match in snippet view article find links to article
(MOBA) games and how OpenAI Five has demonstrated the use of deep reinforcement learning (DRL) agents to achieve superhuman competence in Dota 2 matchesRubik's Cube (10,383 words) [view diff] case mismatch in snippet view article find links to article
Prati (2021). "Solving Rubik's Cube via Quantum Mechanics and Deep Reinforcement Learning". Journal of Physics A: Mathematical and Theoretical. 54 (5):List of volunteer computing projects (4,255 words) [view diff] exact match in snippet view article find links to article
2018-03-04 Software testing, chess Trains chess neural networks with deep reinforcement learning. Experiments with training parameters and net architectures NoMahjong (13,142 words) [view diff] case mismatch in snippet view article find links to article
Hsiao-Wuen (31 March 2020). "Suphx: Mastering Mahjong with Deep Reinforcement Learning". arXiv:2003.13590 [cs.AI]. "Top-grossing". Facebook. RetrievedInternet of things (19,751 words) [view diff] exact match in snippet view article find links to article
driving force for autonomous IoT. An approach in this context is deep reinforcement learning where most of IoT systems provide a dynamic and interactive environmentCurriculum learning (1,366 words) [view diff] exact match in snippet view article find links to article
Curriculum learning for heterogeneous star network embedding via deep reinforcement learning. pp. 468–476. doi:10.1145/3159652.3159711. hdl:2142/101634.Reward hacking (1,505 words) [view diff] exact match in snippet view article find links to article
Yuval Tassa, Tom Erez, and Martin Riedmiller. "Data-efficient deep reinforcement learning for dexterous manipulation." arXiv preprint arXiv:1704.03073Quantum machine learning (10,293 words) [view diff] case mismatch in snippet view article find links to article
Xiaoli; Goan, Hsi-Sheng (2020). "Variational Quantum Circuits for Deep Reinforcement Learning". IEEE Access. 8: 141007–141024. arXiv:1907.00397. Bibcode:2020IEEEAHover (behaviour) (1,809 words) [view diff] exact match in snippet view article
2023). "Exploring storm petrel pattering and sea-anchoring using deep reinforcement learning". Bioinspiration & Biomimetics. 18 (6). University of PortlandTokamak (14,070 words) [view diff] exact match in snippet view article find links to article
(February 2024). "Avoiding fusion plasma tearing instability with deep reinforcement learning". Nature. 626 (8000): 746–751. Bibcode:2024Natur.626..746S. doi:10Generative adversarial network (14,084 words) [view diff] exact match in snippet view article find links to article
enforce the alignment of the latent feature space, such as in deep reinforcement learning. This works by feeding the embeddings of the source and targetFusion power (20,836 words) [view diff] exact match in snippet view article find links to article
address fusion heating, measurement, and power production. A deep reinforcement learning system has been used to control a tokamak-based reactor. TheFederated learning (5,963 words) [view diff] case mismatch in snippet view article find links to article
Guo, Weisi; Nallanathan, Arumugam; Wu, Qihui (2021). "Green Deep Reinforcement Learning for Radio Resource Management: Architecture, Algorithm CompressionOccupant-centric building controls (1,910 words) [view diff] exact match in snippet view article find links to article
associated with thermal comfort and indoor air control via a deep reinforcement learning algorithm". Building and Environment. 155: 105–117. doi:10.1016/jTimeline of computing 2020–present (23,329 words) [view diff] exact match in snippet view article find links to article
Scaramuzza, Davide (August 2023). "Champion-level drone racing using deep reinforcement learning". Nature. 620 (7976): 982–987. Bibcode:2023Natur.620..982K. doi:10Drones in wildfire management (4,517 words) [view diff] case mismatch in snippet view article find links to article
Mousavi, Seyed Sajad; Schukat, Michael; Howley, Enda (2018). "Deep Reinforcement Learning: An Overview". Proceedings of SAI Intelligent Systems ConferenceGlossary of engineering: M–Z (31,123 words) [view diff] case mismatch in snippet view article find links to article
Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning" IEEE Transactions on Vehicular Technology, 2020. Feynman, RichardApplications of artificial intelligence (20,753 words) [view diff] exact match in snippet view article find links to article
Hassabis, Demis (26 February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. Bibcode:2015Natur.518..529M. doi:102023 in science (44,482 words) [view diff] exact match in snippet view article find links to article
Scaramuzza, Davide (August 2023). "Champion-level drone racing using deep reinforcement learning". Nature. 620 (7976): 982–987. Bibcode:2023Natur.620..982K. doi:10AI safety (9,544 words) [view diff] case mismatch in snippet view article find links to article
Jacob; Krueger, David (2022-06-28). "Goal Misgeneralization in Deep Reinforcement Learning". Proceedings of the 39th International Conference on MachineReinforcement learning from human feedback (4,911 words) [view diff] case mismatch in snippet view article find links to article
Brown, Tom; Martic, Miljan; Legg, Shane; Amodei, Dario (2017). "Deep Reinforcement Learning from Human Preferences". Advances in Neural Information Processing