Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Entropy-regularized diffusion policy with Q-ensembles for offline reinforcement learning
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Systems and Control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Automatic control.
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Systems and Control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Artificial Intelligence.ORCID iD: 0000-0003-3334-8655
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Systems and Control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Artificial Intelligence.ORCID iD: 0000-0002-9099-3522
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Systems and Control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Automatic control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Artificial Intelligence.ORCID iD: 0000-0001-5183-234X
Show others and affiliations
2024 (English)In: Advances in Neural Information Processing Systems 37 (NeurIPS 2024), 2024, Vol. 37Conference paper, Published paper (Refereed)
Place, publisher, year, edition, pages
2024. Vol. 37
National Category
Other Computer and Information Science
Research subject
Machine learning
Identifiers
URN: urn:nbn:se:uu:diva-545831OAI: oai:DiVA.org:uu-545831DiVA, id: diva2:1923473
Conference
Neural Information Processing Systems
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)Kjell and Marta Beijer FoundationSwedish Research Council, 2021-04301Swedish Research Council, 2023-04546Available from: 2024-12-25 Created: 2024-12-25 Last updated: 2025-01-09Bibliographically approved

Open Access in DiVA

No full text in DiVA

Authority records

Zhang, RuoqiLuo, ZiweiSjölund, JensSchön, Thomas B.Mattsson, Per

Search in DiVA

By author/editor
Zhang, RuoqiLuo, ZiweiSjölund, JensSchön, Thomas B.Mattsson, Per
By organisation
Division of Systems and ControlAutomatic controlArtificial Intelligence
Other Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 42 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf