Skip to content
#

embodied-agent

Here are 29 public repositories matching this topic...

[NeurIPS 2024] GenRL: Multimodal foundation world models allow grounding language and video prompts into embodied domains, by turning them into sequences of latent world model states. Latent state sequences can be decoded using the decoder of the model, allowing visualization of the expected behavior, before training the agent to execute it.

  • Updated Jul 31, 2024
  • Python

Improve this page

Add a description, image, and links to the embodied-agent topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the embodied-agent topic, visit your repo's landing page and select "manage topics."

Learn more