This repository is the official implemetation of the paper "FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation". The code has been tested on Ubuntu 22.04, ...
Abstract: Text-to-image diffusion models have made remarkable progress in generating high-quality images, but achieving fine-grained control over visual text remains a challenge. Existing methods fall ...