Skip to content

Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation. Project Website: http://semantic-search.cs.uni-freiburg.de

License

Notifications You must be signed in to change notification settings

robot-learning-freiburg/Semantic-Search

Repository files navigation

Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation

arXiv | website

Repository providing the source code for the paper

Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation Sai Prasanna, Daniel Honerkamp* Kshitij Sirohi*, Tim Welschehold, Wolfram Burgard and Abhinav Valada

Please cite the paper as follows:

@article{prasanna2024perception,
  title={Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation},
  author={Sai Prasanna and Daniel Honerkamp and Kshitij Sirohi and Tim Welschehold and Wolfram Burgard and Abhinav Valada},
  journal={Proceedings of the International Symposium on Robotics Research (ISRR)},
  year={2024}
}

Setup

  1. You have to obtain the API user name and token for hm3d dataset from matterport by following their instructions. Set these as environment variables export USERNAME=<API_TOKEN_USER_ID> export PASSWORD=<API_TOKEN>.
  2. Run the setup.sh to create the conda environment.
  3. Download the EMSANet checkpoint from https://drive.google.com/uc?id=1LD4_g-jL4KJPRUmCGgXxx2xGQ7TNZ_o2 and extract it tar -xvf checkpoint.tar.gz -C ./third_party/trained_models/

Evaluating aggregation approaches with the Shortest path policy

To evaluate the aggregation approaches with the shortest path policy, run

./scripts/eval_sp_policy_emsanet.sh
./scripts/eval_sp_policy_maskrcnn.sh
./scripts/eval_sp_policy_segformer.sh

Training and evaluating RL Policy

To train the RL policy on ground truth semantics and evaluate it with different semantic models and aggregation approaches, run

./scripts/train_rl_policy.sh
./scripts/eval_rl_policy_emsanet.sh
./scripts/eval_rl_policy_maskrcnn.sh
./scripts/eval_rl_policy_segformer.sh

Misc

Calibrating the perception model

  1. Collect the data for calibrating the perception model. Run
python -m sem_objnav.obj_nav.collect_seg_data --output_dir calibation_dataset
  1. Check the notebooks sem_objnav/notebooks/emsanet_scaling_temp.ipynb and sem_objnav/notebooks/segformer_scaling_temp.ipynb for calibation.

Stubborn

To collect data and train the models used in stubborn, run ./scripts/train_stubborn.sh.

Hyperparameter optimization

To find optimal hyperparameters for the aggregation strategies, run ./scripts/htune.sh.

About

Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation. Project Website: http://semantic-search.cs.uni-freiburg.de

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published