chenfei-wu · chat1q2w3e4r5t · Mar 11, 2023 · Mar 11, 2023 · Mar 11, 2023 · Mar 11, 2023
diff --git a/README.md b/README.md
@@ -4,39 +4,21 @@
 
 See our paper: [<font size=5>Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models</font>](https://arxiv.org/abs/2303.04671)
 
-## Demo 
-<img src="./assets/demo_short.gif" width="750">
-
-##  System Architecture 
-
-
-<p align="center"><img src="./assets/figure.jpg" alt="Logo"></p>
-
+## Intro
+I implement a google-colab version under standard GPU environment.
+I just use two models `T2I` and `ImageCaption` to process images because of my insufficient GPU memory.
+You can try my colab notebook here 
 
-## Quick Start
-
-```
-# create a new environment
-conda create -n visgpt python=3.8
-
-# activate the new environment
-conda activate visgpt
-
-#  prepare the basic environments
-pip install -r requirement.txt
+[![Open 2k image generation in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1bl-JAgrUru9GlGsb9hrcZCqj3uI6ev_O?usp=sharing)
+## Demo 
+`T2I`
 
-# download the visual foundation models
-bash download.sh
+<img src="./assets/dog-meme.jpg" width="750">
 
-# prepare your private openAI private key
-export OPENAI_API_KEY={Your_Private_Openai_Key}
+`ImageCaption`
 
-# create a folder to save images
-mkdir ./image
+<img src="./assets/football.jpg" width="750">
 
-# Start Visual ChatGPT !
-python visual_chatgpt.py
-```
 
 ## GPU memory usage
 Here we list the GPU memory usage of each visual foundation model, one can modify ``self.tools`` with fewer visual foundation models to save your GPU memory:

diff --git a/assets/dog-meme.jpg b/assets/dog-meme.jpg
diff --git a/assets/football.jpg b/assets/football.jpg
diff --git a/download.sh b/download.sh
@@ -3,12 +3,12 @@ ln -s ControlNet/ldm ./ldm
 ln -s ControlNet/cldm ./cldm
 ln -s ControlNet/annotator ./annotator
 cd ControlNet/models
-wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_canny.pth
-wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_depth.pth
-wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_hed.pth
-wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_mlsd.pth
-wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_normal.pth
-wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_openpose.pth
-wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_scribble.pth
-wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_seg.pth
+#wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_canny.pth
+#wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_depth.pth
+#wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_hed.pth
+#wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_mlsd.pth
+#wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_normal.pth
+#wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_openpose.pth
+#wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_scribble.pth
+#wget https://huggingface.co/lllyasviel/ControlNet/resolve/main/models/control_sd15_seg.pth
 cd ../../
diff --git a/requirement.txt b/requirement.txt
@@ -3,7 +3,7 @@ torchvision==0.13.1
 numpy==1.23.1
 transformers==4.26.1
 albumentations==1.3.0
-opencv-contrib-python==4.3.0.36
+opencv-python==4.5.1.48
 imageio==2.9.0
 imageio-ffmpeg==0.4.2
 pytorch-lightning==1.5.0