Clinical Camel

⚠️ Upcoming Code Release ⚠️

The codebase related to model inference, training, evaluation, and DBKE is under preparation and will be released soon. Please stay tuned for updates!

Model Description

Clinical Camel model is a transformer-based language model trained on the LLaMA 13B architecture. It is specially designed as a reseach focused medical conversational model.

Access the live demo. This corresponds to an earlier version of the Clinical Camel model.

Model Conversion

Delta weights for the model are provided. The apply_delta script from FastChat can be used to convert LLaMA-13B to Clinical Camel.

The delta weights can be found at this Hugging Face link.

Data

The model was trained on a diverse dataset which includes:

100,000 synthetic dialogues produced via dialogue-based knowledge encoding (DBKE).
10,187 USMLE questions which were converted via DBKE.
The ShareGPT dataset was also used, adding further diversity to the training data.

Training

The training code and inference model are based on FastChat. We would like to extend our gratitude to the developers of FastChat for making their code available for use.

The model was trained for 2 epochs. The specific training parameters and configurations used for the Clinical Camel model are detailed below:

Parameter	Value
Learning Rate	2E-5
Batch Size	4
Epochs	2
Optimizer	AdamW (Torch)
Max Gradient Norm	1
Weight Decay	0
Warm-up Steps	0
Warm-up Ratio	0.03
Gradient Accumulation Steps	8
Per Device Training Batch Size	4
Maximum Sequence Length	2048
Learning Rate Scheduler	Cosine

Model Comparison

The Clinical Camel model was compared to several other popular models in a variety of benchmarks. These include USMLE self-assessment scores, performance on multi-step management problems, and responses to standardized safety questions.

USMLE Self-Assessment Scores

In the USMLE Self-Assessment, Clinical Camel achieved the highest score on Step 1 and Step 3.

	GPT-3.5 (%)	Clinical Camel (%)	Chat Doctor (%)	PMC-LLaMA	MedAlpaca 13b (%)	Vicuna (%)
Step 1	36.1	53.2	11.7	1.1	12.2	21.3
Step 2	56.9	51.4	18.5	2.7	27.5	20.4
Step 3	55.7	58.2	14.8	3.3	26.2	30.3

Multi-Step Management Problems

Clinical Camel shows competitive performance in multi-step management problems.

	GPT-3.5 (%)	Clinical Camel (%)	Chat Doctor (%)	PMC-LLaMA	MedAlpaca 13b (%)	Vicuna (%)
CFPC EM	81.3	74.9	53.2	13.9	31.2	68.6
CFPC FM	85.0	82.1	67.0	14.3	44.7	74.8

Publication

The work related to the Clinical Camel model has been published in arXiv. Feel free to check out the paper for a more comprehensive understanding of the model and its performance.

Future Updates

We are in the process of preparing the remainder of the code used in this project for publication. Please check back in the near future for additional updates and resources. Your patience is appreciated as we work to provide a comprehensive and usable repository.

License

This project is licensed under AGPL-3.0. Please see the LICENSE file for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
evaluation		evaluation
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clinical Camel

⚠️ Upcoming Code Release ⚠️

Model Description

Model Conversion

Data

Training

Model Comparison

USMLE Self-Assessment Scores

Multi-Step Management Problems

Publication

Future Updates

License

About

Releases

Packages

Languages

License

bowang-lab/clinical-camel

Folders and files

Latest commit

History

Repository files navigation

Clinical Camel

⚠️ Upcoming Code Release ⚠️

Model Description

Model Conversion

Data

Training

Model Comparison

USMLE Self-Assessment Scores

Multi-Step Management Problems

Publication

Future Updates

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages