数据来源:https://github.com/zhangsheng93/cMedQA
- questions.csv All Questions and their content.
- answers.csv All Answers and their content.
- train_candidates.txt dev_candidates.txt test_candidates.txt The split of training set, development set and test set respectively.
这里我们主要使用了questions.csv和answers.csv两个文件的信息,提取了医疗问题和相应的回答
注意:这里只是做了一个最简单的医疗问答的demo,后面还有很多需要优化的地方