This project enables automated scientific research using Large Language Models (LLMs) and search APIs. I've deliberately kept this readme simple. More discussion on this topic can be found in the linked Medium community science post.
git clone https://github.com/jd-coderepos/deep-research.git
cd deep-research
# Windows
python -m venv venv
venv\Scripts\activate
# macOS/Linux
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
Create a .env
file in the project root with the following content:
OPENAI_API_KEY=your-openai-api-key
FIRECRAWL_API_KEY=your-firecrawl-api-key
⚠️ Do not commit .env to version control.
To start the research process:
python src/main.py
deep-research/
├── src/ # Main source code
├── scripts/ # Research evaluation pipeline
├── data/ # Research data and reports
├── requirements.txt # Python dependencies
└── README.md # Setup instructions
After generating research reports, you can evaluate their quality using the qualitative analysis pipeline:
cd scripts
python qualitative_analysis_pipeline.py
The pipeline provides comprehensive quality assessment across different depth-breadth configurations, generating:
- Multi-dimensional quality metrics
- Publication-ready visualizations
- Comparative analysis reports
- Statistical evaluation of research effectiveness
For detailed documentation on the evaluation pipeline, see scripts/README.md
.
This project is a Python reimplementation of the original deep-research repository by David Zhang, developed in TypeScript. Credit goes to the original author for the concept and design of the deep-research workflow.