GitHub - josk0/essaygrader: Use Claude to grade philosophy essays

A prompt and a primitive class to grade philosophy essays.

put some essays in /essays_input (as .txt or .md) your Anthropic API key in .env and run grade_essays.py

Things that don't work

Model generates results in thinking_text block instead of response_text block. This usually happens when Claude gets into a discussion with the "student" and justifies the grade. The fields (such as <score1>) that populate the dataframe (and the csv) are hence then not where we expect them and you have to go find them in the output files when rows in the CSV are empty.
Grading is somewhat indeterministic: Although variance in final letter grades is low (for multiple runs over the same essay), I should check for variance in qualitative assessment and the feedback given to students

Improve the prompt to shut down excessive reasoning (such as discussions with the student)
~~extract fields from both thinking_block as well as response_block~~
Experiment with temperature parameter
~~Better error handling. Esp since API runs sometimes fail (httpx.ReadTimeout in _send_to_api): should save dataframe at the end either way~~

~~load prompt from file~~
parametrize number of grading criteria in rubric (so that you can exchange rubric)
Make this tool easier to use by others: Load files that are not text or markdown files; make grade-essays.py a CLI tool

~~Require statement and reward clarity of terms and definitions~~
~~Require focus on one line of argument (e.g. on objectivity essay question, people don't just focus on the measurement problem but also talk about fairness etc)~~
include essay question in prompt?
~~make clear that it's not required to give references~~
~~explain to the model to ignore spelling mistakes that are due to OCR or conversion problems~~

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
config		config
src/essaygrader		src/essaygrader
.gitignore		.gitignore
README.md		README.md
concat-result-files.py		concat-result-files.py
grade-essays.py		grade-essays.py
requirements.txt		requirements.txt