Valhala
Written complex biological questions with multiple choice answer and generated an AI response. I then evaluated the response for reasoning errors even if the LLM reached the correct answer. I then highlighted the incorrect step in reasoning, explained why it was wrong and re-wrote the step so it was correct. I next input the correct steps and asked the LLM to finish the solution. I then repeated this process until the LLM had correctly reached the final answer with sound reasoning.