DeepSeek vs. ChatGPT: Who Bests China’s Pharmacist Exam?

DeepSeek vs. ChatGPT: Who Bests China’s Pharmacist Exam?

Which AI better helps pharmacists-in-training? Researchers tested DeepSeek-R1 and ChatGPT-4o on 2,306 real, text-only questions (2017–2021) from China’s pharmacist licensure exam, scoring exact-answer accuracy in Chinese.

  • DeepSeek-R1: 90.0% accuracy vs. ChatGPT-4o: 76.1% (p<0.001)
  • Advantages held across units, especially foundational and clinical synthesis
  • Year-by-year multiple-choice trends favored DeepSeek, but unit-year gaps weren’t statistically significant
  • Methods: Pearson’s Chi-squared for overall; Fisher’s exact for yearly unit checks

Takeaway: Domain-focused models can outperform general chatbots on high-stakes, specialized exams—useful for study aids and formative feedback. Still, AI should complement—not replace—licensed experts and oversight in clinical, legal, and ethical decisions.

Study: https://arxiv.org/abs/2511.20526v1

Paper: https://arxiv.org/abs/2511.20526v1

Register: https://www.AiFeta.com

AI LLM Pharmacy Healthcare MedEd EdTech China Exams DeepSeek ChatGPT

Read more