DeepSeek vs. ChatGPT: Who Bests China’s Pharmacist Exam?
Which AI better helps pharmacists-in-training? Researchers tested DeepSeek-R1 and ChatGPT-4o on 2,306 real, text-only questions (2017–2021) from China’s pharmacist licensure exam, scoring exact-answer accuracy in Chinese.
- DeepSeek-R1: 90.0% accuracy vs. ChatGPT-4o: 76.1% (p<0.001)
- Advantages held across units, especially foundational and clinical synthesis
- Year-by-year multiple-choice trends favored DeepSeek, but unit-year gaps weren’t statistically significant
- Methods: Pearson’s Chi-squared for overall; Fisher’s exact for yearly unit checks
Takeaway: Domain-focused models can outperform general chatbots on high-stakes, specialized exams—useful for study aids and formative feedback. Still, AI should complement—not replace—licensed experts and oversight in clinical, legal, and ethical decisions.
Study: https://arxiv.org/abs/2511.20526v1
Paper: https://arxiv.org/abs/2511.20526v1
Register: https://www.AiFeta.com
AI LLM Pharmacy Healthcare MedEd EdTech China Exams DeepSeek ChatGPT