基于情境学习与链式思维提示的音乐理论教学:机器的教学策略
Teaching LLMs Music Theory with In-Context Learning and Chain-of-Thought Prompting: Pedagogical Strategies for Machines
摘要 Abstract
本研究评估了大型语言模型(LLMs)如ChatGPT、Claude和Gemini通过情境学习和链式思维提示学习音乐理论概念的能力。利用精心设计的提示(情境学习)和逐步的工作示例(链式思维提示),我们探讨了如何教授LLMs日益复杂的材料以及人类学习者教学策略如何转化为教育机器。性能通过加拿大皇家音乐学院(RCM)六级考试中的问题进行评估,涵盖广泛的主题,包括音程和弦识别、调性检测、终止式分类和节拍分析。此外,我们评估了各种音乐编码格式在此任务中的适用性(ABC、Humdrum、MEI、MusicXML)。所有实验均在有无上下文提示的情况下运行。结果显示,在无上下文情况下,ChatGPT使用MEI编码达到最佳成绩52%,而在有上下文情况下,Claude使用MEI编码达到最佳成绩75%。未来的研究将进一步完善提示并扩展到更高级的音乐理论概念。本研究有助于更广泛地理解教授LLMs,并为教育工作者、学生和AI音乐工具开发者提供应用价值。
This study evaluates the baseline capabilities of Large Language Models (LLMs) like ChatGPT, Claude, and Gemini to learn concepts in music theory through in-context learning and chain-of-thought prompting. Using carefully designed prompts (in-context learning) and step-by-step worked examples (chain-of-thought prompting), we explore how LLMs can be taught increasingly complex material and how pedagogical strategies for human learners translate to educating machines. Performance is evaluated using questions from an official Canadian Royal Conservatory of Music (RCM) Level 6 examination, which covers a comprehensive range of topics, including interval and chord identification, key detection, cadence classification, and metrical analysis. Additionally, we evaluate the suitability of various music encoding formats for these tasks (ABC, Humdrum, MEI, MusicXML). All experiments were run both with and without contextual prompts. Results indicate that without context, ChatGPT with MEI performs the best at 52%, while with context, Claude with MEI performs the best at 75%. Future work will further refine prompts and expand to cover more advanced music theory concepts. This research contributes to the broader understanding of teaching LLMs and has applications for educators, students, and developers of AI music tools alike.