摘要 Abstract
针对K-12学生的教育材料常常包含多种模态(如文本和图像),这对模型全面理解这些材料中的细微信息提出了挑战。本文提出了一种名为UniEDU的统一语言与视觉辅助系统,该系统能够服务于各种教育应用场景,包括知识推荐、知识追踪、时间成本预测以及用户答案预测等,并在一个模型中实现这些功能。与传统的任务专用模型不同,UniEDU提供了一种在多个教育任务中表现优异且具备强大泛化能力的统一解决方案。其适应性强,非常适合在多样化的学习环境中实际部署。此外,通过显著降低计算开销,UniEDU实现了约300%的效率提升,同时与经过完全微调的模型相比,性能下降极小,保持了竞争力。这项工作标志着朝着创建适应教育需求变化的多功能AI系统迈出了重要一步。
Education materials for K-12 students often consist of multiple modalities, such as text and images, posing challenges for models to fully understand nuanced information in these materials. In this paper, we propose a unified language and vision assistant UniEDU designed for various educational applications, including knowledge recommendation, knowledge tracing, time cost prediction, and user answer prediction, all within a single model. Unlike conventional task-specific models, UniEDU offers a unified solution that excels across multiple educational tasks while maintaining strong generalization capabilities. Its adaptability makes it well-suited for real-world deployment in diverse learning environments. Furthermore, UniEDU is optimized for industry-scale deployment by significantly reducing computational overhead-achieving approximately a 300\% increase in efficiency-while maintaining competitive performance with minimal degradation compared to fully fine-tuned models. This work represents a significant step toward creating versatile AI systems tailored to the evolving demands of education.