Data - 大模型笔记

大模型笔记

理论#

[2026.01] From Entropy to Epiplexity: Rethinking Information for Computationally Bounded Intelligence

数据清洗#

[2025.04] Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models 四维质量评估（PRRC）; Meta-rater 方法训练多个代理小模型从多个维度打分，最后选出综合质量更高的数据。
[2024.02] Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation 用rm对qa对打分然后排序。pca降维，kmeans聚类。
[2023.12] What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning deita。complexity, quality, and diversity。用gpt来给指令和QA对打复杂度和质量分，用emb_sim来评估相似度。
- 论文解读：如何自动选择SFT数据
- LLM模型之高质量数据选择和微调方法
[2023.08] InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models instag

主动学习#

« Previous Next »