数据清洗#
- [2024.02] Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation 用rm对qa对打分然后排序。pca降维,kmeans聚类。
- [2023.12] What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning deita。complexity, quality, and diversity。用gpt来给指令和QA对打复杂度和质量分,用emb_sim来评估相似度。
- [2023.08] InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models instag