Building instruction-tuning datasets from human-written instructions with open-weight large language models

Youmi Ma, Sakae Mizuki, Kazuki Fujii, Taishi Nakamura, Masanari Ohi, Hinari Shimada, Taihei Shiotani, Koshiro Saito, Koki Maeda, Kakeru Hattori, Takumi Okamoto, Shigeki Ishida, Rio Yokota, Hiroya Takamura, Naoaki Okazaki

The 2nd Conference on Language Modeling (COLM) · July 2025

BibTeX

@inproceedings{ma2025instruction,
  title={Building instruction-tuning datasets from human-written instructions with open-weight large language models},
  author={Youmi Ma and Sakae Mizuki and Kazuki Fujii and Taishi Nakamura and Masanari Ohi and Hinari Shimada and Taihei Shiotani and Koshiro Saito and Koki Maeda and Kakeru Hattori and Takumi Okamoto and Shigeki Ishida and Rio Yokota and Hiroya Takamura and Naoaki Okazaki},
  booktitle={The 2nd Conference on Language Modeling (COLM)},
  year={2025}
}