Training a large language model (LLM) on responses from other LLMs has some potential benefits but also notable risks:
Potential Benefits:
- Data Augmentation: Responses from other LLMs can serve as an additional data source, increasing the volume of data and helping fill gaps in specific areas.
- Faster Training: Using pre-generated responses from other models could reduce time spent collecting or curating human-generated data.
- Task Specialization: If the other LLM is specialized (e.g., in coding or customer support), training on its outputs might help improve performance in that domain.