GitHub describes this training data as inputs, outputs, code snippets, and associated context, but the fine print goes into ...