GitHub is updating its data usage policy for GitHub Copilot, allowing the AI coding assistant to leverage user interaction data for training its models. Starting April 24, inputs, outputs, code snippets, and associated context from Copilot Free, Pro, and Pro+ users will be collected for model improvement unless users actively opt out. This move aims to enhance the AI's accuracy and contextual awareness, drawing parallels to earlier improvements seen from using Microsoft employee data.
The policy change, detailed in a blog post, specifies that data such as accepted or modified suggestions, code context, comments, and interaction patterns will be used. This aligns with industry trends in leveraging real-world data to refine AI capabilities. Users who previously opted out will retain their preferences, ensuring their data is not used unless they choose to opt back in.
