UK Biobank Health Data Leaked on GitHub 80 Times Despite Takedown Requests
Researchers have accidentally uploaded confidential UK Biobank health data to GitHub at least 80 times, forcing the organization to issue legal takedown requests. The data leaks continued despite removal efforts, with some files remaining available on archive websites.

UK Biobank, which holds health data from over 500,000 people, has been fighting a losing battle against accidental data leaks. Researchers working with the sensitive information have repeatedly uploaded datasets to GitHub, a popular code-sharing platform, exposing confidential medical records.
The problem became clear in 2024 when investigations revealed the extent of the breaches. UK Biobank has issued at least 80 legal takedown requests to remove the exposed data from GitHub. Despite these efforts, many files remained accessible through code archive websites even after the original uploads were removed.
The leaks appear to follow a pattern. Data from one monitoring system shows requests stopped during January, February, and most of March 2026, suggesting either better security measures or gaps in detection rather than an end to accidental uploads.
Experts now question whether UK Biobank can fully regain control of the released information. Once health data spreads online, it becomes nearly impossible to completely erase from all platforms and archives.
Your health information could be at risk if you're part of medical research. These leaks show how personal medical data can end up online by mistake, potentially affecting insurance or employment if it falls into the wrong hands.
Watch for new data protection measures from UK Biobank and whether legal action follows against researchers who leaked the information.
Was this article helpful?
0 people found this helpful