You’re offline. This is a read only version of the page.
Toggle navigation
Research Registry
Research Registry
Browse the Research Registry
Search the Research Registry
Useful links
Useful links
Public Data Browser
Support for grant applications
PanelApp
Research Management Twitter
Research Environment User Guide
Research Environment videos
Research Environment Training Sessions
News
Help
Help
Service Desk
Research Environment User Guide
For the public
All
All
Web Pages
Search Filter
All
Web Pages
Search
Sign in
Research Portal
Home
Research Registry
Browse the Research R...
Browse the research registry public
Browse the research registry
In this section
Browse the Research Registry
Search the Research Registry
Research registry ID
*
Date submitted
*
Project lead
Title
*
Improved methods for storing and processing large-scale genetic variation data
Community 1
*
Community 2
Community 3
Lay summary
*
Current large-scale genetic variation datasets have enormous potential for advancing human health, but the storage and analysis of these datasets present major challenges to existing data formats. For example, performing a simple query to find the genome coordinates of all genetic variants can take several hours to complete. This is a major hindrance to researcher productivity as well as a substantial economic cost. We develop improved computational methods to store genetic data in a form that much faster and cheaper to store and process, and showcase these improvements on the 100,000 Genomes dataset.