COVID-19 Genomics UK Consortium
The COVID-19 Genomics UK (COG-UK) consortium was a group of academic institutions and public health agencies in the United Kingdom created in April 2020[1][2][3] to collect, sequence and analyse genomes of SARS-CoV-2 at scale, as part of COVID-19 pandemic response.
Established
The genome data generated by COG-UK was integrated with epidemiological data and patient health records to monitor introductions into the UK, community transmission and outbreaks of SARS-CoV-2; to assess changes in transmissibility and virulence; and to evaluate the impact of treatments and non-pharmaceutical interventions. COG-UK members also undertook research that integrated human genomic and health data to understand the biology of SARS-CoV-2 and its impact on those infected.[4]
The consortium identified the SARS-CoV-2 Alpha variant (at the time, referred to as Variant of Concern 202012/01) in November 2020, which became the subject of subsequent investigations by the UK public health agencies, coordinated by Public Health England and supported by COG-UK.[5][6]
Between April and September 2021, SARS-CoV-2 sequencing transitioned to become a public health-led national service[7] after which COG-UK focused on data linkage, research and international training.[8]
The consortium formally closed at the end of March 2023.[9]
Funding
COG-UK was supported by £20 million funding from the Department of Health and Social Care, UK Research and Innovation (UKRI), and the Wellcome Sanger Institute.[1]
The consortium received a further £12.2 million from the Department of Health and Social Care's Testing Innovation Fund in November 2020 to facilitate the genome sequencing capacity needed to meet the increasing number of COVID-19 cases in the UK over the 2020-2021 winter period.[10]
Together with Wellcome Connecting Science (WCS), COG-UK was also awarded a Foreign, Commonwealth & Development Office/Wellcome Epidemic Preparedness Coronavirus grant of nearly £1 million to develop COG-Train, a learning programme to support the global scientific and public health community in SARS-CoV-2 genome sequencing.[11]
Structure[edit]
The consortium comprised the four UK public health agencies, the Wellcome Sanger Institute, sixteen academic partners, and National Health Service organisations. It adopted a Hub and Spoke model with a centralised administrative hub (University of Cambridge), a national sequencing hub (Sanger Institute), and sequencing and analysis capacity regionally distributed across all partners.[12]
Samples from COVID-19 patients were collected by NHS laboratories, public health laboratories and national coronavirus testing centres and sent to partner sites for sequencing. Viral genome sequencing was undertaken by the four UK public health agencies, the Wellcome Sanger Institute, the Quadram Institute, and fifteen universities including Queen's University Belfast, the University of Birmingham, Cardiff University, the University of Cambridge, the University of Edinburgh, the University of Exeter, the University of Glasgow, the University of Liverpool, Northumbria University, the University of Nottingham, the University of Oxford, the University of Portsmouth, University College London, Imperial College London and the University of Sheffield.[13]
The rapid launch and delivery of SARS-CoV-2 genome sequencing data was enabled by use of the pre-existing infrastructure and expertise of COG-UK partners and members. For example, COG-UK relied on the Cloud Infrastructure for Microbial Bioinformatics (CLIMB),[14] which was launched in 2014 with support from the Medical Research Council. CLIMB is an open, cloud-based computing infrastructure for developing and sharing datasets and bioinformatics software, tools and methods to interpret ‘big data’.[15] CLIMB is a partnership between the Universities of Bath, Birmingham, Cardiff, Leicester, Swansea and Warwick, the London School of Hygiene and Tropical Medicine and the Quadram Institute.[16]
Genome data for each positive sample sequenced was linked to the person who provided the sample using an anonymous code. These were uploaded to CLIMB-COVID where the genomes could be characterised, which included assigning each genome to a lineage and detecting mutations. Public health agencies were then able to access this information and link it to detailed public health information.[17]
Key people[edit]
The executive director and chair of the consortium was Sharon Peacock, a professor and microbiologist at the University of Cambridge.[18][19]
More than 600 consortium members contributed to the work of COG-UK between April 2020 and the end of March 2023[1], with key roles fulfilled by researchers from across the consortium partners [2], and a management team based at the University of Cambridge [3].
COG-UK pioneered early and large-scale coordinated national sequencing of SARS-CoV-2 viral genomes, along with the open and rapid sharing of genomic data.
By December 2020, COG-UK had sequenced the genomes of more than 150,000 samples of SARS-CoV-2 virus and uploaded them to GISAID, representing around 5% of all UK COVID-19 cases. Approximately 60% of these were sequenced at the Wellcome Sanger Institute. By December 2021, the UK had sequenced over 1.8 million SARS-CoV-2 genomes.
The insights arising from analysis of these genome data had a range of impacts during the COVID-19 pandemic response:[20]
By September 2021, COG-UK pivoted to focus on three strategic areas:[8]
Data linkage & Analysis
By linking and analysing different types of data, it is possible to undertake more in depth research and generate new understanding about the differences in the transmission of SARS-CoV-2 variants and the disease that they cause.
COG-UK collaborated with partners through the UK Health Data Research Alliance and the Outbreak Data Analysis Platform (ODAP) to link viral genome data with other complex datasets.[35][36]
Through these collaborations, SARS-CoV-2 sequencing data has been linked to a number of different datasets including:
All such linked data are stored securely with controlled access through approved pathways to maintain patient privacy and ensure appropriate usage and attribution.[37]
Research
COG-UK provided seed funding for consortium members to undertake research projects that would generate novel insights relevant to the COVID-19 pandemic, and to prepare for future pandemics.[38]
These projects focussed on:
COG-UK also provided seed funding for consortium members to support early career scientists reinvigorate research projects delayed owing to time spent on the pandemic response.[39]
Training
Together with Wellcome Connecting Science, COG-UK established a free international online learning programme in using genomics to investigate SARS-CoV-2 and other infectious diseases.
Known as COG-Train, the programme sought to ‘facilitate an increase global genome sequencing and analysis capacity, reduce sequencing inequality and enhance pathogen surveillance’. Its training courses were built through partnerships with international researchers, public health experts and surveillance networks.
Outputs of COG-Train included a series of massive online, open-access courses on all aspects of SARS-CoV-2 sequencing, as well as week-long intensive virtual training courses, short expert workshops and concurrent distributed classrooms.[40]
Funded by the Wellcome Trust and the Foreign, Commonwealth & Development Office, COG-Train delivered free online training to thousands of learners in more than 100 countries worldwide. Its three week course ‘The Power of Genomics to Understand the COVID-19 Pandemic’ remains available through the FutureLearn platform.
Protocols and tools developed by COG-UK consortium members were used globally during the first three years of the COVID-19 pandemic and continue to be of utility today. These include:
By December 2020, the number of sequences uploaded to GISAID by COG-UK was just under 5% of all UK COVID-19 cases, compared to 3.2% for the United States and 60% for Australia.[19] Approximately 60% of these were sequenced at the Wellcome Sanger Institute[18] and the COG-UK consortium was reported to have understood 'the genetic history of more than 150,000 samples of SARS-CoV-2 virus'.[49]
According to the COG-UK website, by December 2021, the UK had sequenced over 1.8 million SARS-CoV-2 genomes.[50]
Seminars and showcases
COG-UK ran a series of events to share the advances and discoveries arising from the work of consortium members and to provide forums for discussion. These included seminars and science showcases,[51][52] where consortium members have shared their analyses and insights with each other and the public.
In October 2021, the COG-UK Together event saw members from across the consortium join a mixed online/in person event to share their experiences during the first 18 months of the pandemic.[53]
Women in COG
Consortium members also organised and hosted the ‘Women in COG’ interview series, which showcased the lives and work of inspirational women (and male supporters/allies) from the COG-UK network and outside of the consortium. Interviewees included:
This series was also distilled into a book titled ‘Snapshots of Women in COG: Scientific Excellence during the COVID-19 pandemic’.