CerealsDB: A community resource for wheat genomics

Lead Research Organisation: The Genome Analysis Centre
Department Name: Research Faculty

Abstract

CerealsDB ( http://www.cerealsdb.uk.net/) is one of the most highly accessed wheat web sites with over 5,000 unique accesses per month.

The reasons for CerealsDB's success are clear; it provides the community with sequence and single nucleotide polymorphism (SNP)-based information in a form accessible to and understandable by non-experts.

While CerealsDB is successful, to remain of value to its users, it needs to develop in conjunction with the community's expectations. In addition, CerealsDB needs to expand its remit to include further interaction with other web sites such as the internationally based Wheat Initiative. This development is new to this application as in our previous work we developed tools specifically for the UK community rather than the global wheat community. However, although our previous work did focus on the UK community, the easy to use style of CerealsDB has already resulted in the web site becoming international recognised, with many of its users being from non-UK organisations.

In this further development, CerealsDB will continue work with UK users to develop sequence and SNP-based resources; however, it will also extend its horizons to include interactions with international web-based resources and the exploitation of genomics resources such as the wheat reference sequence. This extension to its remit will result in an expanded audience for CerealsDB and more extensive tools and datasets being made available to the community.

Technical Summary

"A community resource for wheat functional genomics" was one of the first projects to be funded by the BBR initiative. The resources enabled Bristol and Rothamsted Research to develop "CerealsDB" and "Monogram". CerealsDB and Monogram are accessed directly or via other web sites such as the Wheat Improvement Strategic Program (WISP), the International Wheat Initiative and GrainGenes. CerealsDB hosts tools and resources to access and analyse sequence and SNP-based datasets for wheat (Triticum aestivum) and related species. CerealsDB assists breeders and academics in exploring the wheat genome and selecting strategies for genotyping and marker assisted selection. CerealsDB includes a database in excess of 100,000 varietal SNPs, of which several thousand have been validated and mapped on one of three mapping populations. CerealsDB also contains information on DArT markers and Expressed Tagged Sequences (ESTs), and was the first site to host the 5x genome sequence for the variety Chinese Spring released in 2010. Since launch, CerealsDB has been updated regularly, with the last update on the 1st May 2013 with the addition of a further ~1,500 mapped SNP markers and a mapping function which assigns unmapped SNPs to putative homoeologous specific chromosome arms.

To ensure that CerealsDB remains on of the most useful wheat-based web sites and to increase the communities awareness of the resources hosted, we intend to develop CerealsDB to included further tools such as an expanded SNP/genotype database, including iSelect and Axiom based datasets along with facilities to identify and characterise functional SNPs. In addition, new releases will include graphic tools to enable breeders and academics to link mapped and non-mapped SNPs/genes across a variety of wheat related species. Finally, although CerealsDB has always been designed to be easy to use, we will provide training across the range of activities included within the web site.

Planned Impact

The BBSRC web site states that "BBSRC is the principal funder of food-related research in the UK and has food security as one of its key strategic drivers." In the UK wheat is, by a significant margin, the most valuable arable crop and therefore not surprisingly the status of the UK wheat crop is a national issue often reported in the press. While BBSRC funds a large number of wheat-related projects, for the most part, it is the translation of this research into wheat varieties with increased yield or improved qualities such as improved bread making qualities that provides the impact required by BBSRC if it is to deliver its strategic objectives. Translation of wheat research in line with BBSRC's strategic priorities, almost always involves either mapping and cloning genes or crossing alleles of interest into elite varieties. In all cases, since its launch, CerealsDB has helped researchers to achieve impact by providing wheat geneticists with unencumbered sequences and SNP-based information which is easy to access and free from IP or MTAs.

The development of CerealsDB has led to strategic changes in the approaches taken by various wheat breeders, for instance many regard the development of CerealsDB as a significant factor in their decision to convert their marker labs to the SNP-based markers.

In accordance with BBSRC strategy to achieving impact, Prof. Edwards has actively engaged with industry, for instance through the CIRC program and via his participation in a BBSRC science interchange programme with Advanta-Nickerson (now Limagrain) Seeds to exchange technologies and ideas. In addition the university base of the Bristol group and the institutional base of TGAC allows each to incorporate its research directly into undergraduate, post-graduate and post doc teaching. This approach has been remarkably successful and has resulted in numerous young researchers being placed within agricultural institutes or companies as PhD students, researchers or employees.

The facilities offered by CerealsDB are unique. We know that this because our users tell us. Although CerealsDB is international recognised as being a primary source of wheat sequences and SNP-based markers, we believe that by adding the facilities described, Bristol will ensure that it increases its international profile and continues to provide a valuable (almost) one stop shop for wheat functional genomics. In so far as we are aware no similar commercially available resources exist, however, it is important to note that our web site is used by both academic and the industrial sectors with several companies relying on it for a substantial part of their business and many of our improvements have been carried out at the suggestion of the UK wheat breeding sector.

Publications


10 25 50
Wilkinson PA (2016) CerealsDB 3.0: expansion of resources and data integration. in BMC bioinformatics
 
Description Not currently applicable
Exploitation Route Not currently applicable
Sectors Agriculture, Food and Drink
URL http://www.cerealsdb.uk.net
 
Title Grassroots API 
Description The Grassroots Infrastructure project aims to create an easily-deployable suite of computing middleware tools to help users and developers gain access to scientific data infrastructure that can easily be interconnected. With the data-generative approaches that are increasingly common in modern life science research, it is vital that the data and metadata produced by these efforts can be shared and reused. The Grassroots Infrastructure project wraps up industry-standard software tools with a consistent API that can be federated on a number of levels. This means institutions and groups can deploy a simple lightweight virtual machine, expose local data, connect up any existing data services, and federate their instance of the Grassroots with others out-of-the-box. 
Type Of Technology Software 
Year Produced 2015 
Open Source License? Yes  
Impact The Grassroots API powers the public BLAST service that runs at TGAC, predominantly for the currently available wheat assemblies including the recently released TGAC v1 w2rap assembly (in preparation). We have served over 4000 unique users with over 6000 BLAST jobs since November 2015. It also underpins the Field Pathgenomics project (BBSRC IPA award 2015, PI - Saunders D., TGAC/JIC fellow), a web portal that represents the detection and subsequent phenotyping and genotyping of the wheat yellow rust pathogen. The site aims to enable researchers and breeders to track rust epidemics over variety and time, allowing for a more proactive approach to wheat crop breeding and farming. Finally, we are working with the CerealsDB group at Univ. Bristol to deploy the Grassroots infrastructure alongside the CerealsDB web portal, allowing a federation of searching, datasets, analysis and dissemination of markers, genotypes and associated feature and literature information. 
URL https://wheatis.tgac.ac.uk/grassroots/api/