Abstract
The sequence of the rice genome holds fundamental information for its biology, including physiology, genetics, development, and evolution, as well as information on many beneficial phenotypes of economic significance. Using a “whole genome shotgun” approach, we have produced a draft rice genome sequence ofOryza sativa ssp.indica, the major crop rice subspecies in China and many other regions of Asia. The draft genome sequence is constructed from over 4.3 million successful sequencing traces with an accumulative total length of 2214.9 Mb. The initial assembly of the non-redundant sequences reached 409.76 Mb in length, based on 3.30 million successful sequencing traces with a total length of 1797.4 Mb from anindica variant cultivar93-11, giving an estimated coverage of 95.29% of the rice genome with an average base accuracy of higher than 99%. The coverage of the draft sequence, the randomness of the sequence distribution, and the consistency of BIG-ASSEMBLER, a custom-designed software package used for the initial assembly, were verified rigorously by comparisons against finished BAC clone sequences from bothindica andjapanica strains, available from the public databases. Over all, 96.3% of full-length cDNAs, 96.4% of STS, STR, RFLP markers, 94.0% of ESTs and 94.9% unigene clusters were identified from the draft sequence. Our preliminary analysis on the data set shows that our rice draft sequence is consistent with the comman standard accepted by the genome sequencing community. The unconditional release of the draft to the public also undoubtedly provides a fundamental resource to the international scientific communities to facilitate genomic and genetic studies on rice biology.
Similar content being viewed by others
References
Sasaki, T., Burr, B., International Rice Genome Sequencing Project: the effort to completely sequence the rice genome, Curr. Opin. Plant. Biol., 2000, 3: 138.
Eckardt, N. A., Sequencing the Rice Genome, The Plant Cell, 2000, 12: 2011.
Lander, E. S., Linton, L. M., Birren, B. et al., Initial sequencing and analysis of the human genome, Nature, 2001, 409: 860.
Venter, J. C., Adams, M. D., Myers, E. W. et al., The sequence of the human genome, Science, 2001, 291: 1304.
Bevan, M., Murphy, G., The small, the large and the wild: the value of comparison in plant genomics, Trends Genet., 1999, 15: 211.
The Arabidopsis Genome Initiative, Analysis of the genome sequence of the flowering plant Arabidopsis thaliana, Nature, 2000, 408: 796.
Tao, Q., Zhao, H., Qiu, L. et al., Construction of a full bacterial artificial chromosome (BAC) library of Oryza sativa genome, Cell Res., 1994, 4: 127.
Umehara, Y., Miyazaki, A., Tanoue, H., Construction and characterization of rice YAC library for physical mapping, Molecular Breeding, 1995, 1: 79.
Wang, G. L., Holsten, T. E., Song, W. Y. et al., Construction of rice bacterial artificial chromosome library and identification of clones linked to the Xa-21 disease resistance locus, Plant. J., 1995, 7: 525.
Gale, M. D., Devos, K. M., Comparative genetics in the grasses, Proc. Natl. Acad. Sci. USA, 1998, 95: 1971.
Rowen, L., Wong, G. K., Lane, R. P. et al., Publication rights in the era of open data release policies, Science, 2000, 289: 1881.
http://www.ornl.gov/hgmis/research/bermuda.html#3.
http://www.tigr.org/tdb/e2kl/osal/BACmapping/description.shtml
http://www.rice-research. org/
http://www.syngenta.com/
Yuan, L. P., Breeding of super hybrid rice for super high yield production, Hybrid Rice (in Chinese), 1997, 1: 1.
Dai, Z. Y., Zhao, B. H., Liu, X. J., Yangdao 6 (93-11), a new mediumindica variety with fine quality, high yield and multi-disease resistance (in Chinese), Jiangsu Agricultural Sciences, 1997, 4: 13.
Sambrook, J., Russell, J. D., Molecular Cloning, 3rd ed., New York: Cold Spring Harbor Laboratory Press, 2001.
Hatano, S., Yamaguchi, J., Hirai, A., The preparation of highmolecular-weight DNA from rice and its analysis by pulsed-field gel electrophoresis, Plant Sci., 1992, 83: 55.
Myers, E. W., Sutton, G. G., Deicher, A. L. et al., A whole-genome assembly of Drosophila, Science, 2000, 287: 2196.
Birnboim, H. C., A rapid alkaline extraction method for the isolation of plasmid DNA, Methods Enzymol, 1983, 100: 243.
Ewing, B., Hillier, L., Wendl, M. C. et al., Base-calling of automated sequencer traces using Phred, I. Accuracy assessment, Genome Res., 1998, 8: 175.
Ewing, B., Green, P., Base-calling of automated sequencer traces using Phred, II. Accuracy assessment, Genome Res., 1998, 8: 186.
http://www.phrap.org/
Sources of STS, STR, RFLP sequences:
http://ars-genome.Cornell.edu/rice/quickqueries.html; http://www.ncbi.nlm.nig.gov/; http://rgp.dna.affrc.go.jp/publicdata/geneticmap2000/index.html
Wong, G. K., Passey, D. A., Huang, Y. et al., Is “junk” DNA mostly intron DNA? Genome Res., 2000, 10: 1672.
Author information
Authors and Affiliations
Corresponding author
Additional information
These authors contributed equally to this work.
About this article
Cite this article
Yu, J., Hu, S., Wang, J. et al. A draft sequence of the rice (Oryza sativa ssp.indica) genome. Chin.Sci.Bull. 46, 1937–1942 (2001). https://doi.org/10.1007/BF02901901
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF02901901