Brazilian coffee genome project: an EST-based genomic resource.
Vieira, LGE; ANDRADE, AC; COLOMBO, CA; MORAES, AHA; METHA, A; OLIVEIRA, AC; LABATE, CA; MARINO, CL; MONTEIRO-VITORELLO, CB; MONTE, DC; GIGLIOTI, E; FORMIGHIERI, EF; CARAZZOLLE, MF; PEREIRA, GAG Brazilian Journal of Plant Physiology, v. 18, p. 95-108. 2006
Coffee is one of the most valuable agricultural commodities and ranks second on international trade exchanges. The genus Coffea belongs to the Rubiaceae family which includes other important plants. The genus contains about 100 species but commercial production is based only on two species, Coffea arabica and Coffea canephora that represent about 70 % and 30 % of the total coffee market, respectively. The Brazilian Coffee Genome Project was designed with the objective of making
modern genomics resources available to the coffee scientific community, working on different aspects of the coffee production chain. We have single-pass sequenced a total of 214,964 randomly picked clones from 37 cDNA libraries of C. arabica, C. canephora and C. racemosa, representing specific stages of cells and plant development that after trimming resulted in 130,792, 12,381 and 10,566 sequences for each species, respectively. The ESTs clustered into 17,982 clusters and 32,155 singletons. Blast analysis of these sequences revealed that 22 % had no significant matches to sequences in the National Center for Biotechnology Information database (of known or unknown function). The generated coffee EST database resulted in the identification of close to 33,000 different unigenes. Annotated sequencing results have been stored in an online database at http://www.lge.ibi.unicamp.br/cafe. Resources developed in this project provide genetic and genomic tools that may hold the key to the sustainability, competitiveness and future viability of the coffee industry in local and international markets.
Apoio financeiro: CBP&D-Café, EMBRAPA e FAPESP.