Saturday 13 July 2024
  • La Cimbali

illycaffè and Lavazza release first full genome sequence for coffee

Must read


A partnership led by illycaffè and Lavazza, together with Istituto di Genomica Applicata, IGA Technology Services, DNA Analytica, and the universities of Trieste,  Udine, Padova, and Verona, has released the results of the Coffea arabica Genome Sequencing Project.

This unique achievement in genome research will accelerate scientific efforts to ensure the future of coffee agriculture, which is threatened by climate change.

World Coffee Research (global coffee industry R&D nonprofit organization) is making the genome sequence available to the public via its website today (

This is the first genome sequence for Coffea arabica fully available to the public.

The study was coordinated by professor Giorgio Graziosi of DNA Analytica Srl. The partnership of Italian researchers, along with world coffee market leaders illycaffè and Lavazza, is an important example of collaboration between the private and public sectors.

Coffee is worth about $160 billion annually and employs more than 25 million families of farmers globally, according to the International Coffee Organization.

Public-private collaboration to address threats of climate change

“The genome research is not only a brilliant example of public-private sector collaboration,” says Andrea Illy, Chairman of illycaffè, “but is also an important step to support coffee growers around the world, who are already facing the damages of climate change.

As indicated by another common research project, also developed with the colleagues from Lavazza together with Earth Institute of Columbia University led by Jeffrey Sachs in 2015, the suitable land for Arabica coffee may be reduced by as much as half by 2050 due to climate change, at the same time as global demand is expected to nearly double.

Research and innovation is one of the most important ways to fight this threat. Making the research results available to everyone is the right thing to do and will maximize the impact of the global effort to make coffee more sustainable.”

Better coffee on the horizon

“We are proud of having contributed to the unveil of the arabica DNA, an incredibly important initiative for the coffee sector conducted by a multi-functional team of researchers. The results of this project highlight the importance of working in a pre-competitive approach, which in turn will help improve the entire coffee production supply chain,” comments Giuseppe Lavazza, Lavazza Vice Chairman.

“The sequencing of the coffee genome gives us the ability to ‘read’ the plant and precisely identify its origins as well as determine, for example, the genes that give it a certain resistance to diseases or infections. This could result in a superior quality coffee end product based on objective criteria. Indeed, excellent quality is the ultimate goal our company has always pursued, and which is the focus for this research project.”

Coffea arabica is one of two species in the Coffea family consumed globally. It is renowned for its high quality and represents over 60% of coffee production in the world.

For the benefit of coffee producers

“We are thrilled to be able to convey this Arabica genome to the global coffee and research community freely and openly,” says Tim Schilling, CEO and founder of World Coffee Research.

“Advanced genetics research is essential to coffee’s future as a sustainable crop, and to exploring the thrilling diversity of flavors found in coffee. Having access to a whole sequenced genome is an essential precursor to unlocking the potential of genetics research to transform coffee production. Utilizing advancements in DNA science for the benefit of coffee producers around the world is the very reason that a collaborative industry research nonprofit like World Coffee Research exists. Our scientists are looking forward to working with other organizations, countries and governments to make use of the treasures within this genome to make coffee more profitable to farmers and better tasting for consumers.”

In many tropical countries where coffee is grown, coffee production is an essential contributor to the national economy, as well as being one of the world’s most important agricultural commodities. Additionally, it supports the livelihoods of an estimated over 25 million smallholder farmers around the world (source ICO). Improvements in coffee production due to advanced applied genetics will have an important positive economic impact globally.

 Advancing sustainable coffee production

According to the Food and Agriculture Organization, “Genome sequencing has the power to revolutionize food security and sustainable agriculture.” The sequencing of the Arabica coffee genome enables coffee breeders to accelerate efforts to increase the productivity, quality, and profitability of coffee growing worldwide. Most countries that grow predominantly Arabica coffee have already seen declining production over recent decades.

Having a genome sequence available enables researchers to understand and target key agronomic traits that matter to farmers and coffee drinkers—for example, better or novel flavors, uniform fruit ripening, resistance to disease, adaptation for the hotter, drier climates of the future, or adaptability to growth under shade. Such efforts are expected to have considerable benefits for coffee farmers, coffee drinkers, and the environment.

Genome sequences for other crops have allowed breeders to identify the genetic and molecular mechanisms that synchronize the ripening process, improve plants’ resistance to devastating diseases and infections, and to adapt plants for changing climate conditions, including rising temperatures, prolonged droughts, and heavy flooding.

Untangling a complex mystery

“This highly ambitious research project has resulted in the sequencing and reconstruction of the genome of Coffea arabica, an usual plant which has a duplicate set of chromosomes compared with other main cultivated species, Coffea canephora (also commonly called Robusta),” explains professor Michele Morgante, scientific director of the Istituto di Genomica Applicata.

“One of the main difficulties was to distinguish between the sequences derived from the two progenitor genomes of Arabica, Coffea canephora and Coffea eugenioides, which are extremely similar. To tackle this problem, we used a hierarchical sequencing approach, in which the genome is divided into relatively small portions before being reconstructed. We are extremely proud to be the first to release the sequence of the Arabica genome and make it available to the scientific community without any restriction on its use.”

Coffea arabica is a genetically complex species, carrying four copies of each of 11 chromosomes (44 total). Scientists call this a tetraploid species. Arabica is the only tetraploid species in the Coffea family. Technically it is described as an allotetraploid genome, the result of a hybridization between diploid parents Coffea canephora and Coffea eugenioides.

This genome sequence was derived from a Coffea arabica plant of the Red Bourbon variety. Fresh ripe coffee cherries were sourced from a coffee plantation in the Ahuachapan region of El Salvador and used as starting material for DNA extraction.

The genome was sequenced with Illumina technology at the Istituto di Genomica Applicata in Udine, Italy. Given the inherent complexity of working with a tetraploid genome, it was sequenced using a “hierarchical” approach instead of the more common whole genome shotgun approach. The genome was annotated from the Universities of Padova and Verona. The annotation was supported by RNA sequencing from 12 different samples derived from 8 different organs.

Key numbers and facts:

  • 36,864 genomic fragments were cloned into bacterial artificial chromosomes (BACs) and sequenced in 96 pools of 384 clones
  • 488 billion base pairs were produced, corresponding to 132 genome equivalents
    The genome size was estimated to be 1.3 Gb, based on a k-mers analysis
  • 96 independent assemblies were generated, using the software programs ABySS and SSPACE, and then merged to generate a multifasta file.
  • The sequence contains 1.51 billion base pairs, divided into 164,254 scaffold sequences
  • 78.311 genes were predicted and functionally annotated in Coffea arabica
    The first fully open open-access Arabica genome

This is the first time that the raw data for a Coffea arabica genome is being made publicly accessible. T

he open-access data files can be downloaded by researchers anywhere in the world on the World Coffee Research website ( Tools to enable browsing of the genome will be released in the weeks ahead.

  • Franke Mytico
  • TME - Cialdy Evo

Latest article