Welcome to the
Draft Map Portal
Cannabis science is still in it's infancy.
We're changing that.
Here you will find the current status and access to:
The Cannabis Proteome Draft Map
The Cannabis Protein 3D Structural Library
The Cannabis Metabolome Draft Map
The first complete, annotated, and publicly available cannabis genome (completed by NCBI 9/6/19!)
What is Proteomics?
Akin to gene-omics (genomics) protein-omics (proteomics) is the study of all the proteins produced by an organism.
We have utilized high pressure ultra high resolution mass spectrometry (LC-MS/MS) to acquire millions of mass spectra from proteins of dozens of cannabis materials. To date, we have identified over 17,000 unique cannabis gene products present in publicly available "next gen" sequencing sets. You can read about our early results in this biorxiV paper.
What is Metabolomics?
As above, metabolism-omics (metabolomics) is the aim of identifying and quantifying all of the small molecules an organism produces under various conditions.
To date, we have identified over 1,000 distinct small molecules from extracts of various cannabis plants. You can read about some applications of metabolomics in cannabis in this biorxIV paper.
Why do we care about protein 3D structures?
The structure of a protein determines the final role of that gene, transcript, and ultimately of that protein.
Imagine the difference between cooked and uncooked egg white. Egg white consists of Albumin protein. Liquid egg white can carry help support the life of a growing chick within an egg. When egg white is cooked, Albumins lose their natural 3D structure and can not function as they do within the egg.
Very few cannabis proteins have solved 3D structures. By using the Cannabis proteome draft map we can find similar proteins from other plants that have been modeled and make a much larger hypothetical 3D structural library reference. We hope that other researchers can use our library to make informed hypotheses of what these proteins do to better understand the biochemistry of these important plants.
Annotated genome now available!
Very little modern science occurs in isolation. We must build on the work of others and a big part of that is making data publicly available. In July, we approached the US NCBI about assembling all publicly available Cannabis genomes into fully annotated information to facilitate further study into these plants. These files were completed and made publicly available on 9/6/19. We have fully reviewed these and are humbled by the quality of the work. NCBI Canabis Sativa Release 100 is available here.
We are currently updating our Proteomic resources with these new files.
Current Result Summaries
Proteomics Current Progress
Current Status of This Project:
2.3 Million High Resolution MS/MS spectra are available for download
7 Million new spectra will be available soon
Our first pass annotated FASTA contains 13,850 sequences and is available from our download page.
The NCBI Cannabis sativa Release 100 FASTA with 29,232 entries will be live on 9/9/19.
Metabolomics Current Progress
We have identified over 1,000 distinct chemical small molecule features. To date we have assigned hypothetical structural predictions to over 500 of these molecules. Further annotation, characterization, and compound purification is currently underway. An online database of the most recent molecules and identifications is available on our portal access page. All current identifications and supporting information may be downloaded from the Full Data Sets Page
3D Protein Structural Libary
We have identified, by homology and peptide identification, over 900 proteins with characterized 3D protein structures. Get your 3D glasses ready. A file with direct links to 965 online interactive SwissModel proteins may be downloaded from our Full Data Sets page.