Data files to integrate¶
All data required to build an InterMine is included in biotestmine/data/malaria-data.tar.gz. Copy this file to your local directory and extract from the archive.
cp biotestmine/data/malaria-data.tar.gz DATA_DIR
cd DATA_DIR
tar -zxvf malaria-data.tar.gz
Edit the project.xml file so that all occurrences of ‘’DATA_DIR’’ point to the your local data directory location.
Data sources¶
malaria-genome¶
The malaria genome as gff3 and fasta, originally downloaded from PlasmoDB
uniprot¶
UniProt XML with protein information and sequences from SwissProt and Trembl. Downloaded from uniprot.org and filtered on taxon id 36329.
gene_ontology¶
The Gene Ontology structure. Downloaded from http://www.geneontology.org/
go_annotation¶
GO term assignments for P. falciparum. Downloaded from http://www.geneontology.org/