File Downloads

Checksums

HistoBase_downloads.md5sum
Text file of MD5 checksums for all of the files on this page. Useful for verifying downloads and checking that you have the latest versions of the files (e.g., using md5sum on Linux or md5 on OS X).

GSC Histoplasma capsulatum G217B assembly and gene predictions

These are the G217B genome assembly and gene predictions referenced in the paper Experimental annotation of the human pathogen Histoplasma capsulatum transcribed regions using high-resolution tiling arrays BMC Microbiology 11:216. They were generated by the Genome Sequencing Center at Washington University and are mirrored here by permission.

F_HCG217B.fasta.041130.gz
11/30/2004 version of the G217B genome assembly, in gzipped FASTA format.
G217B_predicted.gff3
Gene, exon, and CDS coordinates for the 9/21/2005 version of the GSC predicted gene set, relative to the assembly FASTA file, in GFFv3 format.
G217B_predicted.fasta
Translated protein sequences for the 9/21/2005 version of the GSC predicted gene set, in FASTA format.

ucsf_hc.01_1 Transcriptome Assemblies

These are the G217B, G186AR, H88, and H143 transcriptome assemblies generated in the paper Genome-wide reprogramming of transcript architecture by temperature specifies the developmental states of the human pathogen Histoplasma (Gilmore et al, PLoS Genet. 11:e1005395). All assemblies are based on strand-specific paired-end sequencing reads of biological replicates from H. capsulatum growing as yeast or hyphae (4 samples total for each strain).

ucsf_hc.01_1.G217B.gff3
Gene, exon, and CDS coordinates for G217B in GFFv3 format. Coordinates are relative to the GSC genome assembly above (F_HCG217B.fasta.041130.gz)
ucsf_hc.01_1.G217B.transcripts.fasta
cDNA sequences for G217B in FASTA format.
ucsf_hc.01_1.G217B.proteins.fasta
Translated protein sequences for G217B in FASTA format.
ucsf_hc.01_1.G186AR.gff3
Gene, exon, and CDS coordinates for G186AR in GFFv3 format. Coordinates are relative to the BROAD G186AR assembly (as of 6/15/2011)
ucsf_hc.01_1.G186AR.transcripts.fasta
cDNA sequences for G186AR in FASTA format.
ucsf_hc.01_1.G186AR.proteins.fasta
Translated protein sequences for G186AR in FASTA format.
ucsf_hc.01_1.H88.gff3
Gene, exon, and CDS coordinates for H88 in GFFv3 format. Coordinates are relative to the BROAD H88 assembly (as of 6/15/2011)
ucsf_hc.01_1.H88.transcripts.fasta
cDNA sequences for H88 in FASTA format.
ucsf_hc.01_1.H88.proteins.fasta
Translated protein sequences for H88 in FASTA format.
ucsf_hc.01_1.H143.gff3
Gene, exon, and CDS coordinates for H143 in GFFv3 format. Coordinates are relative to the BROAD H143 assembly (as of 6/15/2011)
ucsf_hc.01_1.H143.transcripts.fasta
cDNA sequences for H143 in FASTA format.
ucsf_hc.01_1.H143.proteins.fasta
Translated protein sequences for H143 in FASTA format.

Broad predicted transcriptome

This is the WU24 predicted transcriptome referenced in the paper Chromosome-level genome assembly of a human fungal pathogen reveals clustering of transcriptionally co-regulated genes. (in preparation) . It was downloaded from the Broad Institute on 6/15/2011 and is mirrored here by permission.

histoplasma_capsulatum_nam1_1_transcripts.fasta
Transcript sequences for WU24 in FASTA format.