File Downloads
Checksums
- HistoBase_downloads.md5sum
- Text file of MD5 checksums for all of the files on this page. Useful for verifying downloads and checking that you have the latest versions of the files (e.g., using md5sum on Linux or md5 on OS X).
GSC Histoplasma capsulatum G217B assembly and gene predictions
These are the G217B genome assembly and gene predictions referenced in the paper Experimental annotation of the human pathogen Histoplasma capsulatum transcribed regions using high-resolution tiling arrays BMC Microbiology 11:216. They were generated by the Genome Sequencing Center at Washington University and are mirrored here by permission.
- F_HCG217B.fasta.041130.gz
- 11/30/2004 version of the G217B genome assembly, in gzipped FASTA format.
- G217B_predicted.gff3
- Gene, exon, and CDS coordinates for the 9/21/2005 version of the GSC predicted gene set, relative to the assembly FASTA file, in GFFv3 format.
- G217B_predicted.fasta
- Translated protein sequences for the 9/21/2005 version of the GSC predicted gene set, in FASTA format.
ucsf_hc.01_1 Transcriptome Assemblies
These are the G217B, G186AR, H88, and H143 transcriptome assemblies generated in the paper Genome-wide reprogramming of transcript architecture by temperature specifies the developmental states of the human pathogen Histoplasma (Gilmore et al, PLoS Genet. 11:e1005395). All assemblies are based on strand-specific paired-end sequencing reads of biological replicates from H. capsulatum growing as yeast or hyphae (4 samples total for each strain).
- ucsf_hc.01_1.G217B.gff3
- Gene, exon, and CDS coordinates for G217B in GFFv3 format. Coordinates are relative to the GSC genome assembly above (F_HCG217B.fasta.041130.gz)
- ucsf_hc.01_1.G217B.transcripts.fasta
- cDNA sequences for G217B in FASTA format.
- ucsf_hc.01_1.G217B.proteins.fasta
- Translated protein sequences for G217B in FASTA format.
- ucsf_hc.01_1.G186AR.gff3
- Gene, exon, and CDS coordinates for G186AR in GFFv3 format. Coordinates are relative to the BROAD G186AR assembly (as of 6/15/2011)
- ucsf_hc.01_1.G186AR.transcripts.fasta
- cDNA sequences for G186AR in FASTA format.
- ucsf_hc.01_1.G186AR.proteins.fasta
- Translated protein sequences for G186AR in FASTA format.
- ucsf_hc.01_1.H88.gff3
- Gene, exon, and CDS coordinates for H88 in GFFv3 format. Coordinates are relative to the BROAD H88 assembly (as of 6/15/2011)
- ucsf_hc.01_1.H88.transcripts.fasta
- cDNA sequences for H88 in FASTA format.
- ucsf_hc.01_1.H88.proteins.fasta
- Translated protein sequences for H88 in FASTA format.
- ucsf_hc.01_1.H143.gff3
- Gene, exon, and CDS coordinates for H143 in GFFv3 format. Coordinates are relative to the BROAD H143 assembly (as of 6/15/2011)
- ucsf_hc.01_1.H143.transcripts.fasta
- cDNA sequences for H143 in FASTA format.
- ucsf_hc.01_1.H143.proteins.fasta
- Translated protein sequences for H143 in FASTA format.
Broad predicted transcriptome
This is the WU24 predicted transcriptome referenced in the paper Chromosome-level genome assembly of a human fungal pathogen reveals clustering of transcriptionally co-regulated genes. (in preparation) . It was downloaded from the Broad Institute on 6/15/2011 and is mirrored here by permission.