Gene PICST_86063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_86063 
SymbolHMX1 
ID4850975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp601200 
End bp602345 
Gene Length1146 bp 
Protein Length292 aa 
Translation table 
GC content43% 
IMG OID640392683 
Productheme binding protein 
Protein accessionXP_001387756 
Protein GI126273936 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG5398] Heme oxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.507414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0416913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAGAAAACTG CCAATATCCG TTTGACATTT GACTTTTTTG TCTGTAATTT TCAAACATTA 
GCTAGCAGAA TTTTCCCCTG ATCGACTTTC CCATATCTCG ATCTTTTTCG CATCTTAAAT
CGCATAGTAT CATAATGTCC AAGGTACAAA ACTCCGGCGC CACCACCAAG CTTTCTCAAC
ACGAGATTCT TCCGGCCAAG AACGACATTG GCGCTCTTGC CAACAGAATC AACTCCGAAA
CAAGATCTCT CCATGACAAA GTCGACAAGT TGGTCACCCT CAAGATGGCC CTCGCACTCA
GAGACGGCAA GATCTACAGA CAGGGCTTGC AGAGTTTCTA CCATGTTTTT GCATCCATCG
AAAAATCGCT CCACGCCCAG CTCGAGAAGG ACGACGAATG GACGCCAATG TTGAAGAGCG
TGTGGAAGCC AGAAATTGCT AGACGTGAGA AGGCAGAACA AGACTTGTTG TTCTACTACG
ATGACAGAAA GGAAAAGTTT GTCAACCCTA TCATGCCAGA GCAGATCGCA TTTGCCAATC
ACATCTTGGA AGGCACTGCC GAAAAGCCAT ACTTGCTCTT TGCCTACTTG CATGTTATGT
ACTTGGCCTT GTTTGCCGGT GGAAGAATCA TGAGATCGTC TTTCGCAAAG GCTACTGGCT
TGTTCCCACA CAAGAACGGC TTGTCCCACG AAGAAATCGT TAAGTTGGGA ACGAATTTCT
TCACGTTCGA TGTTGCTGAC GAGAACTTGC TCAGAATGAT CTACAAGAGA GACTACGAGC
TTGTCACCAG AAACGGTCTT ACTGAAGAAC AAAAATTGGA AATCATTGAA GAATCAAAGT
ATATTTTTGA ACAGAACGCT AAGTGTATAG TCGAGCTTGA AGCCCACAAC ATGGCCAGAT
TAAAGCTGAA GTGGTCCTAC TTGGCTGTCA CTAAGGGTTA CCAAGCCCTC TTGGTTATCC
TTGCCTTGCT TGCATTGGAA TACGTCAGAA GATTCATCTA CAGCTTTGCT TAGAGAATAT
TGAATATTCA ATCCACAGCA AACAAATTTC AGTTCTATTT CTTTTTATTA TTAGTTATAC
CTTTTGATTG CATATCTATT CACCAGTGAG GTTTGGTTGG GATGGTGTTT AATATAAACT
GCCAGT
 
Protein sequence
MSKVQNSGAT TKLSQHEILP AKNDIGALAN RINSETRSLH DKVDKLVTLK MALALRDGKI 
YRQGLQSFYH VFASIEKSLH AQLEKDDEWT PMLKSVWKPE IARREKAEQD LLFYYDDRKE
KFVNPIMPEQ IAFANHILEG TAEKPYLLFA YLHVMYLALF AGGRIMRSSF AKATGLFPHK
NGLSHEEIVK LGTNFFTFDV ADENLLRMIY KRDYELVTRN GLTEEQKLEI IEESKYIFEQ
NAKCIVELEA HNMARLKLKW SYLAVTKGYQ ALLVILALLA LEYVRRFIYS FA