Gene PICST_36859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36859 
SymbolHEM15 
ID4840160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1563846 
End bp1564910 
Gene Length1065 bp 
Protein Length354 aa 
Translation table12 
GC content44% 
IMG OID640391475 
Productferrochelatase precursor 
Protein accessionXP_001385655 
Protein GI150866158 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATGG GAGGTCCTTC TACAGTGTCT GAAACTCACG ACTTTCTTTT CAGGCTCTTT 
TCAGATGGAG ACTTAATTCC GTTTGGACCT TTCCAAAATA TCCTTGCCAA ATGGATTGCC
AGAAGAAGAA CACCCAAGAT CGAGGAGCAT TATAAAGAGA TTGGTGGAGG TTCTCCTATT
CGTTACTGGT CCGAATTCCA GTGCAAGAGA GTGTGTGAGA TTCTAGACAA ATCGAATCCA
GAAACAGCTC CTCATAAACC ATACGTAGCT TTCAGATATG CAAAACCATT GACGGAAGAT
ACGCTCCAAC AGATGTTGGA TGACGGGGTT AAGCGTGCTG TGGCCTTCTC CCAGTATCCC
CAGTTCTCAT ACTCTACGAC TGGCTCTTCT ATCAACGAAT TGTATAGACA AACATTGCAA
CTCGATCCAG ACCGTAGGAT CAACTGGTCT GTAATCGACA GATGGCCCAA GGATAAAGGT
TTAGTTTCAG CCTTTTGTAC TCACATCAAC GACAAACTCA CTGAGTTCCC TGCCGAAGAC
AGAGACAAAA TTGTTTTGTT GTTTTCGGCT CATTCATTGC CAATGGAGAT CGTCAACAAG
GGTGATTCGT ATCCTGCTGA GGTGGCGGCC ACAGTCTACG CCATCATGGA AAAGTTGAAA
TTCCTGTTGC CCTACAGATT GGTATGGCAA TCACAAGTTG GACCCAAGCC TTGGTTGGGT
GGGCAAACTG CTAAGATCAC AGGAAAGCTC GATCTCAGAG ACGATATCAA AGGCATCATC
TTAGTTCCTG TAGCGTTCAC TTCTGACCAC ATTGAAACGC TCCATGAGTT GGATATCGAG
TTGGTGGAAG ATCTCAAGAA TCCAGAAAAG GTTAAAAGAG CCGCTTCATT GAACGGAAGT
GAGATATTTA TTGAAGGTTT GGCTGATTTG GTCAAGAATC ACTTGCAGTC GGGTAAATTG
TACTCCACAC AATTGGAATT GGATTGTAAG TTAGGTGGAG AAACCGCGAA GAATACTTTC
AAGCATCCCA GCGAGTTATT TGGAGACCAC TCGAAATCCC TGTAA
 
Protein sequence
MNMGGPSTVS ETHDFLFRLF SDGDLIPFGP FQNILAKWIA RRRTPKIEEH YKEIGGGSPI 
RYWSEFQCKR VCEILDKSNP ETAPHKPYVA FRYAKPLTED TLQQMLDDGV KRAVAFSQYP
QFSYSTTGSS INELYRQTLQ LDPDRRINWS VIDRWPKDKG LVSAFCTHIN DKLTEFPAED
RDKIVLLFSA HSLPMEIVNK GDSYPAEVAA TVYAIMEKLK FSLPYRLVWQ SQVGPKPWLG
GQTAKITGKL DLRDDIKGII LVPVAFTSDH IETLHELDIE LVEDLKNPEK VKRAASLNGS
EIFIEGLADL VKNHLQSGKL YSTQLELDCK LGGETAKNTF KHPSELFGDH SKSS