Gene EcSMS35_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0520 
SymbolhemH 
ID6143840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp526264 
End bp527226 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content54% 
IMG OID641615413 
Productferrochelatase 
Protein accessionYP_001742620 
Protein GI170680326 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0674477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCAGA CTAAAACCGG TATCCTGCTG GCAAACCTGG GTACGCCCGA TGCCCCCACA 
CCTGAAGCGG TAAAACGCTA TCTGAAACAA TTTTTAAGCG ACAGACGCGT GGTTGATACC
TCACGGTTGT TATGGTGGCC GTTGCTGCGC GGCGTGATTT TGCCGCTGCG CTCGCCGCGT
GTGGCGAAGC TGTATGCCTC TGTCTGGATG GAAGGTGGCT CGCCGCTGAT GGTTTACAGC
CGCCAGCAAC AGCAGGCGCT GGCACAACGT TTACCGGAGA CGCCCGTAGC GCTGGGAATG
AGCTACGGCT CGCCATCACT GGAAAGCGCC GTAGATGAAC TTCTGGCAGA GCATGTAGAT
CATATTGTGG TGCTGCCGCT TTATCCGCAA TACTCCTGTT CAACTGTCGG TGCGGTATGG
GATGAACTGG CGCGAATTCT GGCGCGCAAA CGTAGCATTC CGGGGATATC GTTTATTCGT
GATTACGCCG ATAACCACGA TTATATTAAT GCACTGGCGA ACAGCGTACG CGCTTCTTTT
GCCAAACATG GCGAACCGGA TCTGCTGTTG CTCTCTTATC ATGGCATTCC CCAGCGTTAT
GCAGATGAAG GCGATGATTA TCCGCAACAT TGCCGCACAA CGACTCGCGA ACTGGCTTCC
GCACTGGAGA TGGCACCGGA AAAAGTGATG ATGACCTTTC AGTCGCGCTT TGGGCGGGAA
CCCTGGCTGA TGCCTTATAC CGACGAAACG CTGAAAATGC TCGGAGAAAA AGGCGTAGGT
CATATTCAGG TGATGTGCCC GGGCTTTGCT GCGGATTGTC TGGAGACGCT GGAAGAGATT
GCCGAGCAAA ACCGTGAGGT CTTCCTCGGT GCGGGCGGGA AAAAATATGA ATATATTCCG
GCGCTTAATG CCACGCCAGA ACATATCGAA ATGATGGCTA ATCTTGTTGC CGCGTATCGC
TAA
 
Protein sequence
MRQTKTGILL ANLGTPDAPT PEAVKRYLKQ FLSDRRVVDT SRLLWWPLLR GVILPLRSPR 
VAKLYASVWM EGGSPLMVYS RQQQQALAQR LPETPVALGM SYGSPSLESA VDELLAEHVD
HIVVLPLYPQ YSCSTVGAVW DELARILARK RSIPGISFIR DYADNHDYIN ALANSVRASF
AKHGEPDLLL LSYHGIPQRY ADEGDDYPQH CRTTTRELAS ALEMAPEKVM MTFQSRFGRE
PWLMPYTDET LKMLGEKGVG HIQVMCPGFA ADCLETLEEI AEQNREVFLG AGGKKYEYIP
ALNATPEHIE MMANLVAAYR