Gene EcolC_3141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3141 
SymbolhemH 
ID6066432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3442543 
End bp3443505 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content54% 
IMG OID641602557 
Productferrochelatase 
Protein accessionYP_001726091 
Protein GI170021137 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.692979 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00475646 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTCAGA CTAAAACCGG TATCCTGCTG GCAAACCTGG GTACGCCCGA TGCCCCCACA 
CCTGAAGCGG TAAAACGCTA TCTGAAACAA TTTTTAAGCG ACAGACGCGT GGTTGATACC
TCACGGTTGT TATGGTGGCC ATTGCTGCGC GGCGTGATTT TGCCGCTGCG CTCGCCGCGT
GTGGCGAAGC TGTATGCCTC TGTCTGGATG GAAGGTGGCT CGCCGCTGAT GGTTTACAGC
CGTCAGCAAC AGCAGGCGCT GGCACAACGT TTACCGGAGA CGCCCGTAGC GCTGGGAATG
AGCTACGGCT CGCCATCACT GGAAAGCGCC GTAGATGAAC TCCTGGCAGA GCATGTAGAT
CATATTGTGG TGCTGCCGCT TTATCCGCAA TACTCCTGTT CAACGGTCGG TGCGGTATGG
GATGAACTGG CACGCATTCT GGCGCGCAAA CGTAGCATTC CGGGGATATC GTTTATTCGT
GATTACGCTG ATAACCACGA TTACATTAAT GCACTGGCGA ACAGCGTACG CGCTTCTTTT
GCCAAACATG GCGAACCGGA TCTGCTGCTG CTCTCTTATC ATGGCATTCC CCAGCGTTAT
GCAGATGAAG GCGATGATTA CCCGCAACGT TGCCGCACAA CGACTCGCGA ACTGGCTTCC
GCACTGGGGA TGGCACCGGA AAAAGTGATG ATGACCTTTC AGTCGCGCTT TGGTCGGGAA
CCCTGGCTGA TGCCTTATAC CGACGAAACG CTGAAAATGC TCGGAGAAAA AGGCGTAGGT
CATATACAGG TGATGTGCCC GGGCTTTGCT GCGGATTGTC TGGAGACGCT GGAAGAGATT
GCCGAGCAAA ACCGTGAGGT CTTCCTCGGT GCCGGCGGGA AAAAATATGA ATATATTCCA
GCGCTTAATG CCACGCCGGA ACATATTGAA ATGATGGCTA ATCTTGTTGC CGCGTATCGC
TAA
 
Protein sequence
MRQTKTGILL ANLGTPDAPT PEAVKRYLKQ FLSDRRVVDT SRLLWWPLLR GVILPLRSPR 
VAKLYASVWM EGGSPLMVYS RQQQQALAQR LPETPVALGM SYGSPSLESA VDELLAEHVD
HIVVLPLYPQ YSCSTVGAVW DELARILARK RSIPGISFIR DYADNHDYIN ALANSVRASF
AKHGEPDLLL LSYHGIPQRY ADEGDDYPQR CRTTTRELAS ALGMAPEKVM MTFQSRFGRE
PWLMPYTDET LKMLGEKGVG HIQVMCPGFA ADCLETLEEI AEQNREVFLG AGGKKYEYIP
ALNATPEHIE MMANLVAAYR