Gene ECH74115_5466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5466 
SymbolhemE 
ID6971373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5109654 
End bp5110718 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content56% 
IMG OID643389112 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_002273513 
Protein GI209398331 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.843277 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC TTAAAAACGA TCGTTATCTG CGGGCGCTGC TGCGCCAGCC CGTTGATGTC 
ACTCCTGTAT GGATGATGCG CCAGGCGGGT CGCTATCTAC CGGAATATAA AGCCACGCGC
GCCCAGGCGG GCGATTTTAT GTCGCTGTGC AAAAACGCCG AGCTGGCGTG TGAAGTGACC
TTGCAACCGC TGCGTCGCTA CCCGCTGGAT GCGGCGATCC TCTTTTCCGA TATCCTCACC
GTCCCGGACG CGATGGGGTT AGGGCTCTAT TTTGAAGCCG GAGAAGGTCC GCGTTTTACC
TCGCCAGTCA CCTGCAAAGC TGACGTCGAT AAACTGCCAA TTCCGGACCC GGAAGATGAA
CTGGGGTACG TGATGAACGC GGTGCGTACC ATTCGTCGCG AACTGAAAGG CGAAGTGCCG
CTGATTGGTT TTTCCGGCAG CCCGTGGACG CTGGCGACCT ACATGGTGGA AGGCGGCAGC
AGCAAAGCCT TCACCGTGAT CAAAAAAATG ATGTATGCCG ATCCGCAGGC GCTGCACGCT
CTACTCGATA AACTGGCGAA AAGCGTCGCT TTGTATCTGA ATGCGCAGAT TAAAGCCGGT
GCTCAGGCTG TGATGATTTT CGACACCTGG GGCGGCGTGC TTACCGGGCG CGATTATCAA
CAGTTCTCGC TTTATTACAT GCATAAAATT GTTGATGGTT TACTGCGTGA AAACGACGGT
CGCCGCGTAC CGGTCACGCT GTTTACCAAA GGCGGCGGAC AGTGGCTGGA AGCCATGGCA
GAAACCGGTT GCGATGCGCT GGGCCTCGAC TGGACAACGG ACATCGCCGA TGCGCGCCGC
CGTGTGGGCA ATAAAGTCGC GTTGCAGGGT AATATGGATC CGTCGATGCT GTACGCGCCG
CCTGCCCGCA TTGAAGAAGA AGTAGCGACT ATACTTGCAG GTTTCGGTCA CGGCGAAGGT
CATGTCTTTA ACCTTGGTCA CGGCATTCAT CAGGATGTGC CGCCAGAACA TGCTGGCGTG
TTCGTGGAGG CAGTGCATCG ACTGTCTGAA CAGTATCACC GCTAA
 
Protein sequence
MTELKNDRYL RALLRQPVDV TPVWMMRQAG RYLPEYKATR AQAGDFMSLC KNAELACEVT 
LQPLRRYPLD AAILFSDILT VPDAMGLGLY FEAGEGPRFT SPVTCKADVD KLPIPDPEDE
LGYVMNAVRT IRRELKGEVP LIGFSGSPWT LATYMVEGGS SKAFTVIKKM MYADPQALHA
LLDKLAKSVA LYLNAQIKAG AQAVMIFDTW GGVLTGRDYQ QFSLYYMHKI VDGLLRENDG
RRVPVTLFTK GGGQWLEAMA ETGCDALGLD WTTDIADARR RVGNKVALQG NMDPSMLYAP
PARIEEEVAT ILAGFGHGEG HVFNLGHGIH QDVPPEHAGV FVEAVHRLSE QYHR