Gene ECH74115_5424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5424 
Symbol 
ID6971475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5070870 
End bp5072036 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content53% 
IMG OID643389076 
Producthippuricase 
Protein accessionYP_002273485 
Protein GI209397697 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.33905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAAC ACTATATTCG AGGGTTTGAG GAAGAACTGC GTGAGATCCG CCATCAGATC 
CACGAAAACC CGGAGCTGGG ATTACAAGAA TTTAAAACCA GTGCTTTGGT TGCGGAAAAG
CTGCGTCAGT GGGGCTATGA AGTAGAGCAA GGACTGGCGA CGACGGGGAT AGTGGCCACG
TTAAAAGTGG GCGACGGAGA GAAAAGTATT GGCCTGCGGG CGGATATGGA TGCGCTGCCG
ATTTATGAAA ATAGTGGCAA GCCCTGGGCC AGTAAACATC CTGGTTTAAT GCACGCCTGC
GGGCATGACG GGCATACCAC CATACTGCTG GGGGCCGCGC GTTATTTTGC CGAAACCCGC
CGTTTTAACG GCACGTTACG TCTTATCTTC CAGCCTGCGG AAGAGATGAT TAATGGCGGT
GAGATCATGG TTAAAGAGGG GCTTTTTGAT CATTTCCCCT GCGATGTCAT TTTCGGCATG
CATAACATGC CCGGTCTGCC GGTGGGTAAG TTTTATTTCC AGCCAGGGGC ACTGATGGCG
TCAATGGATC AGTTCCATAT TACAGTTCGT GGTTGTGGCG GACACGGTGC GATCCCGCAC
AAAGCCATCG ATCCGGTGCT AGTTGCCGCA CATATCACCA CCGCATTACA AAGCATTGTG
TCACGCAATG TCGATCCGCT GGAAGCGGCG GTAATTACCG TCGGCAGCAT TGTTGCGGGT
GAGGCGGCTA ACGTCATTCC TGACAGTGCT GAGATGAAAA TCAGCGTCCG CTCTCTTAGC
CGCGATACCC GACAACTTCT TCTGACCCGT ATTCCTGCTC TGGCACAAGC CCAGGCTGCC
AGCTTTGGCG CTACGGCTGA GGTTACGCAT GTTAACGGTA CACCAGTATT AGTAAATGAC
GAGGAGATGG CGCGCTTCGC CTGGCAGGTG GCGTGTAAAA CGTTTGGTGA AGATCGGGCC
GAGTTTGGCA TCAAGCCCCT GATGGGCAGT GAAGATTTCT CTTTCATGCT GGAAGCCCAA
CCCAAAGGCG GTTTCCTGTT ATTCGGTAAT GGCGATGTCG GGGAAGGTTC CTGCATGGTG
CATAACCCGG GTTACGACTT CAACGATGCA AGTCTGGTCC CGGCAAGCAG CTACTGGGGC
GCACTGGTAG AAGCCTGGCT GCAATAA
 
Protein sequence
MIEHYIRGFE EELREIRHQI HENPELGLQE FKTSALVAEK LRQWGYEVEQ GLATTGIVAT 
LKVGDGEKSI GLRADMDALP IYENSGKPWA SKHPGLMHAC GHDGHTTILL GAARYFAETR
RFNGTLRLIF QPAEEMINGG EIMVKEGLFD HFPCDVIFGM HNMPGLPVGK FYFQPGALMA
SMDQFHITVR GCGGHGAIPH KAIDPVLVAA HITTALQSIV SRNVDPLEAA VITVGSIVAG
EAANVIPDSA EMKISVRSLS RDTRQLLLTR IPALAQAQAA SFGATAEVTH VNGTPVLVND
EEMARFAWQV ACKTFGEDRA EFGIKPLMGS EDFSFMLEAQ PKGGFLLFGN GDVGEGSCMV
HNPGYDFNDA SLVPASSYWG ALVEAWLQ