Gene ECH74115_2352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2352 
SymbolanmK 
ID6968418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2224515 
End bp2225624 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content55% 
IMG OID643386226 
Productanhydro-N-acetylmuramic acid kinase 
Protein accessionYP_002270710 
Protein GI209397320 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2377] Predicted molecular chaperone distantly related to HSP70-fold metalloproteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0170825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGG GCCGCTTTAT TGGCGTTATG TCAGGCACCA GCCTTGATGG TGTTGATGTT 
GTGTTGGCGA CAATTGATGA ACACCGGGTC GCACAGCTGG CAAGTTTGAG CTGGCCGATC
CCGGTGTCTC TGAAACAGGC TGTACTGGAT ATTTGCCAGG GCCAGCAGCT TACACTTTCG
CAGTTTGGAC AGCTTGATAC TCAACTCGGG CGACTTTTTG CTGATGCGGT CAAGGCCTTG
CTTAAGGAAC AAAACCTGCA GGCGAGAGAT ATAGTTGCGA TCGGCTGTCA CGGTCAAACC
GTCTGGCATG AACCGACGGG CGTGGCACCA CACACTTTAC AGATTGGCGA TAACAATCAA
ATTGTGGCAC GCACCGGAAT TACGGTTGTC GGTGATTTTC GCCGTCGCGA TATTGCCTTG
GGAGGACAAG GCGCACCGCT GGTACCTGCG TTCCATCATG CCTTGCTGGC TCACCCAACC
GAGCGACGAA TGGTGCTCAA TATTGGCGGC ATCGCCAATC TGTCACTGCT CATTCCTGGG
CAGCCGGTTG GCGGCTACGA TACCGGTCCT GGTAACATGC TGATGGATGC CTGGATCTGG
CGTCAGGCCG GTAAACCTTA CGATAAAGAT GCCGAGTGGG CACGGGCGGG TAAAGTTATT
CTCCCACTGC TGCAAAATAT GCTCAGCGAC CCATATTTCT CGCAACCTGC ACCGAAAAGC
ACCGGACGCG AATACTTTAA CTACGGCTGG CTGGAGCGCC ATTTACGCCA TTTTCCGGGT
GTTGATCCCC GTGATGTTCA GGCCACACTG GCAGAACTCA CCGCCGTGAC CATTTCTGAA
CAAGTTTTGT TGAGCGGTGG CTGCGAACGA TTGATGGTAT GTGGTGGAGG TAGTCGTAAT
CCGCTACTCA TGGCGCGTCT GGCGGCATTA CTGCCAGGCA CAGAAGTCAC CACCACCGAT
GCCGTTGGCA TTAGTGGCGA TGACATGGAA GCATTGGCTT TCGCTTGGCT TGCCTGGCGG
ACGCTGGCGG GATTACCAGG AAATCTGCCT TCCGTCACTG GCGCAAGCCA GGAGACGGTT
CTGGGGGCTA TTTTCCCCGC CAACTCGTGA
 
Protein sequence
MKSGRFIGVM SGTSLDGVDV VLATIDEHRV AQLASLSWPI PVSLKQAVLD ICQGQQLTLS 
QFGQLDTQLG RLFADAVKAL LKEQNLQARD IVAIGCHGQT VWHEPTGVAP HTLQIGDNNQ
IVARTGITVV GDFRRRDIAL GGQGAPLVPA FHHALLAHPT ERRMVLNIGG IANLSLLIPG
QPVGGYDTGP GNMLMDAWIW RQAGKPYDKD AEWARAGKVI LPLLQNMLSD PYFSQPAPKS
TGREYFNYGW LERHLRHFPG VDPRDVQATL AELTAVTISE QVLLSGGCER LMVCGGGSRN
PLLMARLAAL LPGTEVTTTD AVGISGDDME ALAFAWLAWR TLAGLPGNLP SVTGASQETV
LGAIFPANS