Gene ECH74115_5166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5166 
SymbolglmU 
ID6968564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4810370 
End bp4811740 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content53% 
IMG OID643388832 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_002273258 
Protein GI209399085 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.136822 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAATA ATGCTATGAG CGTAGTGATC CTTGCCGCAG GTAAAGGCAC GCGCATGTAT 
TCCGATCTTC CGAAAGTGCT GCATACCCTT GCCGGGAAAG CGATGGTTCA GCATGTCATT
GATGCTGCGA ATGAATTAGG CGCAGCGCAC GTTCACCTGG TGTACGGTCA CGGTGGCGAT
CTGCTAAAAC AGGCGCTGAA AGACGACAAC CTGAACTGGG TGCTTCAGGC AGAGCAGCTG
GGTACGGGTC ATGCCATGCA GCAGGCCGCA CCTTTCTTTG CCGATGATGA AGACATTTTA
ATGCTCTACG GCGACGTGCC GCTGATCTCT GTCGAAACAC TCCAGCGTCT GCGTGATGCT
AAACCGCAGG GTGGCATTGG TCTGCTGACG GTGAAACTGG ATGATCCGAC CGGTTATGGA
CGTATCACCC GTGAAAACGG CAAAGTTACC GGCATTGTTG AGCACAAAGA TGCCACCGAC
GAGCAGCGTC AGATTCAGGA GATCAACACC GGCATTCTGA TCGCTAACGG CGCAGATATG
AAACGCTGGC TGGCGAAGCT GACCAACAAT AATGCGCAGG GCGAATACTA CATCACCGAC
ATTATTGCGC TGGCGTATCA GGAAGGGCGT GAAATCGTCG CCGTTCATCC GCAACGTTTA
AGCGAAGTAG AAGGCGTGAA TAACCGCCTG CAACTCTCCC GTCTGGAGCG CGTTTACCAG
TCCGAACAGG CTGAAAAACT GCTGTTAGCA GGCGTTATGC TGCGCGATCC GGCGCGTTTT
GATCTGCGCG GTACGCTTAC TCACGGGCGC GATGTTGAAA TTGATACTAA CGTTATCATC
GAGGGCAACG TGACTCTCGG CCATCGCGTG AAAATCGGCT CCGGTTGCGT GATTAAAAAC
AGCGTGATTG GCGATGATTG CGAAATTAGC CCATATACCG TCGTGGAAGA CGCGAATCTG
GCGGCGGCCT GTACCATTGG CCCGTTTGCC CGTTTGCGTC CTGGTGCTGA GTTGTTGGAA
GGTGCACACG TCGGTAACTT TGTTGAGATG AAAAAAGCAC GTCTGGGTAA AGGCTCGAAA
GCTGGTCATC TGACTTACCT TGGCGATGCG GAAATTGGCG ATAACGTTAA CATCGGCGCG
GGAACCATTA CCTGCAACTA CGATGGTGCG AATAAATTTA AGACCATTAT CGGCGACGAT
GTGTTTGTCG GTTCCGACAC TCAGCTGGTG GCCCCGGTAA CAGTAGGCAA AGGCGCGACC
ATTGCTGCGG GTACAACTGT GACGCGTAAT GTCGGCGAAA ATGCATTAGC TATCAGCCGT
GTGCCGCAGA CTCAGAAAGA AGGCTGGCGT CGTCCGGTAA AGAAAAAGTG A
 
Protein sequence
MLNNAMSVVI LAAGKGTRMY SDLPKVLHTL AGKAMVQHVI DAANELGAAH VHLVYGHGGD 
LLKQALKDDN LNWVLQAEQL GTGHAMQQAA PFFADDEDIL MLYGDVPLIS VETLQRLRDA
KPQGGIGLLT VKLDDPTGYG RITRENGKVT GIVEHKDATD EQRQIQEINT GILIANGADM
KRWLAKLTNN NAQGEYYITD IIALAYQEGR EIVAVHPQRL SEVEGVNNRL QLSRLERVYQ
SEQAEKLLLA GVMLRDPARF DLRGTLTHGR DVEIDTNVII EGNVTLGHRV KIGSGCVIKN
SVIGDDCEIS PYTVVEDANL AAACTIGPFA RLRPGAELLE GAHVGNFVEM KKARLGKGSK
AGHLTYLGDA EIGDNVNIGA GTITCNYDGA NKFKTIIGDD VFVGSDTQLV APVTVGKGAT
IAAGTTVTRN VGENALAISR VPQTQKEGWR RPVKKK