Gene EcSMS35_4098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4098 
SymbolglmU 
ID6145586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4193633 
End bp4195003 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content53% 
IMG OID641618922 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_001746060 
Protein GI170679754 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.251907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAATA ATGCTATGAG CGTAGTGATC CTTGCCGCAG GTAAAGGCAC GCGCATGTAT 
TCCGATCTTC CGAAAGTGCT GCATACCCTT GCCGGGAAAG CGATGGTTCA GCATGTCATT
GATGCTGCGA ATGAATTAGG CGCAGCGCAC GTTCACCTGG TGTACGGTCA CGGCGGCGAT
CTGCTAAAAC AGGCGCTGAA AGACGACAAC CTGAACTGGG TGCTTCAGGC AGAGCAGCTG
GGTACGGGTC ATGCGATGCA GCAGGCCGCA CCTTTCTTTG CCGATGATGA AGACATTTTA
ATGCTCTACG GCGACGTGCC GCTGATCTCT GTCGAAACAC TCCAGCGCCT GCGTGATGCT
AAACCGCAGG GTGGCATTGG TCTGCTGACG GTAAAACTGG ATGATCCGAC CGGTTATGGA
CGTATCACCC GTGAAAACGG CAAAGTTACC GGCATTGTTG AGCACAAAGA CGCCACCGAC
GAGCAGCGTC AGATTCAGGA GATCAACACC GGCATTCTGA TTGCCAACGG CGCAGATATG
AAACGCTGGC TGGCGAAGCT GACCAACAAT AACGCGCAGG GTGAATACTA CATCACCGAC
ATTATTGCGC TGGCGTATCA GGAAGGACGT GAAATCGTCG CCGTTCATCC GCAACGTTTA
AGCGAAGTAG AAGGCGTGAA TAACCGCCTG CAACTCTCCC GACTGGAGCG CGTTTACCAG
TCCGAACAGG CTGAAAAACT GCTGTTAGCA GGCGTTATGC TGCGCGATCC GGCGCGTTTT
GATCTGCGCG GTACGCTTAC TCACGGGCGC GATGTTGAAA TTGATACTAA CGTTATCATC
GAGGGCAACG TGACTCTCGG TCATCGCGTG AAAATCGGCA CCGGTTGTGT GATTAAAAAC
AGCGTGATTG GCGATGATTG CGAAATTAGC CCGTATACCG TTGTGGAAGA CGCGAATCTG
GCGGCGGCCT GTACCATTGG CCCGTTTGCC CGTCTGCGTC CTGGTGCTGA GTTGCTGGAA
GGTGCACACG TTGGTAACTT CGTTGAGATG AAAAAAGCGC GTCTGGGTAA AGGCTCGAAA
GCTGGTCATC TGACTTACCT GGGCGATGCG GAAATTGGCG ATAACGTTAA CATCGGCGCG
GGAACCATTA CCTGCAACTA CGATGGTGCG AATAAATTTA AGACTATTAT CGGCGACGAT
GTGTTTGTCG GTTCCGACAC TCAGCTGGTG GCCCCGGTAA CAGTAGGCAA AGGCGCGACC
ATTGCTGCGG GTACAACTGT GACGCGTAAT GTCGGCGAAA ACGCCCTGGC GATCAGCCGT
GTGCCGCAGA CTCAAAAAGA AGGCTGGCGT CGTCCGGTAA AGAAAAAGTA A
 
Protein sequence
MLNNAMSVVI LAAGKGTRMY SDLPKVLHTL AGKAMVQHVI DAANELGAAH VHLVYGHGGD 
LLKQALKDDN LNWVLQAEQL GTGHAMQQAA PFFADDEDIL MLYGDVPLIS VETLQRLRDA
KPQGGIGLLT VKLDDPTGYG RITRENGKVT GIVEHKDATD EQRQIQEINT GILIANGADM
KRWLAKLTNN NAQGEYYITD IIALAYQEGR EIVAVHPQRL SEVEGVNNRL QLSRLERVYQ
SEQAEKLLLA GVMLRDPARF DLRGTLTHGR DVEIDTNVII EGNVTLGHRV KIGTGCVIKN
SVIGDDCEIS PYTVVEDANL AAACTIGPFA RLRPGAELLE GAHVGNFVEM KKARLGKGSK
AGHLTYLGDA EIGDNVNIGA GTITCNYDGA NKFKTIIGDD VFVGSDTQLV APVTVGKGAT
IAAGTTVTRN VGENALAISR VPQTQKEGWR RPVKKK