Gene EcolC_4264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4264 
SymbolglmU 
ID6068016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4714788 
End bp4716158 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content53% 
IMG OID641603701 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_001727187 
Protein GI170022233 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.855705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.99098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAATA ATGCTATGAG CGTAGTGATC CTTGCCGCAG GCAAAGGCAC GCGCATGTAT 
TCCGATCTTC CGAAAGTGCT GCATACCCTT GCCGGGAAAG CGATGGTTCA GCATGTCATT
GATGCTGCGA ATGAATTAGG CGCAGCGCAC GTTCACCTGG TGTACGGTCA CGGCGGCGAT
CTGCTAAAAC AGGCGCTGAA AGACGACAAC CTTAACTGGG TGCTTCAGGC AGAGCAGCTG
GGTACGGGTC ATGCAATGCA GCAGGCCGCA CCTTTCTTTG CCGATGATGA AGACATTTTA
ATGCTCTACG GCGACGTGCC GCTGATCTCT GTCGAAACAC TCCAGCGTCT GCGTGATGCT
AAACCGCAGG GTGGCATTGG TCTGCTGACG GTGAAACTGG ATGATCCGAC CGGTTATGGA
CGTATCACCC GTGAAAACGG CAAAGTTACC GGCATTGTTG AGCACAAAGA TGCCACCGAC
GAGCAGCGTC AGATTCAGGA GATCAACACC GGCATTCTGA TTGCCAACGG CGCAGATATG
AAACGCTGGC TGGCGAAGCT GACCAACAAT AATGCTCAGG GCGAATACTA CATCACCGAC
ATTATTGCGC TGGCGTATCA GGAAGGGCGT GAAATCGTCG CCGTTCATCC GCAACGTTTA
AGCGAAGTAG AAGGCGTGAA TAACCGCCTG CAACTCTCCC GTCTGGAGCG TGTTTATCAG
TCCGAACAGG CTGAAAAACT GCTGTTAGCA GGCGTTATGC TGCGCGATCC AGCGCGTTTT
GATCTGCGTG GTACGCTAAC TCACGGGCGC GATGTTGAAA TTGATACTAA CGTTATCATC
GAGGGCAACG TGACTCTCGG TCATCGCGTG AAAATTGGCA CCGGTTGCGT GATTAAAAAC
AGCGTGATTG GCGATGATTG CGAAATCAGT CCGTATACCG TTGTGGAAGA TGCGAATCTG
GCAGCGGCCT GTACCATTGG CCCGTTTGCC CGTTTGCGTC CTGGTGCTGA GTTGCTGGAA
GGTGCTCACG TCGGTAACTT CGTTGAGATG AAAAAAGCGC GTCTGGGTAA AGGCTCGAAA
GCTGGTCATC TGACTTACCT GGGCGATGCG GAAATTGGCG ATAACGTTAA CATCGGCGCG
GGAACCATTA CCTGCAACTA CGATGGTGCG AATAAATTTA AGACCATTAT CGGCGACGAT
GTGTTTGTTG GTTCCGACAC TCAGCTGGTG GCCCCGGTAA CAGTAGGCAA AGGCGCGACC
ATTGCTGCGG GTACAACTGT GACGCGTAAT GTCGGCGAAA ATGCATTAGC TATCAGCCGT
GTGCCGCAGA CTCAGAAAGA AGGCTGGCGT CGTCCGGTAA AGAAAAAGTG A
 
Protein sequence
MLNNAMSVVI LAAGKGTRMY SDLPKVLHTL AGKAMVQHVI DAANELGAAH VHLVYGHGGD 
LLKQALKDDN LNWVLQAEQL GTGHAMQQAA PFFADDEDIL MLYGDVPLIS VETLQRLRDA
KPQGGIGLLT VKLDDPTGYG RITRENGKVT GIVEHKDATD EQRQIQEINT GILIANGADM
KRWLAKLTNN NAQGEYYITD IIALAYQEGR EIVAVHPQRL SEVEGVNNRL QLSRLERVYQ
SEQAEKLLLA GVMLRDPARF DLRGTLTHGR DVEIDTNVII EGNVTLGHRV KIGTGCVIKN
SVIGDDCEIS PYTVVEDANL AAACTIGPFA RLRPGAELLE GAHVGNFVEM KKARLGKGSK
AGHLTYLGDA EIGDNVNIGA GTITCNYDGA NKFKTIIGDD VFVGSDTQLV APVTVGKGAT
IAAGTTVTRN VGENALAISR VPQTQKEGWR RPVKKK