Gene B21_03558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03558 
SymbolglmU 
ID8115541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3800708 
End bp3802078 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content53% 
IMG OID644849726 
Producthypothetical protein 
Protein accessionYP_003001299 
Protein GI251786995 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAATA ATGCTATGAG CGTAGTGATC CTTGCCGCAG GCAAAGGCAC GCGCATGTAT 
TCCGATCTTC CGAAAGTGCT GCATACCCTT GCCGGGAAAG CGATGGTTCA GCATGTCATT
GATGCTGCGA ATGAATTAGG CGCAGCGCAC GTTCACCTGG TGTACGGTCA CGGCGGCGAT
CTGCTAAAAC AGGCGCTGAA AGACGACAAC CTTAACTGGG TGCTTCAGGC AGAGCAGCTG
GGTACGGGTC ATGCAATGCA GCAGGCCGCA CCTTTCTTTG CCGATGATGA AGACATTTTA
ATGCTCTACG GCGACGTGCC GCTGATCTCT GTCGAAACAC TCCAGCGTCT GCGTGATGCT
AAACCGCAGG GTGGCATTGG TCTGCTGACG GTGAAACTGG ATGATCCGAC CGGTTATGGA
CGTATCACCC GTGAAAACGG CAAAGTTACC GGCATTGTTG AGCACAAAGA TGCCACCGAC
GAGCAGCGTC AGATTCAGGA GATCAACACC GGCATTCTGA TTGCCAACGG CGCAGATATG
AAACGCTGGC TGGCGAAGCT GACCAACAAT AATGCTCAGG GCGAATACTA CATCACCGAC
ATTATTGCGC TGGCATATCA GGAAGGGCGT GAAATCGTCG CCGTTCATCC GCAACGTTTA
AGCGAAGTAG AAGGCGTGAA TAACCGCCTG CAACTCTCCC GTCTGGAGCG TGTTTATCAG
TCCGAACAGG CTGAAAAACT GCTGTTAGCA GGCGTTATGC TGCGCGATCC GGCGCGTTTT
GATCTGCGTG GTACGCTTAC TCACGGGCGC GATGTTGAAA TTGATACTAA CGTTATCATC
GAGGGCAACG TGACTCTCGG TCATCGCGTG AAAATCGGCA CCGGTTGCGT GATTAAAAAC
AGCGTGATTG GCGATGATTG CGAAATTAGC CCGTATACCG TTGTGGAAGA TGCGAATCTG
GCAGCGGCCT GTACCATTGG CCCGTTTGCC CGTTTGCGTC CTGGTGCTGA GTTGCTGGAA
GGTGCACACG TCGGTAACTT CGTTGAGATG AAAAAAGCGC GTCTGGGTAA AGGCTCCAAA
GCTGGTCATC TGACTTACCT GGGCGATGCG GAAATTGGCG ATAACGTTAA CATCGGCGCG
GGAACCATTA CCTGCAACTA CGATGGTGCG AATAAATTTA AGACCATTAT CGGCGACGAT
GTGTTTGTTG GTTCCGACAC TCAGCTGGTG GCCCCGGTAA CAGTAGGCAA AGGCGCGACC
ATTGCTGCGG GGACAACTGT GACGCGTAAT GTCGGCGAAA ATGCATTAGC TATCAGCCGT
GTGCCGCAGA CTCAGAAAGA AGGCTGGCGT CGTCCGGTAA AGAAAAAGTG A
 
Protein sequence
MLNNAMSVVI LAAGKGTRMY SDLPKVLHTL AGKAMVQHVI DAANELGAAH VHLVYGHGGD 
LLKQALKDDN LNWVLQAEQL GTGHAMQQAA PFFADDEDIL MLYGDVPLIS VETLQRLRDA
KPQGGIGLLT VKLDDPTGYG RITRENGKVT GIVEHKDATD EQRQIQEINT GILIANGADM
KRWLAKLTNN NAQGEYYITD IIALAYQEGR EIVAVHPQRL SEVEGVNNRL QLSRLERVYQ
SEQAEKLLLA GVMLRDPARF DLRGTLTHGR DVEIDTNVII EGNVTLGHRV KIGTGCVIKN
SVIGDDCEIS PYTVVEDANL AAACTIGPFA RLRPGAELLE GAHVGNFVEM KKARLGKGSK
AGHLTYLGDA EIGDNVNIGA GTITCNYDGA NKFKTIIGDD VFVGSDTQLV APVTVGKGAT
IAAGTTVTRN VGENALAISR VPQTQKEGWR RPVKKK