Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4098 |
Symbol | glmU |
ID | 6145586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4193633 |
End bp | 4195003 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618922 |
Product | bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase |
Protein accession | YP_001746060 |
Protein GI | 170679754 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) |
TIGRFAM ID | [TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.251907 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAATA ATGCTATGAG CGTAGTGATC CTTGCCGCAG GTAAAGGCAC GCGCATGTAT TCCGATCTTC CGAAAGTGCT GCATACCCTT GCCGGGAAAG CGATGGTTCA GCATGTCATT GATGCTGCGA ATGAATTAGG CGCAGCGCAC GTTCACCTGG TGTACGGTCA CGGCGGCGAT CTGCTAAAAC AGGCGCTGAA AGACGACAAC CTGAACTGGG TGCTTCAGGC AGAGCAGCTG GGTACGGGTC ATGCGATGCA GCAGGCCGCA CCTTTCTTTG CCGATGATGA AGACATTTTA ATGCTCTACG GCGACGTGCC GCTGATCTCT GTCGAAACAC TCCAGCGCCT GCGTGATGCT AAACCGCAGG GTGGCATTGG TCTGCTGACG GTAAAACTGG ATGATCCGAC CGGTTATGGA CGTATCACCC GTGAAAACGG CAAAGTTACC GGCATTGTTG AGCACAAAGA CGCCACCGAC GAGCAGCGTC AGATTCAGGA GATCAACACC GGCATTCTGA TTGCCAACGG CGCAGATATG AAACGCTGGC TGGCGAAGCT GACCAACAAT AACGCGCAGG GTGAATACTA CATCACCGAC ATTATTGCGC TGGCGTATCA GGAAGGACGT GAAATCGTCG CCGTTCATCC GCAACGTTTA AGCGAAGTAG AAGGCGTGAA TAACCGCCTG CAACTCTCCC GACTGGAGCG CGTTTACCAG TCCGAACAGG CTGAAAAACT GCTGTTAGCA GGCGTTATGC TGCGCGATCC GGCGCGTTTT GATCTGCGCG GTACGCTTAC TCACGGGCGC GATGTTGAAA TTGATACTAA CGTTATCATC GAGGGCAACG TGACTCTCGG TCATCGCGTG AAAATCGGCA CCGGTTGTGT GATTAAAAAC AGCGTGATTG GCGATGATTG CGAAATTAGC CCGTATACCG TTGTGGAAGA CGCGAATCTG GCGGCGGCCT GTACCATTGG CCCGTTTGCC CGTCTGCGTC CTGGTGCTGA GTTGCTGGAA GGTGCACACG TTGGTAACTT CGTTGAGATG AAAAAAGCGC GTCTGGGTAA AGGCTCGAAA GCTGGTCATC TGACTTACCT GGGCGATGCG GAAATTGGCG ATAACGTTAA CATCGGCGCG GGAACCATTA CCTGCAACTA CGATGGTGCG AATAAATTTA AGACTATTAT CGGCGACGAT GTGTTTGTCG GTTCCGACAC TCAGCTGGTG GCCCCGGTAA CAGTAGGCAA AGGCGCGACC ATTGCTGCGG GTACAACTGT GACGCGTAAT GTCGGCGAAA ACGCCCTGGC GATCAGCCGT GTGCCGCAGA CTCAAAAAGA AGGCTGGCGT CGTCCGGTAA AGAAAAAGTA A
|
Protein sequence | MLNNAMSVVI LAAGKGTRMY SDLPKVLHTL AGKAMVQHVI DAANELGAAH VHLVYGHGGD LLKQALKDDN LNWVLQAEQL GTGHAMQQAA PFFADDEDIL MLYGDVPLIS VETLQRLRDA KPQGGIGLLT VKLDDPTGYG RITRENGKVT GIVEHKDATD EQRQIQEINT GILIANGADM KRWLAKLTNN NAQGEYYITD IIALAYQEGR EIVAVHPQRL SEVEGVNNRL QLSRLERVYQ SEQAEKLLLA GVMLRDPARF DLRGTLTHGR DVEIDTNVII EGNVTLGHRV KIGTGCVIKN SVIGDDCEIS PYTVVEDANL AAACTIGPFA RLRPGAELLE GAHVGNFVEM KKARLGKGSK AGHLTYLGDA EIGDNVNIGA GTITCNYDGA NKFKTIIGDD VFVGSDTQLV APVTVGKGAT IAAGTTVTRN VGENALAISR VPQTQKEGWR RPVKKK
|
| |