Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0701 |
Symbol | nagE |
ID | 6146947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 703324 |
End bp | 705270 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615590 |
Product | PTS system N-acetyl glucosamine specific transporter subunits IIABC |
Protein accession | YP_001742789 |
Protein GI | 170681159 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific [COG2190] Phosphotransferase system IIA components |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR00830] PTS system, glucose subfamily, IIA component [TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component [TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.179782 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATTT TAGGTTTTTT CCAGCGACTC GGTAGGGCGT TACAGCTCCC TATCGCGGTG CTGCCGGTGG CGGCGCTGTT GCTGCGATTC GGTCAGCCAG ATTTACTTAA CGTTGCGTTT ATTGCCCAGG CGGGCGGTGC GATTTTTGAT AACCTCGCAT TAATCTTCGC CATCGGTGTG GCATCCAGCT GGTCGAAAGA CAGCGCAGGT GCGGCGGCAC TGGCGGGTGC GGTAGGTTAC TTTGTGTTAA CCAAAGCGAT GGTGACCATC AACCCAGAAA TTAACATGGG TGTACTGGCG GGTATCATTA CCGGTCTGGT TGGTGGCGCA GCTTATAACC GTTGGTCCGA TATTAAACTG CCGGACTTCC TGAGTTTCTT CGGCGGCAAA CGCTTTGTGC CGATTGCCAC CGGCTTCTTC TGCCTGGTGC TGGCGGCCAT TTTTGGTTAC GTCTGGCCGC CGGTACAGCA CGCTATCCAT GCAGGCGGCG AGTGGATCGT TTCTGCGGGC GCGCTGGGTT CCGGTATCTT TGGTTTCATC AACCGTCTGC TGATCCCAAC CGGTCTGCAT CAGGTACTGA ACACCATCGC CTGGTTCCAG ATTGGTGAAT TCACCAACGC GGCGGGTACG GTTTTTCACG GTGACATCAA CCGCTTCTAT GCCGGTGACG GCACCGCGGG GATGTTCATG TCCGGCTTCT TCCCGATCAT GATGTTCGGT CTGCCGGGTG CGGCGCTGGC GATGTACTTT GCAGCACCGA AAGAGCGCCG TCCGATGGTT GGCGGGATGC TGCTTTCTGT TGCTGTTACT GCGTTCCTGA CCGGTGTGAC TGAGCCGCTG GAATTCCTGT TCATGTTCCT TGCACCGCTG CTGTACCTCC TGCACGCACT GCTGACCGGT ATCAGCCTGT TTGTGGCAAC GTTGCTGGGT ATCCATGCCG GCTTCTCTTT CTCTGCGGGG GCTATCGACT ACGCGTTGAT GTATAACCTG CCGGCCGCCA GCCAGAACGT CTGGATGCTG CTGGTGATGG GGGTTGTCTT CTTCGCTATC TACTTCGTGG TGTTCAGTTT GGTTATCCGC ATGTTCAACC TGAAAACGCC AGGTCGTGAA GATAAAGAAG ACGAGATCGT TACTGAAGAG GCTAACAGCA ATACTGAAGA AGGTCTCAAT CAACTGGCAA CCAACTATAT TGCTGCGGTT GGCGGCACTG ACAACCTGAA AGCAATTGAC GCCTGTATCA CCCGTCTGCG CCTTACAGTG GCTGACTCTG CCCGCGTTAA CGATACGATG TGTAAACGTC TGGGTGCTTC TGGGGTAGTG AAACTGAACA AACAGACTAT TCAGGTGATT GTTGGCGCGA AAGCAGAATC CATCGGCGAT GCGATGAAGA AAGTCGTTGC CCGCGGTCCG GTAGCCGCTG CCTCTGCTGA AACTGCTCCG GCAACTGCCG CGCCTGTAGC AAAACCGCAG GCTGTACCAA ACGCAGTATC TATCGCGGAG CTGGTATCGC CGATTACCGG TGATGTTGTG GCACTGGATC AGGTTCCTGA CGAAGCATTC GCCAGCAAAG CGGTGGGTGA CGGTGTGGCG GTGAAACCGA CAGATAAAAT CGTCGTATCA CCAGCCGCAG GGACTATCGT GAAAATCTTC AACACCAACC ACGCATTCTG CCTGGAAACT GAAAAAGGCG CGGAGATTGT CGTCCATATG GGTATCGACA CCGTAGCGCT GGAAGGTAAA GGCTTTAAAC GTCTGGTGGA AGAGGGCGCG CAAGTAAGCG CAGGGCAACC GATTCTGGAA ATGGATCTGG ATTATCTGAA CGAAAACGCC CGCTCGATGA TTAGCCCGGT GGTTTGCAGC AATATCGACG ATTTCAGCGG CTTGATCATT AAAGCTCAGG GCCATGTCGT GGCGGGTCAA ACGCCGCTGT ATGAAATCAA AAAGTAA
|
Protein sequence | MNILGFFQRL GRALQLPIAV LPVAALLLRF GQPDLLNVAF IAQAGGAIFD NLALIFAIGV ASSWSKDSAG AAALAGAVGY FVLTKAMVTI NPEINMGVLA GIITGLVGGA AYNRWSDIKL PDFLSFFGGK RFVPIATGFF CLVLAAIFGY VWPPVQHAIH AGGEWIVSAG ALGSGIFGFI NRLLIPTGLH QVLNTIAWFQ IGEFTNAAGT VFHGDINRFY AGDGTAGMFM SGFFPIMMFG LPGAALAMYF AAPKERRPMV GGMLLSVAVT AFLTGVTEPL EFLFMFLAPL LYLLHALLTG ISLFVATLLG IHAGFSFSAG AIDYALMYNL PAASQNVWML LVMGVVFFAI YFVVFSLVIR MFNLKTPGRE DKEDEIVTEE ANSNTEEGLN QLATNYIAAV GGTDNLKAID ACITRLRLTV ADSARVNDTM CKRLGASGVV KLNKQTIQVI VGAKAESIGD AMKKVVARGP VAAASAETAP ATAAPVAKPQ AVPNAVSIAE LVSPITGDVV ALDQVPDEAF ASKAVGDGVA VKPTDKIVVS PAAGTIVKIF NTNHAFCLET EKGAEIVVHM GIDTVALEGK GFKRLVEEGA QVSAGQPILE MDLDYLNENA RSMISPVVCS NIDDFSGLII KAQGHVVAGQ TPLYEIKK
|
| |