Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0571 |
Symbol | nagE |
ID | 6272576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 547969 |
End bp | 549915 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641724776 |
Product | PTS system N-acetyl glucosamine specific transporter subunits IIABC |
Protein accession | YP_001879323 |
Protein GI | 187730563 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific [COG2190] Phosphotransferase system IIA components |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR00830] PTS system, glucose subfamily, IIA component [TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component [TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATTT TAGGTTTTTT CCAGCGACTC GGTAGGGCGT TACAGCTCCC TATCGCGGTG CTGCCGGTGG CGGCACTGTT GCTGCGATTC GGTCAGCCAG ATTTACTTAA CGTTGCGTTT ATTGCCCAGG CGGGCGGTGC GATTTTTGAT AACCTCGCAT TAATCTTCGC CATCGGTGTG GCATCCAGCT GGTCGAAAGA CAGCGCAGGT GCGGCGGCGC TGGCGGGTGC GGTAGGTTAC TTTGTGTTAA CCAAAGCGAT GGTGACCATC AACCCAGAAA TAAACATGGG TGTACTGGCG GGTATCATTA CCGGTCTGGT TGGTGGCGCA GCCTATAACC GTTGGTCCGA TATTAAACTG CCGGACTTCC TGAGCTTCTT CGGCGGCAAA CGCTTTGTGC CGATTGCCAC CGGCTTCTTC TGCCTGGTGC TGGCGGCCAT TTTTGGTTAC GTCTGGCCGC CGGTACAGCA CGCTATCCAT GCAGGAGGCG AGTGGATCGT TTCTGCGGGC GCGCTGGGTT CCGGTATCTT TGGTTTCATC AACCGTCTGC TGATCCCAAC CGGTCTGCAT CAGGTGCTGA ACACCATCGC CTGGTTCCAG ATTGGTGAAT TCACCAACGC GGCGGGTACG GTTTTCCACG GTGATATCAA CCGCTTCTAT GCCGGTGACG GCACCGCGGG GATGTTCATG TCCGGCTTCT TCCCGATTAT GATGTTCGGT CTGCCGGGTG CGGCGCTGGC GATGTACTTC GCAGCACCGA AAGAGCGTCG TCCGATGGTT GGCGGGATGC TGCTTTCTGT TGCTGTTACT GCGTTCCTGA CCGGTGTGAC TGAGCCGCTG GAATTCCTGT TCATGTTCCT TGCACCGCTG CTGTACCTCC TGCACGCACT GCTGACCGGT ATCAGCCTGT TTGTGGCAAC ACTGCTGGGT ATCCATGCTG GCTTCTCTTT CTCTGCGGGG GCTATCGACT ACGCGTTGAT GTATAATCTG CCGGCCGCCA GCCAGAACGT CTGGATGCTG CTGGTGATGG GTGTTGTCTT CTTCGCTATC TACTTCGTGG TGTTCAGTTT GGTTATCCGC ATGTTCAACC TGAAAACGCC GGGTCGTGAA GATAAAGAAG ACGAGATCGT TACTGAAGAA GCCAACAGCA ACACTGAAGA AGGCCTCAAT CAACTGGCGA CCAACTATAT TGCTGCGGTT GGTGGCACTG ACAACCTGAA AGCAATTGAC GCCTGTATCA CCCGTCTGCG CCTGACCGTG GCTGACTCTG CCCGCGTCAA CGATACGATG TGTAAACGTC TGGGTGCTTC TGGTGTAGTG AAACTGAACA AACAGACTAT TCAGGTGATT GTTGGCGCGA AAGCAGAATC CATCGGCGAT GCGATGAAGA AAGTCGTTGC CCGTGGTCCG GTAGCCGCTG CGTCAGCTGA AGAAACTCCG TCAACTGCCG CGCCTGTAGC AAAACCGCAG GCTGTACCAA ACGCGGTATC TATCGCGGAG CTGGTATCGC CGATTACCGG TGATGTCGTG GCACTGGATC AGGTTCCTGA CGAAGCATTC GCCAGCAAAG CGGTGGGTGA CGGTGTGGCG GTGAAACCGA CAGATAAAAT CGTCGTATCA CCAGCCGCAG GGACAATCGT GAAAATCTTC AACACCAACC ACGCGTTCTG CCTGGAAACC GAAAAAGGCG CGGAGATCGT CGTCCATATG GGTATCGACA CCGTAGCGCT GGAAGGTAAA GGCTTTAAAC GTCTGGTGGA AGAGGGTGCG CAGGTAAGCG CAGGGCAACC GATTCTGGAA ATGGATCTGG ATTACCTGAA CGCTAACGCC CGCTCGATGA TTAGCCCGGT GGTTTGCAGC AATATCGACG ATTTCAGTGG CTTGATCATT AAAGCTCAGG GCCATGTTGT GGCGGGTCAA ACACCGCTGT ATGAAATCAA AAAGTAA
|
Protein sequence | MNILGFFQRL GRALQLPIAV LPVAALLLRF GQPDLLNVAF IAQAGGAIFD NLALIFAIGV ASSWSKDSAG AAALAGAVGY FVLTKAMVTI NPEINMGVLA GIITGLVGGA AYNRWSDIKL PDFLSFFGGK RFVPIATGFF CLVLAAIFGY VWPPVQHAIH AGGEWIVSAG ALGSGIFGFI NRLLIPTGLH QVLNTIAWFQ IGEFTNAAGT VFHGDINRFY AGDGTAGMFM SGFFPIMMFG LPGAALAMYF AAPKERRPMV GGMLLSVAVT AFLTGVTEPL EFLFMFLAPL LYLLHALLTG ISLFVATLLG IHAGFSFSAG AIDYALMYNL PAASQNVWML LVMGVVFFAI YFVVFSLVIR MFNLKTPGRE DKEDEIVTEE ANSNTEEGLN QLATNYIAAV GGTDNLKAID ACITRLRLTV ADSARVNDTM CKRLGASGVV KLNKQTIQVI VGAKAESIGD AMKKVVARGP VAAASAEETP STAAPVAKPQ AVPNAVSIAE LVSPITGDVV ALDQVPDEAF ASKAVGDGVA VKPTDKIVVS PAAGTIVKIF NTNHAFCLET EKGAEIVVHM GIDTVALEGK GFKRLVEEGA QVSAGQPILE MDLDYLNANA RSMISPVVCS NIDDFSGLII KAQGHVVAGQ TPLYEIKK
|
| |