Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0772 |
Symbol | nagE |
ID | 6968098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 787242 |
End bp | 789188 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643384800 |
Product | PTS system N-acetyl glucosamine specific transporter subunits IIABC |
Protein accession | YP_002269306 |
Protein GI | 209398022 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific [COG2190] Phosphotransferase system IIA components |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR00830] PTS system, glucose subfamily, IIA component [TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component [TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.131412 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATTT TAGGTTTTTT CCAGCGACTC GGTAGGGCGT TACAGCTCCC TATCGCGGTG CTGCCGGTGG CGGCACTGTT GCTGCGATTC GGTCAGCCAG ATTTACTTAA CGTTGCGTTT ATTGCCCAGG CGGGCGGTGC GATTTTTGAT AACCTCGCAT TAATCTTCGC CATCGGTGTG GCATCCAGCT GGTCGAAAGA CAGCGCAGGT GCGGCGGCGC TGGCGGGGGC GGTAGGTTAC TTTGTGTTAA CCAAAGCGAT GGTGACCATC AACCCAGAAA TTAACATGGG TGTACTGGCG GGTATCATTA CCGGTCTGGT TGGTGGCGCA GCCTATAACC GTTGGTCCGA TATTAAACTG CCGGACTTCC TGAGCTTCTT CGGCGGCAAA CGCTTTGTGC CGATCGCCAC CGGCTTCTTC TGTCTGGTGC TGGCGGCCAT TTTTGGTTAC GTCTGGCCGC CGGTACAGCA CGCTATCCAT GCAGGCGGCG AGTGGATCGT TTCTGCGGGC GCGCTGGGTT CCGGTATCTT TGGTTTCATC AACCGTCTGT TGATCCCAAC CGGTCTGCAT CAGGTGCTGA ACACCATCGC CTGGTTCCAG ATTGGTGAAT TCACCAACGC GGCGGGTACG GTTTTCCACG GCGACATCAA CCGTTTCTAC GCTGGTGACG GCACCGCGGG GATGTTCATG TCCGGCTTCT TCCCGATCAT GATGTTCGGC CTGCCGGGTG CGGCGCTGGC GATGTACTTC GCAGCACCGA AAGAGCGTCG TCCGATGGTT GGCGGGATGC TGCTTTCTGT TGCTGTTACT GCGTTCCTGA CCGGTGTGAC TGAGCCGCTG GAATTCCTGT TCATGTTCCT TGCTCCGCTG CTGTACCTCC TGCACGCACT GCTGACCGGT ATCAGCCTGT TTGTGGCAAC GCTGCTGGGT ATCCATGCCG GCTTCTCTTT CTCTGCGGGG GCTATCGACT ACGCGTTGAT GTATAACCTG CCGGCCGCCA GCCAGAACGT CTGGATGCTG CTGGTTATGG GTGTTGTCTT CTTCGCTATC TACTTCGTGG TGTTCAGTTT GGTTATCCGC ATGTTCAACC TGAAAACGCC GGGTCGTGAA GATAAAGAAG ACGAGATCGT TACTGAAGAA GCCAACAGCA ACACTGAAGA AGGCCTCAAT CAACTGGCGA CCAACTATAT TGCTGCGGTT GGCGGCACTG ACAACCTGAA AGCAATTGAC GCCTGTATCA CCCGTCTGCG CCTTACCGTG GCTGACTCTG CCCGCGTTAA CGATACGATG TGTAAACGTC TGGGGGCTTC TGGGGTAGTA AAACTGAACA AACAGACTAT TCAGGTGATT GTTGGCGCGA AAGCAGAATC CATCGGCGAT GCGATGAAGA AAGTCGTTGC CCGTGGCCCG GTAGCAGCTG CCTCTGCTGA AGCAACTCCG GCAACTGCCG CTCCTGTAGC AAAACCGCAG GCTGTACCAA ACGCGGTATC TATCGCGGAG CTGGTATCGC CGATTACCGG TGATGTCGTG GCACTGGATC AGGTTCCTGA CGAAGCATTC GCCAGCAAAG CGGTGGGTGA CGGTGTGGCG GTGAAACCGA CAGATAAAAT CGTCGTATCA CCAGCCGCAG GAACAATCGT GAAAATCTTC AACACCAACC ACGCGTTCTG CCTGGAAACC GAAAAAGGCG CGGAGATTGT CGTTCACATG GGTATCGATA CCGTAGCGCT GGAAGGTAAA GGCTTTAAAC GTCTGGTGGA AGAGGGCGCG CAGGTAAGCG CAGGGCAACC GATTCTGGAA ATGGATCTGG ATTACCTGAA CGCTAACGCC CGTTCGATGA TTAGCCCGGT GGTTTGCAGC AATATCGACG ATTTCAGTGG CTTGATCATT AAAGCTCAGG GCCATGTTGT GGCGGGTCAA ACACCGCTGT ATGAAATCAA AAAGTAA
|
Protein sequence | MNILGFFQRL GRALQLPIAV LPVAALLLRF GQPDLLNVAF IAQAGGAIFD NLALIFAIGV ASSWSKDSAG AAALAGAVGY FVLTKAMVTI NPEINMGVLA GIITGLVGGA AYNRWSDIKL PDFLSFFGGK RFVPIATGFF CLVLAAIFGY VWPPVQHAIH AGGEWIVSAG ALGSGIFGFI NRLLIPTGLH QVLNTIAWFQ IGEFTNAAGT VFHGDINRFY AGDGTAGMFM SGFFPIMMFG LPGAALAMYF AAPKERRPMV GGMLLSVAVT AFLTGVTEPL EFLFMFLAPL LYLLHALLTG ISLFVATLLG IHAGFSFSAG AIDYALMYNL PAASQNVWML LVMGVVFFAI YFVVFSLVIR MFNLKTPGRE DKEDEIVTEE ANSNTEEGLN QLATNYIAAV GGTDNLKAID ACITRLRLTV ADSARVNDTM CKRLGASGVV KLNKQTIQVI VGAKAESIGD AMKKVVARGP VAAASAEATP ATAAPVAKPQ AVPNAVSIAE LVSPITGDVV ALDQVPDEAF ASKAVGDGVA VKPTDKIVVS PAAGTIVKIF NTNHAFCLET EKGAEIVVHM GIDTVALEGK GFKRLVEEGA QVSAGQPILE MDLDYLNANA RSMISPVVCS NIDDFSGLII KAQGHVVAGQ TPLYEIKK
|
| |