Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_00636 |
Symbol | nagE |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | + |
Start bp | 663818 |
End bp | 665764 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | fused N-acetyl glucosamine specific PTS enzyme: IIC, IIB, and IIA components |
Protein accession | ACT42512 |
Protein GI | 253976842 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.794808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATTT TAGGTTTTTT CCAGCGACTC GGTAGGGCGT TACAGCTCCC TATCGCGGTG CTGCCGGTGG CGGCGCTGTT GCTGCGATTC GGTCAGCCAG ATTTACTTAA CGTTGCGTTT ATTGCCCAGG CGGGCGGTGC GATTTTTGAT AACCTCGCAT TAATCTTCGC CATCGGTGTG GCATCCAGCT GGTCGAAAGA CAGCGCAGGT GCGGCGGCGC TGGCGGGTGC GGTAGGTTAC TTTGTGTTAA CCAAAGCGAT GGTGACCATC AACCCAGAAA TTAACATGGG TGTACTGGCG GGTATCATTA CCGGTCTGGT TGGTGGCGCA GCCTATAACC GTTGGTCCGA TATTAAACTG CCTGACTTCC TGAGCTTCTT CGGCGGCAAA CGCTTTGTGC CGATTGCCAC CGGCTTCTTC TGCCTGGTGC TGGCGGCCAT TTTTGGTTAC GTCTGGCCGC CGGTACAGCA CGCTATCCAT GCAGGCGGCG AGTGGATCGT TTCTGCGGGC GCGCTGGGTT CCGGTATCTT TGGTTTCATC AACCGTCTGC TGATCCCAAC CGGTCTGCAT CAGGTGCTGA ACACCATCGC CTGGTTCCAG ATTGGTGAAT TCACCAACGC GGCGGGTACG GTTTTCCACG GCGACATCAA CCGTTTCTAC GCTGGTGACG GCACCGCGGG GATGTTCATG TCCGGCTTCT TCCCGATCAT GATGTTCGGT CTGCCGGGTG CGGCGCTGGC GATGTACTTC GCAGCACCGA AAGAGCGTCG TCCGATGGTT GGCGGGATGC TGCTTTCTGT TGCTGTTACT GCGTTCCTGA CCGGTGTGAC TGAGCCGCTG GAATTCCTGT TCATGTTCCT TGCGCCGCTG CTGTACCTCC TGCACGCACT GCTGACCGGT ATCAGCCTGT TTGTGGCAAC GCTGCTGGGT ATCCACGCGG GCTTCTCCTT CTCTGCGGGG GCTATCGACT ACGCGTTGAT GTATAACCTG CCGGCCGCCA GCCAGAACGT CTGGATGCTG CTGGTTATGG GTGTTGTCTT CTTCGCTATC TACTTCGTGG TGTTCAGTTT GGTTATCCGT ATGTTCAACC TGAAAACGCC GGGCCGTGAA GATAAAGAAG ACGAGATCGT TACTGAAGAA GCCAACAGCA ACACAGAAGA AGGTCTGACT CAACTGGCGA CCAACTATAT TGCTGCGGTG GGCGGTACTG ACAACCTGAA AGCAATTGAC GCCTGTATTA CTCGTCTGCG CCTGACCGTG GTTGACTCAG CTCGCGTCAA CGATGCGATG TGTAAACGTC TGGGTGCTTC TGGGGTAGTG AAACTGAACA AACAGACTAT TCAGGTGATT GTTGGCGCGA AAGCGGAATC CATTGGCGAT GCAATGAAGA AAGTTGTTGC CCGTGGTCCG GTAGCCGCTG CGTCAGCTGA AGCAACTCCG GCAACTGCCG CTCCTGTAGC AAAACCGCAG GCTGTACCAA ACGCGGTGTC TATCGCGGAG CTGGTATCGC CGATTACCGG TGATGTTGTG GCACTGGATC AGGTTCCTGA CGAAGCATTC GCCAGCAAAG CGGTGGGTGA CGGTGTGGCG GTGAAACCGA CAGATAAAAT CGTCGTATCA CCAGCCGCAG GGACAATCGT GAAAATCTTC AACACCAACC ACGCGTTCTG CCTGGAAACC GAAAAAGGCG CGGAGATCGT CGTCCATATG GGTATCGACA CCGTAGCGCT GGAAGGTAAA GGCTTTAAAC GTCTGGTGGA AGAGGGGGCG CAGGTAAGCG CAGGGCAACC GATTCTGGAA ATGGATCTGG ATTACCTGAA CGCTAACGCC CGCTCGATGA TTAGCCCGGT GGTTTGCAGC AATATCGACG ATTTCAGTGG CTTGATCATT AAAGCTCAGG GCCATGTTGT GGCGGGTCAA ACACCGCTGT ATGAAATCAA AAAGTAA
|
Protein sequence | MNILGFFQRL GRALQLPIAV LPVAALLLRF GQPDLLNVAF IAQAGGAIFD NLALIFAIGV ASSWSKDSAG AAALAGAVGY FVLTKAMVTI NPEINMGVLA GIITGLVGGA AYNRWSDIKL PDFLSFFGGK RFVPIATGFF CLVLAAIFGY VWPPVQHAIH AGGEWIVSAG ALGSGIFGFI NRLLIPTGLH QVLNTIAWFQ IGEFTNAAGT VFHGDINRFY AGDGTAGMFM SGFFPIMMFG LPGAALAMYF AAPKERRPMV GGMLLSVAVT AFLTGVTEPL EFLFMFLAPL LYLLHALLTG ISLFVATLLG IHAGFSFSAG AIDYALMYNL PAASQNVWML LVMGVVFFAI YFVVFSLVIR MFNLKTPGRE DKEDEIVTEE ANSNTEEGLT QLATNYIAAV GGTDNLKAID ACITRLRLTV VDSARVNDAM CKRLGASGVV KLNKQTIQVI VGAKAESIGD AMKKVVARGP VAAASAEATP ATAAPVAKPQ AVPNAVSIAE LVSPITGDVV ALDQVPDEAF ASKAVGDGVA VKPTDKIVVS PAAGTIVKIF NTNHAFCLET EKGAEIVVHM GIDTVALEGK GFKRLVEEGA QVSAGQPILE MDLDYLNANA RSMISPVVCS NIDDFSGLII KAQGHVVAGQ TPLYEIKK
|
| |