Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0538 |
Symbol | nagE |
ID | 4884064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 509566 |
End bp | 511326 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640126466 |
Product | pts system, N-acetylglucosamine-specific IIBC component |
Protein accession | YP_001057591 |
Protein GI | 126438375 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific [COG1264] Phosphotransferase system IIB components |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGGAA ATCCGTTTCT GAAAATACAG AGCCTCGGCA GGGCGCTGAT GCTGCCGATC GCGGTGCTGC CGGTGGCGGG CATCCTGCTG CGCCTCGGGC AGCAGGACGT GCTCAACATC AAGATGATCG CCGACGCGGG CGGCGCGATC TTCGAGAACC TGCCGCTGCT GTTCGCGATC GGCGTCGCGG TCGGCTTCGC GAAGGACAAC AACGGCGTGG CGGCGCTCGC GGGCGCGATC GGCTATCTGA TCGAAGTCGC GATCATGAAG GACATCGATC CGAAGCTGAA CATGGGCGTG CTGTCCGGGA TCATCGCGGG CGTCGTCGCG GGGCTGCTGT ACAACCGCTA CAAGGACATC AAGCTGCCCG ACTACCTCGC GTTCTTCGGC GGCAAGCGCT TCGTGCCGAT CATCACGGGG CTCGCGTGCG TCGTGCTCGG GATCGTGTTC GGCTACGTAT GGCAGCCGGT GCAGCACGCG ATCGACGCGG TCGGCCAGTG GCTGCTGACG GCGGGCGCGA TCGGCACGTT CGTCTACGGG TTCCTGAACC GCCTGTTGCT CGTCACGGGG CTGCACCACA TCATCAATTC GCTCGTCTGG TTCGTGTTCG GCACGTTCAC GCCGGCGGGC GGCGCTGCGG TGACGGGCGA TCTGCATCGC TTCTTCGCGG GCGATCCGAG CGCGGGCGGC TTCATGGCGG GCTTCTTCCC GATCATGATG TTCGGCCTGC CGGCCGCGTG CCTCGCGATG TTTCACGAGG CGCCGAAGGC GCGCCGCGCG ATCGTCGGCG GCCTGCTGTT CTCGATGGCG CTCACCTCGT TCCTGACGGG CGTGACCGAG CCGATCGAGT TCAGCTTCAT GTTCCTCGCG CCGGTGCTGT ACGTGATCCA CGCGGTGCTC ACGGGCCTTT CGCTCGCGAT CTGCCAGTTG CTCGGCGTGA AGCTCGGCTT CACGTTCTCG GCGGGCGCGA TCGACTATGT GCTGAACTAC GGGCTGTCGA CGAAGGGCTG GATCGCGATC CCGCTCGGCC TTGCGTACGG TCTCGCCTAC TACGGCCTCT TCCGCTTCTT CATCCGCAAG TTCAACATGG CGACGCCGGG CCGCGAGCCC GCGGGCGCGG ACGCGCAGGC GCAGTCGTTC GCGTCGGGCG GTTTCGTCGC GCCGACGGCG GGCGCATCGG TGCCGCGCGC GCAGCGCTAC ATCGCGGCGC TCGGCGGCGC GGCGAACCTG TCGGTCGTCG ATGCGTGCAC GACGCGGCTG CGTCTTTCCG TCGTCGATCC CGAGAAGGTG TCCGAAGCGG ATCTGCGCAC GATCGGCGCG CGCGGCGTGC TCAAACGCGG CGGCAGCAGC GTGCAGGTGA TCATCGGGCC GGAGGCGGAC CTCATCGCCG ATGAGATTCG CGCGACGCTC GGCAGCGGCG CGGCGGTGCC CGCGGCTGCG GCTGCCGCGG CGCCTGCGGT GGCGGCAACG GCGGCGGGCG CGCAGTCGGG CCCGCTCGAT CCGGAGCCGA CGCGCTGGCT CGCGGTGTTC GGCGGCGCGA CGAACGTCGC TTCGCTCGAC GCGGTCGCGG CGACGCGCCT GCGCGTCGTC GTGCGCGATC CGTCGGCGGT CGATCGCCAG CGCCTCGCGA CGCTTGACGT CGCCTGGGTC GCGAGCGACA CGTTCCATAT CGTCTGCGGC CAGTCGGCGC CGCGCTATGC GCAGCAGCTC GCCGCGCGCC TGCCGTCGTC CGACGGCGGC ACGGCGGCCC AGCCCGCCTG A
|
Protein sequence | MDGNPFLKIQ SLGRALMLPI AVLPVAGILL RLGQQDVLNI KMIADAGGAI FENLPLLFAI GVAVGFAKDN NGVAALAGAI GYLIEVAIMK DIDPKLNMGV LSGIIAGVVA GLLYNRYKDI KLPDYLAFFG GKRFVPIITG LACVVLGIVF GYVWQPVQHA IDAVGQWLLT AGAIGTFVYG FLNRLLLVTG LHHIINSLVW FVFGTFTPAG GAAVTGDLHR FFAGDPSAGG FMAGFFPIMM FGLPAACLAM FHEAPKARRA IVGGLLFSMA LTSFLTGVTE PIEFSFMFLA PVLYVIHAVL TGLSLAICQL LGVKLGFTFS AGAIDYVLNY GLSTKGWIAI PLGLAYGLAY YGLFRFFIRK FNMATPGREP AGADAQAQSF ASGGFVAPTA GASVPRAQRY IAALGGAANL SVVDACTTRL RLSVVDPEKV SEADLRTIGA RGVLKRGGSS VQVIIGPEAD LIADEIRATL GSGAAVPAAA AAAAPAVAAT AAGAQSGPLD PEPTRWLAVF GGATNVASLD AVAATRLRVV VRDPSAVDRQ RLATLDVAWV ASDTFHIVCG QSAPRYAQQL AARLPSSDGG TAAQPA
|
| |