Gene BMA3172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA3172 
SymbolnagE 
ID3089782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006348 
Strand
Start bp3277103 
End bp3278869 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content70% 
IMG OID637563731 
ProductPTS system, N-acetylglucosamine-specific IIABC component 
Protein accessionYP_104651 
Protein GI53724497 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGGAA ATCCGTTTCT GAAAATACAG AGCCTCGGCA GGGCGCTGAT GCTGCCGATC 
GCGGTGCTGC CGGTGGCGGG CATCCTGCTG CGCCTCGGGC AGCAGGACGT GCTCAACATC
AAGATGATCG CCGACGCGGG CGGCGCGATC TTCGAGAACC TGCCGCTGCT GTTCGCGATC
GGCGTCGCGG TCGGCTTCGC GAAGGACAAC AACGGCGTGG CGGCGCTCGC GGGCGCGATC
GGCTATCTGA TCGAAGTCGC GATCATGAAG GACATCGATC CGAAGCTGAA CATGGGCGTG
CTGTCCGGGA TCATCGCGGG CGTCGTCGCG GGGCTGCTGT ACAACCGCTA CAAGGACATC
AAGCTGCCCG ACTACCTCGC GTTCTTCGGC GGCAAGCGCT TCGTGCCGAT CATCACGGGG
CTCGCGTGCG TCGTGCTCGG GATCGTGTTC GGCTACGTAT GGCAGCCGGT GCAGCACGCG
ATCGACGCGG TCGGCCAGTG GCTGCTGACG GCGGGCGCGA TCGGCACGTT CGTCTACGGG
TTCCTGAACC GCCTGTTGCT CGTCACGGGG CTGCACCACA TCATCAATTC GCTCGTCTGG
TTCGTGTTCG GCACGTTCAC GCCGGCGGGC GGCGCCGCGG TGACGGGCGA TCTGCATCGC
TTCTTCGCGG GCGATCCGAG CGCGGGCGGC TTCATGGCGG GCTTCTTCCC GATCATGATG
TTCGGCCTGC CGGCCGCGTG CCTCGCGATG TTTCACGAGG CGCCGAAGGC GCGCCGCGCG
ATCGTCGGCG GCCTGCTGTT CTCGATGGCG CTCACCTCGT TCCTGACGGG CGTGACCGAG
CCGATCGAGT TCAGCTTCAT GTTCCTCGCG CCGGTGCTGT ACGTGATCCA CGCGGTGCTC
ACGGGCCTTT CGCTCGCGAT CTGCCAGTTG CTCGGCGTGA AGCTCGGCTT CACGTTCTCG
GCGGGCGCGA TCGACTATGT GCTGAACTAC GGGCTGTCGA CGAAGGGCTG GATCGCGATC
CCGCTCGGCC TCGCGTACGG TCTCGCCTAC TACGGCCTCT TCCGCTTCTT CATCCGCAAG
TTCAACATGG CGACGCCGGG CCGCGAGCCC GCGGGCGCGG ACGCGCAGGC GCAGTCGTTC
GCGTCGGGCG GTTTCGTCGC GCCGACGGCG GGCGCATCGG TGCCGCGCGC GCAGCGCTAC
ATCGCGGCGC TCGGCGGCGC GGCGAACCTG TCGGTCGTCG ATGCGTGCAC GACTCGGCTG
CGTCTTTCCG TCGTCGATCC CGAGAAGGTG TCCGAAGCGG ATCTGCGCAC GATCGGCGCG
CGCGGCGTGC TCAAGCGCGG CGGCAGCAGC GTGCAGGTGA TCATCGGGCC GGAGGCGGAC
CTCATCGCCG ATGAGATTCG CGCGACGCTC GGCAGCGGCG CGGCGGCGCC CGCGGCTGCG
GCTGCCGCGG CGCCTGCGGC GGCGGCAACG GCAACGGCGG CGGGCGCGCA GTCGGGCCCG
CTCGATCCGG AGCCGACGCG CTGGCTCGCG GTGTTCGGCG GCGCGACGAA CGTCGCTTCG
CTCGACGCGG TCGCGGCGAC GCGCCTGCGC GTCGTCGTAC GCGATCCGTC GGCGGTCGAT
CGCCAGCGCC TCGCGACGCT TGACGTCGCC TGGGTCGCGA GCGACACGTT CCATATCGTC
TGCGGCCAGT CGGCGCCGCG CTATGCGCAG CAGCTCGCCG CGCGCCTGCC GTCGTCCGAC
GGCGGCACGG CGGCCCAGCC CGCCTGA
 
Protein sequence
MDGNPFLKIQ SLGRALMLPI AVLPVAGILL RLGQQDVLNI KMIADAGGAI FENLPLLFAI 
GVAVGFAKDN NGVAALAGAI GYLIEVAIMK DIDPKLNMGV LSGIIAGVVA GLLYNRYKDI
KLPDYLAFFG GKRFVPIITG LACVVLGIVF GYVWQPVQHA IDAVGQWLLT AGAIGTFVYG
FLNRLLLVTG LHHIINSLVW FVFGTFTPAG GAAVTGDLHR FFAGDPSAGG FMAGFFPIMM
FGLPAACLAM FHEAPKARRA IVGGLLFSMA LTSFLTGVTE PIEFSFMFLA PVLYVIHAVL
TGLSLAICQL LGVKLGFTFS AGAIDYVLNY GLSTKGWIAI PLGLAYGLAY YGLFRFFIRK
FNMATPGREP AGADAQAQSF ASGGFVAPTA GASVPRAQRY IAALGGAANL SVVDACTTRL
RLSVVDPEKV SEADLRTIGA RGVLKRGGSS VQVIIGPEAD LIADEIRATL GSGAAAPAAA
AAAAPAAAAT ATAAGAQSGP LDPEPTRWLA VFGGATNVAS LDAVAATRLR VVVRDPSAVD
RQRLATLDVA WVASDTFHIV CGQSAPRYAQ QLAARLPSSD GGTAAQPA