Gene BURPS1106A_0555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0555 
SymbolnagE 
ID4899738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp524319 
End bp526079 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content70% 
IMG OID640133785 
ProductPTS system, N-acetylglucosamine-specific IIBC component 
Protein accessionYP_001064838 
Protein GI126452617 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.723185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGGAA ATCCGTTTCT GAAAATACAG AGCCTCGGCA GGGCGCTGAT GCTGCCGATC 
GCGGTGCTGC CGGTGGCGGG CATCCTGCTG CGCCTCGGGC AGCAGGACGT GCTCAACATC
AAGATGATCG CCGACGCGGG CGGCGCGATC TTCGAGAACC TGCCGCTGCT GTTCGCGATC
GGCGTCGCGG TCGGCTTCGC GAAGGACAAC AACGGCGTGG CGGCGCTCGC GGGCGCGATC
GGCTATCTGA TCGAAGTCGC GATCATGAAG GACATCGATC CGAAGCTGAA CATGGGCGTG
CTGTCCGGGA TCATCGCGGG CGTCGTCGCG GGGCTGCTGT ACAACCGCTA CAAGGACATC
AAGCTGCCCG ACTACCTCGC GTTCTTCGGC GGCAAGCGCT TCGTGCCGAT CATCACGGGG
CTCGCGTGCG TCGTGCTCGG GATCGTGTTC GGCTACGTAT GGCAGCCGGT GCAGCACGCG
ATCGACGCGG TCGGCCAGTG GCTGCTGACG GCGGGCGCGA TCGGCACGTT CGTCTACGGG
TTCCTGAACC GCCTGCTGCT CGTCACGGGG CTGCACCACA TCATCAATTC GCTCGTCTGG
TTCGTGTTCG GCACGTTCAC GCCGGCGGGC GGCGCCGCGG TGACGGGCGA TCTGCATCGC
TTCTTCGCGG GCGATCCGAG CGCGGGCGGC TTCATGGCGG GCTTCTTCCC GATCATGATG
TTCGGCCTGC CGGCCGCGTG CCTCGCGATG TTTCACGAGG CGCCGAAGGC GCGCCGCGCG
ATCGTCGGCG GCCTGCTGTT CTCGATGGCG CTCACCTCGT TCCTGACGGG CGTGACCGAG
CCGATCGAGT TCAGCTTCAT GTTCCTCGCG CCGGTGCTGT ACGTGATCCA CGCGGTGCTC
ACGGGCCTTT CGCTCGCGAT CTGCCAGTTG CTCGGCGTGA AGCTCGGCTT CACGTTCTCG
GCGGGCGCGA TCGACTATGT GCTGAACTAC GGGCTGTCGA CGAAGGGCTG GATCGCGATC
CCGCTCGGCC TTGCGTACGG TCTCGCCTAC TACGGCCTCT TCCGCTTCTT CATCCGCAAG
TTCAACATGG CGACGCCGGG CCGCGAGCCC GCGGGCGCGG ACGCGCAGGC GCAGTCGTTC
GCGTCGGGCG GTTTCGTCGC GCCGACGGCG GGCGCATCGG TGCCGCGCGC GCAGCGCTAC
ATCGCGGCGC TCGGCGGCGC GGCGAACCTG TCGGTCGTCG ATGCGTGCAC GACGCGGCTG
CGTCTTTCCG TCGTCGATCC CGAGAAGGTG TCCGAAGCGG ATCTGCGCAC GATCGGCGCG
CGCGGCGTGC TCAAACGCGG CGGCAGCAGC GTGCAGGTGA TCATCGGGCC GGAGGCGGAC
CTCATCGCCG ATGAGATTCG CGCGACGCTC GGCAGCGGCG CGGCGGCGCC CGCGGCTGCG
GCTGCCGCGG CGCCTGCGGC GGCGGCAACG GCGGCGGGCG CGCAGTCGGG CCCGCTCGAT
CCGGAGCCGA CGCGCTGGCT CGCGGTGTTC GGCGGCGCGA CGAACGTCGC TTCGCTCGAC
GCGGTCGCGG CGACGCGCCT GCGCGTCGTC GTGCGCGATC CGTCGGCGGT CGATCGCCAG
CGCCTCGCGA CGCTTGACGT CGCCTGGGTC GCGAGCGACA CGTTCCATAT CGTCTGCGGC
CAGTCGGCGC CGCGCTATGC GCAGCAGCTC GCCGCGCGCC TGCCGTCGTC CGACGGCGGC
ACGGCGGCCC AGCCCGCCTG A
 
Protein sequence
MDGNPFLKIQ SLGRALMLPI AVLPVAGILL RLGQQDVLNI KMIADAGGAI FENLPLLFAI 
GVAVGFAKDN NGVAALAGAI GYLIEVAIMK DIDPKLNMGV LSGIIAGVVA GLLYNRYKDI
KLPDYLAFFG GKRFVPIITG LACVVLGIVF GYVWQPVQHA IDAVGQWLLT AGAIGTFVYG
FLNRLLLVTG LHHIINSLVW FVFGTFTPAG GAAVTGDLHR FFAGDPSAGG FMAGFFPIMM
FGLPAACLAM FHEAPKARRA IVGGLLFSMA LTSFLTGVTE PIEFSFMFLA PVLYVIHAVL
TGLSLAICQL LGVKLGFTFS AGAIDYVLNY GLSTKGWIAI PLGLAYGLAY YGLFRFFIRK
FNMATPGREP AGADAQAQSF ASGGFVAPTA GASVPRAQRY IAALGGAANL SVVDACTTRL
RLSVVDPEKV SEADLRTIGA RGVLKRGGSS VQVIIGPEAD LIADEIRATL GSGAAAPAAA
AAAAPAAAAT AAGAQSGPLD PEPTRWLAVF GGATNVASLD AVAATRLRVV VRDPSAVDRQ
RLATLDVAWV ASDTFHIVCG QSAPRYAQQL AARLPSSDGG TAAQPA