Gene BURPS668_0540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0540 
Symbol 
ID4885556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp511967 
End bp514468 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content72% 
IMG OID640126468 
Productglycosy hydrolase family protein 
Protein accessionYP_001057593 
Protein GI126440590 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.152646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGAA TTTCGCATTC CCTGTGCGCC GCGCTATTGG CCGCCGCGAC GCTGTTGCCC 
ACGGCCTCGC GCGCGCAACT GCCCGCGCGC CCCACGGCCG GCGCGGCCGC GCCCGCGACG
GCCGCGCCCG TGCGGCCCGC GTCCACGCCG GCCGAGCTCG CCGCGCGGCT CGCCGACGGC
CTCGCGGTGC GCGTGGCCGT CGACAACAAT CACGCGGCAT CGGCCGGCGT GCCGTGCGCC
GATCTCGGCG CGGATTGGGC GAGCTGCGCG ACGGGCCGCC TGATCCTGCA GAATCGCGGC
CACTCGCCCC TCACCGACGG CGGCTGGAAG CTCTATCTGC ACAGCATCCG CCGGCTGCTC
CGAATCGACC GCCCCGGCTT CACGCTGCGC CATCTGACGG GCGATCTGTA CGAGCTGACG
CCGCAGCCCG GCACGGTAAG GCTCGCGCAG GGCGAGCGCA TCGAGCTGCC GTTCGTCGCC
GAATACTGGC TGCGCCGCTA CAGCGACGTG ATCCCGCGCC CGTACGTGGT CGTCGACGGC
GCGGCGCCCG CGGTGCTGCG CTACGACGAT ACCGACGACG AGCTGCGCTA CGTCGAAACG
CTGCCCGCCG ACGCGCAGAA CAATTCGCCC GGCAATGCGC CGCCCGCCGC CGCGCAGCCG
GTGGCGAACC GCGCGCTGCC GAGCGTGAAG CGGCAGCGCG CGCTGCCCGG CGCGCTCGAT
CTGCGCGGCG TCGAGCTGAC GCTGCCGGAG CTGCCGTCCG CGCAGGTCGC GGCGCTGCGC
GAACGCGCGG GCACGCTCGG CCTGGACGGC GCGCGCGTGC CGGTGTGGGG CGTCGTCGCG
CCGCGCCGGC TGCCCGCCGA CATCGCGGTG CCGGGCGGCT ACCGGCTCGC GATCGGCCCG
CGCGGCGCGT TCATCGAGGG GGCCGATCGC GCGGGCCTCT ACTACGGCGT GCAGACGCTC
TTCTCGCTCG TGCCGGCCGG CGGCGCGACG GTGCCCGCGA TGCTGATCGA AGACGCGCCG
CGCTTCACGC ACCGCGGGAT GCACGTCGAT CTCGCGCGCA ACTTCAAGCC GCCCGCCACG
CTGCGCCGGC TGATCGACCA GATGAGCGCG TACAAGCTCA ACCGGCTGCA TCTGCACCTG
TCCGACGACG AGGGCTGGCG CATCGAGATT CCCGGCCTGC CCGAGCTGAC CGACGTCGGC
GCGCGCCGCT GCCACGACCC GAGCGAGACG CGCTGCCTGC TGCCGCAGCT CGGCTCGGGG
CCCGACGATC GTTCGGGCGG CGGCTACCTG ACGCGCGACG ACTACGTCGC GCTGTTGCGC
TACGCGGCCG AGCGCTTCGT CGAAGTGATC CCCGAGATCG ACATGCCCGC GCACTCGCGC
GCGGCCGTCG TATCGATGGA GGCGCGCTAT CGCCGCCTGC ACGCGGCGGG CCGCGAGCGG
GAAGCGAACG CGTATCGGCT GCTCGATGCG CAGGACACGT CGAACCTGCT GACCGTGCAG
TTCTACGACC GGCGCAGCGA TCTGAACCCG TGCATGCCGG GCGCGCTGAA CTTCGCGTCG
AAGGTGCTCC GCGAGATCGC GTCGATGCAC GCGGACGCGC AAGCGCCGCT GCGGATCTGG
CACTTCGGCG GCGACGAGGC GAAGAACATC CTGCTCGGCG CGGGCTTCCA GCCGCTCGAC
GGCGCCGATC CCGGCAAGGG CCGCGTCGAT CTCGCCGCGC AGGACAAGCC GTGGGCGCGC
TCGCCCGCCT GTACGGCGCT GCTTCGGCGC GGCGAGATCA AATCGATCGA TGAATTGCCG
ACGCGCTTCG CGAAGCAGGT CAGCGCGATC GTGAACGCGA ACGGAATCGG CACGATGGCC
GCGTGGCAGG ACGGCATCAA GCACGCGAGC GGGCCGCGGG AGTTCAGCAC GCGGCACGTG
ATGGTGTCGC TGTGGGACAC CATCTTCTGG GGCGCGTCCG ACAGCGCGCG CGATCTGAGC
GCGAAGGGCT ACCGGACCGT GCTCGCGCTG CCCGATTACC TGTACTTCGA TTTCCCGTAC
ACGCGCAATC CGCGCGAGCG CGGCTATTAC TGGGGCTCGC AGGCGACGGA CGAGTACAAG
GTGTTCTCGC TCGCGCCGGA GAACCTGCCG CAGAACGCCG AGGTGTTCGG CGATCGCGAC
GGCAACCCGT TCGAGGTGAC GAGTGCGGGC GCGGCGCCGA GCATCGAGGG CATCCAGGGG
CAGGCGTGGG GCGAGGTGAT GCGCAACGGG CAACTGCTCG AATACATGGT GTATCCGCGC
CTTCTCGCGC TCGCCGAGCG CGCGTGGCAC AAGGCTGACT GGGAACTGCC CTACGCGGCC
GGCGTGCGCT ACAAGCTCGG CGACACGCAT CACGTCGACA CGGCCGCGCT CGAGCGCGAC
TGGGCGGGCT TCGCGACGGT GCTCAAGCAG CGCGAACTGC CGAAGCTCGA GCGTGCGGGC
ATCGGGTATC GCAAGCCGAC GTTTACGCTG ACGGGCGAAT GA
 
Protein sequence
MNRISHSLCA ALLAAATLLP TASRAQLPAR PTAGAAAPAT AAPVRPASTP AELAARLADG 
LAVRVAVDNN HAASAGVPCA DLGADWASCA TGRLILQNRG HSPLTDGGWK LYLHSIRRLL
RIDRPGFTLR HLTGDLYELT PQPGTVRLAQ GERIELPFVA EYWLRRYSDV IPRPYVVVDG
AAPAVLRYDD TDDELRYVET LPADAQNNSP GNAPPAAAQP VANRALPSVK RQRALPGALD
LRGVELTLPE LPSAQVAALR ERAGTLGLDG ARVPVWGVVA PRRLPADIAV PGGYRLAIGP
RGAFIEGADR AGLYYGVQTL FSLVPAGGAT VPAMLIEDAP RFTHRGMHVD LARNFKPPAT
LRRLIDQMSA YKLNRLHLHL SDDEGWRIEI PGLPELTDVG ARRCHDPSET RCLLPQLGSG
PDDRSGGGYL TRDDYVALLR YAAERFVEVI PEIDMPAHSR AAVVSMEARY RRLHAAGRER
EANAYRLLDA QDTSNLLTVQ FYDRRSDLNP CMPGALNFAS KVLREIASMH ADAQAPLRIW
HFGGDEAKNI LLGAGFQPLD GADPGKGRVD LAAQDKPWAR SPACTALLRR GEIKSIDELP
TRFAKQVSAI VNANGIGTMA AWQDGIKHAS GPREFSTRHV MVSLWDTIFW GASDSARDLS
AKGYRTVLAL PDYLYFDFPY TRNPRERGYY WGSQATDEYK VFSLAPENLP QNAEVFGDRD
GNPFEVTSAG AAPSIEGIQG QAWGEVMRNG QLLEYMVYPR LLALAERAWH KADWELPYAA
GVRYKLGDTH HVDTAALERD WAGFATVLKQ RELPKLERAG IGYRKPTFTL TGE