Gene BURPS1106A_0556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0556 
Symbol 
ID4901239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp526713 
End bp529214 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content72% 
IMG OID640133786 
Productglycosy hydrolase family protein 
Protein accessionYP_001064839 
Protein GI126451570 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGAA TTTCGCATTC CCTGTGCGCC GCGCTATTGG CCGCCGCGAC GCTGTTGCCC 
ACGGCCTCGC GCGCGCAACT GCCCGCGCGC CCCATGGCCG GCGCGGCCGC GCCCGCGACG
GCCGCGCCCG TGCGGCCCGC GTCCACGCCG GCCGAGCTCG CCGCGCGGCT CGCCAACGGC
CTCGCGGTGC GCGTGGCCGT CGACAACAAT CACGCGGCAT CGGCCGGCGT GCCGTGCGCC
GACCTCGGCG CGGACTGGGC GAGCTGCGCG ACGGGCCGCC TGATCCTGCA GAATCGCGGC
CACTCGCCCC TCACCGACGG CGGCTGGAAG CTCTATCTGC ACAGCATCCG CCGGCTGCTC
CGAATCGACC GCCCCGGCTT CACGCTGCGC CATCTGACGG GCGATCTATA CGAGCTGACG
CCGCAGCCCG GCACGGTAAG GCTCGCGCAG GGCGAGCGCA TCGAGCTGCC GTTCGTCGCC
GAATACTGGC TGCGCCGCTA CAGCGACGTG ATCCCGCGCC CGTACGTGGT CGTCGACGGC
GCGGCGCCCG CGGTGCTGCG CTACGACGAT ACCGACGACG AGCTGCGCTA CGTCGAAACG
CTGCCCGCCG ACGCGCAGAA CAATTCGCCC GGCAATGCGC CGCCCGCCGC CGCGCAGCCG
GTGGCGAACC GCGCGCTGCC CAGCGTGAAG CGGCAGCGCG CGCTGCCCGG CGCGCTCGAT
CTGCGCGGCG TCGAGCTGAC GCTGCCGGAG CTGCCGTCCG CGCAGGTCGC GGCGCTGCGC
GAACGTGCGG GCACGCTCGG CCTGGACGGC GCGCGCGTGC CGGTGTGGGG CGTCGTCGCG
CCGCGCCGGC TGCCCGCCGA CATCGCGGTG CCGGGCGGCT ACCGGCTCGC GATCGGCCCG
CGCGGCGCGT TCATCGAGGG GGCCGATCGC GCGGGCCTCT ACTACGGCGT GCAGACGCTC
TTCTCGCTCG TGCCGGCCGG CGGCGCGACG GTGCCCGCGA TGCTGATCGA AGACGCGCCG
CGCTTCACGC ACCGCGGGAT GCACGTCGAT CTCGCGCGCA ACTTCAAGCC GGCCGCCACG
CTGCGCCGGC TGATCGACCA GATGAGCGCG TACAAGCTCA ACCGGCTGCA TCTGCACCTG
TCCGACGACG AGGGCTGGCG CATCGAGATT CCCGGCCTGC CCGAGCTGAC CGACGTCGGC
GCGCGCCGCT GCCACGACCC GAGCGAGACG CGCTGCCTGC TGCCGCAGCT CGGCTCGGGG
CCCGACGATC GTTCGGGCGG CGGCTACCTG ACGCGCGACG ACTACGTCGC GCTGCTGCGC
TACGCGGCCG AGCGCTTCGT CGAAGTGATC CCCGAGATCG ACATGCCCGC GCACTCGCGC
GCGGCCGTCG TATCGATGGA GGCGCGCTAT CGCCGCCTGC ACGCGGCGGG CCGCGAGCGG
GAAGCGAACG CGTATCGGCT GCTCGATGCG CAGGACACGT CGAACCTGCT GACCGTGCAG
TTCTACGACC GGCGCAGCGA TCTGAACCCG TGCATGCCGG GCGCGCTGAA CTTCGCGTCG
AAGGTGATCC GCGAGATCGC GTCGATGCAC GCGGACGCGC AAGCGCCGCT GCGGATCTGG
CACTTCGGCG GCGACGAGGC GAAGAACATC CTGCTCGGCG CGGGCTTCCA GCCGCTCGAC
GGCGCCGATC CCGGCAAGGG CCGCGTCGAT CTCGCCGCGC AGGACAAGCC GTGGGCGCGC
TCGCCCGCCT GTACGGCGCT GCTTCGGCGC GGCGAGATCA AATCGATCGA CGAATTGCCG
ACGCGCTTCG CGAAGCAGGT CAGCGCGATC GTGAACGCGA ACGGAATCGG CACGATGGCC
GCGTGGCAGG ACGGCATCAA GCACGCGAGC GGGCCGCGGG AGTTCAGCAC GCGGCACGTG
ATGGTGTCGC TGTGGGACAC CATCTTCTGG GGCGCGTCCG ACAGCGCGCG CGATCTGAGC
GCGAAGGGCT ACCGGACCGT GCTCGCGCTG CCCGATTACC TGTACTTCGA TTTCCCGTAC
ACGCGCAATC CGCGCGAGCG CGGCTATTAC TGGGGCTCGC AGGCGACGGA CGAGTACAAG
GTGTTCTCGC TCGCGCCGGA GAACCTGCCG CAGAACGCCG AGGTGTTCGG CGATCGCGAC
GGCAACCCGT TCGAGGTGAC GAGCGCAGGC GCGGCGCCGA GCATCGAGGG CATCCAGGGG
CAGGCGTGGG GCGAGGTGAT GCGCAACGGG CAACTGCTCG AATACATGGT GTATCCGCGC
CTTCTCGCGC TCGCCGAGCG CGCGTGGCAC AAGGCCGACT GGGAACTGCC CTACGCGGCC
GGCGTGCGCT ACAAGCTCGG CGACACGCAT CACGTCGACA CGGCCGCGCT CGAGCGCGAC
TGGGCGGGCT TCGCGACGGT GCTCAAGCAG CGCGAACTGC CGAAGCTCGA GCGTGCGGGC
ATCGGGTATC GCAAGCCGAC GTTTACGCTG ACGGGCGAAT GA
 
Protein sequence
MNRISHSLCA ALLAAATLLP TASRAQLPAR PMAGAAAPAT AAPVRPASTP AELAARLANG 
LAVRVAVDNN HAASAGVPCA DLGADWASCA TGRLILQNRG HSPLTDGGWK LYLHSIRRLL
RIDRPGFTLR HLTGDLYELT PQPGTVRLAQ GERIELPFVA EYWLRRYSDV IPRPYVVVDG
AAPAVLRYDD TDDELRYVET LPADAQNNSP GNAPPAAAQP VANRALPSVK RQRALPGALD
LRGVELTLPE LPSAQVAALR ERAGTLGLDG ARVPVWGVVA PRRLPADIAV PGGYRLAIGP
RGAFIEGADR AGLYYGVQTL FSLVPAGGAT VPAMLIEDAP RFTHRGMHVD LARNFKPAAT
LRRLIDQMSA YKLNRLHLHL SDDEGWRIEI PGLPELTDVG ARRCHDPSET RCLLPQLGSG
PDDRSGGGYL TRDDYVALLR YAAERFVEVI PEIDMPAHSR AAVVSMEARY RRLHAAGRER
EANAYRLLDA QDTSNLLTVQ FYDRRSDLNP CMPGALNFAS KVIREIASMH ADAQAPLRIW
HFGGDEAKNI LLGAGFQPLD GADPGKGRVD LAAQDKPWAR SPACTALLRR GEIKSIDELP
TRFAKQVSAI VNANGIGTMA AWQDGIKHAS GPREFSTRHV MVSLWDTIFW GASDSARDLS
AKGYRTVLAL PDYLYFDFPY TRNPRERGYY WGSQATDEYK VFSLAPENLP QNAEVFGDRD
GNPFEVTSAG AAPSIEGIQG QAWGEVMRNG QLLEYMVYPR LLALAERAWH KADWELPYAA
GVRYKLGDTH HVDTAALERD WAGFATVLKQ RELPKLERAG IGYRKPTFTL TGE