Gene BMASAVP1_A0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A0143 
Symbol 
ID4678944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp153003 
End bp155504 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content72% 
IMG OID639844421 
Productputative beta-N-acetylhexosaminidase 
Protein accessionYP_991494 
Protein GI121600904 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGAA TCTCGCATTC CCTGTGCGCC GCGCTATTGG CCGCCGCGAC GCTGTTGCCC 
ACGGCCTCGC GCGCGCAACT GCCCGCGCGC CCCACGGCCG GCGCGGGCGC GCCCGCGACG
GCCGCGCCCG TGCGGCCCGC GTCCACGCCG GCCGAGCTCG CCGCGCGGCT CGCCAACGGC
CTCGCGGTGC GCGTGGCCGT CGACAACAAT CACGCGGCAT CGGCCGGCGT GCCGTGCGCC
GACCTCGGCG CGGACTGGGC GAGCTGCGCG ACGGGCCGCC TGATCCTGCA GAATCGCGGC
CACTCGCCCC TCACCGACGG CGGCTGGAAG CTCTATCTGC ACAGCATCCG CCGGCTGCTC
CGAATCGACC GCCCCGGCTT CACGCTGCGC CATCTGACGG GCGATCTATA CGAGCTGACG
CCGCAGCCCG GCACGGTAAG GCTCGCGCAG GGCGAGCGCA TCGAGCTGCC GTTCGTCGCC
GAATACTGGC TGCGCCGCTA CAGCGACGTG ACCCCGCGCC CGTACGTGGT CGTCGACGGC
GCGGCGCCCG CGGTGCTGCG CTACGACGAT ACCGACGACG AGCTGCGCTA CGTCGAAACG
CTGCCCGCCG ACGCGCAGAA CAACTCGCCC GGCAATGCGC CGCCCGCCGC CGCGCAGCCG
GTGGCGAACC GCGCGCTGCC GAGCGTGAAG CGGCAGCGCG CGCTGCCCGG CGCGCTCGAT
CTGCGCGGCG TCGAGCTGAC GCTGCCGGAG CTGCCGTCCG CGCAGGTCGC GGCGCTGCGC
GAACGCGCGG GCACGCTCGG CCTGGACGGC GCGCGCGTGC CGGTGTGGGG CGTCGTCGCG
CCGCGCCGGC TGCCCGCCGA CATCGCGGTG CCGGGCGGCT ACCGGCTCGC GATCGGCCCG
CGCGGCGCGT TCATCGAGGG GGCCGATCGC GCGGGCCTCT ACTACGGCGT GCAGACGCTC
TTCTCGCTCG TGCCGGCCGG CGGCGCGACG GTGCCCGCGA TGCTGATCGA AGACGCGCCG
CGCTTCACGC ACCGCGGGAT GCACGTCGAT CTCGCGCGCA ACTTCAAGCC GCCCGCCACG
CTGCGCCGGC TGATCGACCA GATGAGCGCG TACAAGCTCA ACCGGCTGCA TCTGCACCTG
TCCGACGACG AGGGCTGGCG CATCGAGATT CCCGGCCTGC CCGAGCTGAC CGACGTCGGC
GCGCGCCGCT GCCACGACCC GAGCGAGACG CGCTGCCTGC TGCCGCAGCT CGGCTCGGGG
CCCGACGATC GTTCGGGCGG CGGCTACCTG ACGCGCGACG ACTACGTCGC GCTGCTGCGC
TACGCGGCCG AGCGCTTCGT CGAAGTGATC CCCGAGATCG ACATGCCCGC GCACTCGCGC
GCGGCCGTCG TATCGATGGA GGCGCGCTAT CGCCGCCTGC ACGCGGCGGG CCGCGAGCGG
GAAGCGAACG CGTATCGGCT GCTCGATGCG CAGGACACGT CGAACCTGCT GACCGTGCAG
TTCTACGACC GGCGCAGCGA TCTGAACCCG TGCATGCCGG GCGCGCTGAA CTTCGCGTCG
AAGGTGATCC GCGAGATCGC GTCGATGCAC GCGGACGCGC AAGCGCCGCT GCGGATCTGG
CACTTCGGCG GCGACGAGGC GAAGAACATC CTGCTCGGCG CGGGCTTCCA GCCGCTCGAC
GGCGCCGATC CCGGCAAGGG CCGCGTCGAT CTCGCCGCGC AGGACAAGCC GTGGGCGCGC
TCGCCCGCCT GTACGGCGCT GCTTCGGCGC GGCGAGATCA AATCGATCGA CGAATTGCCG
ACGCGCTTCG CGAAGCAGGT CAGCGCGATC GTGAACGCGA ACGGAATCGG CACGATGGCC
GCGTGGCAGG ACGGCATCAA GCACGCGAGC GGGCCGCGGG AGTTCAGCAC GCGGCACGTG
ATGGTGTCGC TGTGGGACAC CATCTTCTGG GGCGCGTCCG ACAGCGCGCG CGATCTGAGC
GCGAAGGGCT ACCGGACCGT GCTCGCGCTG CCCGATTACC TGTACTTCGA TTTCCCGTAC
ACGCGCAATC CGCGCGAGCG CGGCTATTAC TGGGGCTCGC AGGCGACGGA CGAGTACAAG
GTGTTCTCGC TCGCGCCGGA GAACCTGCCG CAGAACGCCG AGGTGTTCGG CGATCGCGAC
GGCAACCCGT TCGAGGTGAC GAGCGCAGGC GCGGCGCCGA GCATCGAGGG CATCCAGGGG
CAGGCGTGGG GCGAGGTGAT GCGCAACGGG CAACTGCTCG AATACATGGT GTATCCGCGC
CTTCTCGCGC TCGCCGAGCG CGCGTGGCAC AAGGCCGACT GGGAACTGCC CTACGCGGCC
GGCGTGCGCT ACAAGCTCGG CGACACGCAT CACGTCGACA CGGCCGCGCT CGAGCGCGAC
TGGGCGGGCT TCGCGACGGT GCTCAAGCAG CGCGAACTGC CGAAGCTCGA GCGTGCGGAC
ATCGGGTATC GCAAGCCGAC GTTTACGCTG ACGGGCGAAT GA
 
Protein sequence
MNRISHSLCA ALLAAATLLP TASRAQLPAR PTAGAGAPAT AAPVRPASTP AELAARLANG 
LAVRVAVDNN HAASAGVPCA DLGADWASCA TGRLILQNRG HSPLTDGGWK LYLHSIRRLL
RIDRPGFTLR HLTGDLYELT PQPGTVRLAQ GERIELPFVA EYWLRRYSDV TPRPYVVVDG
AAPAVLRYDD TDDELRYVET LPADAQNNSP GNAPPAAAQP VANRALPSVK RQRALPGALD
LRGVELTLPE LPSAQVAALR ERAGTLGLDG ARVPVWGVVA PRRLPADIAV PGGYRLAIGP
RGAFIEGADR AGLYYGVQTL FSLVPAGGAT VPAMLIEDAP RFTHRGMHVD LARNFKPPAT
LRRLIDQMSA YKLNRLHLHL SDDEGWRIEI PGLPELTDVG ARRCHDPSET RCLLPQLGSG
PDDRSGGGYL TRDDYVALLR YAAERFVEVI PEIDMPAHSR AAVVSMEARY RRLHAAGRER
EANAYRLLDA QDTSNLLTVQ FYDRRSDLNP CMPGALNFAS KVIREIASMH ADAQAPLRIW
HFGGDEAKNI LLGAGFQPLD GADPGKGRVD LAAQDKPWAR SPACTALLRR GEIKSIDELP
TRFAKQVSAI VNANGIGTMA AWQDGIKHAS GPREFSTRHV MVSLWDTIFW GASDSARDLS
AKGYRTVLAL PDYLYFDFPY TRNPRERGYY WGSQATDEYK VFSLAPENLP QNAEVFGDRD
GNPFEVTSAG AAPSIEGIQG QAWGEVMRNG QLLEYMVYPR LLALAERAWH KADWELPYAA
GVRYKLGDTH HVDTAALERD WAGFATVLKQ RELPKLERAD IGYRKPTFTL TGE