Gene BURPS1106A_2835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2835 
Symbol 
ID4901047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2790070 
End bp2791098 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content70% 
IMG OID640136061 
Productbeta-hexosaminidase 
Protein accessionYP_001067082 
Protein GI126452419 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.261076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGT CCCCCGGTCC GGTGATGCTC GACGTCGCCG GCACGACGCT CACGCGCGAC 
GACGCGCGCC GCCTCGCGCA TCCGCACACG GGCGGCGTGA TCCTGTTCGC GCGCCACTTC
GAGAGCCGCG CGCAACTCGT CGCGCTGACC GAGGCGATCC GGGCGATCCG CGACGGCATC
CTGATCGCGG TCGATCACGA GGGCGGCCGC GTGCAGCGCT TTCGCACCGA CGGCTTCACC
GTGCTGCCGG CGATGCGCCG GCTCGGCGAG CTGTGGGACA AGGACGTGCT GCACGCGACG
AAGGCGGCGA CCGCGCTCGG CTATGTGCTC GCTTCCGAGC TGCGCGCGTG CGGCATCGAC
ATGAGCTTCA CGCCCGTGCT CGATCTCGAT TACGGCCGCT CGAAGGTGAT CGGCGATCGC
GCGTTCCATC GCGATCCGCG CGTCGTCGCG TTGCTCGCGA AGAGCGTCAA CCACGGGCTC
GCGCTCGCCG GGATGGCGAA CTGCGGCAAG CATTTTCCCG GCCACGGCTT CGCGCAGGCC
GATTCGCACG TCGCGCTGCC GACCGACGAT CGTCCGCTCG ACGAGATCCT CGCGAACGAC
GCGGCGCCGT ACGACTGGCT CGGGCTGTCG TTGTCGGCCG TTATCCCGGC GCACGTGATC
TACACGCAGG TCGATTCGAA GCCGGCCGGC TTCTCGCGCG TGTGGTTGCA GGACGTGCTG
CGCGGCCGGC TGCGCTTTGC GGGCGCCGTG TTCAGCGACG ATCTGTCGAT GGAGGCCGCG
CGCGAGGGCG GCACGCTCGC GCAGTCGGCG CAGGCCGCGC TCGAGGCGGG CTGCGACATG
GTGCTCGTGT GCAACCAGCC GGATGCGGCG GAGCGGGTGC TCGACGAGCT GCGCACGACG
GCGTCGCGCG AATCGTCGCG GCGGATCAAG CAGATGCGGC CGCGCGGCAA GGCGCTCGAG
TGGCGCAAGC TGATGCGCGA GCCGCGCTAT CTGAATGCGC AGGGCCTGTT GCGCAGCACG
TTCGCCTGA
 
Protein sequence
MKLSPGPVML DVAGTTLTRD DARRLAHPHT GGVILFARHF ESRAQLVALT EAIRAIRDGI 
LIAVDHEGGR VQRFRTDGFT VLPAMRRLGE LWDKDVLHAT KAATALGYVL ASELRACGID
MSFTPVLDLD YGRSKVIGDR AFHRDPRVVA LLAKSVNHGL ALAGMANCGK HFPGHGFAQA
DSHVALPTDD RPLDEILAND AAPYDWLGLS LSAVIPAHVI YTQVDSKPAG FSRVWLQDVL
RGRLRFAGAV FSDDLSMEAA REGGTLAQSA QAALEAGCDM VLVCNQPDAA ERVLDELRTT
ASRESSRRIK QMRPRGKALE WRKLMREPRY LNAQGLLRST FA