Gene BURPS668_0638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0638 
Symbol 
ID4882323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp610694 
End bp613213 
Gene Length2520 bp 
Protein Length839 aa 
Translation table11 
GC content73% 
IMG OID640126566 
Productglycosy hydrolase family protein 
Protein accessionYP_001057690 
Protein GI126442197 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCCG CGCCGGATCG CGTGGCGCGC GGCGCCGCCC AATGGACGTT GATCGCGACG 
CCCGCTGGCG CGATCGCGCG ACCGAGCGAG CTCGGCGAAG CCGGCTGGTG CGCGGCAAGC
GTGCCCGGCA CCGTCGCGCA GGCGCTCGCC GCCGCGCGCC GCTTCGATCC CGCGCATCCG
TACCCGCTCG GCGACAGCGA CTACTGGTAC CGCACGACGC TGCACGGCGC GGGGCCGCGC
ATCGTCCGGC TCAACGGCCT CGCGACGATC GCCGAGGTCT GGCTCGACGA CACGCTGCTG
CTCTGCTCGG ACAACATGTA CGTCGCGCAC GATCTGCCCG TGACGCTCGG CGGCGCGCAC
CGTCTCGCGC TCTGCTTTCG CTCGCTCGAC CGGCACCTCG CCGAGCACCC GCCGCGCGGC
CGCGCGCGCT GGCGCACGCG CCTCGTCGAT ACGCCCGCGC TGCGCGGCGT GCGCGCGACG
TTCCTCGGCC GAATGCCGGG CTGGTTCCCG GCGATCGAGC CGGTCGGCCC GTGGCGTCCG
ATCGACATCG TGAATCCGGC CGGGGCGCCG ACGATCGTGC GCGACACGCT GCGCGCGACG
CTCGACGGGC GCGACGGCGT GCTCGACGCG ACGCTCGAGT TCGCCGCGCC GCTTCCGAGA
ACGGCGCGCG CGCAGCTCGT CTGCGGCGAG CATGCGGCGC CGCTCGAAGC GACCGGTCCG
CGCACCGCGC GCGCAACGCT CAGGATCGCG AACGTCACGC CGTGGTGGCC GCATACGCAC
GGCGAACCCG CGTTGTACGA CGTCGGCGTG GCAATCGGCG GCGCAACGAT TGCGCTCGCC
AAAACGGGCT TTCGCACGCT CGCCGTCGAG CGCGGCGCGG ACGGCCGCGG CTTCGCGCTG
TCGGTCAACG GCACGCCGCT TTTCGCGCGC GGCGCATGCT GGACGAGCGC CGATCCCGTC
GGGCTGCACG CCGATGCGCC CGCCTATCGC CGCGCGCTCG CGCTCGCGCG CGACGCCGGC
TGCAACATGA TCCGCGTCGG CGGCACGATG ATCTACGAGG CCGACGCGTT CTACGCGCTC
TGCGACGAGT TGGGGCTGCT CGTGTGGCAA GATTTCATGC TCGCGAACCT CGACTATCCG
TCGAACGATC CGCGCTTTGC CGAATCGCTC AAGCGCGAGG CCGAGCAGTT CCTCGGCCGG
CACATGGCGC GGCCGTCGAT CGCGGTGCTG TGCGGCGGCA GCGAGATCGC GCAGCAGGCC
GCGATGGTCG GCCTCGCGCC CGACGAGCGC CGCGTGCCCG CCACCGAGCA ATGGCTCGCC
GAACTGTGCG CCGCGCATCG CCCCGATGCC GCGTACGTCA GCGATTCGCC GCACGGCGGC
GTGCTGCCGT TCGCGCCGCG CGAGGGCGTT ACGCACTACT ACGGCGTCGG CGCGTACCTG
CGCCCTCCCG AGGATGCACG CCGCGCCGGC GTGCGCTTCG CGAGCGAGTG CCTCGCGTTC
GCGAACGTGC CGTGCGACGC GACGCTCGCC TCGATCGGCT CGCCCGCCGC GCACGAGCCG
GCCTGGAAAC GCGCGGTGCC GCGCGATCCC GGCGCGCCGT GGGATTTCGA CGACGTGCGC
GATCACTACC TGCGCACGCT GTACGGCGTC GAGCCCGCGC GCTTGCGCAG CATCGATCCC
GCCCGCTATC TGACGCTGTC GCGCGCCGTC GTCGCCGATC TCGTCGGGGA GACGCTCGCC
GAGTGGCGCC GCGTCGGCTC CTCGTGCGCG GGCGCGCTCG TCTGGCAGTT CCAGGACGTG
ATGCCGGGCG CGGGCTGGGG CCTCGTCGAC GCGCACGGCC GGCCGAAATC CGCATGGCAT
GCGTTGCGGC GCGTATCGCA GCCGCGGCAG ATCCTGCTGA CCGACGAAGG GCTCAACGGC
CTCGACGTGC ACGTGCTCAA CGATGCGCCC GCGCCGCTCG AAGCCCGCAT CGAGCTCGTC
GCGCTACGCG ACGGCAAGAC GCCGGTCGCG CGCGCGGCCC GCACGGTCCA CGTCGCCGCC
CACGCGGGCC AATGCGTGAA TTCGGCCGAC CTGCTCGGCC GATTCTTCGA TTTCACCTAT
GCGTACCGCT TCGGCCCGCG CGAGCACGAC GTCGTGATCG CATCGCTGTA CGCGAGCGAC
GGCGCGCTGC TGTCGCAGGC GTTCCACTTT CCCGAACGCA CCGCGCCGAC CGTGTTCGAG
CGCGGCGACA TCGGTCTCGA GGCGAGCGCC GCGTATCGAG ACGGCCGCTG GTGCGTGCAG
GTGCAGACGC GCACGTTCGC GCGCTACGTG CATGTGTGCG CGCCGGGCCT GCTGCCCGAC
ATCGACTGGT TCCATCTCGC GCCGGGTGCC GCCGCGCGGA TCGAGTTCGC CGCCGACCCT
CATTCTCCCG CTCCCGACCA CCGCCCGCCC GCAGCCGACG CGGCCCACTG CGCCCCGCCC
GCGATCGAGG TGCGCGCACT CAATTCCAAC AAGACCATTC GCCCGAGGAT AGAAAATTGA
 
Protein sequence
MKSAPDRVAR GAAQWTLIAT PAGAIARPSE LGEAGWCAAS VPGTVAQALA AARRFDPAHP 
YPLGDSDYWY RTTLHGAGPR IVRLNGLATI AEVWLDDTLL LCSDNMYVAH DLPVTLGGAH
RLALCFRSLD RHLAEHPPRG RARWRTRLVD TPALRGVRAT FLGRMPGWFP AIEPVGPWRP
IDIVNPAGAP TIVRDTLRAT LDGRDGVLDA TLEFAAPLPR TARAQLVCGE HAAPLEATGP
RTARATLRIA NVTPWWPHTH GEPALYDVGV AIGGATIALA KTGFRTLAVE RGADGRGFAL
SVNGTPLFAR GACWTSADPV GLHADAPAYR RALALARDAG CNMIRVGGTM IYEADAFYAL
CDELGLLVWQ DFMLANLDYP SNDPRFAESL KREAEQFLGR HMARPSIAVL CGGSEIAQQA
AMVGLAPDER RVPATEQWLA ELCAAHRPDA AYVSDSPHGG VLPFAPREGV THYYGVGAYL
RPPEDARRAG VRFASECLAF ANVPCDATLA SIGSPAAHEP AWKRAVPRDP GAPWDFDDVR
DHYLRTLYGV EPARLRSIDP ARYLTLSRAV VADLVGETLA EWRRVGSSCA GALVWQFQDV
MPGAGWGLVD AHGRPKSAWH ALRRVSQPRQ ILLTDEGLNG LDVHVLNDAP APLEARIELV
ALRDGKTPVA RAARTVHVAA HAGQCVNSAD LLGRFFDFTY AYRFGPREHD VVIASLYASD
GALLSQAFHF PERTAPTVFE RGDIGLEASA AYRDGRWCVQ VQTRTFARYV HVCAPGLLPD
IDWFHLAPGA AARIEFAADP HSPAPDHRPP AADAAHCAPP AIEVRALNSN KTIRPRIEN