Gene BURPS1106A_A0244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0244 
SymbolclpB 
ID4904425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp231358 
End bp234297 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content72% 
IMG OID640143351 
Productchaperone clpB 
Protein accessionYP_001074287 
Protein GI126456099 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR03345] type VI secretion ATPase, ClpV1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.707407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATCT CGCGTCAGGC GCTGTTCGGG AAACTCGGGG CCACGCTCTT CAGGGCGATC 
GAATCGGCCA CGGTTTTCTG CAAGCTGCGC GGCAATCCGT ACGTCGAGCT GGTGCACTGG
CTGCAGCAGT TGTTGCAGCA GTCCGACTCC GATCTGCACC GGATCGTGCG GCACGCCGGC
ATCGAGCGCG ACGCGCTCGA TCGCGACATC GCGCGCGCGC TCGCCACGCT GCCGGCCGGC
GCGGGCTCGA TCAGCGATTT TTCCCATCAC GTCGAAGCCG CGATCGAGCG CGCATGGGTG
CTCGCAACGC TGCGCTTCGG CGACCGGCGC ATCCGCGGCG CATGGCTCGT CGCCGCGCTC
GTCGATACGC CGGAACTGCG GCGCGTGCTG CTTTCCATTT CGCCGGCGTT CGCGAGGATT
CCTCACGACG ACGCTCTCGA CGACGTGCTG CCCGCGTGGA CGGCCGGTTC GCCGGAAGCG
GCGGACGCGC CGTACGACCA CGTCGATTCC GCGCCCGCTT CGCCGGGCGA GCCGTCCGGC
GCGACGCGCG CCGCGCCGAA CGGCTCGCCG CTCGAGCGAT ATTGCACGGA CCTGACCGCG
CGCGCGCGCG ACGGCGACAT CGACCCCGTC ATCGGGCGCG AGCTCGAAAT CCGCACGATG
ACCGACGTGC TGCTGCGGCG CAGGCAGAAC AATCCGCTGC TCACCGGCGA GGCGGGCGTG
GGCAAGACGG CCGTCGTCGA GGGCCTCGCG CTCGCGATCG CGAACGGCGA CGTGCCGCCG
AAGCTGGCCG ACGTGCGCCT GATGAGCGTC GACGTGGGCG CGCTGCTGGC CGGCGCGGGC
ATGAAAGGCG AATTCGAATC GCGCCTGAAA GGCGTGCTCG AGGCCGCGGC GAAATCCGTC
GCGCCCGTCA TCCTGTTCGT CGACGAGATT CACACGCTGA TCGGCGCGGG CGGACAGGCC
GGCACGGGCG ACGCGGCGAA CCTGCTCAAG CCCGCGCTCG CGCGCGGCAC GATCCGCACG
ATCGGCGCGA CGACATGGGC GGAGTACAAG CGGCACATCG AAAAGGATCC CGCATTGACC
CGCCGCTTTC AGGTGCTGCA AGTGCCGGAG CCCGAAGAGC CGGCCGCGGT GCACATGGTG
CGGGGCGTCG CGCGAGCGTT CGCGCGGCAC CACCGCGTGA CGGTGCGCGA CGAGGCGATC
CGCGCCGCCG TCGCGCTGTC GCACCGCTAC ATTCCGTCGC GGCATCTGCC GGACAAGGCG
ATCAGCCTCC TCGACACCGC ATGCGCGCGC GTCGCGCTCT CGCAGCACGC CGCGCCCGGC
GAACTGCAGC ACGTACGCCA GCGCTTGCTC GCGGCGCGCG CCGAGCGCGA TCTGCTCGAA
CAGGAGGCGC GCATCGGGCT CGACGCCGGG CAATCGCTCG CGGCGGTGCG CGAACGCATC
GAAGCGCTCG CGGCCGAGGA AGCGGCCGTC GACGCGCGCT GGAAGGCGCA GGCCGACGCG
GCGCGCGCGC TGCTCGCCGC GCGCGAGGCC GCACTCGCGG AATGTCACCG CGAGTCTTGC
TCCGAAACGC GCGCCGGCTC GCTCTCCGAA TCTCGTACCG AATCTCGTAC CGAATCGCGC
GCCAGATCAC ACATCGACTC CAGCGCCTAT GCTCACAGCG ACGTCCCGGC CGAAATGCAC
GTCGGCTCGC ACGCCGGCTC GCGCGCCGCA ACGTGTCCGG ACACGCACGC CGAAGCGCAC
GCCGCCCCCG CCTCGCCCCC GCCCGCCGCC GATACGCCGC ACGCCGGCGC CGCCCCCGGG
CTGCGCGAGC TCGAACGCGC GCTCGCGGCG GCCCAGGGCG ACGCACCGCT CGTGTTCCCG
GAAGTCGACG AGACGATCGT CGCGCAGATC GTCGCGGATT GGACCGGCAT TCCGGTCGGC
CGCATGATGA CCGACGAAGT CGCCGCCGTG CGCGCGCTGC CCGCGACGCT CGAGGCGCGC
GTGATCGGCC AGCCCGACGC GCTGCGGCAG ATCGGCGAGC GCGTGCAGAC CGCGCGCGCG
GGCCTCGCCG ATCCGAAGAA GCCGCTCGGC GTATTCCTGC TTGCGGGCCC GTCGGGCGTC
GGCAAGACCG AAACGGCGCT CGCGCTCGCC GAGGCGCTGT ACGGCGGCGA ACAGAGCCTG
ATCACGATCA ACATGAGCGA GTACCAGGAA GCCCACACCG TGTCGGGCCT CAAGGGCGCG
CCGCCCGGCT ATGTCGGCTA CGGCGAGGGC GGCGTGCTGA CCGAGGCGGT GCGGCGGCGG
CCGTACAGCG TCGTGCTGCT CGACGAGATC GAGAAGGCGC ACCGCGACGT GCACGAACTC
TTCTTCCAGG TCTTCGACAA GGGCTACATG GAAGACGGCG ACGGCCGCTA CATCGATTTC
CGCAACACGA CGATCCTGCT CACGAGCAAC GTCGGCGCGG AACTGAGCGC GAGCCTGTGT
GCCGACGCAT CGCTCGCGCC CGATGCCGCC GCGCTGCGCG ACGCGCTCAT GCCCGAACTG
CTGAAGGTCT TCCCCGCCGC GTTCCTCGGG CGCGTGAGCG TCGTGCCGTA CCGGCCGCTC
GAAGCGCGCG CGCTCGCGCG CATCGTGCGC CTGCATCTGG ATCGCGTCGT CGCGCGCATG
GCCGAGCGGC ACCGCATCGC GCTCGCGTAC GACGACGCCG TCGTCGACTA CGTCGTCGGG
CGTTGCCTCG TGCAGGAAAC CGGCGCGCGG CTGCTGATCG GATTCATCGA GCAGCACGTG
CTGCCTCGGC TGTCCGCGCT GTGGCTCGAC GCGTTCCCGT CGAAGGCGGC GCTCGCGCGC
ATCGACATCG GTGTGGCCGA CGCGGCCGCG CCCGCGGCGC GCGCGCTCGT CTTCCGGCCC
GGCCAAGCAA GCCGGGCGGG GCCGCCGAAC GCGCCGCTCA CCGCCGTGCA GGCCGGCTAG
 
Protein sequence
MAISRQALFG KLGATLFRAI ESATVFCKLR GNPYVELVHW LQQLLQQSDS DLHRIVRHAG 
IERDALDRDI ARALATLPAG AGSISDFSHH VEAAIERAWV LATLRFGDRR IRGAWLVAAL
VDTPELRRVL LSISPAFARI PHDDALDDVL PAWTAGSPEA ADAPYDHVDS APASPGEPSG
ATRAAPNGSP LERYCTDLTA RARDGDIDPV IGRELEIRTM TDVLLRRRQN NPLLTGEAGV
GKTAVVEGLA LAIANGDVPP KLADVRLMSV DVGALLAGAG MKGEFESRLK GVLEAAAKSV
APVILFVDEI HTLIGAGGQA GTGDAANLLK PALARGTIRT IGATTWAEYK RHIEKDPALT
RRFQVLQVPE PEEPAAVHMV RGVARAFARH HRVTVRDEAI RAAVALSHRY IPSRHLPDKA
ISLLDTACAR VALSQHAAPG ELQHVRQRLL AARAERDLLE QEARIGLDAG QSLAAVRERI
EALAAEEAAV DARWKAQADA ARALLAAREA ALAECHRESC SETRAGSLSE SRTESRTESR
ARSHIDSSAY AHSDVPAEMH VGSHAGSRAA TCPDTHAEAH AAPASPPPAA DTPHAGAAPG
LRELERALAA AQGDAPLVFP EVDETIVAQI VADWTGIPVG RMMTDEVAAV RALPATLEAR
VIGQPDALRQ IGERVQTARA GLADPKKPLG VFLLAGPSGV GKTETALALA EALYGGEQSL
ITINMSEYQE AHTVSGLKGA PPGYVGYGEG GVLTEAVRRR PYSVVLLDEI EKAHRDVHEL
FFQVFDKGYM EDGDGRYIDF RNTTILLTSN VGAELSASLC ADASLAPDAA ALRDALMPEL
LKVFPAAFLG RVSVVPYRPL EARALARIVR LHLDRVVARM AERHRIALAY DDAVVDYVVG
RCLVQETGAR LLIGFIEQHV LPRLSALWLD AFPSKAALAR IDIGVADAAA PAARALVFRP
GQASRAGPPN APLTAVQAG