Gene BURPS668_A0795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0795 
Symbol 
ID4888701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp764590 
End bp767460 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content72% 
IMG OID640130735 
Productputative ATP-dependent Clp protease, ATP-binding subunit ClpB 
Protein accessionYP_001061794 
Protein GI126445313 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR03345] type VI secretion ATPase, ClpV1 family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATCGCG AACGCATATT CAACTGTCTC GGCCGCACGA CCTATGCGGC CCTTGTCGAT 
GCGACGGCGC TGGGCCGATC GCGTCGGCAC GCCTTCATCG ATCTCGATCA CTGGGCGCTG
TGCCTGCTGC AGCGCGAGCA GAGCGATCTC GCGCGGCTGT TCGAGCTGTT CGGCAGCGAT
GTCGGCGAAG CGAAGCGCCG CATGGAGAAG GCGCTCGACG GCTTCGACGT GAGCGGCGAC
TCGCTGCGCG ACATCTCCAG CTCGCTGGAG CGCAGCGTGG GGCCGGCCGT CATCTGGAGC
CAGATCGCGG CGCGCGCGGG CAAGGTGCGC TCCGGCCACC TGCTGCTCGC GTGGCTCGAC
GAGGATCTGA CGCGCCGCTG GCTGCAGCAG CGCGTGCCGA GCGGCATCAC GTCGGTGGCG
CTCGACGACG TGGTGAAGCG CTATGAGGCG CTCGCGGCGG GCTGGCCGGA AGCCGACGAG
GCGCCGGCCG CACTGGACGG CGCGGCGCTG GGCGCGCAGG CGGGCGAAGC CGGCGCAGAC
GGCACGGGCG ACGCGCTCGC GAAGTGGGCG ACGTGCGTGA CGGAGCAGGC CGCGCGCGGC
GAGCTCGATC CGGTGGTGGG GCGCGACGAC GAATTGCGCA CGGTGATCGA CATTCTGTCG
CGCCGCCGGC AGAACAACCC GATCCTCGTG GGCGAGGCGG GGGTGGGCAA GACGGCCGTG
GTCGAGGCGC TCGCGCAGAA GATCCACGCG GGCGCTGTGC CGCCGGGGCT CGTGGGCGCG
CAGGTGTGGG CGCTCGACCT GGCGCGGATG CAGGCGGGCG CGGGGGTGCG CGGCGAGTTC
GAGCAGCGCC TGAAATCGCT GATCGACGCG GTGATCGCCT CGCCCGCGCC GATCATCCTG
TTCTGCGACG AGACGCACAC GCTGATCGGC GCGGGCGGCG CGGCGGGCAC GGGCGACGCG
GCCAACCTGA TCAAGCCGAT GCTCGCGCGC GGCCAATTGA GGATGGTCGC GGCGACGACG
TGGTCCGAAT ACAAGCAGTA CATCGAGCCG GACGCGGCGC TCGTGCGGCG CTTCCAGGCG
GTCGCGGTCG ACGAGCCGAG CGACGATGCG GCGGTCGACA TGCTGCGCAC GATCGCGCCG
CGCTTTGCCG CGCACCACGG CGTGCGCATC GTCGATTCGG CGCTGCGCGG CGCGGTCGAG
CTGTCGCGCC GCTATCTGCC CGCGCGGCAG TTGCCGGACA AGGCGATCAG CCTGCTCGAC
ACCGCGTGCG CGCGCGTGGC GATGAGCCAG AGCTGCGCGC CCGCGGAGCT CGAGCGCTTG
CAGCACCAGG CGTTCGCGAT CGGCCAGACG CTCGATTGGC GCGCGAGCGA CCGGCGCATG
GGCGTGCGCA CGCCGGGCGA CGAAGCCGAG CTCGAAGGCC GTCAGGCGAG CCTCGCGCAG
CAGGCGGCGA CGCTCGAGAC GGTCGTGGAC GCGCAGCGCG ACGAGGTGCG TGCGTGGCTC
GCGCGGCTGA ACGACGCCAC GCCGCAGGCG GCCGACGGCG ACGGCGCTGC GTTCGCCGCG
CGCATCGGCG CGAATCGCTG GGTGCGGCCA TGGGTCGACG AACACGTGGT GTCGGAGGTG
CTCGCCGAAT GGACGGGGGT GCCCGTCGCG CAGCTCGCGC AGGACGACGC GCAGCGCGTG
GTGGAGCTCG AGGCGGCATT GAACGCGGGC ATCCACGGGC AGACGGGCGC GATGCGCTCG
ATCGCGCAGG CGCTGCAGGT GTCGCACTCG GGGCTGAACG ATCCGCGCCG CCCGCTCGGC
GTGATGCTGC TCGCGGGGCC GACGGGCACG GGCAAGAGCC AGGCGGCCGC GAAGCTCGCC
GAGCTGCTGT TCGGCGGCGA GCGCAACCTG CTGCAGTTCA ACATGAACGA GTTCCAGGAA
GCGCACACGG TGTCGACGCT CAAGGGCGCG CCGCCGGGGT ACGTCGGCTA CGGCAAGGGC
GGTCGGCTGA CCGAGGCGGT GCGCAAGAAG CCGTACAGCG TGCTGCTGCT GGACGAATTC
GATCGCGCGC ATCCGGACAT TCATGAGGTG TTCTATCAAG TCTTCGATCA GGGGTGGATG
GAGGACGGCG AAGGCCGCCG GATCAGCTTC CGCAACTGCC TGATCCTGCT GACGAGCAAT
CTGGGGGAGG CGGAGATCGA AGCGGCGTGC AAGGCCGATC CGCGGATCTC GCAGGCGAAG
CTCGACAAGC TGGTGGGCGA GCGGCTGCAG GGGCGTTTCT CGCCGGCGCT GCTCGCGCGG
ATTCAGCTCG TGGCGTTCCG CACGCTCGAT GTCGACGCGT TGACGGGCAT CGCGACGCAG
GCGCTGGACG AGCTGGGCGA GCGCCTGGCG CAGAACGATC TGCAATGGCG CGCGGACGAA
GGCGTGGCGT CGTGGATCGC GCATGCGGTG TCGCAGCATC CGGCGAACGG ACGCGCGGTG
CGCGACCTGT TGCGCCAGCA CGTGATGCCG GCCGTGGCGC GCGGCGTGCT CGCCGCGCGT
GCGGAAGGGC GAGCGCTGAA GACGGTGCGG CTCGCGGCGA ACGAGAAGCT GTCGCTCGTG
TTCGACGAGG ACGCGTGGGA ACTGAGCGGC ACCGATGCGG CGTCGCTCGG CGAGCAGGCG
CAGGCGGTGG CGATGGCGCG CGAGGCGGAG GCGGTGGCCG CTGCCGTGGC GGCACGCGAA
GCGCACGGCG CGCACGGCAC GAATGAGGCG GATGGAGGCG GCGGCGCGCC GCACGCGGCG
AAAGCCGATG CGCATGTCGA TTCGGATGAC GAACGTCCAG GCGGCGCGCC TCATCCCGAT
GAAACGGCTT CGGCGAACGC CGGCACGACG GGAGAACCGT CATGCGTCTG A
 
Protein sequence
MNRERIFNCL GRTTYAALVD ATALGRSRRH AFIDLDHWAL CLLQREQSDL ARLFELFGSD 
VGEAKRRMEK ALDGFDVSGD SLRDISSSLE RSVGPAVIWS QIAARAGKVR SGHLLLAWLD
EDLTRRWLQQ RVPSGITSVA LDDVVKRYEA LAAGWPEADE APAALDGAAL GAQAGEAGAD
GTGDALAKWA TCVTEQAARG ELDPVVGRDD ELRTVIDILS RRRQNNPILV GEAGVGKTAV
VEALAQKIHA GAVPPGLVGA QVWALDLARM QAGAGVRGEF EQRLKSLIDA VIASPAPIIL
FCDETHTLIG AGGAAGTGDA ANLIKPMLAR GQLRMVAATT WSEYKQYIEP DAALVRRFQA
VAVDEPSDDA AVDMLRTIAP RFAAHHGVRI VDSALRGAVE LSRRYLPARQ LPDKAISLLD
TACARVAMSQ SCAPAELERL QHQAFAIGQT LDWRASDRRM GVRTPGDEAE LEGRQASLAQ
QAATLETVVD AQRDEVRAWL ARLNDATPQA ADGDGAAFAA RIGANRWVRP WVDEHVVSEV
LAEWTGVPVA QLAQDDAQRV VELEAALNAG IHGQTGAMRS IAQALQVSHS GLNDPRRPLG
VMLLAGPTGT GKSQAAAKLA ELLFGGERNL LQFNMNEFQE AHTVSTLKGA PPGYVGYGKG
GRLTEAVRKK PYSVLLLDEF DRAHPDIHEV FYQVFDQGWM EDGEGRRISF RNCLILLTSN
LGEAEIEAAC KADPRISQAK LDKLVGERLQ GRFSPALLAR IQLVAFRTLD VDALTGIATQ
ALDELGERLA QNDLQWRADE GVASWIAHAV SQHPANGRAV RDLLRQHVMP AVARGVLAAR
AEGRALKTVR LAANEKLSLV FDEDAWELSG TDAASLGEQA QAVAMAREAE AVAAAVAARE
AHGAHGTNEA DGGGGAPHAA KADAHVDSDD ERPGGAPHPD ETASANAGTT GEPSCV