Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0706 |
Symbol | |
ID | 4906108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 690237 |
End bp | 693107 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640143812 |
Product | putative ATP-dependent Clp protease, ATP-binding subunit ClpB |
Protein accession | YP_001074742 |
Protein GI | 126457113 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0542] ATPases with chaperone activity, ATP-binding subunit |
TIGRFAM ID | [TIGR03345] type VI secretion ATPase, ClpV1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATCGCG AACGCATATT CAACTGTCTC GGCCGCACGA CCTATGCGGC CCTTGTCGAT GCGACGGCGC TGGGCCGATC GCGTCGGCAC GCCTTCATCG ATCTCGATCA CTGGGCGCTG TGCCTGCTGC AGCGCGAGCA GAGCGATCTC GCGCGGCTGT TCGAGCTGTT CGGCAGCGAT GCCGGCGAAG CGAAGCGCCG CATGGAGAAG GCGCTCGACG GCTTCGACGT GAGCGGCGAC TCGCTGCGCG ACATCTCCAG CTCGCTGGAG CGCAGCGTGG GGCCGGCCGT CATCTGGAGC CAGATCGCGG CGCGCGCGGG CAAGGTGCGC TCCGGCCACC TGCTGCTCGC GTGGCTCGAC GAGGATCTGA CGCGCCGCTG GCTGCAGCAG CGCGTGCCGA GCGGCATCAC GTCGGTGGCG CTCGACGACG TGGTGAAGCG CTATGAGGCG CTCGCGGCGG GCTGGCCGGA GGCCGACGAG GCGCCGGCCG CGCTGGACGG CGCGGCGCTG GGCGCGCAGG CGGGCGAAGC CGGCGCAGAC GGCACGGGCG ACGCGCTCGC GAAATGGGCG ACGTGCGTGA CGGAGCAGGC CGCGCGCGGC GAGCTCGATC CGGTGGTGGG GCGCGACGAC GAATTGCGCA CGGTGATCGA CATTCTGTCG CGCCGCCGGC AGAACAACCC GATCCTCGTG GGCGAGGCGG GGGTGGGCAA GACGGCCGTG GTCGAGGCGC TCGCGCAGAA GATCCACGCG GGCGCGGTGC CGCCGGGGCT CGTGGGCGCG CAGGTGTGGG CGCTCGACCT GGCGCGGATG CAGGCGGGCG CGGGGGTACG CGGCGAGTTC GAGCAGCGCC TGAAATCGCT GATCGACGCG GTGATCGCCT CGCCCGCGCC GATCATCCTG TTCTGCGACG AGACGCACAC GCTGATCGGC GCGGGCGGCG CGGCGGGCAC GGGCGACGCG GCCAACCTGA TCAAGCCGAT GCTCGCGCGC GGCCAATTGA GGATGGTCGC GGCGACGACG TGGTCCGAAT ACAAGCAGTA CATCGAGCCG GACGCGGCGC TCGTGCGGCG CTTCCAGGCG GTCGCGGTCG ACGAGCCGAG CGACGATGCG GCGGTCGACA TGCTGCGCAC GATCGCGCCG CGCTTTGCCG CGCACCACGG CGTGCGCATC GTCGATTCGG CGCTGCGCGG CGCGGTCGAG CTGTCGCGCC GCTATCTGCC CGCGCGGCAG TTGCCGGACA AGGCGATCAG CCTGCTCGAC ACCGCGTGCG CACGCGTGGC GATGAGCCAG AGCTGCGCGC CCGCGGAGCT CGAGCGCTTG CAGCACCAGG CGTTCGCGAT CGGCCAGACG CTCGATTGGC GCGCGAGCGA CCGGCGCATG GGCGTGCGCA CGCCGGGCGA CGAAGCCGAG CTCGAAGGCC GTCAGGCGAG CCTCGCGCAG CAGGCGGCGA CGCTCGAGAC GGTCGTGGAC GCGCAGCGCG ACGAGGTGCG TGCGTGGCTC GCGCGGCTGA ACGACGCCAC GCCGCAGGCG GCCGACGGCG ACGGCGCTGC GTTCGCCGCG CGCATCGGCG CGAATCGCTG GGTGCGGCCA TGGGTCGACG AACACGTGGT GTCGGAGGTG CTCGCCGAAT GGACGGGGGT GCCCGTCGCG CAGCTCGCGC AGGACGACGC GCAGCGCGTG GTGGAGCTCG AGGCGGCATT GAACGCGGGC ATCCACGGGC AGACGGGCGC GATGCGCTCG ATCGCGCAGG CGCTGCAGGT GTCGCACTCG GGGCTGAACG ATCCGCGCCG CCCGCTCGGC GTGATGCTGC TCGCGGGGCC GACGGGCACG GGCAAGAGCC AGGCGGCCGC GAAGCTCGCC GAGCTGCTGT TCGGCGGCGA GCGCAACCTG CTGCAGTTCA ACATGAACGA GTTCCAGGAA GCGCACACGG TGTCGACGCT CAAGGGCGCG CCGCCGGGGT ACGTCGGCTA CGGCAAGGGC GGTCGGCTGA CCGAGGCGGT GCGCAAGAAG CCGTACAGCG TGCTGCTGCT GGACGAATTC GATCGCGCGC ATCCGGACAT TCATGAGGTG TTCTATCAAG TCTTCGATCA GGGGTGGATG GAGGACGGCG AAGGCCGCCG GATCAGCTTC CGCAACTGCC TGATCCTGCT GACGAGCAAT CTGGGGGAGG CGGAGATCGA AGCGGCGTGC AAGGCCGATC CGCGGATCTC GCAGGCGAAG CTCGACAAGC TGGTGGGCGA GCGGCTGCAG GGGCGTTTCT CGCCGGCGCT GCTCGCGCGG ATTCAGCTCG TGGCGTTCCG CACGCTCGAT GTCGACGCGT TGACGGGCAT CGCGACGCAG GCGCTGGACG AGCTGGGCGA GCGCCTGGCG CAGAACGATC TGCAATGGCG CGCGGACGAA GGCGTGGCGT CGTGGATCGC GCATGCGGTG TCGCAGCATC CGGCGAACGG ACGCGCGGTG CGCGACCTGT TGCGCCAGCA CGTGATGCCG GCCGTGGCGC GCGGCGTGCT CGCCGCGCGT GCGGAAGGGC GAGCGCTGAA GACGGTGCGG CTCGCGGCGA ACGAGAAGCT GTCGCTCGTG TTCGACGAGG ACGCGTGGGA GCTGAGCGGC ACCGATGCGG CGTCGCTCGG CGAGCAGGCG CAGGCGGTGG CGATGGCGCG CGAGGCGGAG GCGGTGGCCG CTGCCGTGGC GGCACGCGAA GCGCACGGCG CGCACGGCAC GAATGAGGCG GATGGAGGCG GCGGCGCGCC GCACGCGGCG AAAGCCGACG CGCATGTCGA TTCGGATGAC GAACGTCCAG GCGGCGCGCC TCATCCCGAT GAAACGGCTT CGGCGAACGC CGGCACGACG GGAGAACCGT CATGCGTCTG A
|
Protein sequence | MNRERIFNCL GRTTYAALVD ATALGRSRRH AFIDLDHWAL CLLQREQSDL ARLFELFGSD AGEAKRRMEK ALDGFDVSGD SLRDISSSLE RSVGPAVIWS QIAARAGKVR SGHLLLAWLD EDLTRRWLQQ RVPSGITSVA LDDVVKRYEA LAAGWPEADE APAALDGAAL GAQAGEAGAD GTGDALAKWA TCVTEQAARG ELDPVVGRDD ELRTVIDILS RRRQNNPILV GEAGVGKTAV VEALAQKIHA GAVPPGLVGA QVWALDLARM QAGAGVRGEF EQRLKSLIDA VIASPAPIIL FCDETHTLIG AGGAAGTGDA ANLIKPMLAR GQLRMVAATT WSEYKQYIEP DAALVRRFQA VAVDEPSDDA AVDMLRTIAP RFAAHHGVRI VDSALRGAVE LSRRYLPARQ LPDKAISLLD TACARVAMSQ SCAPAELERL QHQAFAIGQT LDWRASDRRM GVRTPGDEAE LEGRQASLAQ QAATLETVVD AQRDEVRAWL ARLNDATPQA ADGDGAAFAA RIGANRWVRP WVDEHVVSEV LAEWTGVPVA QLAQDDAQRV VELEAALNAG IHGQTGAMRS IAQALQVSHS GLNDPRRPLG VMLLAGPTGT GKSQAAAKLA ELLFGGERNL LQFNMNEFQE AHTVSTLKGA PPGYVGYGKG GRLTEAVRKK PYSVLLLDEF DRAHPDIHEV FYQVFDQGWM EDGEGRRISF RNCLILLTSN LGEAEIEAAC KADPRISQAK LDKLVGERLQ GRFSPALLAR IQLVAFRTLD VDALTGIATQ ALDELGERLA QNDLQWRADE GVASWIAHAV SQHPANGRAV RDLLRQHVMP AVARGVLAAR AEGRALKTVR LAANEKLSLV FDEDAWELSG TDAASLGEQA QAVAMAREAE AVAAAVAARE AHGAHGTNEA DGGGGAPHAA KADAHVDSDD ERPGGAPHPD ETASANAGTT GEPSCV
|
| |