Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_1114 |
Symbol | clpA |
ID | 3690172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | + |
Start bp | 1166621 |
End bp | 1169275 |
Gene Length | 2655 bp |
Protein Length | 884 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637727569 |
Product | ATP-dependent Clp protease ATP-binding subunit clpA |
Protein accession | YP_332525 |
Protein GI | 76809374 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0542] ATPases with chaperone activity, ATP-binding subunit |
TIGRFAM ID | [TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000182282 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGGAT TGCGGGCCTG TCCGAGTTGT TTAGAATGGG TGTATGGCGA TTATCCCGGA CAAGCAGGAC AGCAGCGTCC TGGAACGCAA GGAACAGAAG CTCAAGCCGC CTTCGATGTA CAAGGTGGTG CTCCTGAACG ACGACTTCAC GCCAATGGAA TTCGTCGTGA TGGTCGTGCA GGAGTATTTC AAGAAAGATC GCGAAACGGC AACACAGATT ATGCTGAAGG TGCATCGCGA AGGGCGGGGA GTTTGTGGGG TCTATACGCG GGACATCGCG TCGACCAAAG TCGAGCAAGT CGTTACCCAT GCGCGGCAGG CCGGGCATCC GCTGCAGTGC GTGATGGAGG AAGCATGATT GCCCAGGAAT TGGAAGTCAG CCTGCACATG GCGTTCATGG AAGCGCGCCA GGCGCGGCAT GAGTTCATTA CGGTCGAGCA TCTTTTGTTG GCCCTGCTGG ATAATCCGAC GGCAGCCGAG GTGTTGCGCG CGTGCGCGGC GAACATCGAG GACCTGCGTC AGAACCTGCG CAATTTCATC CACGACAACA CGCCTACCGT TCCCGGCACC GACGATGTCG ACACGCAGCC GACGCTCGGC TTTCAGCGCG TGATCCAGCG GGCGATCATG CACGTGCAGT CGACGTCGAA CGGCAAGAAG GAAGTCACGG GCGCGAACGT GCTCGTCGCG ATCTTCGGCG AGAAGGATTC GCACGCGGTC TACTACTTGC AGCAGCAGGG CGTGACGCGC CTCGACGTCG TCAACTTCAT CTCGCACGGC ATCGCGAAGA CGAGCAGCGG CGAAGCCGCG AAGCCCGCCG ACGCGAACGC GGAAGGCGAG GACGCGGGCG CGCAGAAGGA AACGCCGCTC GCGCAGTTCA CGCAGAACCT GAACCAGATG GCGAAGGACG GGCGCATCGA TCCGCTGATC GGGCGCGAGT CCGAGGTCGA GCGCGTCGTG CAGGTGCTCT GCCGCCGCCG CAAGAACAAT CCGCTCCTCG TCGGCGAGGC GGGCGTCGGC AAGACGGCGA TCGCCGAAGG CCTCGCGTAC CGGATCACGC GCGGCGAGGT GCCGGACATC CTCGCGAACG CGCAGGTGTA TTCGCTCGAC ATGGGCGCGC TCCTCGCGGG CACGAAGTAC CGCGGCGATT TCGAGCAGCG TCTGAAGACG GTGCTGAAGG AGCTGAAGGA GCGGCCGCAC GCGATACTCT TCATCGACGA GATCCACACG CTGATCGGCG CGGGCGCCGC GTCGGGCGGC ACGCTCGATG CGTCGAACCT GCTGAAGCCG GCGCTGTCGT CGGGCACGCT CAAGTGCATT GGCGCGACGA CGTTTACCGA ATATCGCGGC ATCTTCGAGA AGGATGCGGC GCTGTCGCGG CGCTTCCAGA AGATCGACGT GACCGAGCCG AGCGTCGAGC AGACGGTCGC GATCCTGCGC GGCCTGAAGT CGCGCTTCGA GGAGCATCAC GGCGTCAAGT ATTCGTCGGG CGCGCTGTCG GCGGCCGCCG AACTGTCGGC GCGGTTCATC ACCGACCGCC ATCTGCCCGA CAAGGCGATC GACGTGATCG ACGAAGCGGG CGCCGCGCAG CGCGTGCTGC CGAAATCCAA GCAGAAGAAG ACGATCGGCA AGAGCGAGAT CGAGGAAATC ATCTCGAAGA TCGCGCGCGT GCCGCCGCAG AGCGTGTCGC AGGACGACCG CAGCAAGCTG CAGACGCTCG ACCGCGATCT GAAGAGCGTC GTGTTCGGCC AGGACCCGGC GATCGACGCG CTCGCCGCCG CGATCAAGAT GGCGCGCGCG GGCCTCGGCA AGCTCGACAA GCCGATCGGC GCGTTCCTGT TCTCCGGCCC GACGGGCGTC GGCAAGACCG AAGTGGCGCG CCAACTGGCG TTCACGCTCG GCATCGAGCT GATCCGCTTC GACATGTCGG AATACATGGA GCGTCACGCG GTGAGCCGCC TGATCGGCGC GCCGCCCGGA TACGTCGGGT TCGACCAGGG CGGCTTGCTG ACCGAGGCCG TCACGAAGAA GCCGCACTGC GTGCTGCTGC TCGACGAGAT CGAGAAGGCG CATCCGGACA TCTTCAACGT GCTGCTGCAG GTGATGGATC ACGGCACGCT GACGGACAAC AACGGCCGCA AGGCGGATTT CCGCAACGTC ATCATCATCA TGACGACGAA CGCGGGTGCG GAGTCGATGC AGAAGGCGAC GATCGGCTTC ACGACGCGGC GCGAAACCGG CGACGAGATG GCAGACATCA AGCGCCTGTT CACGCCCGAG TTCCGCAACC GGCTCGATGC GACGATCAGC TTCCGTTCGC TCGATGAGGA AATCATCATG CGGGTGGTCG ACAAGTTCCT GATCCAGCTC GAGGAGCAAC TGCACGAGAA GAAGGTCGAC GCGCTCTTCA CCGATGCGCT GCGCAAGCAT CTCGCGAAGC ACGGCTTCGA TCCGCTGATG GGCGCGCGAC CGATGCAGCG GTTGATCCAG GATACGATCC GGCGCGCGCT CGCCGACGAG TTGCTGTTCG GCAAGCTCGT CAACGGCGGG CACGTGACGG TCGACGTCGA CGAGAACGAC AAGGTGCTGC TGTCGTTCGA CAAGACGGCG ACGCCGCCCA GCAAGCCGAA CGAAGAGGCG GTCGAAGTCG AATAG
|
Protein sequence | MTGLRACPSC LEWVYGDYPG QAGQQRPGTQ GTEAQAAFDV QGGAPERRLH ANGIRRDGRA GVFQERSRNG NTDYAEGASR RAGSLWGLYA GHRVDQSRAS RYPCAAGRAS AAVRDGGSMI AQELEVSLHM AFMEARQARH EFITVEHLLL ALLDNPTAAE VLRACAANIE DLRQNLRNFI HDNTPTVPGT DDVDTQPTLG FQRVIQRAIM HVQSTSNGKK EVTGANVLVA IFGEKDSHAV YYLQQQGVTR LDVVNFISHG IAKTSSGEAA KPADANAEGE DAGAQKETPL AQFTQNLNQM AKDGRIDPLI GRESEVERVV QVLCRRRKNN PLLVGEAGVG KTAIAEGLAY RITRGEVPDI LANAQVYSLD MGALLAGTKY RGDFEQRLKT VLKELKERPH AILFIDEIHT LIGAGAASGG TLDASNLLKP ALSSGTLKCI GATTFTEYRG IFEKDAALSR RFQKIDVTEP SVEQTVAILR GLKSRFEEHH GVKYSSGALS AAAELSARFI TDRHLPDKAI DVIDEAGAAQ RVLPKSKQKK TIGKSEIEEI ISKIARVPPQ SVSQDDRSKL QTLDRDLKSV VFGQDPAIDA LAAAIKMARA GLGKLDKPIG AFLFSGPTGV GKTEVARQLA FTLGIELIRF DMSEYMERHA VSRLIGAPPG YVGFDQGGLL TEAVTKKPHC VLLLDEIEKA HPDIFNVLLQ VMDHGTLTDN NGRKADFRNV IIIMTTNAGA ESMQKATIGF TTRRETGDEM ADIKRLFTPE FRNRLDATIS FRSLDEEIIM RVVDKFLIQL EEQLHEKKVD ALFTDALRKH LAKHGFDPLM GARPMQRLIQ DTIRRALADE LLFGKLVNGG HVTVDVDEND KVLLSFDKTA TPPSKPNEEA VEVE
|
| |