Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2954 |
Symbol | |
ID | 4886819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2804789 |
End bp | 2807455 |
Gene Length | 2667 bp |
Protein Length | 888 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640132890 |
Product | putative ATP-dependent Clp protease, ATP-binding subunit ClpB |
Protein accession | YP_001063945 |
Protein GI | 126442624 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0542] ATPases with chaperone activity, ATP-binding subunit |
TIGRFAM ID | [TIGR03345] type VI secretion ATPase, ClpV1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.400792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGATA TCGGCCGAGT GACCTTGTTT GGAAAACTGA ACACTTTTCT CTACGAGACC CTGGAGCAGG CGACCGGCTT TTGCCGGCTG CGCGGCAATC CCTATGTCGA GCTCGCGCAC TGGCTGAACC AGATGCTCCG GCGCCCGGAC AGCGACGTGC ACCGGATCCT GCGCCGCTTC GACATCGACG CGGCGGCGAT CGACCGCGGG ATCGTTTCGG CGCTCGACCG GCTGCCGCGC GGCGCGGGTT CCGTGTCCGA TCTGTCCGCG CACATCGATG ATGCGGTCGA ACGCGCATGG GTGTACGCGA CGCTCAAGTA CGACGCGGCG CAGATTCGCG GCGCGGTGCT GCTGCTCGCG CTCGTGAAGA CGCCGCAGCT GCGCAACGTG CTGTACGCGA TCGCGCGGGA CTTCGAGCGC ATCGTGCCTG ACGTGCTTGC CGACGAGCTC GAACGGATCG TCACAGGCTC GCCCGAGGCG CCGTCGCCCG CCGCGCGCGC GGCGGCGGGG GCGATCGGCG ACGCGGGGGC GGCGCCGTCC GCCGCGCGCG AAGGCTCGGC GCTCGCGCGC TACGCGATCG ATCTGACCGC GCGTGCGCGC GCGGGCGAGA TCGATCCGGT GGTCGGCCGC GACGGCGAGA TTCGGCAGAT CGTCGACATC CTGCTGCGCC GCCGGCAGAA CAATCCGCTG CTCGTCGGCG AGGCGGGCGT CGGCAAGACC GCGGTCGCCG AAGGCTTCGC GCTGCGCATC GTCGCGGGCG ACGTGCCGCC GCCGCTGCGC GACGTCGAAC TGTATCTGCT CGACATCGGC TTGCTGCAGG CGGGCGCGAG CGTGAAGGGC GAATTCGAGA GCCGCCTGCG CGGCGTGATC GACGAGGCGA GTTCGAGCGA GCGGCCCGTC ATTCTGTTCA TCGACGAAGT ACACACGCTC GTCGGCGCGG GCGGCGCGGC GGGCACGGGC GACGCGGCGA ACCTGTTGAA GCCCGCGCTC GCGCGCGGCC TGCTGCGCAC GATCGGCGCG ACGACGTGGT CCGAATACAA GCAATACATC GAGAAGGACC CGGCGCTCAC GCGGCGCTTC CAGCTCGTGC AGGTGCGCGA GCCGGAGGAG GGCGCGGCGC TGGCGATGCT GCGCGGGCTC GCCGCGAAGC TGGAGGCGCA CCATCGCGTG CTCGTGCTCG ACGATGCGCT GCAGGCGGCC GTCACGCTGT CGCATCGCTA CATTCCCGCG CGGCAGTTGC CGGACAAGGC GATCAGCCTG CTCGACACCG CGTGCGCGCG CGTCGCGGTC AGCCAGCACG CGGTGCCCGC GCCGATCGAG GACGTGCGGC GGCGGATCGA CAGCCTGCGC GTCGAGCGCG AGCTGATCGC GCGCGAGTGC GCGCTCGGCG CGGGCGATGC GCAACGGCTC GATGCAATCG ATGCATCGAT CGCCGGCGAA CAGGCCACGC TCGATGCGCT CGACGCGCGC TGGCAGGCGG CGCGCGACGC GCTCGGCAAG ATCGTCGACT GGCGGGCCTC GCTGCTGGCC GACGATTCTT CGCGCGTGCT CGATGAGGCG GCGCGCGCGG ACGTGCAGGC GAAGCTCTCG GCCGCGCTGC GCGCGCTCGC CGAATTGCAG GGCGAGACGC CGCTCGTGCT GCCCGCGGTC GATACGCACG CGGTGGCCGC CGTGGTGTCC GACTGGACCG GCATCCCGCT CGGGCGGATG GTGCGCGACG AGATGCAATC GGTGCTGAAG CTCGCCGAGA CGCTCGCCGA GCGCGTGGTC GGCCAGCCGC ACGCGGTCGA GCTGATCGCC GAGCGCATTC AGACCGCGCG CGCGCGGCTC GACGATCCGG CCAAGCCGCA CGGCGTGTTC CTGCTGTGCG GGCCGTCCGG CGTCGGCAAG ACCGAGACGG CGCTCGCGCT CGCCGAGACG CTGTACGGCG GCGAGCACAA CGCGATCACG ATCAACATGA GCGAGTTTCA GGAGGCGCAC ACCGTATCGA CGCTCAAGGG CGCGCCGCCC GGCTACGTCG GCTACGGACA GGGCGGGGTG CTGACCGAGG CGGTGCGGCG GCGGCCGTAC AGTGTCGTGC TGCTCGACGA AATCGAGAAG GCGCACCGCG ACGTGCACGA GATCTTCTTT CAGGTGTTCG ACAAGGGCTG GATGGAAGAC GGCGAGGGGC GCTATATCGA CTTTCGCAAC ACGGTGATCC TCCTCACGTC GAACGTCGGT TCCGAGCGCG TGATGCAGCT GTGCCGCGAT CCGCAGCGCC TGCCCGATGC GCAGACCTTG ACCGATGCGC TGCGCGCGCC GCTGCGCGAG GTGTTTCCCG CCGCGTTGTT GGGACGCCTG ACGGTCGTTC CGTACTACCC GCTCACCGAC GAGATGCTCG CGCGGATCGT CGCATTGCAG CTCGCGCGCA TCGAGCGGCG CATCGAAGCG CACCACGGCA TCGCGTTGCG CTGCGCGGAT TCGGCGACCG CGCTGATCGC CGAGCGCTGC CGGACGATCG AATCCGGCGG CCGCATGGTC GACGCGATTC TCACGCACAC GGTGCTGCCG CGCATCAGCC AGGAGATCCT GCGCGCGACG ATCGAGGGGC GCGCGCTGCG GGCGATCGAC GTGAGCGCCG AAGACGGCCA GTTCGTTTAC CGATTCGAAG AGGAGGGCGC GACGTGA
|
Protein sequence | MSDIGRVTLF GKLNTFLYET LEQATGFCRL RGNPYVELAH WLNQMLRRPD SDVHRILRRF DIDAAAIDRG IVSALDRLPR GAGSVSDLSA HIDDAVERAW VYATLKYDAA QIRGAVLLLA LVKTPQLRNV LYAIARDFER IVPDVLADEL ERIVTGSPEA PSPAARAAAG AIGDAGAAPS AAREGSALAR YAIDLTARAR AGEIDPVVGR DGEIRQIVDI LLRRRQNNPL LVGEAGVGKT AVAEGFALRI VAGDVPPPLR DVELYLLDIG LLQAGASVKG EFESRLRGVI DEASSSERPV ILFIDEVHTL VGAGGAAGTG DAANLLKPAL ARGLLRTIGA TTWSEYKQYI EKDPALTRRF QLVQVREPEE GAALAMLRGL AAKLEAHHRV LVLDDALQAA VTLSHRYIPA RQLPDKAISL LDTACARVAV SQHAVPAPIE DVRRRIDSLR VERELIAREC ALGAGDAQRL DAIDASIAGE QATLDALDAR WQAARDALGK IVDWRASLLA DDSSRVLDEA ARADVQAKLS AALRALAELQ GETPLVLPAV DTHAVAAVVS DWTGIPLGRM VRDEMQSVLK LAETLAERVV GQPHAVELIA ERIQTARARL DDPAKPHGVF LLCGPSGVGK TETALALAET LYGGEHNAIT INMSEFQEAH TVSTLKGAPP GYVGYGQGGV LTEAVRRRPY SVVLLDEIEK AHRDVHEIFF QVFDKGWMED GEGRYIDFRN TVILLTSNVG SERVMQLCRD PQRLPDAQTL TDALRAPLRE VFPAALLGRL TVVPYYPLTD EMLARIVALQ LARIERRIEA HHGIALRCAD SATALIAERC RTIESGGRMV DAILTHTVLP RISQEILRAT IEGRALRAID VSAEDGQFVY RFEEEGAT
|
| |