Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_18321 |
Symbol | clpB |
ID | 4776078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1589658 |
End bp | 1592249 |
Gene Length | 2592 bp |
Protein Length | 863 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640087341 |
Product | ATP-dependent Clp protease, Hsp 100, ATP-binding subunit ClpB |
Protein accession | YP_001017839 |
Protein GI | 124023532 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0542] ATPases with chaperone activity, ATP-binding subunit |
TIGRFAM ID | [TIGR03346] ATP-dependent chaperone ClpB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.207261 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACCTA CGGCGGATCA ATTTACCGAA AAAGGCTGGG CAGCAATCGT TTTGGCCCAA CAACTTGCCC AGCAACGCAA GCACCAGCAA CTGGAAACAG AGCATTTGTT GTTGTCACTG CTAGAGCAAA ATGCTTTGGC AGGAAGAATC CTAGAAAAAG CAGGTGTGTC GATCGGCAAC TTGCAAACAG CCGTGGAGGC ACACCTGCAT GAACAGCCCA CTCTGCAGGC AGCGCCAGAC TCTGTCTATC TGGGCAAAGG AGTCAATGAT CTGCTCGATC AAGCTGATAA ACATAAGCAA GCTTTTGGCG ATGGTTTCAT CTCAATTGAA CATCTGTTAC TTGCATTGGC CGGGGACAAT CGTTGCGGCC GCAAGCTGCT CAATCAAGCG GGCGTGGATG CGGGCAAACT AAAGGTTGCA ATTGACGCGG TACGCGGAAA CCAGAAGGTG ACCGATCAGA ACCCAGAAGG AACTTACGAA TCTCTCGAAA AATATGGCAG AGACCTGACT GCTGCTGCTC GCGAAGGCAA ACTCGACCCT GTCATCGGAA GGGACGATGA GATTCGACGC ACGATTCAGA TTCTTAGTAG GCGCACCAAA AACAACCCTG TCCTCATTGG GGAGCCAGGG GTGGGAAAAA CCGCAATTGT TGAAGGCCTC GCACAACGGA TTGTTAATGG CGATGTACCT GCAGCCCTTC AGAACAGACA GTTAATCGCC CTTGATATGG GTGCTTTGAT TGCAGGGGCA AAATATCGAG GTGAATTCGA AGAGCGACTC AAGGCAGTGC TCAAAGAAGT TACCGCCTCT GAAGGACAAA TCGTACTTTT TATTGATGAA ATCCATACCG TTGTAGGAGC TGGAGCTACC GGTGGGGCAA TGGATGCCAG CAATCTGCTT AAGCCAATGC TGGCTCGAGG GGAACTGCGC TGCATTGGTG CAACGACCCT CGATGAGCAC CGACAGCACA TCGAAAAAGA CCCCGCTTTA GAAAGAAGAT TCCAGCAAGT GCTTGTGGAT CAACCCACCG TGCAAGACAC GATCTCCATC CTGCGTGGTC TCAAAGAACG CTACGAAGTG CATCATGGTG TGCGCATTGC TGATAACGCC CTGGTTGCTG CAGCTGTGCT CAGTAGCCGA TATATTGCCG ATCGTTTTTT GCCAGATAAA GCCATCGATT TGATGGATGA ATCTGCTGCA CGACTAAAAA TGGAAATCAC CTCCAAGCCG GAGGAAATCG ATGAAATCGA TCGCAAGATT GTGCAGCTCG AGATGGAGAA GCTTTCCTTA GGACGTGAGT CCGACTCCGT TAGTAAGGAA AGATTGGAAA AACTAGAACG CGAGCTTGCT GAATTAGCAG AGCAACAAAG TGCGCTCAAT GCACAGTGGC AACAAGAAAA AGGTGCGATT GACGACCTTT CTTCGCTCAA AGAAGAAATC GAAAGGGTGC AATTGCAAGT TGAGCAAGCC AAGCGCAGCT ACGACCTCAA CAAAGCAGCT GAACTGGAGT ACGGAACTCT GGCGGGATTG CAGAAACAAC TCAGCGAGAA AGAAACTGCC CTGGCCCAAG ATGGAGAGGC AGGCGATAAA TCACTGCTGC GAGAGGAGGT TACGGAAGAC GATATTGCTG ATGTCATCGC TAAATGGACC GGGATTCCTG TCGCCAAGCT TGTGCAGTCG GAAATGGAAA AGCTGCTGGG ACTTGAAGCA GAACTCCACC AACGTGTGAT TGGCCAAGAA CAGGCTGTGC AAGCTGTTGC GGATGCGATT CAGCGTTCTA GGGCAGGTTT AAGTGATCCC AACCGCCCGA TCGCAAGTTT TTTATTTCTA GGCCCCACTG GTGTAGGTAA AACGGAGCTA TCCAAGGCAT TGGCCTCTCA ACTGTTTGAC AGCGAGGCGG CTTTGGTGCG AATCGATATG TCTGAGTATA TGGAGAAGCA CAGTGTAAGC AGACTGATCG GGGCACCCCC GGGTTATGTC GGCTATGAGG CTGGTGGACA GCTCACCGAA GCCGTACGCA GACGCCCTTA TGCGGTAATC CTGTTCGATG AAGTGGAGAA GGCACACCCA GATGTGTTCA ATGTGATGTT ACAGATCCTC GATGATGGAC GTGTTACCGA TGGTCAGGGT CGCACAGTTG ATTTTACAAA TACGGTGCTA ATTCTTACCA GTAATATTGG CAGCCAGTCG ATTCTCGACT TGGGAGGCGA TGATAGTCAA TACGGGGAAA TGGAGCGTCG AGTTCATGAT GCTTTGCACG CTCATTTTAG GCCTGAGTTT CTCAACCGCC TGGATGAAAC AATCATCTTT CATAGCCTCA GGCGCGAGGA ACTGCGTCAG ATCGTTGCCC TTCAGGTCAA TCGGCTGCGT GAGCGTCTTG GCGATCGCAA GCTTGGCCTA GAGATCAGTG ATACAGCAGC TGATTGGCTC GCTAATGCTG GCTATGACCC TGTTTATGGG GCCAGACCCC TCAAGCGAGC GATCCAACGC GAGCTTGAAA CTCCAATAGC CAAAAGTATC CTGGCTGGCT TTTATGGAGA TAGTCAGATT GTGCATGTGG ATGTGGACGA GGAGCGTCTG AGCTTTCGAT AA
|
Protein sequence | MQPTADQFTE KGWAAIVLAQ QLAQQRKHQQ LETEHLLLSL LEQNALAGRI LEKAGVSIGN LQTAVEAHLH EQPTLQAAPD SVYLGKGVND LLDQADKHKQ AFGDGFISIE HLLLALAGDN RCGRKLLNQA GVDAGKLKVA IDAVRGNQKV TDQNPEGTYE SLEKYGRDLT AAAREGKLDP VIGRDDEIRR TIQILSRRTK NNPVLIGEPG VGKTAIVEGL AQRIVNGDVP AALQNRQLIA LDMGALIAGA KYRGEFEERL KAVLKEVTAS EGQIVLFIDE IHTVVGAGAT GGAMDASNLL KPMLARGELR CIGATTLDEH RQHIEKDPAL ERRFQQVLVD QPTVQDTISI LRGLKERYEV HHGVRIADNA LVAAAVLSSR YIADRFLPDK AIDLMDESAA RLKMEITSKP EEIDEIDRKI VQLEMEKLSL GRESDSVSKE RLEKLERELA ELAEQQSALN AQWQQEKGAI DDLSSLKEEI ERVQLQVEQA KRSYDLNKAA ELEYGTLAGL QKQLSEKETA LAQDGEAGDK SLLREEVTED DIADVIAKWT GIPVAKLVQS EMEKLLGLEA ELHQRVIGQE QAVQAVADAI QRSRAGLSDP NRPIASFLFL GPTGVGKTEL SKALASQLFD SEAALVRIDM SEYMEKHSVS RLIGAPPGYV GYEAGGQLTE AVRRRPYAVI LFDEVEKAHP DVFNVMLQIL DDGRVTDGQG RTVDFTNTVL ILTSNIGSQS ILDLGGDDSQ YGEMERRVHD ALHAHFRPEF LNRLDETIIF HSLRREELRQ IVALQVNRLR ERLGDRKLGL EISDTAADWL ANAGYDPVYG ARPLKRAIQR ELETPIAKSI LAGFYGDSQI VHVDVDEERL SFR
|
| |