Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_27741 |
Symbol | clpB2 |
ID | 4778902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2441230 |
End bp | 2444010 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640088297 |
Product | putative ATP-dependent Clp protease, Hsp 100, ATP-binding subunit ClpB |
Protein accession | YP_001018769 |
Protein GI | 124024462 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0542] ATPases with chaperone activity, ATP-binding subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCCTG AGACCAGCCA AGCCGCTTCC GCCCAGGGAA GTCTCACAAG TGATCCTGAT CGATTCAGTG ATGAAGCTTG GGATTTGTTG CTTGTCGGTC AGGATGTCGC CCGCCGTTGG CGACATGGTC AGCTGGATGT GGAGCACTTG ATCCAGGTGC TTTTCAGTGA TCGGCGTTAC AGCGATTGGG TGTTGGCTCT GCCCATTGAT TCAACTGCGC TGCTGGATCG CTTGGAAGGT TTTTTGGCCG AGCAGCCTAT GGCTCGAGGC GATGAGTTGT TCATCGGTGA TGACCTTGAA GACCTCCTTG AAGAAGCAGA CCGTTCCAGA GTTCGCTGGG GATCTCGGCT GATTGATGTG TCTCATTTGC TGATTGCGTT GGGGCGCGAT CCGCGCATTG GATCCGAGCT ATTTGAAGAG TTAGGACTGC CCAGTGAACG CTTGGAAGCG GAGTTGAGAC GCTTGCCCAA AGGCAGGCGC CAGTTGCGTA CATCAACGCC GCCTTCTCCT CCTGCGCCCG ATACTGCGCC ACCTGCAGCG GTGGCCTCAC AAACGAGACC TGTCTCCTCT GCTACCCCGC CGATTATCCA AGAGCCTGCC ATCGCTGCTG GTGAGCCAAC TCCATTGCAA CTCCAGCAAG AACCTCAGGC TCTTGAGGCC TATGGTCGTG ACCTTACCGC TGCAGCTCAG GCAGGACAGC TAGATCCTGT GATCGGTAGA GATCCAGAGA TTCGCCGTTT GATCAAGGTT TTATCCCGTC GCGGCAAGAA CAATCCGGTG TTGATTGGAG CTCCTGGTGT TGGTAAAACG GCCATCGCCG AGCTGTTGGC TCAACGCATT GTTGCTGGGG AGGTTCCTGA TTCCCTGAAG GGTTTGCGCT TGATCTCCCT TGATCTTGGT GCACTGATTG CTGGGGCCAA GTTTCGTGGT CAATTTGAGG AACGGCTTCG GTCTGTGTTG AAAGAGGTGA GCGATCCAGA TGCAGGTGTG GTGTTGTTCA TCGATGAACT GCACACGGTG GTGAGCAGCG ATCGTTCCAG TGCGGATGCT GGAAGTTTGC TCAAGCCGGC TTTAGCAAGG GGAGACCTCC GCTGTATTGG GGCCACGACC CCTGAGAACT ATCGCCGCAC AGTGGAGAAG GATCAGGCAC TCAATCGGCG TTTTCAGCAG GTTTTAATCA AGGAGCCGAG TCTTGAATTG AGCGTTGAAA TCTTGCGAGG CCTGAAAGAG CGCTACGAAC TTCACCACGG CGTCACGATC ACCGATGGTG CGGTGACTGC TGCAAATCGC TTGGCTGATC GCTACATCAG TGACCGCTGC TTACCTGATA AGGCCATCGA TCTCATCGAT GAGGCTGCAG CCCAACTCAA GATGGATGTC ACCTCGAAGC CTCAGGTGGT TGAAGAGGCG GAAGCAGAGC TGCGTCGTGT CGAGCTTGCT TTGCTGGCGG CTGAGCAGGC ACCGGAGGTT GAACGGGTGC AGTTACAGGC TGCTCGAATT GCTGCGGCGT CTCAGCTCAC AGATTTACGA GAGCGCTGGC AGATCGAACG GGATCATCTG GCCGAGTTGC GGGATTTGTT GCAACAAGAC GAGGATCTCC GCAATGCCAT TGCTGAAGCA GAGCGACTCG GTGATCTAGA GGCGGCTGCA CGTCTTCAAT ACGACCAATT GCATAGGGTT CAGCAACGGC GGGCCGACCT TGAAGAAACG TTGGCTGTTG CCCAGGCATC TGGTTCGGCG CTCCTACGCG AGCAAGTGGA GGCTGAAGAC ATTGCCGATG TCGTTGCACG CTGGACTGGC ATTCCTGTTC AGCGGTTGCT GGCGGCTGAA CGGCAAAAGT TGCTAGATCT TGAAGCCCAT CTAGGCGAGC GGGTGATTGG TCAGCCCGAA GCGGTTCAGG CGGTGGCGGC AGCGATTCGT CGGGCTCGAG CGGGCATGAA GGATCCGCGC CGTCCTGTTG GTTCGTTTTT GTTTCTTGGA CCCACTGGCG TGGGCAAGAC AGAACTGGCA AAAGCGCTTG CGGCCTCCTT GTTCGATGAG GAAGAGGCGC TGGTGCGACT CGATATGAGT GAATTCATGG AACGCAATGC CGTGGCGCGT CTTTTGGGGG CTCCTCCCGG CTATGTCGGC TATGAGGAGG GTGGACAGCT CACCGAAGCA GTGTGTCGCC GTCCTTATGC GGTGTTACTT CTTGATGAAA TTGAGAAGGC TCATCCTGAT GTGTTCAATG TGCTCCTGCA GGTTCTGGAC GATGGCCGTC TCACTGACTC GCAGGGCCGC ACCGTGGACT TCCGTCACAC CGTGGTTGTG ATGACTAGCA ACTTGGCTAG TCGCGCCATC CTTGATGCGT CTCGCTTGGC GCAGCAGGAG GGGACAGATG GGGATTGTCT TGATCAGAGC CTGGCTCAAA AGGTTGATCA GGCCCTCAAT AAGCAATTTC GGCCTGAATT TTTGAATCGC ATCGATGAGG TAATACGTTT TCGGCCGCTC AAGGCTGATG ACCTTCAGCG AATTGTGCAG CTGCAATTGG CCGATCTCTC CAGCCTGCTT GCTGAGCAAG GTCTTGAATT GCGTGTGGAG GCAGACGCGG TTGAGGCTTT GGCTTTGCAA GGTTATGAGC CGGAATATGG CGCTCGCCCT TTACGGCGTG TACTGCGTCG TCGGGTTGAG AACCCGTTGG CCACGGAATT GCTGGAAGAG CGTTTCAGCG GTGCCCGCGC CGTGCGCGTG ATCCCCGGTG CTCAGGCCTC CGAACCATTT CAGTTCCTTC CTGAGGATTG A
|
Protein sequence | MHPETSQAAS AQGSLTSDPD RFSDEAWDLL LVGQDVARRW RHGQLDVEHL IQVLFSDRRY SDWVLALPID STALLDRLEG FLAEQPMARG DELFIGDDLE DLLEEADRSR VRWGSRLIDV SHLLIALGRD PRIGSELFEE LGLPSERLEA ELRRLPKGRR QLRTSTPPSP PAPDTAPPAA VASQTRPVSS ATPPIIQEPA IAAGEPTPLQ LQQEPQALEA YGRDLTAAAQ AGQLDPVIGR DPEIRRLIKV LSRRGKNNPV LIGAPGVGKT AIAELLAQRI VAGEVPDSLK GLRLISLDLG ALIAGAKFRG QFEERLRSVL KEVSDPDAGV VLFIDELHTV VSSDRSSADA GSLLKPALAR GDLRCIGATT PENYRRTVEK DQALNRRFQQ VLIKEPSLEL SVEILRGLKE RYELHHGVTI TDGAVTAANR LADRYISDRC LPDKAIDLID EAAAQLKMDV TSKPQVVEEA EAELRRVELA LLAAEQAPEV ERVQLQAARI AAASQLTDLR ERWQIERDHL AELRDLLQQD EDLRNAIAEA ERLGDLEAAA RLQYDQLHRV QQRRADLEET LAVAQASGSA LLREQVEAED IADVVARWTG IPVQRLLAAE RQKLLDLEAH LGERVIGQPE AVQAVAAAIR RARAGMKDPR RPVGSFLFLG PTGVGKTELA KALAASLFDE EEALVRLDMS EFMERNAVAR LLGAPPGYVG YEEGGQLTEA VCRRPYAVLL LDEIEKAHPD VFNVLLQVLD DGRLTDSQGR TVDFRHTVVV MTSNLASRAI LDASRLAQQE GTDGDCLDQS LAQKVDQALN KQFRPEFLNR IDEVIRFRPL KADDLQRIVQ LQLADLSSLL AEQGLELRVE ADAVEALALQ GYEPEYGARP LRRVLRRRVE NPLATELLEE RFSGARAVRV IPGAQASEPF QFLPED
|
| |