Gene P9303_27741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_27741 
SymbolclpB2 
ID4778902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2441230 
End bp2444010 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content56% 
IMG OID640088297 
Productputative ATP-dependent Clp protease, Hsp 100, ATP-binding subunit ClpB 
Protein accessionYP_001018769 
Protein GI124024462 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCCTG AGACCAGCCA AGCCGCTTCC GCCCAGGGAA GTCTCACAAG TGATCCTGAT 
CGATTCAGTG ATGAAGCTTG GGATTTGTTG CTTGTCGGTC AGGATGTCGC CCGCCGTTGG
CGACATGGTC AGCTGGATGT GGAGCACTTG ATCCAGGTGC TTTTCAGTGA TCGGCGTTAC
AGCGATTGGG TGTTGGCTCT GCCCATTGAT TCAACTGCGC TGCTGGATCG CTTGGAAGGT
TTTTTGGCCG AGCAGCCTAT GGCTCGAGGC GATGAGTTGT TCATCGGTGA TGACCTTGAA
GACCTCCTTG AAGAAGCAGA CCGTTCCAGA GTTCGCTGGG GATCTCGGCT GATTGATGTG
TCTCATTTGC TGATTGCGTT GGGGCGCGAT CCGCGCATTG GATCCGAGCT ATTTGAAGAG
TTAGGACTGC CCAGTGAACG CTTGGAAGCG GAGTTGAGAC GCTTGCCCAA AGGCAGGCGC
CAGTTGCGTA CATCAACGCC GCCTTCTCCT CCTGCGCCCG ATACTGCGCC ACCTGCAGCG
GTGGCCTCAC AAACGAGACC TGTCTCCTCT GCTACCCCGC CGATTATCCA AGAGCCTGCC
ATCGCTGCTG GTGAGCCAAC TCCATTGCAA CTCCAGCAAG AACCTCAGGC TCTTGAGGCC
TATGGTCGTG ACCTTACCGC TGCAGCTCAG GCAGGACAGC TAGATCCTGT GATCGGTAGA
GATCCAGAGA TTCGCCGTTT GATCAAGGTT TTATCCCGTC GCGGCAAGAA CAATCCGGTG
TTGATTGGAG CTCCTGGTGT TGGTAAAACG GCCATCGCCG AGCTGTTGGC TCAACGCATT
GTTGCTGGGG AGGTTCCTGA TTCCCTGAAG GGTTTGCGCT TGATCTCCCT TGATCTTGGT
GCACTGATTG CTGGGGCCAA GTTTCGTGGT CAATTTGAGG AACGGCTTCG GTCTGTGTTG
AAAGAGGTGA GCGATCCAGA TGCAGGTGTG GTGTTGTTCA TCGATGAACT GCACACGGTG
GTGAGCAGCG ATCGTTCCAG TGCGGATGCT GGAAGTTTGC TCAAGCCGGC TTTAGCAAGG
GGAGACCTCC GCTGTATTGG GGCCACGACC CCTGAGAACT ATCGCCGCAC AGTGGAGAAG
GATCAGGCAC TCAATCGGCG TTTTCAGCAG GTTTTAATCA AGGAGCCGAG TCTTGAATTG
AGCGTTGAAA TCTTGCGAGG CCTGAAAGAG CGCTACGAAC TTCACCACGG CGTCACGATC
ACCGATGGTG CGGTGACTGC TGCAAATCGC TTGGCTGATC GCTACATCAG TGACCGCTGC
TTACCTGATA AGGCCATCGA TCTCATCGAT GAGGCTGCAG CCCAACTCAA GATGGATGTC
ACCTCGAAGC CTCAGGTGGT TGAAGAGGCG GAAGCAGAGC TGCGTCGTGT CGAGCTTGCT
TTGCTGGCGG CTGAGCAGGC ACCGGAGGTT GAACGGGTGC AGTTACAGGC TGCTCGAATT
GCTGCGGCGT CTCAGCTCAC AGATTTACGA GAGCGCTGGC AGATCGAACG GGATCATCTG
GCCGAGTTGC GGGATTTGTT GCAACAAGAC GAGGATCTCC GCAATGCCAT TGCTGAAGCA
GAGCGACTCG GTGATCTAGA GGCGGCTGCA CGTCTTCAAT ACGACCAATT GCATAGGGTT
CAGCAACGGC GGGCCGACCT TGAAGAAACG TTGGCTGTTG CCCAGGCATC TGGTTCGGCG
CTCCTACGCG AGCAAGTGGA GGCTGAAGAC ATTGCCGATG TCGTTGCACG CTGGACTGGC
ATTCCTGTTC AGCGGTTGCT GGCGGCTGAA CGGCAAAAGT TGCTAGATCT TGAAGCCCAT
CTAGGCGAGC GGGTGATTGG TCAGCCCGAA GCGGTTCAGG CGGTGGCGGC AGCGATTCGT
CGGGCTCGAG CGGGCATGAA GGATCCGCGC CGTCCTGTTG GTTCGTTTTT GTTTCTTGGA
CCCACTGGCG TGGGCAAGAC AGAACTGGCA AAAGCGCTTG CGGCCTCCTT GTTCGATGAG
GAAGAGGCGC TGGTGCGACT CGATATGAGT GAATTCATGG AACGCAATGC CGTGGCGCGT
CTTTTGGGGG CTCCTCCCGG CTATGTCGGC TATGAGGAGG GTGGACAGCT CACCGAAGCA
GTGTGTCGCC GTCCTTATGC GGTGTTACTT CTTGATGAAA TTGAGAAGGC TCATCCTGAT
GTGTTCAATG TGCTCCTGCA GGTTCTGGAC GATGGCCGTC TCACTGACTC GCAGGGCCGC
ACCGTGGACT TCCGTCACAC CGTGGTTGTG ATGACTAGCA ACTTGGCTAG TCGCGCCATC
CTTGATGCGT CTCGCTTGGC GCAGCAGGAG GGGACAGATG GGGATTGTCT TGATCAGAGC
CTGGCTCAAA AGGTTGATCA GGCCCTCAAT AAGCAATTTC GGCCTGAATT TTTGAATCGC
ATCGATGAGG TAATACGTTT TCGGCCGCTC AAGGCTGATG ACCTTCAGCG AATTGTGCAG
CTGCAATTGG CCGATCTCTC CAGCCTGCTT GCTGAGCAAG GTCTTGAATT GCGTGTGGAG
GCAGACGCGG TTGAGGCTTT GGCTTTGCAA GGTTATGAGC CGGAATATGG CGCTCGCCCT
TTACGGCGTG TACTGCGTCG TCGGGTTGAG AACCCGTTGG CCACGGAATT GCTGGAAGAG
CGTTTCAGCG GTGCCCGCGC CGTGCGCGTG ATCCCCGGTG CTCAGGCCTC CGAACCATTT
CAGTTCCTTC CTGAGGATTG A
 
Protein sequence
MHPETSQAAS AQGSLTSDPD RFSDEAWDLL LVGQDVARRW RHGQLDVEHL IQVLFSDRRY 
SDWVLALPID STALLDRLEG FLAEQPMARG DELFIGDDLE DLLEEADRSR VRWGSRLIDV
SHLLIALGRD PRIGSELFEE LGLPSERLEA ELRRLPKGRR QLRTSTPPSP PAPDTAPPAA
VASQTRPVSS ATPPIIQEPA IAAGEPTPLQ LQQEPQALEA YGRDLTAAAQ AGQLDPVIGR
DPEIRRLIKV LSRRGKNNPV LIGAPGVGKT AIAELLAQRI VAGEVPDSLK GLRLISLDLG
ALIAGAKFRG QFEERLRSVL KEVSDPDAGV VLFIDELHTV VSSDRSSADA GSLLKPALAR
GDLRCIGATT PENYRRTVEK DQALNRRFQQ VLIKEPSLEL SVEILRGLKE RYELHHGVTI
TDGAVTAANR LADRYISDRC LPDKAIDLID EAAAQLKMDV TSKPQVVEEA EAELRRVELA
LLAAEQAPEV ERVQLQAARI AAASQLTDLR ERWQIERDHL AELRDLLQQD EDLRNAIAEA
ERLGDLEAAA RLQYDQLHRV QQRRADLEET LAVAQASGSA LLREQVEAED IADVVARWTG
IPVQRLLAAE RQKLLDLEAH LGERVIGQPE AVQAVAAAIR RARAGMKDPR RPVGSFLFLG
PTGVGKTELA KALAASLFDE EEALVRLDMS EFMERNAVAR LLGAPPGYVG YEEGGQLTEA
VCRRPYAVLL LDEIEKAHPD VFNVLLQVLD DGRLTDSQGR TVDFRHTVVV MTSNLASRAI
LDASRLAQQE GTDGDCLDQS LAQKVDQALN KQFRPEFLNR IDEVIRFRPL KADDLQRIVQ
LQLADLSSLL AEQGLELRVE ADAVEALALQ GYEPEYGARP LRRVLRRRVE NPLATELLEE
RFSGARAVRV IPGAQASEPF QFLPED