Gene P9301_02271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_02271 
SymbolclpB2 
ID4912842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp209214 
End bp211970 
Gene Length2757 bp 
Protein Length918 aa 
Translation table11 
GC content31% 
IMG OID640159793 
Productputative ATP-dependent Clp protease, Hsp 100, ATP-binding subunit ClpB 
Protein accessionYP_001090451 
Protein GI126695565 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGAAA CCCTTACATC CAGTCCCGAA TTATTTAGCG ATATCAGTTG GAACCTTCTT 
TTATTAGGGG AAGAAACCGC AAAAAAATGG GATCATAGCG AATTTAATAT TGAACACATA
ATTCATACAT TGTTCTCATC AAGTGAATTC TTTGCTTTCA TTGAAAAATT ATCAATCGAC
CAAGATACAG TTTTAGACAT AACAGAAGAT TTTTTAGAAG AGACACCAAC AAATGAGTCA
GATATTTTTA CTATCGGAGA AGATTTAGAA ATTTTATTAG ATAACGCGAA TCAGATTAAA
ACTCAATGGG GATCGAGATT AATAGAAATC CCTCATTTAC TAATTGCTCT TGGAAGAGAT
TTAAGAATTG GGAATTATGT TTTTGAAGAA GGAAACCTTT CAATGGAAAA ATTAGAGGAA
GAATTAAAGT TTTACCCAAA TATTAATCAA TCAAAAGATT CTTTTAATTA TGGGAATGTA
ATTGAAATAA ATAATCAATC TAATTTTGAA TCAGATAAAG AAACTTTAGT TAAAGAAGAA
AAATTGAAAA AAGCTATTGT TCCATTACCG AAAAGTGAAC TTCAAATTGA AACCAAAAAA
CAAGTTGGAA AAGATGAAAA TGCTCTTTCA ATTTATGGAA AAGATTTAAC AGAATCAGCT
AAAAAAGGCT TAATGGATCC CATTTTAGGA AGAGAAAATG AGATCAATAA TTTAATGAGG
GTACTCTGCA GAAGAAACAA AAATAACCCT ATACTCATTG GCAATCCTGG AGTTGGTAAA
ACCTCAATTG CAAAATTACT TGCTCAATTA ATTGTAGACA AAAAAGTTCC TGATACTTTA
AAGGACTTAA AAATTATTTC ACTTGACTTA GGTGCATTAG TTTCTGGGAC CAAATTTAGA
GGTCAACTAG AAGAAAGACT AAGTTTAATA ATACAGGAAC TAAATAATCC AAACCAAGGA
ATGATTCTAT TTATTGATGA AATTCACTCA ATATTAAGTT CTGACAGATC TTCTACCAAC
ATCAGTAATG TTTTAAAACC TTTACTAGCT GAAGGAGAAC TTAGATGTAT CGGTACAACA
ACACCTGAGA AATTTCGTGA AACTATTGAA AAAGATCAGG CATTAAATAA TTGCTTTCAA
AAGATAGCTG TTAATGAACC TTCAGTAGAA TTAAGCGCAA AAATACTACA AGGTATCAAA
AAGAAATATG AATCACATCA TGGCATAAAA ATTTCTGAAG AGGCTGTAAA CTATTCTGCA
AAATTAGCCG ATAGATACAT CAGCGATAAA TGTCTCCCTG ATAGCGCAAT AGATTTAATT
GATGAAGCAG CTGCACAGTT GAAAATCGAG TCTAATAATG TGCCTCAAAT CATTCTTCAA
CAAGAAAACA AACTTAATAC AATTGATGAA AAATTGAATA ATTTGCAAGG AGAAAATATC
GAAGCTCAAG AAAAACTATT GAATAATAGA CAACAATCAG AGGCAAAATT GAACCTTCTT
TTAGAAAATT GGAACAATTT ACGTGAAGAA ATGGAGGAAT TATCTATTTT AATGAAAGAA
GAAGATAAGA TAACCAAACA AATTAAAGAT AAATCAAATC GAGAAATTGA AAATGATCTA
GATTATTTAG AAAAGCTTGA AGAAGAGTTA AGTGAAATAG AGAATGACAT ACAAAAAGTT
GAAGAGAACT TTAATAAAAT AAAGAAAAAT AGAAATTTCC CTTTTAAATA TCAAGTTGAA
CCTGATGATA TTGCTGATGT TGTCTCAAAA ATTACAGGTA TTCCAATTTC TAAAGTAGTT
TCAAATGAGC GTAAGAAATT AGTCAATCTA GAGACAGAAC TAAGTGAAAA AGTTATTGGA
CAAGAAAAAG CTATAGAAGC TGTTTCTGCT GCAATTAGAA GAGCTCGCGT TGGCATGAAA
AGTCCTAAAA GACCTATCGG ATCTTTTTTA TTTATGGGTC CTACAGGCGT TGGTAAAACA
GAATTAGCAA AATCTCTTGC AACAGCTTTA TTTGATGAAG AAGACGCACT TTTAAGATTA
GACATGAGTG AATATATGGA GAAAAATGCT GTAGCAAGGC TTTTGGGAGC TCCTCCAGGT
TATGTTGGTT ATGAAGAGGG AGGTCAATTA ACTGAAGCTG TAAGACGTAA ACCCTATTCA
GTAATACTTC TTGATGAGAT AGAAAAAGCA CATTCAGAAG TTTTTAATAT CCTTTTACAA
GTTTTAGATG AAGGAAGATT AACGGACTCT CAAGGAAGGA CAGTAGACTT CAAAAATACC
GTAATCATTA TGACAAGCAA CCTAGCTGGT AAATCTATAC TGGAATATTC ACAAAAAATT
TCTAGAAGTG ATGGAAAGTT AGAAAAAGAC CAACAAAACT TAGATGACTC CATTAGTAAT
ACATTGTCTT CAATTTTTAG ACCTGAATTT TTAAATAGAA TTGATGAAGT GGTAAAGTTT
AATCCATTAT CTATTGATGA ACTTCAAAAA ATAATAATTT TACAAACAGA AGATTTAAAA
AACCTGCTAC TTGAACAAAA AATAAATATC GCTATAGACA AAAAAGTTAT TAATAAAATT
GCAAATGATT CTTACGAACC TGAATATGGT GCTAGGCCAC TTAGCAGGGA ACTTAGAAGA
CAAATAGAAA ATCCCTTGGC TGCAAAACTT TTAGAGGAAA ACTTCAAAAA CAAAAAAAAT
ATAATAATTA AACTTAACCC TGCTAAAAAA GATGAGATCA TTTTCAAACC TAGCTGA
 
Protein sequence
MRETLTSSPE LFSDISWNLL LLGEETAKKW DHSEFNIEHI IHTLFSSSEF FAFIEKLSID 
QDTVLDITED FLEETPTNES DIFTIGEDLE ILLDNANQIK TQWGSRLIEI PHLLIALGRD
LRIGNYVFEE GNLSMEKLEE ELKFYPNINQ SKDSFNYGNV IEINNQSNFE SDKETLVKEE
KLKKAIVPLP KSELQIETKK QVGKDENALS IYGKDLTESA KKGLMDPILG RENEINNLMR
VLCRRNKNNP ILIGNPGVGK TSIAKLLAQL IVDKKVPDTL KDLKIISLDL GALVSGTKFR
GQLEERLSLI IQELNNPNQG MILFIDEIHS ILSSDRSSTN ISNVLKPLLA EGELRCIGTT
TPEKFRETIE KDQALNNCFQ KIAVNEPSVE LSAKILQGIK KKYESHHGIK ISEEAVNYSA
KLADRYISDK CLPDSAIDLI DEAAAQLKIE SNNVPQIILQ QENKLNTIDE KLNNLQGENI
EAQEKLLNNR QQSEAKLNLL LENWNNLREE MEELSILMKE EDKITKQIKD KSNREIENDL
DYLEKLEEEL SEIENDIQKV EENFNKIKKN RNFPFKYQVE PDDIADVVSK ITGIPISKVV
SNERKKLVNL ETELSEKVIG QEKAIEAVSA AIRRARVGMK SPKRPIGSFL FMGPTGVGKT
ELAKSLATAL FDEEDALLRL DMSEYMEKNA VARLLGAPPG YVGYEEGGQL TEAVRRKPYS
VILLDEIEKA HSEVFNILLQ VLDEGRLTDS QGRTVDFKNT VIIMTSNLAG KSILEYSQKI
SRSDGKLEKD QQNLDDSISN TLSSIFRPEF LNRIDEVVKF NPLSIDELQK IIILQTEDLK
NLLLEQKINI AIDKKVINKI ANDSYEPEYG ARPLSRELRR QIENPLAAKL LEENFKNKKN
IIIKLNPAKK DEIIFKPS