Gene P9211_02251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_02251 
SymbolclpB2 
ID5731225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp214924 
End bp217686 
Gene Length2763 bp 
Protein Length920 aa 
Translation table11 
GC content37% 
IMG OID641284569 
Productputative ATP-dependent Clp protease, Hsp 100, ATP-binding subunit ClpB 
Protein accessionYP_001550110 
Protein GI159902766 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.260089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA GTCTTACAAC TAACCCTGAT CTCTTTAGCA ATTCTGCATG GGAGCTATTA 
ATAGGAAGTG AGCATGAAGC TCGTAGATGG AGACATGAAT ATCTAGATGT TGAGCATCTT
CTTCAAGTGC TATTTACTGA TTCTAATTAC AAAAATATTG TAGATCCACT GCCAATAAAT
AATGCCGAAC TACTTGATGA AGTGGAAGAA TTTCTTGCAA ACTTGCCAGC ATCTAGTTCA
AACAGAATTT TTATAGGTGA AGATCTGGAG GAGTTACTAG ACACAGCTGA AAATTTCAGA
GCAAGGTGGG GATCTCGTTT AATTGAGATC TCACATATTC TAATTGCCCT AGGTAGAGAT
AAGCGCATCG GGACGAAGCT ATTCACAGAA CTCGGACTCC CAAGTGAGAT TCTAGAAAGC
GAGTTAAGGC GATTACCAAA ACCAATAAAT AAAAAACGCA GCAGATCCGA AGAATCAATT
CTTATAAAAG AATCAATTCC TTATCAGACA ACCATTAATG ATAAGGAAGA AAACTCTAAA
ATTTTTGATC AAGAATCTTC TATCCAAAAA ACCCAGACAG CCCAAATTCA ATTAAAAAAT
GGTTCAAATA TTAATGAAGA AATAAATCCA CTGAAGGAAT TTGGAAAAGA TTTAACCCTA
GCAGCCATGA ACGGCAAACT AGACCCTGTA ATCGGCAGAA ATGAAGAAAT TCAATTAGTC
ATCAAAGTCT TATCTCGAAG AGGTAAAAAT AATCCTGTGT TAATTGGTGC TCCAGGAGTT
GGCAAAACAG CCATTGCTGA ATTATTAGCC CAAAAAATAA TTTGTAACGA AGTACCAGAT
TCTTTAAAGG GTCTAAAGCT AATTTCGTTA GATATAGGGG CTTTAATTGC AGGTACAAAG
TTTCGTGGTC AATTTGAAGA GCGATTAAGA TCGGTATTAG CTAGAGCTAG CAACCCTGAT
GCGGGGGTCA TCTTATTTAT TGACGAATTA CATACAGTTT TAAGTACAGA TCGCTCTAGC
GCAGATGCAG GAAGCTTACT AAAGCCTGTT CTTGCAAGCG GTGATCTACG TTGTATTGGT
GCTACTACCC CTGAAAGCTT TCAACGCACA ATAGAGAAAG ACCAGGCTCT TAATCGAAGA
TTTCAACAAG TCCCAATCAA AGAACCAAAT CTTGAAGTAA GCGTAGATAT TCTAAGAGGA
CTTAAAGAGC GCTATGAATT ACATCATGGA GTCAAAATTA GTGATGAAGC ATTAATTGCT
GCCAATCGAT TAGCAGATAG ATATATTGGT GATCGCTGCC TTCCAGATAA AGCAATTGAT
CTAATTGATG AAGCTTCTGC TCAGCTAAAG ATGGAAGCAA CCTCAAAACC ATTAGCTTTA
GAAGAATTAG AATCAAGCCT TCATAAGTTA AGCGTCGATT TAATTAAAGC CGAAGAAAAT
TCATTAGAAA CAGAAGTAAT TAGAATAAAA TCGAAACGTG ATCTTATAAT TCAAGAGTCC
AAAAAAATCT CTGCTCAATG GGAAAATGAA AAACTAATGG CTAAAGAACT TAGTGAGTTA
GTCAATCAAG AAAATATTAT TTGCAATTCA ATAGAAGACG CAGAAGAGAA AGGAGATTTA
GAAACAGTAG CCAGGTTGAA ATATGATGAA CTTCACTATT TACATGAAGC AATTAATGAC
TTAAAAGCTT CGATTGAGAG ATTCAAAAGT GATGGAACAT CTTTAATAAG AGACCAAGTA
GAGCCTGAAG ATATAGCTGA TGTTGTTTCT AGATGGACTG GGATACCTGT CAATAAAGTT
ATGGCTGGTG AAAAACAAAA GCTTTTAAAT TTAGAAACAG ATCTAGGAAA TAAAGTAATT
GGTCAATCTG AAGCAGTCAA AGCAGTTGCA GAGGCTATAA AACGAGCAAG AGCTGGGATG
AAAGATAGTT ATAGACCAAT TGGATCGTTT CTTTTCTTAG GGCCAACAGG CGTAGGAAAA
ACAGAGCTTG CAAAAGCCCT TGCTGCTTCA CTGTTTGATG AAGAGGAGGC CTTAATCAGA
TTGGATATGA GTGAATTTAT GGAAAGAAAT GCAGTTGCAA GATTGCTTGG AGCGCCACCA
GGATATGTAG GCTATGAAGA AGGAGGTCAG CTCACTGAGG CAGTAAGACG GCGGCCATAT
TCAGTACTAC TTTTAGACGA AATAGAAAAA GGTCACCCAG ATGTATTCAA TATTCTTCTT
CAAGTTTTGG ATGATGGACG TCTTACAGAC TCTCAAGGGA GAACAGTAGA TTTTCGTCAC
ACAGTTGTTG TTATGACAAG TAATCTTGCA AGTCGCACAA TTCTAGAAAA CGCTAACTCC
TTATCTGCAG AGATAAATGA AACTACTCTT AGCAATGAGA AATTAACCAA AAGTATTGAT
CAAGCATTAA GGCAGCAGTT CCGACCAGAG TTTCTCAATC GTATAGATGA AATAATCTGT
TTCAAGCCAC TTTCTATAGA TCATCTCCAA CGCATTGTGA GATTACAGCT AACTGAGCTG
AGAGAATTAC TAGCTGAGCA AGGCTTAGAG CTGCGTGTAG ATAGTTCAAC TATTGAAGCG
CTTGCAAAGG AAGGTTATGA GCCTGAATAT GGCGCAAGAC CTTTAAGACG CATCATCAGA
AGGCGCATAG AGAACCCTCT TGCAAACCAA TTATTAGAAG ATAAATTTTT TGGGGCAAAT
GCGGTGAGAA TAAAAGTTTC TTCAGATAAA TCTGAATCAT TAGAATTTAT TGGAGAGAAC
TAA
 
Protein sequence
MKKSLTTNPD LFSNSAWELL IGSEHEARRW RHEYLDVEHL LQVLFTDSNY KNIVDPLPIN 
NAELLDEVEE FLANLPASSS NRIFIGEDLE ELLDTAENFR ARWGSRLIEI SHILIALGRD
KRIGTKLFTE LGLPSEILES ELRRLPKPIN KKRSRSEESI LIKESIPYQT TINDKEENSK
IFDQESSIQK TQTAQIQLKN GSNINEEINP LKEFGKDLTL AAMNGKLDPV IGRNEEIQLV
IKVLSRRGKN NPVLIGAPGV GKTAIAELLA QKIICNEVPD SLKGLKLISL DIGALIAGTK
FRGQFEERLR SVLARASNPD AGVILFIDEL HTVLSTDRSS ADAGSLLKPV LASGDLRCIG
ATTPESFQRT IEKDQALNRR FQQVPIKEPN LEVSVDILRG LKERYELHHG VKISDEALIA
ANRLADRYIG DRCLPDKAID LIDEASAQLK MEATSKPLAL EELESSLHKL SVDLIKAEEN
SLETEVIRIK SKRDLIIQES KKISAQWENE KLMAKELSEL VNQENIICNS IEDAEEKGDL
ETVARLKYDE LHYLHEAIND LKASIERFKS DGTSLIRDQV EPEDIADVVS RWTGIPVNKV
MAGEKQKLLN LETDLGNKVI GQSEAVKAVA EAIKRARAGM KDSYRPIGSF LFLGPTGVGK
TELAKALAAS LFDEEEALIR LDMSEFMERN AVARLLGAPP GYVGYEEGGQ LTEAVRRRPY
SVLLLDEIEK GHPDVFNILL QVLDDGRLTD SQGRTVDFRH TVVVMTSNLA SRTILENANS
LSAEINETTL SNEKLTKSID QALRQQFRPE FLNRIDEIIC FKPLSIDHLQ RIVRLQLTEL
RELLAEQGLE LRVDSSTIEA LAKEGYEPEY GARPLRRIIR RRIENPLANQ LLEDKFFGAN
AVRIKVSSDK SESLEFIGEN