Gene A9601_02251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02251 
SymbolclpB2 
ID4716909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp208224 
End bp210986 
Gene Length2763 bp 
Protein Length920 aa 
Translation table11 
GC content31% 
IMG OID640077924 
Productputative ATP-dependent Clp protease, Hsp 100, ATP-binding subunit ClpB 
Protein accessionYP_001008620 
Protein GI123967762 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGAAA CACTTACATC CAGTCCTGAA CTATTTAGCG ATATTAGTTG GAATCTTCTT 
TTATTAGGGG AAGAAACCGC AAAAAAATGG GATCATAGCG AATTTAATAT TGAACACATA
ATTCATACAT TGTTCTCATC AAGTGAATTC TTTGCCTTCA TTGAAAAATT ATCAATCGAC
CAAGATACAG TTTTAGACAT AACAGAAGAT TTTTTAGAAG AGACACCAAC AAATGAGTCA
GATATTTTTA CTATCGGAGA AGATTTAGAA ATTTTATTAG ATAACGCGAA TCAGATTAAA
ACTCAATGGG GATCGAGATT AATAGAAATC CCTCATTTAC TAATTGCTCT TGGAAGAGAT
TTAAGAATTG GAAATTATGT TTTTGAAGAA GGAAACCTTT CAATGGAAAA ATTAGAGGAA
GAATTAAAGT TTTTCCCAAA TATTAATCAA TCAAAAGATT CTTTTAATTA TGGGAATGTA
ATTGAAATAA ATAATCAATC CAATTTTGAA TCAAACAATG AGACTAACGA GACTTTTGTA
AAAGAAGAAA AATTTAAAAA AGCTATTGTT CCATTACAGA AAAGTGAACT TCAAATTGAA
ACAAAACAAG TTGGAAAAGA TGAAAATGCT CTTTCAATTT ATGGAAAAGA TTTAACAGAA
TCAGCTAAAA AAGGGTTACT GGATCCCGTT TTAGGAAGAG AAAATGAGAT CAATAATTTA
ATGAGGGTAC TCTGCAGAAG AAACAAAAAT AACCCTATAC TTATTGGCAA TCCTGGAGTT
GGTAAAACCT CAATTGCAAA ATTACTTGCT CAATTAATTG TAGACAAAAA AGTTCCTGAT
ACTTTAAAGG ACTTAAAAAT TATTTCACTT GACTTAGGTG CATTAGTTTC TGGGACTAAA
TTTAGAGGTC AACTAGAGGA AAGACTAAGC TTAATAATGC AGGAACTAAA TAATCCAAAC
CAAGGAATGA TCCTATTTAT TGATGAAATT CACTCAATAT TAAGTTCTGA CAGATCTTCT
ACCGACATCA GTAATATCTT AAAACCTTTA CTAGCTGAAG GAGAACTTAG ATGTATCGGT
ACAACTACAC CTGAGAAATT TCGTGAAACT ATTGAAAAAG ATCAGGCATT AAATAATTGC
TTTCAAAAGA TAACTGTTAA TGAACCTTCA GTAGAATTAA GCGCAAAAAT ATTACAAGGG
ATCAAAAAGA AATATGAATC ACATCATGGC ATAAAAATTT CTGAAGAGGC TGTAAACTAT
TCTGCAAAAT TGGCCGATAG ATACATCAGC GATAAATGTC TCCCTGATAG TGCAATAGAT
TTAATTGATG AAGCAGCCGC ACAGTTAAAA ATCGAGTCTA ATAATATGCC TCAAATCATT
CTCCAACAAG AAAACAAACT TAATACTATC GATGAAAAAT TGAATAATTT GCAAGGAGAC
AATATCGAAG CTCAAGAAAA ACTATTGAAT AATAGACAAC AATCAGAGGC AAAATTGAAC
GTTCTTTTAG AAAATTGGAA CAATTTACGT GAAGAGATGG AGGAATTATC CATTTTAATG
AAAGAAGAAG ATAAGCTAAC CAAACAAATC AAAGATAAAT CAAATCGCGA AATTGAAAAT
GATCTAGATT ATTTAGAAAA GCTTGAAGAA GAGTTAAGTA AAATAGAGAA TGAAATACAA
AAACTTGAAG AGAACTTTAC TAAAATAAAG AAAAATAGAA ATTTCCCTTT TAAATATCAA
GTTGAACCTG ATGATATTGC AGATGTTATA TCGAAAATCA CAGGTATTCC AATTTCTAAA
GTAGTTTCAA ATGAACGTAA GAAATTAGTC AATCTAGAAA CAGAACTAAG TGAAAAAGTT
ATTGGACAAG AAAAAGCCAT AGAAGTTGTT TCTGCTGCAA TTAGAAGAGC TCGAGTTGGC
ATGAAAAGTC CCAAAAGACC TATTGGATCT TTTTTATTTA TGGGTCCTAC TGGTGTTGGT
AAAACAGAAT TAGCAAAATC TCTTGCAACA GTTTTATTTG ATGAAGAAGA CGCACTTTTA
AGATTAGACA TGAGTGAATA TATGGAGAAA AATGCCGTAG CAAGACTTTT AGGAGCTCCC
CCAGGTTATA TTGGTTATGA AGAGGGAGGT CAATTAACTG AAGCTGTAAG ACGTAAACCC
TACTCAGTAA TACTTCTTGA CGAGATAGAA AAAGCTCATG CAGAAGTATT TAATATCCTT
TTGCAAGTCT TAGATGAAGG AAGATTAACG GACTCTCAAG GAAGGACCGT AGATTTCAAA
AATACGGTAA TCATTATGAC AAGTAACCTA GCTGGTAAAT CTATACTGGA GTATTCACAA
AAAATTTCTA AAAGTGAGGG AAAGTTAGAA AAAGATCAAC AAACCCTAGA TGATTCAATT
AGTAATGCAT TGTCTTCAAT TTTTAGACCT GAATTTTTAA ATAGAATTGA CGAAGTGGTA
AAGTTTGATC CACTTTCTAT TGATGAACTT CAAAAAATAA TCATTCTACA AACAGAAGAT
TTAAAGAACC TGCTACTTGA GCAGAAAATA AATATCGCTA TAGACAAAAA AGTTATCAAT
AAAATTGCAA ACGATTCTTA CGAACCTGAA TATGGTGCTA GGCCACTTAG CAGGGAACTT
AGAAGACAAA TAGAAAATCC CTTGGCTGCA AAACTTTTAG AGGATAGTTT CAAAAATAAA
AAAAATATAA CAATTAAACT TAACCCTACT AAAAAAGATG AGATCGTTTT CAAACCTAGC
TGA
 
Protein sequence
MRETLTSSPE LFSDISWNLL LLGEETAKKW DHSEFNIEHI IHTLFSSSEF FAFIEKLSID 
QDTVLDITED FLEETPTNES DIFTIGEDLE ILLDNANQIK TQWGSRLIEI PHLLIALGRD
LRIGNYVFEE GNLSMEKLEE ELKFFPNINQ SKDSFNYGNV IEINNQSNFE SNNETNETFV
KEEKFKKAIV PLQKSELQIE TKQVGKDENA LSIYGKDLTE SAKKGLLDPV LGRENEINNL
MRVLCRRNKN NPILIGNPGV GKTSIAKLLA QLIVDKKVPD TLKDLKIISL DLGALVSGTK
FRGQLEERLS LIMQELNNPN QGMILFIDEI HSILSSDRSS TDISNILKPL LAEGELRCIG
TTTPEKFRET IEKDQALNNC FQKITVNEPS VELSAKILQG IKKKYESHHG IKISEEAVNY
SAKLADRYIS DKCLPDSAID LIDEAAAQLK IESNNMPQII LQQENKLNTI DEKLNNLQGD
NIEAQEKLLN NRQQSEAKLN VLLENWNNLR EEMEELSILM KEEDKLTKQI KDKSNREIEN
DLDYLEKLEE ELSKIENEIQ KLEENFTKIK KNRNFPFKYQ VEPDDIADVI SKITGIPISK
VVSNERKKLV NLETELSEKV IGQEKAIEVV SAAIRRARVG MKSPKRPIGS FLFMGPTGVG
KTELAKSLAT VLFDEEDALL RLDMSEYMEK NAVARLLGAP PGYIGYEEGG QLTEAVRRKP
YSVILLDEIE KAHAEVFNIL LQVLDEGRLT DSQGRTVDFK NTVIIMTSNL AGKSILEYSQ
KISKSEGKLE KDQQTLDDSI SNALSSIFRP EFLNRIDEVV KFDPLSIDEL QKIIILQTED
LKNLLLEQKI NIAIDKKVIN KIANDSYEPE YGARPLSREL RRQIENPLAA KLLEDSFKNK
KNITIKLNPT KKDEIVFKPS