Gene P9211_04501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_04501 
SymbolgroEL 
ID5731113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp424806 
End bp426503 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content42% 
IMG OID641284807 
Productchaperonin GroEL 
Protein accessionYP_001550335 
Protein GI159902991 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.383214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAC TCCTTAGTTC TTCAGATGAA TCTAGAGGCG CCCTTGAGAA AGGGGTAGAT 
GCACTTGCCA ACGCAGTAAA GGTAACCATT GGTCCCAAAG GCAGAAATGT AGTACTAGAA
AAGAAGTTTG GAGCTCCAGA TATAGTTAAC GATGGGGTCT CCATAGCCAA AGACATAGAA
CTAGAAGACC CCTTTGAAAA CTTAGGTGCA AAGCTTATTG AGCAGGTTGC TTCTAAAACG
AAAGATAAAG CTGGTGATGG CACAACAACA GCCACTGTTT TAGCTCAAGT AATGGTTCAT
GAGGGACTAA AAAATACTGC CGCAGGGGCA AGCCCTATCG AGCTTCGTCG TGGCATGGAA
AAAGCAGTTT CATTCATAGT TGAAAAATTG CAACAAAAAA GTAAAGGCAT AAGTGGCAAT
GAAATTCTTC AAGTAGCAAC GGTTAGTTCG GGTGGTGATG AAGAGATCGG GGAAATGGTG
GCTGAGGCCA TGGAGAAAGT CAGTGTAGAT GGTGTAATTA CAGTCGAAGA ATCAAAGTCC
TTAAACACTG AGCTGGAAAT AACCGAAGGG ATGGCTTTTG ATAGAGGTTA TAGTTCGCCT
TACTTTGTTA CTGATGCTGA CCGTCAAATT TGCGAGTTTG AAAACCCTTT ACTCTTAATA
ACCGATAGAA AAATTAGCTC CATAGGTGAC CTAGTCCCTG TTTTAGAAGC AGTCCAAAAA
AGTGGCTCTC CTTTAGTGAT TCTTTCTGAA GAAGTTGAAG GAGAAGCATT GGCAACTTTA
GTAGTAAATA AAAATCGTGG AGTTTTACAA GTAGCAGCTG TTCGCGCCCC ATCATTTGGG
GAAAGGCGTA AAGCAGCTCT TGCAGATATT AGTGTTCTAA CTGGAGGGAC ATTAATAAGC
GAAGATAAAG CAATGTCATT AGAAAAAGTT TCTCTCTCAG ATTTAGGTAA AGCCAGAAAA
ATAACCATTA CAAAAGACTC GACAACTATC GTTGCTAATG ATGACCATCG CAAAGCTGTG
GAGTCACGAG TAGCTTCTAT TAAAAGAGAA TTAGATAGCA CTGATTCTGA TTACGACCGA
GAGAAGTTGA ATGAGCGAAT AGCAAAACTT GCTGGGGGAG TAGCTGTAAT TAAAGTAGGG
GCGCCAACTG AAACAGAGTT AAAGAATCGA AAACTTAGGA TTGAAGACGC TTTAAATGCA
ACTCGTGCTG CAGTAGAAGA AGGAATTGTT GCAGGAGGTG GGAGCACTCT TCTTCAATTA
AGTAATGAGC TCAATAGTCT TTCAAAAGAG TTAAGTGGTG ATAAGAAAAC TGGAGTTGAC
ATAATTAAAA AAGCCTTATC AGCTCCAGCC AGGCAAATAG CTGTAAATGC AGGAGAGAAT
GGAGATGTTG TTGTATCTCA AATTGAACAA CTGGGGAAAG GCTTTAATGC TGCCACAGGA
CAATATGAGG ACCTTCTTTC CACTGGCATA ATCGATGCAG TGAAAGTAAT ACGACTAGCA
CTTCAAGATG CAGTTTCAAT CGCTTCACTA ATCATCACTA CAGAAGTAGT AATTGCCGAC
AAGCCTGAAC CACCAGCAGC TCCAGGGGCA GAAGGGGCTG GAGACCCAAT GGGTGGTATG
GGTGGTATGG GTGGTATGGG TGGTATGGGT GGTATGATGG GTGGCATGGG TGGCATGGGT
ATGCCTGGAA TGATGTAA
 
Protein sequence
MAKLLSSSDE SRGALEKGVD ALANAVKVTI GPKGRNVVLE KKFGAPDIVN DGVSIAKDIE 
LEDPFENLGA KLIEQVASKT KDKAGDGTTT ATVLAQVMVH EGLKNTAAGA SPIELRRGME
KAVSFIVEKL QQKSKGISGN EILQVATVSS GGDEEIGEMV AEAMEKVSVD GVITVEESKS
LNTELEITEG MAFDRGYSSP YFVTDADRQI CEFENPLLLI TDRKISSIGD LVPVLEAVQK
SGSPLVILSE EVEGEALATL VVNKNRGVLQ VAAVRAPSFG ERRKAALADI SVLTGGTLIS
EDKAMSLEKV SLSDLGKARK ITITKDSTTI VANDDHRKAV ESRVASIKRE LDSTDSDYDR
EKLNERIAKL AGGVAVIKVG APTETELKNR KLRIEDALNA TRAAVEEGIV AGGGSTLLQL
SNELNSLSKE LSGDKKTGVD IIKKALSAPA RQIAVNAGEN GDVVVSQIEQ LGKGFNAATG
QYEDLLSTGI IDAVKVIRLA LQDAVSIASL IITTEVVIAD KPEPPAAPGA EGAGDPMGGM
GGMGGMGGMG GMMGGMGGMG MPGMM