Gene P9303_05041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_05041 
SymbolgroEL 
ID4776858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp495716 
End bp497350 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content55% 
IMG OID640086008 
Productchaperonin GroEL 
Protein accessionYP_001016521 
Protein GI124022214 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGC GCATTATTTA CAACGAGCAA GCTCGACGTG CCCTTGAGCG CGGCATTGAC 
ATCCTGGCCG AATCCGTGGC CGTCACACTG GGGCCCAAAG GCCGCAATGT GGTGCTTGAG
AAAAAGTTTG GTGCACCCCA AATCATCAAT GATGGCGTCA CCATCGCCAA AGAGATTGAG
CTTGAAGATC ACATTGAGAA CACCGGTGTT GCTCTGATCC GTCAGGCAGC TTCGAAGACC
AATGACGCTG CAGGTGATGG CACGACCACA GCCACTGTTT TGGCCCATGC CATGGTGAAG
GCCGGCTTGC GCAATGTGGC TGCTGGTGCC AATGCAATCT CCTTGAAGAA GGGGATCGAC
AAGGCTTCCG ACTTCCTCGT TTCCAAGATT GAGGAGCTGG CCAAGCCGAT CAGTGACAGC
AACGCCATTG CTCAGTGTGG AACCATTGCT GCCGGCAACG ACGAGGAAGT GGGTCAGATG
ATTGCTGACG CCATGGACAA AGTTGGCAAG GAAGGTGTGA TCTCTCTTGA AGAGGGCAAG
TCGATGACCA CTGAACTGGA GGTCACCGAG GGGATGCGTT TCGATAAGGG CTACATCTCC
CCTTACTTCG CCACTGACAC TGAGCGGATG GAGGCTGTGC TGGATGAGCC CTACATTCTC
CTCACTGATA AGAAGATTGG CTTGGTACAG GATCTGGTGC CTGTGCTTGA ACAGATTGCC
CGCACTGGCA AGCCCCTGCT GATCATTGCT GAGGACATTG AGAAGGAAGC TCTCGCAACC
TTGGTGGTCA ATCGTCTGCG TGGTGTGCTG AATGTGGCTG CCGTGAAGGC GCCTGGGTTT
GGTGATCGCC GTAAGGCCAT GCTCGAAGAC ATGGCAGTGC TGACCAATGG TCAGTTGATC
ACTGAAGACG CCGGTCTTAA ATTGGACAAT GCCAAGCTGG AAATGCTTGG TACAGCGCGC
CGTGTGACCA TTAATAAGGA CACCACAACC ATCGTTGCCG AAGGCAATGA GACGGCGGTG
AAAGGGCGTT GCGAGCAGAT CAAAAAGCAG ATGGACGAGA CTGACTCGAC CTACGACAAG
GAGAAGCTGC AGGAACGACT GGCCAAGTTG GCCGGTGGCG TTGCTGTTGT GAAGGTGGGT
GCTGCTACCG AAACCGAGAT GAAGGACAAG AAGCTCCGTC TTGAAGACGC CATCAACGCC
ACCAAGGCAG CAGTTGAGGA AGGCATCGTC CCAGGTGGTG GCACCACACT GACTCATCTC
GCTGCTGATC TGCAGAAGTG GGCCAATAGC AACCTCAGCG GTGAGGAGCT GATCGGTGCC
AACATCGTGG AGGCTTCCCT GGCTGCTCCC TTGATGCGCA TTGCTGAGAA TGCCGGCGCC
AATGGTGCTG TCGTGGCCGA GAACGTTAAG AGCAGGCCCA TCAGTGACGG CTATAACGCT
GCTACTGGGG ATTACATCGA CATGCTTGCT GCAGGCATTG TTGACCCTGC CAAAGTCACC
CGTTCTGGTT TGCAGAATGC GGCCTCGATT GCTGGCATGG TGCTGACCAC TGAGTGCATC
GTTGCTGATT TGCCAGAGAA GAAGGAGGCA GCTCCTGCAG GCGGTGGCGG CATGGGTGGT
GACTTCGACT ACTGA
 
Protein sequence
MAKRIIYNEQ ARRALERGID ILAESVAVTL GPKGRNVVLE KKFGAPQIIN DGVTIAKEIE 
LEDHIENTGV ALIRQAASKT NDAAGDGTTT ATVLAHAMVK AGLRNVAAGA NAISLKKGID
KASDFLVSKI EELAKPISDS NAIAQCGTIA AGNDEEVGQM IADAMDKVGK EGVISLEEGK
SMTTELEVTE GMRFDKGYIS PYFATDTERM EAVLDEPYIL LTDKKIGLVQ DLVPVLEQIA
RTGKPLLIIA EDIEKEALAT LVVNRLRGVL NVAAVKAPGF GDRRKAMLED MAVLTNGQLI
TEDAGLKLDN AKLEMLGTAR RVTINKDTTT IVAEGNETAV KGRCEQIKKQ MDETDSTYDK
EKLQERLAKL AGGVAVVKVG AATETEMKDK KLRLEDAINA TKAAVEEGIV PGGGTTLTHL
AADLQKWANS NLSGEELIGA NIVEASLAAP LMRIAENAGA NGAVVAENVK SRPISDGYNA
ATGDYIDMLA AGIVDPAKVT RSGLQNAASI AGMVLTTECI VADLPEKKEA APAGGGGMGG
DFDY