Gene Syncc9605_2169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_2169 
SymbolgroEL 
ID3735595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp1985081 
End bp1986715 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content62% 
IMG OID637776757 
Productchaperonin GroEL 
Protein accessionYP_382463 
Protein GI78213684 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.824864 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.120928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGC GCATCATCTA CAACGAGAAC GCCCGTCGCG CTCTCGAAAA AGGCATCGAC 
ATCCTCTGCG AAGCCGTCGC CGTGACCCTG GGCCCCAAAG GCCGCAACGT GGTGCTCGAG
AAAAAGTTCG GTGCACCCCA GATCATCAAT GACGGCGTCA CCATCGCCAA GGAGATCGAG
CTCGAAGACC ACATCGAGAA CACCGGTGTT GCTCTGATTC GTCAGGCCGC CTCCAAAACC
AACGACGCTG CCGGTGACGG CACCACCACC GCTACCGTTT TGGCCCATGC CATGGTCAAG
GCAGGTCTGC GCAACGTGGC TGCCGGTGCC AATGCCATCA CCCTGAAGAA GGGCATCGAC
AAGGCGTCCG ATTTCCTGGT CAGCAAGATC AAGGAAATGG CCAAGCCCAT CGCTGACAGC
AATGCCATCG CCCAGGTGGG CACCATCTCC GCTGGCAACG ACGAGGAAGT CGGCAAGATG
ATCGCCGACG CCATGGACAA GGTTGGCAAG GAAGGTGTGA TTTCCCTCGA AGAGGGCAAG
TCCATGGAGA CCGAACTCGA GGTCACCGAG GGCATGCGCT TCGACAAGGG CTACATCTCC
CCTTATTTCG CCACCGACAC CGAGCGGATG GAAGCCGTCC TCGACGAGCC CTACATCCTG
CTCACCGACA AGAAGATCGG TCTGGTGCAG GATCTGGTGC CCGTGCTGGA GCAAATCGCC
CGCACCGGCA AGCCTCTGCT GATCATTGCA GAGGACATCG AGAAGGAAGC CCTCGCCACT
CTGGTGGTGA ACCGCCTGCG CGGTGTGCTG AACGTGGCCG CCGTGAAGGC CCCTGGTTTC
GGTGATCGCC GTAAGGCCAT GTTGGAAGAC ATGGCAGTAC TGACCAACGG TCAGCTGATC
ACCGAGGACG CTGGTCTCAA GCTGGAGAAC GCCAAGCTGG AGATGCTGGG CACCGCCCGT
CGCATCACAA TCAACAAGGA CACCACCACC ATCGTTGCCG AGGGCAACGA GGCGGCTGTC
GGCGCCCGCT GCGAACAGAT CAAGAAGCAG ATGGACGAGA CCGACTCCAC CTACGACAAG
GAGAAGCTGC AGGAGCGTCT GGCCAAGCTG GCCGGTGGCG TGGCTGTGGT GAAGGTGGGT
GCGGCCACCG AAACCGAGAT GAAGGACAAG AAACTCCGTC TCGAGGACGC CATCAACGCC
ACCAAGGCGG CTGTTGAGGA AGGCATCGTT CCTGGTGGCG GCACCACCCT GGCGCACCTG
GCTCCGGCCC TGGAGCAGTG GGCAGCCTCC AGCCTGTCCG GCGAAGAACT GATCGGCGCC
AACATCGTGG CTGCTGCTCT CACCGCTCCG CTGATGCGGA TCGCTGAAAA CGCTGGTGCC
AACGGTGCTG TTGTGGCTGA GAACGTCAAG GCTCGCGCTG GTGCCGAAGG CTTCAACGCT
GCCTCCGGTG AGTACGTCGA CATGCTGGCT GCCGGCATCG TCGATCCCGC CAAGGTGACC
CGCTCCGGTC TGCAAAATGC CGCATCCATC GCTGGCATGG TGCTAACCAC CGAGTGCATC
GTGGCTGATT TGCCTGAGAA GAAGGAAGCA GCTCCTGCCG GCGGCGGCAT GGGCGGCGGC
GACTTCGACT ACTGA
 
Protein sequence
MAKRIIYNEN ARRALEKGID ILCEAVAVTL GPKGRNVVLE KKFGAPQIIN DGVTIAKEIE 
LEDHIENTGV ALIRQAASKT NDAAGDGTTT ATVLAHAMVK AGLRNVAAGA NAITLKKGID
KASDFLVSKI KEMAKPIADS NAIAQVGTIS AGNDEEVGKM IADAMDKVGK EGVISLEEGK
SMETELEVTE GMRFDKGYIS PYFATDTERM EAVLDEPYIL LTDKKIGLVQ DLVPVLEQIA
RTGKPLLIIA EDIEKEALAT LVVNRLRGVL NVAAVKAPGF GDRRKAMLED MAVLTNGQLI
TEDAGLKLEN AKLEMLGTAR RITINKDTTT IVAEGNEAAV GARCEQIKKQ MDETDSTYDK
EKLQERLAKL AGGVAVVKVG AATETEMKDK KLRLEDAINA TKAAVEEGIV PGGGTTLAHL
APALEQWAAS SLSGEELIGA NIVAAALTAP LMRIAENAGA NGAVVAENVK ARAGAEGFNA
ASGEYVDMLA AGIVDPAKVT RSGLQNAASI AGMVLTTECI VADLPEKKEA APAGGGMGGG
DFDY