Gene Syncc9902_0506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_0506 
SymbolgroEL 
ID3742273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp519237 
End bp520871 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content53% 
IMG OID637770677 
Productchaperonin GroEL 
Protein accessionYP_376518 
Protein GI78184083 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.982033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAGC GCATTATTTA TAACGAGAAC GCTCGCCGCG CTCTCGAAAA AGGCATTGAC 
ATTCTTGCCG AATCCGTTGC TGTCACGTTG GGACCTAAAG GTCGCAACGT TGTTCTCGAG
AAGAAATTCG GTTCTCCTCA GATCATCAAT GATGGTGTCA CCATCGCCAA AGAGATTGAG
CTTGAAGATC ACATTGAGAA CACCGGTGTT GCTCTGATTC GCCAGGCCGC ATCGAAAACC
AACGATGCCG CTGGTGATGG CACCACAACG GCTACCGTCC TTGCCCACGC CATGGTGAAG
GCCGGCCTGC GTAACGTTGC CGCAGGAGCC AACGCCATCA CCCTTAAGAA AGGGATTGAT
AAGGCGTCCG ATTTCTTGGT CGGCAAGATC AAAGACATGG CAAAGCCCAT TGCAGATAGC
AATGCCATTG CCCAGGTCGG CACGATTTCC GCCGGTAATG ACGAAGAAGT CGGCAAAATG
ATCGCCGATG CCATGGATAA GGTCGGTAAA GAGGGCGTGA TTTCCCTGGA AGAAGGCAAG
TCCATGGAGA CTGAGCTCGA GGTCACGGAA GGCATGCGTT TCGATAAGGG ATATATCTCC
CCTTATTTCG CCACCGACAC CGAGCGGATG GAAGCTGTTC TCGATGAGCC TTACATCCTC
TTAACCGATA AGAAGATTGG TCTCGTCCAA GATCTCGTTC CTGTGCTCGA GCAAATTGCA
CGCACTGGAA AACCTCTTCT GATCATTGCT GAGGACATTG AGAAAGAAGC TCTCGCCACC
CTCGTGGTGA ACCGCCTGCG CGGTGTGCTG AATGTGGCAG CCGTTAAGGC TCCTGGTTTT
GGTGATCGCC GTAAGGCCAT GCTTGAAGAC ATGGCTGTGC TGACGAATGG TCAGCTGATC
ACAGAGGACG CTGGCCTCAA ACTCGAGAAC GCCAAGCTGG AGATGTTGGG CACAGCCCGT
CGCATCACCA TCAACAAAGA CACCACCACC ATCGTTGCCG AAGGCAATGA GGCTGCTGTT
GGCGCTCGTT GCGAGCAGAT TAAGAAGCAG ATGGACGAAA CCGACTCCAC CTACGACAAA
GAAAAGCTGC AAGAACGTCT GGCCAAATTG GCTGGTGGTG TTGCTGTTGT GAAAGTTGGT
GCAGCGACCG AAACCGAGAT GAAGGACAAA AAACTTCGTC TGGAAGACGC CATCAATGCC
ACCAAGGCCG CCGTTGAAGA AGGCATTGTT CCAGGTGGTG GTACCACCTT GGCTCACCTG
GCTCCAGCCC TCGAAGAGTG GGCCAATGGC AACCTCTCCG GTGAAGAGTT GATTGGTGCA
AACATTGTTG CGGCAGCTTT AACAGCTCCA TTGATGCGCA TCGCTGAAAA TGCTGGTGCA
AACGGTGCTG TGGTTGCCGA AAATGTTAAG TCGCGTTCAA ACAACGAGGG CTATAACGCT
GCCAATGGCG ACTACGTCGA CATGCTGGCC GCCGGCATTG TTGACCCTGC CAAGGTGACG
CGTTCTGGCT TGCAGAATGC TGCGTCAATC GCTGGCATGG TTCTTACCAC TGAGTGCATT
GTGGCTGACT TACCCGAGAA GAAAGACGCT GCTCCTGCCG GTGGTGGCAT GGGTGGTGGC
GACTTCGATT ATTGA
 
Protein sequence
MAKRIIYNEN ARRALEKGID ILAESVAVTL GPKGRNVVLE KKFGSPQIIN DGVTIAKEIE 
LEDHIENTGV ALIRQAASKT NDAAGDGTTT ATVLAHAMVK AGLRNVAAGA NAITLKKGID
KASDFLVGKI KDMAKPIADS NAIAQVGTIS AGNDEEVGKM IADAMDKVGK EGVISLEEGK
SMETELEVTE GMRFDKGYIS PYFATDTERM EAVLDEPYIL LTDKKIGLVQ DLVPVLEQIA
RTGKPLLIIA EDIEKEALAT LVVNRLRGVL NVAAVKAPGF GDRRKAMLED MAVLTNGQLI
TEDAGLKLEN AKLEMLGTAR RITINKDTTT IVAEGNEAAV GARCEQIKKQ MDETDSTYDK
EKLQERLAKL AGGVAVVKVG AATETEMKDK KLRLEDAINA TKAAVEEGIV PGGGTTLAHL
APALEEWANG NLSGEELIGA NIVAAALTAP LMRIAENAGA NGAVVAENVK SRSNNEGYNA
ANGDYVDMLA AGIVDPAKVT RSGLQNAASI AGMVLTTECI VADLPEKKDA APAGGGMGGG
DFDY