Gene Syncc9902_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_1747 
SymbolgroEL 
ID3742157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1679160 
End bp1680848 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content56% 
IMG OID637771938 
Productchaperonin GroEL 
Protein accessionYP_377748 
Protein GI78185313 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAC TTCTTTCTTT CTCCGACGAA TCACGCAGTT CCCTTGAGCG TGGTGTGAAC 
GCCCTTGCCA ATGCTGTCCG AGTCACCATC GGACCCAAGG GTCGGAATGT CGTTCTCGAG
AAGAAATTTG GCGCCCCAGA CATCGTTAAT GACGGCGATA CCATTGCCCG CGATATCGAG
CTGGAAGATC CTTTTGAAAA TCTCGGCGCC AAGCTGATTC AACAGGTGGC ATCCAGAACT
AAAGACAAAG CTGGAGACGG CACCACCACG GCCACGGTTT TAGCTCAGGC CATGGTTCGC
GAAGGACTCC GCAACACAGC AGCAGGAGCT AGCCCTGTAG AGCTTCGTCG TGGGATGGAG
AAAGCGGCAG CACAGGTGGT TGCCGGTTTG GCGAGCCGAA GTCAGGCCGT CGAAGGTGAT
TCCATCCAAC AGGTGGCCAC GGTGAGTTCC AGTGGCGATG AAGAAGTGGG TCGGATGATC
GCTGAAGCGA TGGATCGGGT CAGCGTGGAC GGCGTCATCA CCGTTGAAGA ATCCAAATCG
CTCGCCACCG AAATGGAGGT GACTGAAGGC ATGGCATTCG ATCGCGGATA CAGCTCGCCC
TATTTCGTCA CGGATGCTGA TCGTCAGGTT TGTGAATTCG AAAATCCATT GATCCTGCTG
ACCGATCGAA AGATCAGCAC CGTCATCGAT TTAGTGCCCG TTCTTGAAGC GGTTCAAAAA
AGTGGCTCGC CGCTTTTAAT CCTCTCGGAA GAGGTGGAGG GGGAAGCCCT GGCCACCTTG
GTGATGAACA AGAGCCGTGG CGTCCTCCAA GTGGCAGCAG TGCGTGCTCC TTCCTTCGGA
GACCGTCGCA AAGCAGCCTT GGCTGATATC GCCATCCTCA CGGGGGGCAC CTTAATCAGC
GAAGACCAAG CGATGACTCT CGACAAGGTG ACGCTCGAGG ATCTCGGTCA CGCCCGTCGG
GTGACGATCA GCAAAGAGAG CACCACCATC GTTGCGAATG ACAATCACAG TGAAGCGGTG
AGCAATCGTG TTGCCGCAAT CAAGCGAGAG CTCGACGCGA CAGAGTCGGA TTACGACCGC
GAAAAGCTGA ATGAGCGGAT TGCCAAACTG GCCGGTGGTG TTGCCGTCAT CAAGGTGGGT
GCTGCAACAG AAACCGAACT GAAAAACCGC AAACTGCGAA TTGAAGACGC CCTGAATGCC
ACCCGTGCCG CTGTGGAAGA AGGAATCGTG GCTGGAGGCG GAAGCACGTT GCTTCAGCTC
GCTGAAGACC TCAACGCCCT AGCGGCACAA CTGGACGGCG ATCAACGCAC CGGCGTAGAA
ATTGTGCAGC GATCACTCAC CGCACCCGTC CACCAGATCG CAACCAATGC AGGACATAAC
GGTGACGTGG TGATCGAAAC GATGCGCCAA AGCGGTCAGG GATTCAATGC CCTAACGGGT
GTGTACGAAG ACTTGATGGC GACAGGCATC GTTGATGCCA CCAAAGTTGT TCGACTTGCA
GTACAGGACG CGGTGTCGAT TGCATCCCTG CTGGTCACAA CTGAGGTAGT GATTGCTGAC
AAACCAGAAC CAGAACCTCC TGCTGGAGCT GGAGGTGAAG ATCCCATGGG TGGAATGGGC
GGCATGGGTG GCATGGGCGG TATGGGTATG CCTGGCATGG GCGGCATGGG CATGCCTGGA
ATGATGTGA
 
Protein sequence
MAKLLSFSDE SRSSLERGVN ALANAVRVTI GPKGRNVVLE KKFGAPDIVN DGDTIARDIE 
LEDPFENLGA KLIQQVASRT KDKAGDGTTT ATVLAQAMVR EGLRNTAAGA SPVELRRGME
KAAAQVVAGL ASRSQAVEGD SIQQVATVSS SGDEEVGRMI AEAMDRVSVD GVITVEESKS
LATEMEVTEG MAFDRGYSSP YFVTDADRQV CEFENPLILL TDRKISTVID LVPVLEAVQK
SGSPLLILSE EVEGEALATL VMNKSRGVLQ VAAVRAPSFG DRRKAALADI AILTGGTLIS
EDQAMTLDKV TLEDLGHARR VTISKESTTI VANDNHSEAV SNRVAAIKRE LDATESDYDR
EKLNERIAKL AGGVAVIKVG AATETELKNR KLRIEDALNA TRAAVEEGIV AGGGSTLLQL
AEDLNALAAQ LDGDQRTGVE IVQRSLTAPV HQIATNAGHN GDVVIETMRQ SGQGFNALTG
VYEDLMATGI VDATKVVRLA VQDAVSIASL LVTTEVVIAD KPEPEPPAGA GGEDPMGGMG
GMGGMGGMGM PGMGGMGMPG MM