Gene Cyan8802_1217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1217 
Symbol 
ID8390528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1236916 
End bp1238583 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content50% 
IMG OID644979226 
Productchaperonin GroEL 
Protein accessionYP_003136977 
Protein GI257059089 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAA TAGTTTCCTT CAGTGATGAA TCAAGACGGT CCTTAGAGCA AGGCGTTAAT 
GCTTTGGCCA ATGCGGTTCG TATTACCCTC GGTCCAAAAG GCCGCAATGT CTTACTCGAA
AAGAAATTTG GAGCCCCCCA AATCGTTAAC GATGGGATCA CCGTCGCCAA AGAAATTGAA
CTCGAAGATC CCTTAGAAAA CACTGGGGCT AGACTCATTC AAGAGATCGC CTCTAAAACC
AAAGAAGTCG CCGGAGATGG AACCACCACC GCCACCGTGA TCGGTCAAGC CCTCATTCGG
GAAGGATTAA AAAACGTCAT TGCCGGAGCC AATCCTGTAG CCTTACGACG AGGTATTGAA
AAAACCGTCG CGTATTTAGT CGAAGAAATT GCCTCGGTTT CTCAACCGGT CGCTGGTGAT
GCCATTGCCC AAGTCGCTAC CGTTTCCTCA GGAAATGACG AAGAAGTGGG CAAAATGATC
GCCCTAGCCA TGGATAAAGT GACCACTGAT GGGGTGATCA CCGTCGAAGA ATCAAAATCC
CTCACCACCG AATTAGACGT AGTAGAAGGG ATGCAAATCG ATCGCGGCTA TCTATCCCCC
TACTTCATCA GTGATCCCGA ACGACAACTG GTGGAATTTG AAAAGCCCTA TATTCTGATT
ACCGACAAAA AAATTAGCGC GATCGCCGAT TTAGTTCCCG TTCTCGAAAA CGTAGCGCGT
TCTGGCAGTC CCCTACTGAT CATTGCTGAA GACGTAGAAG GAGAAGCCCT CGCTACCCTG
GTGGTGAACA AAGCCAGAGG GGTCTTAAAC GTCGCAGCAA TTAAAGCCCC TGCCTTTGGC
GAACGACGCA AAGCTGTGTT ACAAGATATT GCCATCCTTA CGGGAGGGAG CGTCATTTCC
GAAGAAGTCG GGCTGACCTT AGATGCAGTA TCCGTTGATA TGCTCGGACA AGCCGATAAA
ATCACTATTG AGAAAGACAA TACTACCATT GTCGCCACCG GAGACAGCAA AACCAAAGGG
GCGATCGAAA AACGGGTCGC CCAACTACGC AAACAACTCG AAGAAACCGA CTCAGAGTAC
GATAAAGAAA AACTCACCGA ACGCATTGCT AAATTAGCGG GTGGGGTCGC CGTGATTAAA
GTGGGAGCAG CCACGGAAAC CGAACTCAAG GATCGTAAGC TGCGCATTGA AGATGCTCTT
AATGCCACTA AAGCGGCCAT CGAAGAAGGC ATCGTCCCCG GTGGTGGAAC CACGTTAATT
CACTTGGCCA AAAAGGTACT TGAGTTTAAG CAAAGCTTAA CCAACCCTGA AGAACAGGTC
GCGGCCGATA TCGTTGCTAA AGCCTTAGAA GCTCCTTTGC GCCAATTGGC TGACAATGCA
GGGGTTGAAG GCTCTGTGAT CATCGACCGT GTGCGTAACA CAGACTTTAA TGTGGGCTAC
AATGCCATGT CAGGGGAATT TGAGGATATG ATCGCGGCGG GCATCATTGA TCCGGCTAAA
GTGGTTCGTT GTGCTGTGCA AAATGCAGCC TCGATTGCCG GAATGGTCTT AACCACAGAA
GCTCTCGTCG TTGAAAAACC TGAACCCGCG GCTCCCCCTC CCCCTGATAT GGGCGGTATG
GGCGGTATGG GTGGTATGGG CGGCATGGGC GGCATGGGCA TGATGTAG
 
Protein sequence
MAKIVSFSDE SRRSLEQGVN ALANAVRITL GPKGRNVLLE KKFGAPQIVN DGITVAKEIE 
LEDPLENTGA RLIQEIASKT KEVAGDGTTT ATVIGQALIR EGLKNVIAGA NPVALRRGIE
KTVAYLVEEI ASVSQPVAGD AIAQVATVSS GNDEEVGKMI ALAMDKVTTD GVITVEESKS
LTTELDVVEG MQIDRGYLSP YFISDPERQL VEFEKPYILI TDKKISAIAD LVPVLENVAR
SGSPLLIIAE DVEGEALATL VVNKARGVLN VAAIKAPAFG ERRKAVLQDI AILTGGSVIS
EEVGLTLDAV SVDMLGQADK ITIEKDNTTI VATGDSKTKG AIEKRVAQLR KQLEETDSEY
DKEKLTERIA KLAGGVAVIK VGAATETELK DRKLRIEDAL NATKAAIEEG IVPGGGTTLI
HLAKKVLEFK QSLTNPEEQV AADIVAKALE APLRQLADNA GVEGSVIIDR VRNTDFNVGY
NAMSGEFEDM IAAGIIDPAK VVRCAVQNAA SIAGMVLTTE ALVVEKPEPA APPPPDMGGM
GGMGGMGGMG GMGMM