Gene PCC8801_1190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1190 
Symbol 
ID7104892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1238236 
End bp1239903 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content50% 
IMG OID643474276 
Productchaperonin GroEL 
Protein accessionYP_002371414 
Protein GI218246043 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA TAGTTTCCTT CAGTGATGAA TCAAGACGGT CCTTAGAGCA AGGCGTTAAT 
GCTTTGGCCA ATGCGGTTCG GATTACCCTC GGTCCCAAAG GCCGTAACGT CCTACTCGAA
AAGAAATTTG GAGCCCCCCA AATCGTTAAC GATGGGATCA CCGTCGCCAA AGAAATTGAA
CTCGAAGATC CCTTAGAAAA CACTGGGGCT AGACTCATTC AAGAGATTGC CTCGAAAACC
AAAGAAGTGG CCGGAGATGG AACCACCACC GCCACCGTGA TCGGTCAAGC CCTCATTCGG
GAAGGATTAA AAAACGTCAT TGCCGGAGCC AATCCTGTAG CCTTGCGACG GGGTATTGAG
AAAACCGTCG CCTATTTAGT CGAAGAAATT GCCTCCGTTT CTCAACCGGT CGCTGGTGAT
GCCATTGCCC AAGTCGCTAC CGTTTCCTCA GGAAATGACG AAGAAGTGGG CAAAATGATC
GCCCTAGCCA TGGATAAAGT GACCACCGAT GGGGTGATCA CCGTCGAAGA ATCAAAATCC
CTCACCACCG AATTAGACGT AGTAGAAGGG ATGCAAATCG ATCGCGGCTA TCTATCCCCC
TACTTCATCA GTGATCCCGA ACGACAACTG GTGGAATTTG AAAAGCCCTA TATTCTGATT
ACCGACAAAA AAATTAGCGC GATCGCCGAT TTAGTTCCCG TTCTCGAAAA CGTAGCGCGT
TCTGGCAGTC CCCTACTGAT TATTGCTGAA GACGTAGAAG GAGAAGCCCT CGCTACCCTG
GTGGTGAACA AAGCCAGAGG GGTCTTAAAC GTCGCAGCAA TTAAAGCCCC TGCCTTTGGC
GAACGACGCA AAGCTGTGTT ACAAGATATT GCTATCCTCA CGGGAGGGAG CGTCATTTCC
GAAGAAGTCG GGCTGACCTT AGATGCAGTA TCCGTTGATA TGCTCGGACA AGCCGATAAA
ATCACTATTG AGAAAGACAA TACTACCATT GTCGCCACTG GAGACAGCAA AACCAAAGGG
GCGATCGAAA AACGGGTCGC CCAACTACGC AAACAACTCG AAGAAACCGA CTCAGAGTAC
GATAAAGAAA AACTCACCGA ACGCATTGCT AAATTAGCGG GTGGGGTCGC CGTGATTAAA
GTGGGAGCAG CCACAGAAAC TGAACTCAAG GATCGTAAGC TGCGTATTGA AGATGCTCTT
AATGCCACTA AAGCGGCCAT CGAAGAAGGC ATCGTCCCCG GTGGTGGAAC CACGTTAATT
CACTTGGCCA AAAAGGTACT TGAATTTAAG CAAAGCTTAA CCAACCCTGA AGAACAGGTC
GCGGCCGATA TCGTTGCTAA AGCCTTAGAA GCTCCTTTGC GCCAATTGGC TGACAATGCA
GGGGTTGAAG GGTCTGTGAT CATCGACCGT GTGCGTAACA CAGACTTTAA TGTGGGCTAC
AATGCCATGT CAGGGGAATT TGAGGATATG ATCGCGGCGG GCATCATTGA TCCGGCTAAA
GTGGTTCGTT GTGCTGTGCA AAATGCAGCC TCGATTGCCG GAATGGTCTT AACCACAGAA
GCTCTCGTCG TTGAAAAACC TGAACCCGCA GCTCCTCCTC CTCCTGATAT GGGCGGCATG
GGCGGTATGG GCGGCATGGG CGGCATGGGC GGCATGGGTA TGATGTAG
 
Protein sequence
MAKIVSFSDE SRRSLEQGVN ALANAVRITL GPKGRNVLLE KKFGAPQIVN DGITVAKEIE 
LEDPLENTGA RLIQEIASKT KEVAGDGTTT ATVIGQALIR EGLKNVIAGA NPVALRRGIE
KTVAYLVEEI ASVSQPVAGD AIAQVATVSS GNDEEVGKMI ALAMDKVTTD GVITVEESKS
LTTELDVVEG MQIDRGYLSP YFISDPERQL VEFEKPYILI TDKKISAIAD LVPVLENVAR
SGSPLLIIAE DVEGEALATL VVNKARGVLN VAAIKAPAFG ERRKAVLQDI AILTGGSVIS
EEVGLTLDAV SVDMLGQADK ITIEKDNTTI VATGDSKTKG AIEKRVAQLR KQLEETDSEY
DKEKLTERIA KLAGGVAVIK VGAATETELK DRKLRIEDAL NATKAAIEEG IVPGGGTTLI
HLAKKVLEFK QSLTNPEEQV AADIVAKALE APLRQLADNA GVEGSVIIDR VRNTDFNVGY
NAMSGEFEDM IAAGIIDPAK VVRCAVQNAA SIAGMVLTTE ALVVEKPEPA APPPPDMGGM
GGMGGMGGMG GMGMM