Gene PCC8801_2854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2854 
SymbolgroEL 
ID7104377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2944457 
End bp2946082 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content49% 
IMG OID643475890 
Productchaperonin GroEL 
Protein accessionYP_002373009 
Protein GI218247638 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAT CTATTGTCTA TAACGAAGAT GCACGTCGCG CCCTAGAGAA AGGGATGGAC 
ATTCTGGCTG AAGCAGTGGC CGTTACCCTC GGTCCCAAAG GTCGTAACGT CGTACTAGAG
AAAAAATTTG GCGCACCCCA AATTATTAAC GATGGGATCA CCATTGCTAA AGAAATTGAA
CTAGAAGATC ATATCGAGAA TACTGGGGTA GCTTTGATCC GTCAAGCTGC CTCTAAAACG
AACGATGTCG CTGGAGATGG AACTACCACC GCTACCGTTC TGGCCCATGC TATTGTTAAA
GAAGGATTAC GAAACGTAGC GGCGGGAGCT AACCCCATTT CCCTCAAACG AGGCATAGAC
AAAGCCACCG AATTTTTAGT GGAAAAAATT GCTGCTTACG CTAAACCCGT TGAAGACTCC
AAAGCGATCG CTCAAGTGGG AGCCATCTCT GCGGGGAATG ATGACGAAGT GGGTCAAATG
ATCGCTAATG CCATGGACAA AGTGGGCAAA GAAGGGGTTA TTTCCCTTGA AGAAGGTAAG
TCCATGACGA CCGAATTGGA AATTACCGAA GGGATGCGCT TTGATAAAGG CTACATTTCT
CCCTACTTTG TCACCGACAC CGAACGGATG GAATGTGTCT TAGACGATCC TGCCATTCTC
TTAACCGATA AGAAAATTAC CCTAGTTCAA GACTTAGTAC CCGTTCTCGA ACAAGTTGCC
CGTCAAGGCA AACCTTTAGT TATCATTGCT GAAGATATCG AAAAAGAAGC CCTCGCTACC
TTAGTGGTGA ACCGTCTGCG GGGTGTTCTG ACCGTTGCTG CGGTTAAAGC CCCTGGTTTT
GGCGATCGCC GTAAGGCTAT GCTTGAAGAT ATCGCCGTTC TCACGGGTGG TCAGGTGATC
AGCGAAGATG CGGGTCTGAA GTTGGAAAAT ACCAAGATTG AAATGCTCGG TACTGCCCGC
CGTATCACCT TAACCAAAGA CAATACCACC ATTGTGGCTG AAGGACACGA CGCTGCCGTT
AAAAGCCGTT GTGAACTGAT CCGCCGTCAA ATGGAAGATA CTGAGTCTTC CTACGACAAG
GAAAAACTCC AAGAACGCTT AGCGAAATTG TCTGGTGGGG TCGCTGTTAT CAAAGTAGGG
GCTGCTACTG AGACGGAAAT GAAAGATCGC AAGCTACGCC TCGAAGATGC CATCAACGCG
ACAAAAGCTG CCGTTGAAGA GGGGATCGTT CCGGGGGGTG GTACTACCTT AGCTCACTTG
GCTCCTCAAT TGGAAGCCTG GGCTAATAGT ACCCTCAGCA ATGAAGAGTT GACCGGTGCT
CTTATTGTAT CGCGCGCTTT GACGGCTCCT CTCAAACGTA TTGCCGAAAA TGCGGGACAA
AACGGTGCGG TTATCGCTGA ACGGGTGAAG GAAAAAGACT TTAACGTTGG TTACGATGCT
GCTACTGGAG AGTTTGCCGA TATGTTCGAG GTGGGAGTCG TTGACCCCGC TAAAGTGACC
CGTTCTGGAC TGCAAAATGC GGCTTCTATT GCAGGAATGA TCCTCACTAC TGAATGTATT
GTTGTGGATA AGCCTGAGAA AGACAAACCC GCGGCCGGTG GTGGCGGTGG AGATTTTGAC
TACTAA
 
Protein sequence
MAKSIVYNED ARRALEKGMD ILAEAVAVTL GPKGRNVVLE KKFGAPQIIN DGITIAKEIE 
LEDHIENTGV ALIRQAASKT NDVAGDGTTT ATVLAHAIVK EGLRNVAAGA NPISLKRGID
KATEFLVEKI AAYAKPVEDS KAIAQVGAIS AGNDDEVGQM IANAMDKVGK EGVISLEEGK
SMTTELEITE GMRFDKGYIS PYFVTDTERM ECVLDDPAIL LTDKKITLVQ DLVPVLEQVA
RQGKPLVIIA EDIEKEALAT LVVNRLRGVL TVAAVKAPGF GDRRKAMLED IAVLTGGQVI
SEDAGLKLEN TKIEMLGTAR RITLTKDNTT IVAEGHDAAV KSRCELIRRQ MEDTESSYDK
EKLQERLAKL SGGVAVIKVG AATETEMKDR KLRLEDAINA TKAAVEEGIV PGGGTTLAHL
APQLEAWANS TLSNEELTGA LIVSRALTAP LKRIAENAGQ NGAVIAERVK EKDFNVGYDA
ATGEFADMFE VGVVDPAKVT RSGLQNAASI AGMILTTECI VVDKPEKDKP AAGGGGGDFD
Y