Gene RPB_3234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3234 
SymbolgroEL 
ID3911035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3697445 
End bp3699088 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content66% 
IMG OID637885136 
Productchaperonin GroEL 
Protein accessionYP_486841 
Protein GI86750345 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.554891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTA AAGAAGTCAA ATTCGGCGTC GACGCCCGCG ACCGCATGCT GCGCGGCGTG 
GACATTCTCG CCAATGCCGT GAAGGTCACG CTCGGCCCGA AGGGCCGCAA CGTCGTGCTC
GACAAGTCGT TCGGCGCGCC CCGCATCACC AAGGACGGCG TCACCGTCGC CAAGGAGATC
GAGCTCGAGG ACAAGTTCGA GAACATGGGC GCCCAGATGG TGCGCGAAGT GGCCTCGAAG
TCCGCCGATC TCGCCGGCGA CGGCACCACT ACCGCGACCG TGCTGGCCGC GGCGATCGTA
CGTGAAGGCG CCAAGTCGGT GGCCGCCGGC ATGAACCCGA TGGATCTGAA GCGCGGCATC
GACCTGGCTG TGGAAGCCGT GGTCGCCGAC CTCGTCAAGA ACTCCAAGAA GGTCACCTCG
AACGACGAGA TCGCCCAGGT CGGCACCATC TCGGCCAATG GCGACGCGGA AATCGGCAAG
TTCCTCGCCG ACGCGATGAA GAAGGTCGGC AACGAGGGTG TCATCACCGT CGAGGAAGCC
AAGTCGCTCG AGACCGAACT CGACGTCGTC GAGGGCATGC AGTTCGACCG CGGCTACATC
TCGCCCTACT TCGTCACCAA CGCCGACAAG ATGCGCGTCG AATTCGACGA CGCCTACATC
CTGATCAACG AGAAGAAGCT CTCCAACCTC AACGAACTGC TGCCGCTGCT CGAAGCCGTG
GTGCAGACCG GCAAGCCGCT GGTGATCGTC GCTGAGGACG TCGAAGGCGA AGCGCTCGCC
ACCCTCGTCG TCAACCGCCT GCGTGGCGGC CTGAAGGTCG CCGCCGTCAA GGCGCCGGGC
TTCGGCGATC GCCGCAAGGC GATGCTGCAG GACATCGCGA TCCTGACCGG CGGCCAGGCG
ATCTCGGAAG ACCTCGGCAT CAAGATGGAG AACGTCACGC TCCAGATGCT CGGTCGCGCC
AAGAAGGTGA TGATCGACAA GGAGAACACC ACGATCGTCA ACGGCGCCGG TAAGAAGGTC
GACATCGAAG CCCGCGTCGC CCAGATCAAG GCGCAGATCG AGGAGACCAC CTCGGACTAC
GATCGCGAGA AGCTGCAGGA GCGCCTGGCC AAGCTCGCCG GCGGCGTCGC GGTGATCCGC
GTCGGCGGCG CCACCGAGAT CGAGGTCAAG GAGCGCAAGG ATCGCGTTGA CGACGCGATG
CACGCCACCC GCGCTGCGGT CGAGGAAGGC ATCGTCCCGG GCGGCGGCGT CGCTCTGCTG
CGCGCCTCCG AGCAGCTCAA GCGCATCAAG ACCGCGAACG ACGACCAGAA GACCGGCGTC
GAGATCGTGC GCAAGGCGCT CTCCGCCCCG GCCCGCCAGA TCGCCATCAA CGCCGGCGAA
GACGGTTCGG TGATCGTCGG CAAGGTGCTG GAGAAGGATC AGTACAACTA CGGCTTCGAC
AGCCAGACCG GCGAATACGG CGACATGGTC AAGAAGGGCA TCATCGACCC GACCAAGGTC
GTGCGTGCGG CGATCCAGAA CGCGGCCTCG GTCGCGGCGC TCTTGATCAC CACCGAAGCC
ATGATCGCGG AGCTGCCCAA GAAGGGCGGC GCCGGCGCCG GCGGCATGCC CCCGGGCGGC
GGCATGGGCG GCATGGATTT CTGA
 
Protein sequence
MSAKEVKFGV DARDRMLRGV DILANAVKVT LGPKGRNVVL DKSFGAPRIT KDGVTVAKEI 
ELEDKFENMG AQMVREVASK SADLAGDGTT TATVLAAAIV REGAKSVAAG MNPMDLKRGI
DLAVEAVVAD LVKNSKKVTS NDEIAQVGTI SANGDAEIGK FLADAMKKVG NEGVITVEEA
KSLETELDVV EGMQFDRGYI SPYFVTNADK MRVEFDDAYI LINEKKLSNL NELLPLLEAV
VQTGKPLVIV AEDVEGEALA TLVVNRLRGG LKVAAVKAPG FGDRRKAMLQ DIAILTGGQA
ISEDLGIKME NVTLQMLGRA KKVMIDKENT TIVNGAGKKV DIEARVAQIK AQIEETTSDY
DREKLQERLA KLAGGVAVIR VGGATEIEVK ERKDRVDDAM HATRAAVEEG IVPGGGVALL
RASEQLKRIK TANDDQKTGV EIVRKALSAP ARQIAINAGE DGSVIVGKVL EKDQYNYGFD
SQTGEYGDMV KKGIIDPTKV VRAAIQNAAS VAALLITTEA MIAELPKKGG AGAGGMPPGG
GMGGMDF