Gene RPC_4726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4726 
SymbolgroEL 
ID3972702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5286835 
End bp5288490 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content66% 
IMG OID637927838 
Productchaperonin GroEL 
Protein accessionYP_534567 
Protein GI90426197 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCA AGGACGTTAA ATTTGCCGGC GACGCGCGCG ATCGCATGTT GCGCGGCGTC 
GACATTCTCG CCAACGCGGT CAAGGTGACG CTCGGTCCGA AGGGCCGCAA CGTCTTGATC
GAGCGCTCGT TCGGCGCCGC CCGCATCACC AAGGACGGCG TCACCGTCGC CAAGGAGATC
GAGCTCGAGG ACAAGTTCGA GAACATGGGC GCGCAGATGC TGCGCGAGGT CGCCTCCAAG
ACCAACGACC TCGCCGGTGA CGGCACCACC ACCGCCACCG TGCTGGCGCA GGCGATCGTG
CGCGAAGGCG CCAAGTCGGT CGCCGCCGGC ATGAACCCGA TGGACCTCAA GCGCGGCATC
GAGATCGCCG TCGCCGCGGT GATCAAGGAC CTCGTCAAGC GCGCCAAGCC GGTGGCCTCC
TCCGCCGAGA TCGCCCAGGT CGGCACCATC TCGTCGAACG GCGACGCCGC GATCGGCAAG
ATGATCGCCC AGGCGATGCA GAAGGTCGGC AACGAAGGCG TCATCACCGT CGAAGAGAAC
AAGTCGCTGA CCACCGAAGT CGACATCGTC GAAGGCATGA AGTTCGATCG CGGCTACCTC
AGCCCGTACT TCGTCACCAA CGCCGAAAAG ATGGCCGTCG AGTTCGACGA CGCCTATGTG
CTGCTGCATG AGAAGAAGGT GTCGGGCCTG CAGTCGATGC TGCCGCTGCT CGAAGCCGTG
GTGCAGTCCG GCAAGCCGCT GGTGATCATC GCCGAGGACG TCGAAGGCGA GGCGCTGGCC
ACCCTGGTGG TCAACCGGCT GCGTGGCGGC CTCAAGGTCG CCGCCGTCAA GGCGCCGGGC
TTCGGCGATC GCCGCAAGGC GATGCTGGAA GACCTCGCGA TCCTGACCGG CGGTCAGCTG
ATCTCCGACG ACCTCGGCAT GAAGCTCGAG AACGTCACGC TGAAGATGCT GGGTCGCGCC
AAGAAGCTGG TGATCGACAA GGAGAACACC ACCATCGTCG GCGGCGCCGG CAAGAAGGCC
GATATCGAAA CCCGGGTCGG CCAGATCAAG GCGCAGATCG AGGAAACCAC CTCGGACTAC
GACCGCGAGA AGCTGCAGGA ACGGCTCGCC AAGCTGGCCG GCGGCGTCGC GGTGATCCGC
GTCGGCGGCG CCACCGAGGT CGAGGTGAAG GAAAAGAAGG ACCGCGTCGA AGACGCGCTC
AACGCCACCC GCGCCGCGGT GCAGGAAGGC ATCGTCCCCG GCGGCGGCGT CGCCCTGCTG
CGCGCCAAGA AGGCAGTCGG CCGTATCAGC AACGACAATC CCGACGTCCA GGCCGGCATC
AACATCGTGC TGAAGGCGCT GGAAGCTCCG ATCCGCCAGA TCGCCGAGAA CGCCGGCGTC
GAAGGCTCGA TCGTGGTCGG CAAGATCCTC GAGAACAAGT CGGAGACCTT CGGCTTCGAC
GCCCAGACCG AGGAATATGT CGACATGCTC GCCAAGGGCA TCGTCGACCC GGCCAAGGTG
GTGCGTACCG CGCTGCAGGA CGCCTCCTCG GTCGCGGCCT TGCTGGTGAC CACCGAATGC
ATGGTCGCGG AAATGCCGCG CGACGCGGCC CCGGCGATGC CGGGCGGCGG CGGCGGCATG
GGCGGAATGG GTGGCATGGG CGGCATGGGC TTCTAA
 
Protein sequence
MAAKDVKFAG DARDRMLRGV DILANAVKVT LGPKGRNVLI ERSFGAARIT KDGVTVAKEI 
ELEDKFENMG AQMLREVASK TNDLAGDGTT TATVLAQAIV REGAKSVAAG MNPMDLKRGI
EIAVAAVIKD LVKRAKPVAS SAEIAQVGTI SSNGDAAIGK MIAQAMQKVG NEGVITVEEN
KSLTTEVDIV EGMKFDRGYL SPYFVTNAEK MAVEFDDAYV LLHEKKVSGL QSMLPLLEAV
VQSGKPLVII AEDVEGEALA TLVVNRLRGG LKVAAVKAPG FGDRRKAMLE DLAILTGGQL
ISDDLGMKLE NVTLKMLGRA KKLVIDKENT TIVGGAGKKA DIETRVGQIK AQIEETTSDY
DREKLQERLA KLAGGVAVIR VGGATEVEVK EKKDRVEDAL NATRAAVQEG IVPGGGVALL
RAKKAVGRIS NDNPDVQAGI NIVLKALEAP IRQIAENAGV EGSIVVGKIL ENKSETFGFD
AQTEEYVDML AKGIVDPAKV VRTALQDASS VAALLVTTEC MVAEMPRDAA PAMPGGGGGM
GGMGGMGGMG F