Gene RPD_2227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2227 
SymbolgroEL 
ID4022712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2491839 
End bp2493482 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content65% 
IMG OID637962422 
Productchaperonin GroEL 
Protein accessionYP_569363 
Protein GI91976704 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.182713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.929536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTA AAGAAGTGAA ATTCGGCGTC GACGCCCGCG ACCGCATGAT GCGCGGCGTG 
GACATTCTCG CCAATGCGGT GAAGGTCACG CTCGGCCCGA AGGGCCGCAA CGTCGTGCTC
GACAAGTCGT TCGGCGCTCC GCGTATCACC AAGGACGGCG TCACCGTCGC CAAGGAGATC
GAGCTCGACG ACAAGTTCGA GAACATGGGC GCGCAGATGG TGCGCGAAGT CGCCTCGAAG
TCGGCCGACG CCGCCGGTGA CGGCACCACC ACCGCGACCG TACTGGCCCA GGCGATCGTC
CGCGAAGGCG GCAAGGCCGT CGCCGCCGGC ATGAACCCGA TGGATCTGAA GCGTGGCATC
GACCTCGCGG TCGAAGCGGT CGTCGCGGAT CTCGTCAAGA ACTCCAAGAA GGTCACCTCG
AACGAGGAGA TCGCCCAGGT CGGCACGATT TCGGCCAATG GCGACGTCGA GATCGGCAAG
TTCCTGTCGG ACGCGATGAA GAAGGTCGGC AACGAGGGTG TCATCACCGT CGAGGAAGCC
AAGTCGCTCG AGACCGAACT CGACGTCGTC GAGGGCATGC AGTTCGATCG CGGCTACATC
TCGCCCTACT TCGTCACCAA CGCCGACAAG ATGCGCGTTG AATTCGACGA CGCCTACATC
CTGATCAACG AGAAGAAGCT CTCCAACCTC AACGAGCTGC TGCCGCTGCT CGAAGCCGTC
GTCCAGACCG GCAAGCCGCT GGTGATCGTC GCTGAGGACG TCGAAGGCGA AGCGCTCGCC
ACCCTCGTCG TCAACCGCCT GCGCGGCGGC CTCAAGGTCG CGGCCGTCAA GGCTCCGGGC
TTCGGCGATC GCCGCAAGGC CATGCTGCAG GACATCGCGA TCCTGACCGG CGGCCAGGCG
ATCTCGGAAG ACCTCGGCAT CAAGATGGAG AACGTCACGC TCCAGATGCT CGGCAAGGCC
AAGAAGGTGA TGATCGACAA GGAAAACACC ACGATCGTCA ACGGCGCCGG CAAGAAGGCC
GACATCGAAG CCCGCGTCGC GCAGATCAAG GCGCAGATCG AGGAAACCAC CTCGGACTAC
GACCGCGAGA AGCTGCAGGA GCGTCTGGCC AAGCTCGCCG GCGGCGTCGC GGTGATCCGC
GTCGGCGGCG CGACCGAGAT CGAAGTGAAG GAGCGCAAGG ATCGCGTTGA TGACGCGATG
CACGCCACCC GCGCTGCGGT CGAGGAAGGC ATCGTCCCGG GCGGCGGCGT CGCTCTGCTG
CGCGCCTCCG AGCAGCTCAA GCGCATCAAG ACCCAGAACG ACGACCAGAA GACCGGCGTC
GAGATCGTGC GCAAGGCGCT CTCCGCCCCG GCCCGCCAGA TCGCCATCAA CGCCGGCGAA
GACGGCTCGG TGATCGTCGG CAAGGTGCTC GAGAAGGACC AGTACAACTA CGGCTTCGAC
AGCCAGACTG GCGAATACGG CGACCTGGTC AAGAAGGGCA TCATCGACCC GACCAAGGTG
GTCCGCACCG CGATCCAGAA CGCAGCCTCC GTTGCCGCGC TGCTGATCAC CACCGAAGCG
ATGGTGGCCG AGCTGCCGAA GAAGGGTGGC GCTGCCGGTG GCATGCCCCC GGGCGGCGGC
GGCATGGGCG GCATGGACTT CTGA
 
Protein sequence
MSAKEVKFGV DARDRMMRGV DILANAVKVT LGPKGRNVVL DKSFGAPRIT KDGVTVAKEI 
ELDDKFENMG AQMVREVASK SADAAGDGTT TATVLAQAIV REGGKAVAAG MNPMDLKRGI
DLAVEAVVAD LVKNSKKVTS NEEIAQVGTI SANGDVEIGK FLSDAMKKVG NEGVITVEEA
KSLETELDVV EGMQFDRGYI SPYFVTNADK MRVEFDDAYI LINEKKLSNL NELLPLLEAV
VQTGKPLVIV AEDVEGEALA TLVVNRLRGG LKVAAVKAPG FGDRRKAMLQ DIAILTGGQA
ISEDLGIKME NVTLQMLGKA KKVMIDKENT TIVNGAGKKA DIEARVAQIK AQIEETTSDY
DREKLQERLA KLAGGVAVIR VGGATEIEVK ERKDRVDDAM HATRAAVEEG IVPGGGVALL
RASEQLKRIK TQNDDQKTGV EIVRKALSAP ARQIAINAGE DGSVIVGKVL EKDQYNYGFD
SQTGEYGDLV KKGIIDPTKV VRTAIQNAAS VAALLITTEA MVAELPKKGG AAGGMPPGGG
GMGGMDF