Gene RPD_4127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4127 
SymbolgroEL 
ID4024649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4591484 
End bp4593136 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content67% 
IMG OID637964335 
Productchaperonin GroEL 
Protein accessionYP_571247 
Protein GI91978588 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.176849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.953611 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCA AAGACGTTAA ATTCGCCGGA GACGCGCGCG ATCGCATGCT GCGCGGCGTC 
GACATCCTCG CCAACGCGGT CAAGGTCACG CTCGGCCCCA AGGGCCGCAA CGTGCTGATC
GAAAAGAGCT TCGGCGCTCC CCGCATCACC AAGGACGGCG TCACCGTCGC CAAGGAGATC
GAGCTCGAGG ACAAGTTCGA GAACATGGGC GCGCAGATGC TGCGCGAAGT GGCCTCGAAG
ACCAACGACC TCGCCGGCGA CGGCACCACC ACGGCGACCG TGCTGGCCCA GGCGATCGTC
CGCGAGGGCG CCAAGTCGGT GGCCGCCGGC ATGAACCCGA TGGATCTGCG CCGCGGCATC
GAGATCGCGG TCGCCGCGGT GATCAAGGAC ATCGGCAAGC GCGCCAAGCC GGTCGCCTCC
TCGGCCGAAA TCGCCCAGGT CGGCACCATC TCCGCCAACG GCGACGCGCC GATCGGCAAG
ATGATCGCCC AGGCGATGCA GAAGGTCGGC AACGAGGGCG TCATCACCGT CGAGGAGAAC
AAATCGCTCG ATACCGAAGT CGACATCGTC GAGGGCATGA AGTTCGACCG CGGCTACCTG
TCGCCGTACT TCGTCACCAA CGCCGAGAAG ATGACGGTCG AGCTCGACGA CGTCTACATC
CTGCTGCACG AGAAGAAGGT GTCGGGTCTG CAGTCGATGC TGCCGGTGCT CGAAGCGGTG
GTGCAGTCGG GCAAGCCGCT GCTGATCATC GCCGAGGACG TCGAAGGCGA AGCGCTGGCG
ACGCTGGTGG TGAACCGTCT GCGCGGCGGC CTCAAGGTCT CCGCCGTCAA GGCGCCGGGC
TTCGGCGACC GCCGCAAGGC GATGCTGGAA GACATCGCGA TCCTGACCGG CGGTCAGCTG
ATCTCGGAAG AACTCGGCGT CAAGCTCGAG AGCGTCACGC TGAAAATGCT CGGCCGCGCC
AAGAAGGTGG TGATCGACAA GGAGAACACC ACGATCGTCA ACGGCGCCGG CAAGAAGGCC
GACATCGAGG CGCGCGTGCA GCAGATCAAG GCGCAGATCG AGGAGACCTC CTCGGACTAC
GACCGTGAGA AGCTGCAGGA GCGTCTGGCC AAGCTCGCGG GCGGCGTCGC GGTGATCCGC
GTCGGCGGCG CGACCGAGGT CGAGGTCAAG GAGAAGAAGG ACCGTGTCGA GGACGCGCTG
AACGCGACCC GCGCCGCGGT GCAGGAAGGC ATCGTACCGG GCGGCGGCGT CGCGCTGCTG
CGCGCCAAGA AGGCGGTCGG CCGCATCAAC AACGACAATG CCGACGTCCA GGCCGGCATC
AACATCGTGC TGAAGGCGCT CGAAGCTCCG ATCCGCCAGA TCGCCGAGAA CGCCGGCGTC
GAGGGCTCGA TCGTGGTCGG CAAGATCCTC GAGAACAAGT CGGAGACGTT CGGCTTCGAC
GCGCAGACCG AGGAATATGT CGACATGCTC GCCAAGGGCA TCGTCGATCC GGCCAAGGTG
GTGCGCACTG CGCTGCAGGA CGCCGCGTCG GTCGCGGCGC TGCTCGTCAC CACCGAAGCG
ATGGTCGCCG AGCTGCCGCG CGAAGCCGCT CCGGCGATGC CGGGCGGCGG CGGGATGGGC
GGCATGGGCG GAATGGGTGG CATGGGCTTC TGA
 
Protein sequence
MAAKDVKFAG DARDRMLRGV DILANAVKVT LGPKGRNVLI EKSFGAPRIT KDGVTVAKEI 
ELEDKFENMG AQMLREVASK TNDLAGDGTT TATVLAQAIV REGAKSVAAG MNPMDLRRGI
EIAVAAVIKD IGKRAKPVAS SAEIAQVGTI SANGDAPIGK MIAQAMQKVG NEGVITVEEN
KSLDTEVDIV EGMKFDRGYL SPYFVTNAEK MTVELDDVYI LLHEKKVSGL QSMLPVLEAV
VQSGKPLLII AEDVEGEALA TLVVNRLRGG LKVSAVKAPG FGDRRKAMLE DIAILTGGQL
ISEELGVKLE SVTLKMLGRA KKVVIDKENT TIVNGAGKKA DIEARVQQIK AQIEETSSDY
DREKLQERLA KLAGGVAVIR VGGATEVEVK EKKDRVEDAL NATRAAVQEG IVPGGGVALL
RAKKAVGRIN NDNADVQAGI NIVLKALEAP IRQIAENAGV EGSIVVGKIL ENKSETFGFD
AQTEEYVDML AKGIVDPAKV VRTALQDAAS VAALLVTTEA MVAELPREAA PAMPGGGGMG
GMGGMGGMGF