Gene RPB_1836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1836 
SymbolgroEL 
ID3908995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2102255 
End bp2103907 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content66% 
IMG OID637883730 
Productchaperonin GroEL 
Protein accessionYP_485455 
Protein GI86748959 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.253263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0261483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCCA AGGACGTCAA ATTCGGTGGA GACGCGCGCG ATCGGATGCT GCGCGGCGTC 
GACATCCTCG CCAATGCGGT CAAGGTCACG CTCGGCCCGA AGGGCCGGAA CGTGCTGATC
GAGAAGAGCT TCGGCGCTCC CCGCATCACC AAGGACGGCG TCACCGTCGC CAAGGAGATC
GAGCTCGACG ACAAGTTCGA GAACATGGGC GCGCAGATGC TGCGCGAAGT CGCCTCCAAG
ACCAACGACC TCGCCGGTGA CGGCACCACC ACAGCGACCG TGCTGGCCCA GGCGATCGTC
CGCGAAGGCG CCAAGTCGGT GGCCGCCGGC ATGAACCCGA TGGATCTGCG CCGCGGCATC
GAGATCGCGG TCCAGGCCGT GGTCAAGGAC ATCCAGAAGC GCGCCCGTCC GGTCGCCTCC
TCGGCCGAGA TCGCCCAGGT CGGTACCATC TCGGCCAATG GCGACGCGCC GATCGGCAAG
ATGATCGCCC AGGCGATGCA GAAGGTCGGC AACGAGGGCG TCATCACCGT CGAAGAGAAC
AAGTCGCTCG AGACCGAAGT CGACATCGTC GAGGGCATGA AGTTCGATCG CGGCTACCTG
TCGCCCTATT TCGTCACCAA CGCCGAGAAG ATGACCGTCG AGCTCGACGA CGTCTACATC
CTGCTGCACG AGAAGAAGGT GTCGGGCCTG CAGTCGATGC TGCCGGTGCT CGAAGCCGTG
GTGCAGTCTG GCAAGCCGCT GCTGATCATC GCCGAGGATG TCGAAGGCGA AGCGCTGGCG
ACGCTGGTGG TCAACCGGCT GCGCGGCGGC CTCAAGGTCT CGGCCGTCAA GGCGCCGGGC
TTCGGCGATC GCCGCAAGGC GATGCTGGAA GACATCGCGA TCCTGACCGG CGGTCAGCTG
ATCTCGGAAG AAATCGGCAT CAAGCTCGAG AGCGTCACGC TGAAGATGCT CGGCCGCGCC
AAGAAGGTGG TGATCGACAA GGAGAACACC ACCATCGTCG GCGGCGCCGG CAAGAAGCCG
GACATCGAGG CCCGCGTCCA GCAGATCAAG GCGCAGATCG AGGAGACCTC CTCGGACTAC
GACCGTGAGA AGCTGCAGGA GCGTCTGGCC AAGCTCGCCG GCGGCGTCGC GGTGATCCGC
GTCGGCGGCG CCACCGAGGT CGAGGTCAAG GAGAAGAAGG ACCGTGTCGA GGACGCGCTG
AACGCGACCC GCGCCGCGGT GCAGGAAGGC ATCGTCCCGG GCGGCGGCGT CGCGCTGCTG
CGCGCCAAGA AGGCGGTCGG CCGCATCCAC AACGACAATG CCGACGTCCA GGCCGGCATC
AACATCGTGC TGAAGGCGCT GGAAGCTCCG ATCCGCCAGA TCGCCGAGAA CGCCGGCGTC
GAAGGCTCGA TCGTGGTCGG CAAGATCCTC GAGAACAAGT CGGAGACGTT CGGCTTCGAC
GCCCAGACCG AGGACTATGT CGACATGCTC GCCAAGGGCA TCGTCGATCC GGCCAAGGTG
GTCCGCACCG CGCTGCAGGA CGCCTCGTCG GTCGCGGCGC TGCTGGTGAC CACCGAAGCC
ATGGTCGCCG AACTGCCGAA GGAAGCCGCG CCGGCGATGC CGGGTGGCGG CGGCATGGGC
GGAATGGGGG GCATGGGCGG CATGGGCTTC TGA
 
Protein sequence
MSAKDVKFGG DARDRMLRGV DILANAVKVT LGPKGRNVLI EKSFGAPRIT KDGVTVAKEI 
ELDDKFENMG AQMLREVASK TNDLAGDGTT TATVLAQAIV REGAKSVAAG MNPMDLRRGI
EIAVQAVVKD IQKRARPVAS SAEIAQVGTI SANGDAPIGK MIAQAMQKVG NEGVITVEEN
KSLETEVDIV EGMKFDRGYL SPYFVTNAEK MTVELDDVYI LLHEKKVSGL QSMLPVLEAV
VQSGKPLLII AEDVEGEALA TLVVNRLRGG LKVSAVKAPG FGDRRKAMLE DIAILTGGQL
ISEEIGIKLE SVTLKMLGRA KKVVIDKENT TIVGGAGKKP DIEARVQQIK AQIEETSSDY
DREKLQERLA KLAGGVAVIR VGGATEVEVK EKKDRVEDAL NATRAAVQEG IVPGGGVALL
RAKKAVGRIH NDNADVQAGI NIVLKALEAP IRQIAENAGV EGSIVVGKIL ENKSETFGFD
AQTEDYVDML AKGIVDPAKV VRTALQDASS VAALLVTTEA MVAELPKEAA PAMPGGGGMG
GMGGMGGMGF