Gene Rpal_2457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2457 
SymbolgroEL 
ID6410119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2653092 
End bp2654729 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content65% 
IMG OID642712336 
Productchaperonin GroEL 
Protein accessionYP_001991446 
Protein GI192290841 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCTA AAGAAGTCAA ATTCGGCGTC GACGCCCGCG ACCGCATGCT GCGTGGCGTG 
GACATTCTCG CCAATGCCGT GAAGGTCACG CTGGGTCCGA AGGGCCGCAA CGTCGTCCTC
GACAAGTCGT TCGGCGCGCC GCGCATCACC AAGGACGGCG TCACCGTCGC CAAGGACATC
GAGCTCGACG ACAAGTTCGA GAACATGGGC GCGCAGATGG TGCGCGAAGT CGCCTCGAAG
TCGGCCGACG CCGCGGGTGA CGGCACCACC ACCGCGACCG TGCTGGCCCA GGCGATCGTC
CGCGAAGGCG CCAAGGCGGT TGCCGCCGGC ATGAACCCGA TGGATCTGAA GCGCGGTATC
GATCTGGCGG TGGAAGCCGT CGTCGCCGAC CTCGTCAAGA ACTCCAAGAA GGTCACCTCG
AACGACGAGA TTGCCCAGGT CGGCACCATC TCGGCCAACG GTGACGCCGA GATCGGCAAG
TTCCTCGCCG ACGCGATGAA GAAGGTCGGC AACGAGGGTG TCATCACCGT CGAGGAAGCC
AAGTCGCTCG AGACCGAACT CGACGTCGTC GAGGGCATGC AGTTCGACCG CGGCTACATC
TCGCCCTACT TCGTCACCAA CGCCGACAAG ATGCGCGTCG AATTCGACGA CGCCTACATC
CTGATCAATG AGAAGAAGCT CTCCAACCTC AACGAGCTGC TGCCGCTGCT CGAGGCGGTG
GTGCAGACCG GCAAGCCGCT GGTGATCGTT GCGGAAGACG TCGAAGGCGA GGCTCTCGCC
ACCCTCGTCG TCAACCGTCT GCGCGGCGGC CTCAAGGTCG CGGCCGTCAA GGCGCCGGGC
TTCGGTGATC GCCGCAAGGC CATGCTGCAG GACATCGCGA TCCTGACCGG CGGCCAGGCG
ATCTCGGAAG ACCTCGGCAT CAAGATGGAG AACGTCACCC TGCAGATGCT GGGTCGCGCC
AAGAAGGTGA TGATCGACAA GGAAAACACC ACGATCGTCA ACGGCGCCGG CAAGAAGGCC
GACATCGAGG CCCGCGTCGC ACAGATCAAG GCGCAGATCG AGGAAACCAC CTCGGACTAC
GACCGCGAGA AGCTGCAGGA GCGTCTGGCC AAGCTCGCCG GCGGCGTCGC GGTGATCCGC
GTCGGCGGTG CGACCGAGGT CGAGGTGAAG GAGCGCAAGG ATCGCGTTGA TGACGCGATG
CACGCCACCC GCGCCGCGGT CGAAGAAGGC ATCGTCCCGG GCGGCGGCGT CGCACTGCTG
CGCGCCTCCG AGCAGCTCAA GGGCCTCAAG ACCAAGAACG ACGACCAGAA GACCGGCGTC
GAGATCGTGC GCCGCGCCCT CTCCGCTCCG GCCCGCCAGA TCGCCATCAA CGCCGGCGAA
GATGGCTCGG TGATCGTCGG CAAGGTGCTC GAGAAGGAGC AGTACGCGTT CGGCTTCGAC
TCGCAGTCGG GCGAATACGG CGACCTGGTC AAGAAGGGCA TCATCGACCC GACCAAGGTG
GTGCGCACCG CGATCCAGAA CGCCGCCTCG GTGGCCGCGC TGCTGATCAC CACCGAAGCG
ATGATCGCCG AACTGCCGAA GAAGAACGCC GGCCCCGCAA TGCCCCCGGG CGGCGGCATG
GGCGGCATGG ACTTCTAA
 
Protein sequence
MSAKEVKFGV DARDRMLRGV DILANAVKVT LGPKGRNVVL DKSFGAPRIT KDGVTVAKDI 
ELDDKFENMG AQMVREVASK SADAAGDGTT TATVLAQAIV REGAKAVAAG MNPMDLKRGI
DLAVEAVVAD LVKNSKKVTS NDEIAQVGTI SANGDAEIGK FLADAMKKVG NEGVITVEEA
KSLETELDVV EGMQFDRGYI SPYFVTNADK MRVEFDDAYI LINEKKLSNL NELLPLLEAV
VQTGKPLVIV AEDVEGEALA TLVVNRLRGG LKVAAVKAPG FGDRRKAMLQ DIAILTGGQA
ISEDLGIKME NVTLQMLGRA KKVMIDKENT TIVNGAGKKA DIEARVAQIK AQIEETTSDY
DREKLQERLA KLAGGVAVIR VGGATEVEVK ERKDRVDDAM HATRAAVEEG IVPGGGVALL
RASEQLKGLK TKNDDQKTGV EIVRRALSAP ARQIAINAGE DGSVIVGKVL EKEQYAFGFD
SQSGEYGDLV KKGIIDPTKV VRTAIQNAAS VAALLITTEA MIAELPKKNA GPAMPPGGGM
GGMDF