Gene Rpal_1331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1331 
SymbolgroEL 
ID6408988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1403319 
End bp1404962 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content66% 
IMG OID642711230 
Productchaperonin GroEL 
Protein accessionYP_001990346 
Protein GI192289741 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCTA AAGACGTCAA GTTTGCCGGC GATGCCCGCG ACCGCATGCT GCGCGGCGTG 
GACGTGCTCG CCAACGCCGT GAAGGTCACG CTCGGTCCGA AGGGCCGGAA CGTGCTGATC
GAGAAGAGCT TCGGTGCCCC GCGCATCACC AAGGACGGCG TCACCGTCGC CAAGGAAGTG
GAGCTCGAGG ACAAGTTCGA GAACATGGGC GCGCAGATGG TGCGCGAGGT GGCCTCGAAG
ACCAACGACC TCGCCGGCGA CGGCACCACC ACCGCCACCG TGCTGGCCCA GGCGATCGTC
CGTGAGGGCG CCAAGGCCGT CGCGGCCGGC ATGAACCCGA TGGACCTGAA GCGCGGCATC
GAGATCGCGG TCGCGGCCGT GGTCAAGGAC ATCCAGAAGC GCGCCAAGCC GGTCGCCTCC
TCGGCTGAAA TCGCCCAGGT CGGCACCATC TCGGCCAACG GCGACGCGCC GATCGGCAAG
ATGATCGCCC AGGCGATGCA GAAGGTCGGC AACGAGGGTG TCATCACGGT CGAAGAGAAC
AAGTCGCTCG AGACCGAAGT CGACATCGTC GAAGGCATGA AGTTCGACCG CGGCTATCTG
TCGCCGTACT TCGTGACCAA CGCCGAAAAG ATGACCGTCG AACTCGACGA CGCCTACATC
CTGCTGCACG AGAAGAAGCT CTCGGGCCTG CAGTCGATGC TGCCGGTGCT CGAAGCGGTG
GTGCAGTCGG GCAAGCCGCT GCTGATCATC GCCGAGGACG TCGAAGGCGA AGCGCTCGCG
ACCCTCGTGG TCAACCGCCT GCGCGGCGGC CTCAAGGTTT CGGCCGTGAA GGCGCCGGGC
TTCGGCGATC GCCGCAAGGC GATGCTGGAA GACATCGCGA TCCTGACCGG CGGCCAGCTG
ATCTCGGAAG ACCTCGGCAT CAAGCTCGAG ACCGTGACGC TGAAGATGCT CGGCCGCGCC
AAGAAGGTGG TGATCGACAA GGAGAACACC ACCATCGTCA ACGGCGCCGG CAAGAAGCCG
GAGATCGAGG CCCGCGTTTC GCAGATCAAG GCGCAGATCG AGGAAACCTC CTCGGACTAC
GACCGTGAGA AGCTGCAGGA GCGTCTGGCC AAGCTCGCGG GCGGCGTCGC GGTGATCCGC
GTCGGCGGCG CGACCGAAGT CGAGGTCAAG GAGAAGAAGG ACCGTGTCGA GGACGCGCTG
AACGCGACCC GCGCTGCGGT TCAGGAAGGC ATCGTGCCGG GCGGCGGCGT CGCCCTGCTG
CGCGCCAAGA AGGCCGTCGG CCGCATCAGC AACGACAATC CGGACGTCCA GGCCGGCATC
AACATCGTGC TGAAGGCGCT CGAAGCTCCG ATCCGCCAGA TCGCCGAGAA CGCCGGTGTC
GAAGGCTCGA TCGTGGTCGG CAAGATCCTC GAGAACAAGA CCGAGACGTT CGGCTTCGAC
GCCCAGACCG AGGAATATGT CGACATGCTC GCCAAGGGTA TCGTCGATCC GGCCAAGGTC
GTGCGCACCG CGCTGCAGGA CGCCTCGTCG GTCGCCTCGC TGCTGGTCAC CACCGAAGCG
ATGGTCGCCG AGCTGCCGAA GGCCGACGCT CCGGCAATGC CGGCCGGTGG TGGCATGGGC
GGTATGGGCG GCATGGGCTT CTAA
 
Protein sequence
MAAKDVKFAG DARDRMLRGV DVLANAVKVT LGPKGRNVLI EKSFGAPRIT KDGVTVAKEV 
ELEDKFENMG AQMVREVASK TNDLAGDGTT TATVLAQAIV REGAKAVAAG MNPMDLKRGI
EIAVAAVVKD IQKRAKPVAS SAEIAQVGTI SANGDAPIGK MIAQAMQKVG NEGVITVEEN
KSLETEVDIV EGMKFDRGYL SPYFVTNAEK MTVELDDAYI LLHEKKLSGL QSMLPVLEAV
VQSGKPLLII AEDVEGEALA TLVVNRLRGG LKVSAVKAPG FGDRRKAMLE DIAILTGGQL
ISEDLGIKLE TVTLKMLGRA KKVVIDKENT TIVNGAGKKP EIEARVSQIK AQIEETSSDY
DREKLQERLA KLAGGVAVIR VGGATEVEVK EKKDRVEDAL NATRAAVQEG IVPGGGVALL
RAKKAVGRIS NDNPDVQAGI NIVLKALEAP IRQIAENAGV EGSIVVGKIL ENKTETFGFD
AQTEEYVDML AKGIVDPAKV VRTALQDASS VASLLVTTEA MVAELPKADA PAMPAGGGMG
GMGGMGF