Gene Rsph17029_0985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0985 
SymbolgroEL 
ID4896901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1016081 
End bp1017724 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content65% 
IMG OID640111571 
Productchaperonin GroEL 
Protein accessionYP_001042868 
Protein GI126461754 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCA AGGACGTCAA GTTCGACACC GATGCCCGCG ACCGCATGCT GCGTGGCGTG 
AACATCCTCG CCGATGCGGT GAAGGTCACG CTGGGCCCGA AAGGCCGCAA CGTCGTGATC
GACAAGTCGT TCGGCGCGCC GCGCATCACC AAGGACGGTG TGTCGGTCGC CAAGGAGATC
GAACTCTCCG ACAAGTTCGA GAACATGGGC GCCCAGATGG TGAAGGAAGT CGCTTCCCGC
ACCAACGACG AGGCGGGCGA CGGCACCACC ACCGCCACCG TGCTCGCCCA GGCGATCATC
AAGGAAGGCC TCAAGGCCGT CGCCGCCGGC ATGAACCCGA TGGACCTCAA GCGCGGCATC
GACCTCGCGA CCTCGAAAGT CGTCGAGGCG ATCAAGGCCG CCGCCCGTCC GGTGAACGAC
TCGCACGAAG TGGCTCAGGT CGGCACGATC TCGGCCAACG GCGAAGCGCA GATCGGCCGC
TTCATCGCTG AAGCGATGCA GAAGGTCGGC AACGAGGGTG TCATCACCGT CGAAGAGAAC
AAGGGCCTCG AGACCGAAGT CGAAGTCGTC GAAGGCATGC AGTTCGACCG CGGCTACCTC
TCGCCCTACT TCGTCACCAA CGCCGACAAG ATGACGGCCG AGCTCGACGA CGTCTACATC
CTGCTCCACG AGAAAAAGCT CTCGTCGCTG CAGCCGATGG TCCCGCTGCT CGAGGCCGTG
ATCCAGTCGC AGAAGCCGCT GCTGATCATC GCCGAGGACG TGGAAGGCGA AGCCCTCGCC
ACGCTCGTGG TCAACAAGCT GCGCGGTGGC CTGAAGATCG CTGCCGTCAA GGCTCCGGGC
TTCGGTGACC GTCGCAAGGC CATGCTGCAG GACATCGCGA TCCTGACCGG CGGCCAGGTG
ATCTCGGAAG ACCTCGGCAT GAAGCTCGAG AATGTCACCA TCGACATGCT CGGCCGCGCC
AAGAAGATCT CGATCAACAA GGACAACACC ACGATCGTGG ACGGCAACGG CGACAAGGCC
GAGATCGACG CGCGCGTGGC CCAGATCCGC AACCAGATCG AGGAAACCTC GTCCGACTAC
GACCGCGAGA AGCTGCAGGA GCGGGTCGCG AAACTGGCGG GCGGCGTGGC CGTCATCCGC
GTCGGCGGCA TGACCGAAGT CGAAGTGAAA GAGCGCAAGG ACCGCGTCGA TGACGCCCTG
AACGCGACCC GTGCGGCCGT GCAGGAAGGC ATCGTCGTCG GCGGCGGCGT GGCCCTGATC
CAGGGCGGCA AGGCGCTCGA CGGTCTGACC GGCGAGAACC CCGACCAGAA CGCGGGCATC
ACCATCGTGC GTCGCGCGCT GGAAGCTCCG CTGCGCCAGA TCGCCCAGAA CGCGGGCGTG
GACGGTTCGG TCGTGGCCGG CAAGGTGCGC GAGTCGAACG AGAAGTCCTT CGGCTTCAAC
GCCCAGACCG AAGAATATGG CGACATGTTC AAGTTCGGCG TGATCGACCC GGCCAAGGTG
GTTCGCACCG CCCTGGAAGA CGCGGCCTCG GTCGCTTCGC TGCTCATCAC CACCGAAGCC
ATGATCGCCG ACAAGCCGGA GCCGAAATCT CCGGCCGGCG GTCCGGGCAT GGGCGGCATG
GGCGGCATGG ACGGCATGAT GTAA
 
Protein sequence
MAAKDVKFDT DARDRMLRGV NILADAVKVT LGPKGRNVVI DKSFGAPRIT KDGVSVAKEI 
ELSDKFENMG AQMVKEVASR TNDEAGDGTT TATVLAQAII KEGLKAVAAG MNPMDLKRGI
DLATSKVVEA IKAAARPVND SHEVAQVGTI SANGEAQIGR FIAEAMQKVG NEGVITVEEN
KGLETEVEVV EGMQFDRGYL SPYFVTNADK MTAELDDVYI LLHEKKLSSL QPMVPLLEAV
IQSQKPLLII AEDVEGEALA TLVVNKLRGG LKIAAVKAPG FGDRRKAMLQ DIAILTGGQV
ISEDLGMKLE NVTIDMLGRA KKISINKDNT TIVDGNGDKA EIDARVAQIR NQIEETSSDY
DREKLQERVA KLAGGVAVIR VGGMTEVEVK ERKDRVDDAL NATRAAVQEG IVVGGGVALI
QGGKALDGLT GENPDQNAGI TIVRRALEAP LRQIAQNAGV DGSVVAGKVR ESNEKSFGFN
AQTEEYGDMF KFGVIDPAKV VRTALEDAAS VASLLITTEA MIADKPEPKS PAGGPGMGGM
GGMDGMM