Gene Rsph17025_2189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2189 
SymbolgroEL 
ID5084204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2220978 
End bp2222618 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content65% 
IMG OID640483752 
Productchaperonin GroEL 
Protein accessionYP_001168384 
Protein GI146278225 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCA AGGACGTCAA GTTCGATACC GATGCCCGCG ACCGCATGCT GCGCGGCGTG 
AACATCCTCG CCGATGCCGT GAAGGTCACG CTGGGCCCGA AAGGCCGCAA CGTCGTGATC
GACAAGTCGT TCGGCGCGCC GCGCATCACC AAGGACGGTG TGTCGGTCGC CAAGGAGATC
GAACTCTCCG ACAAGTTCGA GAACATGGGC GCCCAGATGG TGAAGGAAGT CGCTTCCCGC
ACCAACGACG AGGCGGGCGA CGGCACCACC ACCGCCACCG TTCTGGCCCA GGCCATCATC
AAGGAAGGCC TCAAGGCCGT GGCCGCCGGC ATGAACCCGA TGGACCTCAA GCGCGGCATC
GACCTCGCCA CTGCGAAGGT CGTGGAGTCG ATCAAGGCCG CCTCGCGTCC GGTCAACGAC
CAGCATGAAG TCGCCCAGGT CGGCACCATC TCGGCCAACG GCGAAGCCCA GATCGGCCGC
TTCATCGCCG ACGCGATGCA GAAGGTCGGC AACGAGGGTG TCATCACCGT CGAGGAGAAC
AAGGGCCTCG AGACCGAAGT CGAAGTCGTC GAAGGCATGC AGTTCGACCG CGGCTACCTC
TCGCCCTACT TCGTCACCAA CGCCGACAAG ATGACGGCCG AGCTGGAAGA CGTGTTCATC
CTGCTGCACG AGAAGAAGCT CTCGTCGCTG CAGCCGATGG TCCCGCTGCT CGAGTCGGTG
ATCCAGGCCC AGCGTCCGCT GCTGATCGTG GCCGAGGACG TGGAAGGCGA AGCCCTCGCG
ACCCTCGTGG TCAACAAGCT GCGTGGTGGC CTGAAGATCG CCGCCGTCAA GGCTCCGGGC
TTCGGCGACC GTCGCAAGGC CATGCTGCAG GACATCGCGA TCCTGACCGG TGGTCAGGTG
ATCTCGGAAG ACCTCGGCAT GAAGCTCGAG AATGTCACCA TCGACATGCT CGGCCGCGCC
AAGAAGGTCT CGATCAACAA GGACAACACG ACCATCGTGG ACGGTGCCGG CGAAAAGGCC
GAGATCGAAG CCCGCGTGTC GCAGATCCGC ACCCAGATCG AAGAGACCAC CTCGGACTAC
GACCGCGAGA AGCTGCAGGA GCGTGTGGCC AAGCTGGCGG GCGGCGTGGC CGTCATCCGC
GTCGGCGGCA TGACCGAAGT CGAAGTGAAA GAGCGCAAGG ACCGCGTCGA TGACGCCCTG
AACGCGACCC GCGCGGCCGT TCAGGAAGGC ATCGTGGTCG GTGGCGGCGT CGCCCTGATC
CAGGCGGGCA AGGTCCTCGA CGGGCTGACC GGCGAGAACC CGGACCAGAA CGCCGGCATC
ACCATCGTGC GCCGCGCGCT GGAAGCTCCG CTGCGCCAGA TCGCCCAGAA CGCGGGCGTG
GACGGCTCGG TCGTGGCCGG CAAGGTCCGC GAGTCCGACG ACAAGGCCTT CGGCTTCAAC
GCCCAGACCG AAGAATATGG CGACATGTTC AAGTTCGGCG TGATCGACCC GGCCAAGGTG
GTTCGCACCG CTCTGGAAGA CGCGGCCTCG GTCGCCTCGC TGCTGATCAC CACCGAAGCC
ATGATCGCCG ACAAGCCCGA GCCGAAGTCG GCTCCGGCCG GCGGCATGGG CGGCATGGGC
GGCATGGACG GCATGATGTA A
 
Protein sequence
MAAKDVKFDT DARDRMLRGV NILADAVKVT LGPKGRNVVI DKSFGAPRIT KDGVSVAKEI 
ELSDKFENMG AQMVKEVASR TNDEAGDGTT TATVLAQAII KEGLKAVAAG MNPMDLKRGI
DLATAKVVES IKAASRPVND QHEVAQVGTI SANGEAQIGR FIADAMQKVG NEGVITVEEN
KGLETEVEVV EGMQFDRGYL SPYFVTNADK MTAELEDVFI LLHEKKLSSL QPMVPLLESV
IQAQRPLLIV AEDVEGEALA TLVVNKLRGG LKIAAVKAPG FGDRRKAMLQ DIAILTGGQV
ISEDLGMKLE NVTIDMLGRA KKVSINKDNT TIVDGAGEKA EIEARVSQIR TQIEETTSDY
DREKLQERVA KLAGGVAVIR VGGMTEVEVK ERKDRVDDAL NATRAAVQEG IVVGGGVALI
QAGKVLDGLT GENPDQNAGI TIVRRALEAP LRQIAQNAGV DGSVVAGKVR ESDDKAFGFN
AQTEEYGDMF KFGVIDPAKV VRTALEDAAS VASLLITTEA MIADKPEPKS APAGGMGGMG
GMDGMM