Gene Rleg_5038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5038 
SymbolgroEL 
ID8007631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp423707 
End bp425335 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content61% 
IMG OID644821953 
Productchaperonin GroEL 
Protein accessionYP_002973213 
Protein GI241113378 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.473372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCTA AAGAAATCAA ATTCAGCACC GAAGCCCGCG AGAAGATGCT GCGTGGCGTC 
GACATCCTGG CCAACGCCGT GAAGGCGACC CTCGGCCCGA AAGGCCGCAA CGTCGTGATC
GAACGATCTT TCGGCGCCCC GCGCATCACC AAGGACGGCG TTTCCGTCGC CAAGGAAATC
GAACTCGAAG ACAAGTTCGA GAACATGGGC GCCCAGATGG TCCGCGAAGT CGCCTCGAAG
ACCAGCGACA TCGCCGGCGA CGGCACCACG ACGGCAACGG TACTGGCCCA GGCGATCGTC
AAGGAGGGCG CCAAGGCGGT TACCTCAGGC ATGAACCCGA TGGACCTGAA ACGCGGCATC
GATCTTGCGG TCGGCGCCAT CGTTGCGGAA CTGAAGGCCA ATGCCCGAAA GATCTCCAAC
AATTCCGAAA TCGCCCAGGT CGGCACGATC TCCGCCAATG GCGATGCCGA AATCGGCCGC
TTTTTGGCGG AAGCCATGGA AAGGGTCGGC AATGATGGCG TCATCACCGT TGAAGAAGCC
AAGACCGCCG AAACCGAACT CGAAGTCGTC GAAGGCATGC AGTTCGACCG CGGCTATCTC
AGCCCCTACT TCGTCACCAA TGCCGACAAG ATGCGGGTCG AGTTTGAAGA CCCTTATATC
CTCATCCATG AGAAGAAGCT CTCGAACCTG CAGTCGATGC TGCCGGTTCT CGAAGCTGTC
GTCCAATCCA GCAAGCCGCT GCTCATCATC GCTGAAGACG TCGAAGGCGA AGCCCTGGCA
ACGCTCGTCG TCAACAAGCT GCGCGGCGGC CTGAAGATCG CCGCCGTCAA GGCTCCTGGC
TTCGGTGACC GCCGCAAGGC CATGCTCGAA GACATCGCCA TCCTGACCGC CGGCACCGTC
ATCTCCGAAG ATCTCGGCAT CAAGCTCGAA TCCGTCACGC TCGATATGCT CGGCCGGGCC
AAGAAGGTTT CGATTGAAAA GGAAAACACC ACGATCGTCG ATGGGTCAGG CGCCAAGTCC
GACATCGAAG GCCGCGTTGC CCAGATCAAG GCCCAGATCG AAGAAACCAC GTCGGACTAT
GACCGCGAGA AGCTGCAGGA ACGTCTTGCC AAGCTCGCCG GCGGCGTTGC CGTCATCCGT
GTCGGCGGCT CGACGGAAGT CGAAGTGAAG GAAAAGAAGG ACCGCGTCGA CGACGCGCTT
CATGCAACCC GCGCTGCCGT TCAGGAAGGC ATTCTGCCTG GTGGCGGCGT GGCGCTGCTG
CGCGCCGTCA AGGCGCTCGA CAATGTCAAA ACCGCCAATG GCGACCAGCG CGTCGGCGTC
GACATCGTTC GCCGCGCGGT CGAGGCACCG GCTCGCCAGA TCGCCGAAAA CGCCGGAGCG
GAAGGCTCGG TCATCGTCGG TAAGCTGCGC GAGAAAAGCG AGTTCTCCTA CGGCTGGAAC
GCTCAGACGG GCGAATATGG CGACCTCTAT GCGCAGGGCG TCATCGATCC GGCCAAGGTG
GTTCGCACCG CGCTGCAGGA TGCGGCCTCC ATCGCCGGTC TTCTCGTCAC GACGGAAGCT
ATGATCGCCG AGAAACCCAA GAAGGACGCG CCACCGCCAA TGCCCGCCGG CCCCGGTATG
GACTTCTAA
 
Protein sequence
MAAKEIKFST EAREKMLRGV DILANAVKAT LGPKGRNVVI ERSFGAPRIT KDGVSVAKEI 
ELEDKFENMG AQMVREVASK TSDIAGDGTT TATVLAQAIV KEGAKAVTSG MNPMDLKRGI
DLAVGAIVAE LKANARKISN NSEIAQVGTI SANGDAEIGR FLAEAMERVG NDGVITVEEA
KTAETELEVV EGMQFDRGYL SPYFVTNADK MRVEFEDPYI LIHEKKLSNL QSMLPVLEAV
VQSSKPLLII AEDVEGEALA TLVVNKLRGG LKIAAVKAPG FGDRRKAMLE DIAILTAGTV
ISEDLGIKLE SVTLDMLGRA KKVSIEKENT TIVDGSGAKS DIEGRVAQIK AQIEETTSDY
DREKLQERLA KLAGGVAVIR VGGSTEVEVK EKKDRVDDAL HATRAAVQEG ILPGGGVALL
RAVKALDNVK TANGDQRVGV DIVRRAVEAP ARQIAENAGA EGSVIVGKLR EKSEFSYGWN
AQTGEYGDLY AQGVIDPAKV VRTALQDAAS IAGLLVTTEA MIAEKPKKDA PPPMPAGPGM
DF