Gene Rleg_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1044 
SymbolgroEL 
ID8012173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1019810 
End bp1021444 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content63% 
IMG OID644823627 
Productchaperonin GroEL 
Protein accessionYP_002974878 
Protein GI241203782 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCCA AGGAAATCAG GTTCTCCACC GACGCGCGCG ACCGCCTGCT GCGCGGCGTC 
GAATTGCTCA ACAACGCCGT CAAGGTGACG CTTGGCCCGA AGGGTCGCAA CGTTGTCATC
GACAAGGCCT ATGGCGCGCC GCGCATTACC AAGGACGGCG TTAGCGTCGC CAAGGAGATC
GAGCTCGCGG ACAAGTTCGA GAACATGGGC GCGCAAATGG TGCGTGAAGT GGCTTCGAAG
ACCAATGATC TTGCCGGCGA CGGCACGACG ACGGCAACCG TTCTCGCCGC CTCCATCTTC
CGCGAAGGCG CCAAGCTCGT CGCTGCCGGC ATGAACCCTA TGGATCTCAG GCGCGGCATC
GATCTCGGCG TTACCGCCGT CGTCAAGGAA ATCAAGGCGC GGGCGATGAA GGTCAAATCG
TCAGGTGAGA TCGCCCAAGT CGGCACCATT GCCGCCAATG GCGACGCCGC CATCGGTGAG
ATGATCGCCA AGGCGATGGA CAAGGTCGGC AATGAGGGCG TCATAACGGT CGAAGAGGCG
CGAACCGCCG AGACCGAACT CGACGTCGTC GAGGGTATGC AGTTCGACCG CGGCTATCTC
TCACCCTATT TCGTCACCAA TGCCGAGAAG ATGCGCGTGG AACTGGAGGA TCCCTATATC
CTCGTTCACG AGAAGAAACT CGGCAGCCTG CAGGCGATGC TGCCGATCCT GGAAGCCGTC
GTACAGACCG GCAAACCGCT TCTGCTCATC TCGGAGGACG TCGAAGGCGA GGCCTTGGCG
ACGCTTGTTG TCAACAAGCT GCGCGGCGGC CTGAAGGTCG CAGCCGTCAA GGCGCCGGGT
TTCGGCGATC GCCGTAAGGC GATGCTCGAA GACATTGCCG TGCTTACATC AGGCCAGATG
ATTTCCGAGG ATCTCGGTAT CAAGCTCGAC AACGTCACGC TCGATATGCT CGGCCGCGCC
AAGCGCGTGC TGATCGACAA GGAGAGCACC ACGATCATCG ACGGCTCCGG CGAGAAAGCG
GCCATCCAGG CGCGCATCCA GCAGATCAAG GCGCAGATCG AGGAGACCAC CTCCGATTAC
GATAAGGAAA AGCTGCAGGA GCGCCTGGCG AAACTCGCCG GCGGCGTCGC GGTCATCCGT
GTCGGCGGCG CCACCGAGAC GGAAGTCAAG GAAAAGAAGG ACCGCATCGA CGATGCACTG
AACGCCACCC GTGCGGCGGT CGAAGAGGGC ATCGTGCCCG GCGGCGGCGT GGCACTGTTG
CGCGCCAAGT CGGCGCTGAC GGGGCTGACC GGGGAGAACG CCGACGTGAC GGCGGGCATC
TCGATCGTGC TCAGGGCGCT CGAAGCCCCG ATCCGGCAGA TCGCCGACAA TGCCGGGTTC
GAGGGCTCCA TCGTCGTTGG AAAACTCGCC GGCAGCAATA ATCACAATCA GGGCTTCGAC
GCACAGACGG AGACCTATGT CGATATGATC GAGGCCGGCA TCGTCGATCC CGCCAAGGTC
GTGCGCACCG CTCTGCAGGA CGCCGGCTCG ATTGCGGCGC TGCTGATCAC CGCCGAGGTG
ATGATCGCCG ACATTCCCGC AAGAGACTCC GCCCCTGCAG CCGGAAATGG CGGCATGGGA
GATATGGGAT ACTGA
 
Protein sequence
MSAKEIRFST DARDRLLRGV ELLNNAVKVT LGPKGRNVVI DKAYGAPRIT KDGVSVAKEI 
ELADKFENMG AQMVREVASK TNDLAGDGTT TATVLAASIF REGAKLVAAG MNPMDLRRGI
DLGVTAVVKE IKARAMKVKS SGEIAQVGTI AANGDAAIGE MIAKAMDKVG NEGVITVEEA
RTAETELDVV EGMQFDRGYL SPYFVTNAEK MRVELEDPYI LVHEKKLGSL QAMLPILEAV
VQTGKPLLLI SEDVEGEALA TLVVNKLRGG LKVAAVKAPG FGDRRKAMLE DIAVLTSGQM
ISEDLGIKLD NVTLDMLGRA KRVLIDKEST TIIDGSGEKA AIQARIQQIK AQIEETTSDY
DKEKLQERLA KLAGGVAVIR VGGATETEVK EKKDRIDDAL NATRAAVEEG IVPGGGVALL
RAKSALTGLT GENADVTAGI SIVLRALEAP IRQIADNAGF EGSIVVGKLA GSNNHNQGFD
AQTETYVDMI EAGIVDPAKV VRTALQDAGS IAALLITAEV MIADIPARDS APAAGNGGMG
DMGY