Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1044 |
Symbol | groEL |
ID | 8012173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1019810 |
End bp | 1021444 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644823627 |
Product | chaperonin GroEL |
Protein accession | YP_002974878 |
Protein GI | 241203782 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCCA AGGAAATCAG GTTCTCCACC GACGCGCGCG ACCGCCTGCT GCGCGGCGTC GAATTGCTCA ACAACGCCGT CAAGGTGACG CTTGGCCCGA AGGGTCGCAA CGTTGTCATC GACAAGGCCT ATGGCGCGCC GCGCATTACC AAGGACGGCG TTAGCGTCGC CAAGGAGATC GAGCTCGCGG ACAAGTTCGA GAACATGGGC GCGCAAATGG TGCGTGAAGT GGCTTCGAAG ACCAATGATC TTGCCGGCGA CGGCACGACG ACGGCAACCG TTCTCGCCGC CTCCATCTTC CGCGAAGGCG CCAAGCTCGT CGCTGCCGGC ATGAACCCTA TGGATCTCAG GCGCGGCATC GATCTCGGCG TTACCGCCGT CGTCAAGGAA ATCAAGGCGC GGGCGATGAA GGTCAAATCG TCAGGTGAGA TCGCCCAAGT CGGCACCATT GCCGCCAATG GCGACGCCGC CATCGGTGAG ATGATCGCCA AGGCGATGGA CAAGGTCGGC AATGAGGGCG TCATAACGGT CGAAGAGGCG CGAACCGCCG AGACCGAACT CGACGTCGTC GAGGGTATGC AGTTCGACCG CGGCTATCTC TCACCCTATT TCGTCACCAA TGCCGAGAAG ATGCGCGTGG AACTGGAGGA TCCCTATATC CTCGTTCACG AGAAGAAACT CGGCAGCCTG CAGGCGATGC TGCCGATCCT GGAAGCCGTC GTACAGACCG GCAAACCGCT TCTGCTCATC TCGGAGGACG TCGAAGGCGA GGCCTTGGCG ACGCTTGTTG TCAACAAGCT GCGCGGCGGC CTGAAGGTCG CAGCCGTCAA GGCGCCGGGT TTCGGCGATC GCCGTAAGGC GATGCTCGAA GACATTGCCG TGCTTACATC AGGCCAGATG ATTTCCGAGG ATCTCGGTAT CAAGCTCGAC AACGTCACGC TCGATATGCT CGGCCGCGCC AAGCGCGTGC TGATCGACAA GGAGAGCACC ACGATCATCG ACGGCTCCGG CGAGAAAGCG GCCATCCAGG CGCGCATCCA GCAGATCAAG GCGCAGATCG AGGAGACCAC CTCCGATTAC GATAAGGAAA AGCTGCAGGA GCGCCTGGCG AAACTCGCCG GCGGCGTCGC GGTCATCCGT GTCGGCGGCG CCACCGAGAC GGAAGTCAAG GAAAAGAAGG ACCGCATCGA CGATGCACTG AACGCCACCC GTGCGGCGGT CGAAGAGGGC ATCGTGCCCG GCGGCGGCGT GGCACTGTTG CGCGCCAAGT CGGCGCTGAC GGGGCTGACC GGGGAGAACG CCGACGTGAC GGCGGGCATC TCGATCGTGC TCAGGGCGCT CGAAGCCCCG ATCCGGCAGA TCGCCGACAA TGCCGGGTTC GAGGGCTCCA TCGTCGTTGG AAAACTCGCC GGCAGCAATA ATCACAATCA GGGCTTCGAC GCACAGACGG AGACCTATGT CGATATGATC GAGGCCGGCA TCGTCGATCC CGCCAAGGTC GTGCGCACCG CTCTGCAGGA CGCCGGCTCG ATTGCGGCGC TGCTGATCAC CGCCGAGGTG ATGATCGCCG ACATTCCCGC AAGAGACTCC GCCCCTGCAG CCGGAAATGG CGGCATGGGA GATATGGGAT ACTGA
|
Protein sequence | MSAKEIRFST DARDRLLRGV ELLNNAVKVT LGPKGRNVVI DKAYGAPRIT KDGVSVAKEI ELADKFENMG AQMVREVASK TNDLAGDGTT TATVLAASIF REGAKLVAAG MNPMDLRRGI DLGVTAVVKE IKARAMKVKS SGEIAQVGTI AANGDAAIGE MIAKAMDKVG NEGVITVEEA RTAETELDVV EGMQFDRGYL SPYFVTNAEK MRVELEDPYI LVHEKKLGSL QAMLPILEAV VQTGKPLLLI SEDVEGEALA TLVVNKLRGG LKVAAVKAPG FGDRRKAMLE DIAVLTSGQM ISEDLGIKLD NVTLDMLGRA KRVLIDKEST TIIDGSGEKA AIQARIQQIK AQIEETTSDY DKEKLQERLA KLAGGVAVIR VGGATETEVK EKKDRIDDAL NATRAAVEEG IVPGGGVALL RAKSALTGLT GENADVTAGI SIVLRALEAP IRQIADNAGF EGSIVVGKLA GSNNHNQGFD AQTETYVDMI EAGIVDPAKV VRTALQDAGS IAALLITAEV MIADIPARDS APAAGNGGMG DMGY
|
| |