Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0804 |
Symbol | groEL |
ID | 5321641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 866544 |
End bp | 868172 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640789741 |
Product | chaperonin GroEL |
Protein accession | YP_001326495 |
Protein GI | 150396028 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.451933 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.636529 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTCA AGGAAGTCAG ATTCACCAGC GATGCCCGCG ATCGCATGCT GCGCGGCGTG GATATCATGG CCAACGCCGT GCGCGTGACT TTGGGACCGA AGGGGCGGAA CGTCGTCATC GACAAGTCCT TTGGAGCGCC GCGGATAACC AAGGACGGCG TTTCCGTCGC CAAGGAAATC GAGCTCGAGG ACAAGTTCGA GAACATGGGG GCGCAGATGC TGCGCGAGGT GGCGTCGCGC ACCAGCGACA TCGCCGGCGA TGGCACCACC ACGGCCACCG TACTCGCGCA AGCAATCGTC AGGGAAGGCG CGAAGGCCGT GGCGGCAGGC ATGAACCCGA TGGACCTGAA GCGTGGCATC GATCTGGCCG TCGAGGCGAT CGTACGGGAA CTCAGGACCA ACGCCCGCAA GGTCTCCAAG AACGCCGAGA TCGCTCAGGT AGCCACGATT TCGGCCAATG GCGACGCAGA AATCGGTCGC TACCTTGCCG AAGCCATGGA AAAGGTCGGC AACGAGGGCG TAATCACCGT CGAAGAGGCC AAGACCGCCG AGATCGAGCT TGAAGTCGTG GAGGGAATGC AGTTCGACCG CGGCTATCTC TCACCCTATT TCATCACCAA CCAGGAGAAG ATGAGGGTGG AACTGGAAGA CGCCTACATA CTGCTGCACG AGAAGAAGCT CTCCAACCTG CAGGCGATGA TCCCAATTCT CGAATCGGTC ATCCAGTCCG GAAAGCCCCT GCTGATCATT GCCGAGGACG TCGAGGGCGA GGCACTCGCG ACGCTGGTGG TCAACAAGCT GCGCGGCGGC CTGAAGATCG CCGCGGTCAA GGCGCCCGGC TTCGGCGACC GCCGCAAGTC CATGCTCGAG GACATCGCGA TCCTGACCGG CGGAACCGTC ATCTCCGAGG AACTCGGGAC TAAGCTCGAG AGCGCGACGA TCGACATCCT CGGCCGCGCG AAACGCGTGA TGGTCGAAAA GGAGACGACG ACGATCGTCG ACGGCGCCGG GTCGAAGGCG GACATCGGTG GCCGCGTCGC CCAGATCAAG GCGCAGATCG AGGACACCAC TTCCGACTAC GATCGGGAGA AGCTGCAGGA GCGGCTCGCC AAGCTCGCGG GCGGTGTCGC CGTGATCCGC GTCGGCGGCT CGACAGAGAT CGAAGTCAAG GAGAAGAAGG ATCGCGTCGA CGATGCCCTT CATGCGACGC GGGCGGCGGT CGAAGAAGGC ATCCTGCCGG GCGGCGGCGT GGCGCTGCTA CGGGTCGTCA GCGTGCTCAA CGGTCTTGCG ACGGCCAACG ACGATCAGCG CGTGGGTATC GAGATCGTCC GCCGCGCTAT CGAGGCACCC GTCCGCCAGA TCGCCGAGAA CGCCGGCGCC GAGGGATCCA TTATCGTCGG GAAGTTGCGG GAGAAAGAGG ATTTTGCCTT TGGCTGGAAT GCCCAGACTG GTGAATTCGG CGATCTCTTT CAAATGGGCG TCATCGACCC AGCAAAGGTC GTGCGCGCCG CCTTGCAGGA CGCCGCCTCC GTCGCCGGTC TTCTTGTGAC GACGGAAGCG ATGATCGCCG AGAAACCGAA GAAGGACGGG CAGCCGCAGA TGCCGCCCGC GCCGGGCATG GATTTCTGA
|
Protein sequence | MAVKEVRFTS DARDRMLRGV DIMANAVRVT LGPKGRNVVI DKSFGAPRIT KDGVSVAKEI ELEDKFENMG AQMLREVASR TSDIAGDGTT TATVLAQAIV REGAKAVAAG MNPMDLKRGI DLAVEAIVRE LRTNARKVSK NAEIAQVATI SANGDAEIGR YLAEAMEKVG NEGVITVEEA KTAEIELEVV EGMQFDRGYL SPYFITNQEK MRVELEDAYI LLHEKKLSNL QAMIPILESV IQSGKPLLII AEDVEGEALA TLVVNKLRGG LKIAAVKAPG FGDRRKSMLE DIAILTGGTV ISEELGTKLE SATIDILGRA KRVMVEKETT TIVDGAGSKA DIGGRVAQIK AQIEDTTSDY DREKLQERLA KLAGGVAVIR VGGSTEIEVK EKKDRVDDAL HATRAAVEEG ILPGGGVALL RVVSVLNGLA TANDDQRVGI EIVRRAIEAP VRQIAENAGA EGSIIVGKLR EKEDFAFGWN AQTGEFGDLF QMGVIDPAKV VRAALQDAAS VAGLLVTTEA MIAEKPKKDG QPQMPPAPGM DF
|
| |