Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_6084 |
Symbol | groEL |
ID | 5320386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 1017063 |
End bp | 1018700 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640777729 |
Product | chaperonin GroEL |
Protein accession | YP_001314661 |
Protein GI | 150378066 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.879265 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCTA AAGAAGTCAA GTTCGGCCGT AGCGCGCGCG AAAAGATGCT GCGCGGCGTC GACATCCTCG CCGACGCAGT CAAGGTGACG CTCGGCCCGA AGGGCCGCAA CGTCGTCATC GACAAGTCGT TCGGCGCGCC GCGCATCACC AAGGACGGCG TCACCGTGGC CAAGGAGATC GAACTCGAAG ACAAGTTCGA GAACATGGGC GCCCAGATGG TCCGCGAAGT CGCTTCGAAG ACCAACGACA TTGCCGGTGA CGGCACGACG ACTGCAACCG TTCTCGCTCA GGCAATCGTT CGCGAAGGCG CGAAGGCCGT TGCCGCCGGC ATGAACCCGA TGGACCTTAA GCGCGGTATC GACCTCGCCG TCGCGGAAGT CGTCAAGGAC CTGCTCGCCA AGGCCAAGAC GATCAACACC TCGGACGAAG TCGCCCAGGT CGGCACGATC TCGGCAAACG GCGAAAAGCA GATCGGTCTC GACATTGCGG AAGCGATGCA GAAGGTCGGC AACGAAGGCG TCATCACGGT TGAAGAAGCC AAGACCGCCG AGACCGAGCT CGAAGTCGTC GACGGCATGC AGTTCGACCG CGGCTACCTG TCGCCCTACT TCGTCACCAA CCCGGAAAAG ATGGTCGCCG ACCTCGAAGA CGCTTACATT CTCCTGCACG AGAAGAAGCT GTCGAACCTG CAGGCGATGC TCCCGGTTCT CGAAGCCGTC GTCCAGACCG GCAAGCCGCT CCTCATCATT GCTGAAGACG TCGAAGGCGA AGCACTCGCA ACGCTCGTCG TCAACAAGCT GCGTGGCGGC CTGAAGATCG CTGCCGTCAA GGCCCCGGGC TTCGGCGACC GCCGCAAGGC CATGCTCGAA GACATCGCCA TCCTGACGGG CGGCACGGTG ATCTCGGAAG ACCTCGGCAT CAAGCTCGAA AGCGTCACGC TCGACATGCT CGGCCGTGCG AAGAAGGTTT CGATCACCAA GGAAAATACG ACGATCGTCG ACGGTGCCGG CCAGAAGTCC GACATCGAAG GCCGCGTCGC CCAGATCAAG GCCCAGATCG AAGAAACCAC TTCCGACTAC GACCGTGAGA AGCTGCAGGA GCGCCTTGCC AAGCTCGCTG GCGGCGTTGC CGTCATCCGC GTCGGCGGTG CGACGGAAGT CGAAGTGAAG GAAAAGAAGG ACCGCATCGA CGACGCTCTC AACGCGACGC GCGCTGCAGT TCAGGAAGGC ATCGTACCGG GCGGCGGCGT TGCTCTGCTG CGTTCCTCCG TCAAGATCAC CGTCAAGGGT GAAAACGACG ACCAGGATGC CGGCGTCAAC ATCGTTCGCC GCGCTCTGCA GTCTCCGGCC CGCCAGATCG TCGAAAACGC TGGCGACGAA GCATCCATCG TCGTCGGCAA GATCCTCGAG AAGGACACCG ACGACTTCGG TTACAACGCA CAGACCGGCG AATATGGCGA CATGATCGCC ATGGGCATCA TCGACCCGGT CAAGGTCGTT CGCACCGCGC TCCAGGACGC AGCCTCGGTT GCTTCGCTGC TCATCACCAC CGAAGCCATG ATCGCCGAGC TGCCGAAGAA GGACGCTCCG GCAATGCCTG GCGGCATGGG CGGCATGGGC GGCATGGACA TGATGTGA
|
Protein sequence | MAAKEVKFGR SAREKMLRGV DILADAVKVT LGPKGRNVVI DKSFGAPRIT KDGVTVAKEI ELEDKFENMG AQMVREVASK TNDIAGDGTT TATVLAQAIV REGAKAVAAG MNPMDLKRGI DLAVAEVVKD LLAKAKTINT SDEVAQVGTI SANGEKQIGL DIAEAMQKVG NEGVITVEEA KTAETELEVV DGMQFDRGYL SPYFVTNPEK MVADLEDAYI LLHEKKLSNL QAMLPVLEAV VQTGKPLLII AEDVEGEALA TLVVNKLRGG LKIAAVKAPG FGDRRKAMLE DIAILTGGTV ISEDLGIKLE SVTLDMLGRA KKVSITKENT TIVDGAGQKS DIEGRVAQIK AQIEETTSDY DREKLQERLA KLAGGVAVIR VGGATEVEVK EKKDRIDDAL NATRAAVQEG IVPGGGVALL RSSVKITVKG ENDDQDAGVN IVRRALQSPA RQIVENAGDE ASIVVGKILE KDTDDFGYNA QTGEYGDMIA MGIIDPVKVV RTALQDAASV ASLLITTEAM IAELPKKDAP AMPGGMGGMG GMDMM
|
| |