Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1054 |
Symbol | thrS |
ID | 5321900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 1121556 |
End bp | 1123538 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640789997 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_001326742 |
Protein GI | 150396275 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.290687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.56724 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCATT CCGTTTCCCT TACATTCCCC GATGGCTCCG TGCGAGAATT CGCCCCCGGC ACGACCGGCC GCGATGTTGC GGAATCGATC TCGAAGTCGC TCGCCAAAAA GTCCGTCGCG ATCGCCATCG ACGGCGAGCT GCGTGACCTC TCGGATCCCG TGACGGAAGG CAGGATTGAG ATCGTGACGC GCGAGGACAA GCGCGCGCTA GAGCTCATCC GCCACGACGC GGCCCATGTC ATGGCGGAAG CCGTCCAGGA GTTGTGGCCG GGAACGCAGG TGACCATTGG CCCGGTCATC GACAATGGTT TCTATTACGA CTTCGCAAAG AACGAGCCCT TCACGCCCGA CGACCTGCCG GTCATCGAGA AGAGGATGAG GGAGATCATC GCGCGCAACA AGCCTTTCAC CAAGGAAGTC TGGTCGCGCG ACAAGGCCAA GGAGGTCTTT GCCGTCAAGG GCGAGAGTTA CAAGGTTGAA CTCGTCGACG CTATCCCCGA AGGCCAGGAC CTGAAGATTT ATTACCAGGG CGACTGGTTC GATCTCTGCC GGGGGCCACA TATGGCTTCG ACGGGCCAGA TCGGCACCGC CTTCAAATTG ATGAAGGTCG CCGGCGCCTA CTGGCGTGGC GACAGCAACA ATCCGATGCT GACGCGCATC TACGGCACCG CCTGGCACAC GCAGGAGGAG CTCGATCAGT ATCTGCACGT GCTGGCCGAG GCCGAAAAGC GCGACCATCG CCGGCTCGGC CGCGAGATGG ACCTGTTCCA TTTCCAGGAG GAAGGACCGG GCGTGGTCTT CTGGCACGGA AAGGGCTGGC GCATCTTCCA GAGCCTCGTC GCCTATATGC GCCGCCGCCT CGAAGGCGAT TATCAGGAGG TCAACGCTCC GCAGGTGCTC GACAAGTCTC TCTGGGAGAC CTCCGGCCAC TGGGGATGGT ATCGTGACAA CATGTTCAAG GTGACTGTCG CCGGGGACGA TACGGATGAC GATCGCGTCT TCGCGCTGAA GCCGATGAAT TGCCCCGGGC ACATCCAGAT CTTCAAGCAT GGCTTGAAGT CTTACCGGGA ACTACCTGTA AGGCTGGCAG AATTCGGTGC CGTCCATCGC TACGAGCCAT CAGGTGCGCT GCACGGCCTG ATGCGCGTGC GCGGCTTCAC CCAGGACGAC GCGCACATCT TCTGCACCGA TGAGCAGATG GCGGCCGAAT GCCTGAAGAT CAACGATCTT ATCCTCTCCG TCTATGAGGA CTTCGGCTTC AAGGAGATCG TCGTGAAGCT GTCGACGCGG CCGGAAAAGC GTGTGGGCTC CGATGAACTG TGGGACCGCG CCGAGGCGGT GATGACGGAG GTCTTGAAGA CGATCGAGGC GCAGTCCGAG GGCCGCATCA AGACCGGCAT CCTGCCGGGC GAGGGTGCGT TCTACGGTCC GAAATTCGAA TATACGCTGA AGGACGCAAT CGGCCGCGAA TGGCAGTGCG GAACGACGCA GGTCGATTTC AATCTGCCGG AGCGCTTCGG CGCCTTCTAC ATCGACAGCG AATCCGAGAA GCGTCAGCCG GTCATGATCC ATCGCGCCAT TTGCGGGTCG ATGGAGCGTT TCCTCGGCAT CCTGCTCGAG AATTTTGCGG GCCATATGCC GCTGTGGATC TCGCCTCTGC AGGTGGTGGT CGCAACGATC ACCTCGGAAG CGGATGATTA TGGCCGCGAA GTGGCCGAAC GCTTGCGGGA CGCCGGCCTG ACGGTCGAGA CCGATTTCCG AAACGAGAAG ATCAACTACA AGGTCCGCGA ACACTCGGTA ACAAAGGTTC CGGTGATCGT CGTCTGCGGC AAGCGGGAAG CAGAGGAACG TTCCGTCAAC ATCCGTCGCC TGGGCTCCCA GGCGCAGACG GCTATGTCGC TCGACGAGGC AGTTGCTTCG CTCTCCGCCG AAGCCACGGC GCCGGACCTC AAGCGCAAGG CGGAGCGGAC CGCCCGCGCC TGA
|
Protein sequence | MSHSVSLTFP DGSVREFAPG TTGRDVAESI SKSLAKKSVA IAIDGELRDL SDPVTEGRIE IVTREDKRAL ELIRHDAAHV MAEAVQELWP GTQVTIGPVI DNGFYYDFAK NEPFTPDDLP VIEKRMREII ARNKPFTKEV WSRDKAKEVF AVKGESYKVE LVDAIPEGQD LKIYYQGDWF DLCRGPHMAS TGQIGTAFKL MKVAGAYWRG DSNNPMLTRI YGTAWHTQEE LDQYLHVLAE AEKRDHRRLG REMDLFHFQE EGPGVVFWHG KGWRIFQSLV AYMRRRLEGD YQEVNAPQVL DKSLWETSGH WGWYRDNMFK VTVAGDDTDD DRVFALKPMN CPGHIQIFKH GLKSYRELPV RLAEFGAVHR YEPSGALHGL MRVRGFTQDD AHIFCTDEQM AAECLKINDL ILSVYEDFGF KEIVVKLSTR PEKRVGSDEL WDRAEAVMTE VLKTIEAQSE GRIKTGILPG EGAFYGPKFE YTLKDAIGRE WQCGTTQVDF NLPERFGAFY IDSESEKRQP VMIHRAICGS MERFLGILLE NFAGHMPLWI SPLQVVVATI TSEADDYGRE VAERLRDAGL TVETDFRNEK INYKVREHSV TKVPVIVVCG KREAEERSVN IRRLGSQAQT AMSLDEAVAS LSAEATAPDL KRKAERTARA
|
| |