Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2044 |
Symbol | thrS |
ID | 8013075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2037176 |
End bp | 2039161 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644824630 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_002975861 |
Protein GI | 241204765 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.565288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCAAG CTATTTCCCT GACATTTCCC GATGGTTCCG TGCGCGGCTA TGATGCCGGC GCAACCGGCC GGGATGTCGC TGAATCCATT TCCAAGTCGC TGGCCAAGAA GGCGGTCGCC GTTGCGATCG ACGGCACCGT GCGCGACCTT TCCGATCCCG TGACGACGGG CAGGATCGAG ATCATCACCC GCAATGACGA CCGCGCGCTC GAACTCATCC GCCATGACGC GGCGCATGTC ATGGCCGAAG CGGTGCAGGA GATCTGGCCC GGCACCCAGG TGACGATCGG CCCGGTCATC GAAAACGGCT TCTATTACGA CTTTGCCAAG AACGAGCCCT TCACACTCGA CGATCTGCCG AAGATCGAAA AGAAGATGAA GGAGATCATC GCCCGCAACG CCGCCTTCAC CAAGCAGGTC TGGTCGCGCG AGAAGGCGAA ACAGGTCTTC GCCGACAAGG GCGAGCAGTA CAAGGTCGAA CTCGTCGATG CGATCCCGGA AGGGCAAGAT CTCAAGATCT ATTACCAGGG CGACTGGTTC GACCTCTGCC GCGGCCCGCA CATGGCCTCG ACCGGCCAGA TCGGCAGCGC CTTCAAGCTC TTGAAGGTGG CTGGCGCCTA TTGGCGCGGC GACAGCAACA ATCCGATGCT GAGCCGCATC TACGGCACGG CCTTCGCCGA GCAGTCGGAA CTCGACAACT ACCTGCATAT GCTTGCCGAA GCCGAAAAGC GCGATCATCG CCGGCTCGGC CGCGAGATGG ATCTCTTCCA TTTCCAGGAA GAGGGTCCCG GCGTCGTCTT CTGGCACGGC AAGGGCTGGC GCGTCTTTCA GACGCTGGTT GCCTATATGC GCCGCCGGCT GGCCGGCGAC TATCAGGAAG TCAATGCGCC ACAGGTGCTC GACAAGTCGC TGTGGGAGAC CTCCGGCCAC TGGGGCTGGT ATCGCGACAA CATGTTCAAG GTAACGGTTG CCGGCGACGA CACCGACGAT GATCGCGTCT TCGCGCTGAA GCCGATGAAC TGCCCCGGCC ATATCCAGAT CTTCAAGCAC GGGCTGAAAT CCTACCGCGA ACTGCCGATC CGGCTGGCCG AATTCGGCAA TGTCCACCGT TACGAGCCGT CGGGCGCGCT GCACGGGCTG ATGCGCGTGC GCGGCTTCAC GCAGGACGAT GCGCATATCT TCTGCACGGA CGAGCAGATG GCCGCCGAAT GCCTGAAGAT CAACGACCTG ATTCTTTCGG TCTATAAGGA TTTCGGCTTC GACGAAGTCA CCATCAAGCT CTCGACAAGG CCGGACAAGC GCGTCGGCTC GGACGATCTC TGGGACCGCG CCGAAAGCGT GATGATGGGC GTGCTGGAGA CGATCCAGCA GCAGTCGAAC AACATCAAGA CCGGCATCCT GCCGGGCGAG GGCGCCTTCT ACGGTCCGAA GTTCGAATAT ACGCTGAAGG ATGCGATCGG CCGAGAATGG CAGTGCGGCA CGACGCAGGT CGACTTCAAC CTGCCGGAAC GCTTCGGCGC CTTCTACATC GACAGCAACT CCGAAAAGAC GCAGCCGGTG ATGATCCACC GCGCCATCTG CGGCTCGATG GAGCGCTTCC TCGGCATCCT GATCGAGAAC TTTGCCGGCC ATATGCCACT CTGGGTATCG CCGCTGCAGG TGGTGGTCGC AACGATCACC TCGGAAGCCG ACGCCTACGG GCTTGAAGTG GCCGAGGCGC TGCGCGAGGC CGGTCTCAAC GTTGAAACCG ATTTCCGCAA CGAGAAGATC AACTACAAGG TCCGCGAGCA TTCGGTCACG AAGGTTCCTG TCATCATCGT CTGCGGCAGG AAGGAAGCCG AGGAGCGCAC GGTCAACATC CGCCGCCTCG GCAGCCAGGA CCAGGTTTCG ATGGGGCTCG ACGCCGCCGT CGAGAGCCTT GCCCTCGAAG CGACACCGCC TGACATCCGT CGGAAGGCCG AAGCAAAAAA AGCCAAGGCG GCCTGA
|
Protein sequence | MSQAISLTFP DGSVRGYDAG ATGRDVAESI SKSLAKKAVA VAIDGTVRDL SDPVTTGRIE IITRNDDRAL ELIRHDAAHV MAEAVQEIWP GTQVTIGPVI ENGFYYDFAK NEPFTLDDLP KIEKKMKEII ARNAAFTKQV WSREKAKQVF ADKGEQYKVE LVDAIPEGQD LKIYYQGDWF DLCRGPHMAS TGQIGSAFKL LKVAGAYWRG DSNNPMLSRI YGTAFAEQSE LDNYLHMLAE AEKRDHRRLG REMDLFHFQE EGPGVVFWHG KGWRVFQTLV AYMRRRLAGD YQEVNAPQVL DKSLWETSGH WGWYRDNMFK VTVAGDDTDD DRVFALKPMN CPGHIQIFKH GLKSYRELPI RLAEFGNVHR YEPSGALHGL MRVRGFTQDD AHIFCTDEQM AAECLKINDL ILSVYKDFGF DEVTIKLSTR PDKRVGSDDL WDRAESVMMG VLETIQQQSN NIKTGILPGE GAFYGPKFEY TLKDAIGREW QCGTTQVDFN LPERFGAFYI DSNSEKTQPV MIHRAICGSM ERFLGILIEN FAGHMPLWVS PLQVVVATIT SEADAYGLEV AEALREAGLN VETDFRNEKI NYKVREHSVT KVPVIIVCGR KEAEERTVNI RRLGSQDQVS MGLDAAVESL ALEATPPDIR RKAEAKKAKA A
|
| |