Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2661 |
Symbol | thrS |
ID | 7978320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2691249 |
End bp | 2693183 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644799462 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_002950621 |
Protein GI | 239827997 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000386283 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGAAA TGATTCGCAT TACATTCCCT GATGGAGCGG TAAAGGAGTT TCCAAAAGGA ACAACGACAG AACAAATCGC TGCATCGATT AGCCCGGGAT TAAAGAAAAA AGCGATTGCC GGCAAACTAA ACGATCGTTT TATTGATTTG CGCACACCGA TTCAGGAAGA TGGATCGATT TCGATCATTA CGCAAGATAT GCCAGAAGCG CTGGACATTT TGCGCCATAG TACTGCCCAT TTAATGGCCC AAGCGATTAA GCGTCTGTAT AAAAATGTAA AACTTGGCGT CGGTCCGGTC ATTGAAAATG GTTTCTATTA CGATATCGAT ATGGAAGAAT CATTAACGCC GGAAGACTTG CCGAAAATTG AACAAGAAAT GCGGAAAATT GTGAAAGAAA ACTTGGAAAT TGTCCGGAAA GAAGTGAGCC GCGAAGAAGC GATTCGCCTT TACGAAGAAA TTGGCGATGA TTTAAAACTG GAATTAATTA ACGATATTCC AGAAGGAGAA ACGATCTCCA TTTACGAACA AGGCGAATTT TTTGACCTTT GCCGCGGTGT CCACGTTCCA TCGACAGGAA AAATTAAAGA GTTTAAGTTG TTGAACATCT CAGGAGCGTA CTGGCGCGGA GACAGCAATA ATAAAATGCT GCAGCGCATT TACGGAACGG CGTTCTTCAA AAAAGAAGAT TTAGATGAAT ATCTTCGCCA GTTGCAAGAA GCAAAAGAAC GCGATCATCG CAAATTAGGA AAAGAGCTTG AATTGTTTAT GACTTCGCAA AAAGTCGGAC AAGGGCTGCC GCTTTGGCTG CCAAAAGGGG CAACGATTCG CCGCATTATC GAGCGGTATA TTGTGGACAA AGAAATTGAA TTAGGTTATC AACATGTTTA TACACCAGTG CTTGGTAGTG TCGAATTATA TAAAACTTCC GGCCACTGGG ATCATTACAA AGACAACATG TTCCCGCCGA TGGAAATGGA CAATGAACAG CTTGTGCTGC GCCCAATGAA CTGTCCGCAT CATATGATGA TTTATAAAAG CAAAATCCAT AGCTATCGGG AGCTTCCGAT TCGTATCGCA GAGCTCGGCA CGATGCATCG CTACGAAATG TCCGGAGCGC TTTCCGGCTT GCAGCGCGTC CGCGGCATGA CATTAAACGA CGCTCACATT TTTGTGCGTC CAGACCAAAT TAAAGATGAG TTCAAACGCG TCGTTAACTT AATTTTAGAA GTATACAAAG ACTTTGGCTT GGATGAATAT TCGTTCCGGC TTTCTTACCG CGATCCACAT GATAAAGAAA AATATTACGA TGATGATGAA ATGTGGGAAA AAGCGCAAAA CATGCTGCGT GAAGCAATGG ATGAATTGGG ATTAGAGTAT TATGAAGCCG AAGGGGAAGC GGCGTTTTAT GGTCCGAAAT TAGACGTGCA AGTGCGCACA GCGCTTGGAA AAGACGAAAC ATTGTCAACG GTGCAGCTTG ATTTCTTATT GCCGGAACGC TTTGATTTAA CATACATCGG CGAAGACGGC AAACCGCATC GCCCGGTTGT CATCCATCGC GGGGTTGTTT CCACGATGGA ACGTTTCGTT GCGTTTCTGA TTGAAGAATA TAAAGGCGCG TTCCCAACTT GGCTTGCCCC AGTGCAAGTC GAAGTGATCC CTGTGTCGCC AGCAGCGCAT CTCGACTATG CGTATAAAGT GAAAGAAGCG TTGCAATCGC AAGGATTCCG CGTCGAAGTC GACGAACGCG ATGAAAAAAT TGGCTACAAA ATTCGTGAAG CGCAAATTCA AAAAATCCCT TACATGCTTG TCGTTGGTGA CAAAGAAATG GCAGAAAATG CCGTCAACGT CCGTAAATAC GGCGAACAAA AAAGCGAAAC GATGTCTCTC GACGATTTTA TTGCCGCTCT GAAAGCGGAA GTGCGTCGAA ACTAG
|
Protein sequence | MSEMIRITFP DGAVKEFPKG TTTEQIAASI SPGLKKKAIA GKLNDRFIDL RTPIQEDGSI SIITQDMPEA LDILRHSTAH LMAQAIKRLY KNVKLGVGPV IENGFYYDID MEESLTPEDL PKIEQEMRKI VKENLEIVRK EVSREEAIRL YEEIGDDLKL ELINDIPEGE TISIYEQGEF FDLCRGVHVP STGKIKEFKL LNISGAYWRG DSNNKMLQRI YGTAFFKKED LDEYLRQLQE AKERDHRKLG KELELFMTSQ KVGQGLPLWL PKGATIRRII ERYIVDKEIE LGYQHVYTPV LGSVELYKTS GHWDHYKDNM FPPMEMDNEQ LVLRPMNCPH HMMIYKSKIH SYRELPIRIA ELGTMHRYEM SGALSGLQRV RGMTLNDAHI FVRPDQIKDE FKRVVNLILE VYKDFGLDEY SFRLSYRDPH DKEKYYDDDE MWEKAQNMLR EAMDELGLEY YEAEGEAAFY GPKLDVQVRT ALGKDETLST VQLDFLLPER FDLTYIGEDG KPHRPVVIHR GVVSTMERFV AFLIEEYKGA FPTWLAPVQV EVIPVSPAAH LDYAYKVKEA LQSQGFRVEV DERDEKIGYK IREAQIQKIP YMLVVGDKEM AENAVNVRKY GEQKSETMSL DDFIAALKAE VRRN
|
| |