Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0559 |
Symbol | thrS |
ID | 6376647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 718614 |
End bp | 720548 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642681713 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_001957689 |
Protein GI | 189501972 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00454252 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATAACC ATACAGTTAA TATTGCACTG CCAGATGGTA CTATAAAATC TTTTGCTAAA GGGGTAACCA GCTTAGAAAT AGCACAATCT ATTAGTGAAC GTCTATCTCA GCAAATTTTA GCAAGTCTAG TTAATGGTGA GGTATGGGAT ATAACCAGGC CTATTACTGA GGATGCTGCA GTAAAATTGC TTACTTGGCA GGATGAAGAT GGTAAAAAAG CATTTTGGCA TTCTTCAGCT CACCTAATGG CAGAAGCGCT AGAGTCGCTT TACCCTGGTA TTAAACTAGG AATTGGTCCT GCTATTGCTA ATGGATTTTA TTATGATATA GACTTTGGTG ATTATGATTT TGATGCTACC CATCTGCCTC GCATAGAAGA GAAGATGCTA GAATTAGCTC GCCAAAATAA CCTTTATCAA GGAATTGTCG TTAATAAGCC TGCGGCCATT AGCTTCTTTC AGAAAAAAGG TGATCCTTAC AAGGTAGAGT TGTTGGAAGG CTTACAAGAC GGAAGTATAA CCTTTTATAA GCATGGTAAT TTCACCGACC TCTGTAGAGG TCCTCATATT CCCCATACAG GTTTTATAAA AGCTGTAAAG CTCTTAAACA TATCAGGCGC GTATTGGCGT GGTAATGAGA AGAATAAACA GCTTACAAGA ATTTATGGTA TTACTTTCCC TCAGCAAAAA GAACTAAAAG CTTATTTAGA ATTATTAGAA GAAGCTCAAA AGAGAAATCA TCAGAAGATA GGTAAAGAGT TAAAGTTATT TACTTTTTCC GAAAAGGTTG GTATTGGTTT GCCTCTCTGG TTGCCTAGAG GTACTGTTTT GCGCGAGCAG CTAGAACAGT TTTTACGTCG GGCCCAGGTA AAAGCAGGCT ACCAACCAGT AGTTACCCCT CACATAGGTC ATAAAGAGCT GTATATGACC TCAGGTCATT ACGATAAGTA TGGGGAAGAT TCTTTTCAGC CTATCCGTAC CCCTCATGAA GGAGAAGAGT TCTTCTTGAA ACCTATGAAT TGCCCCCACC ATTGTGAAAT TTATAAGCAT GAGCCTAGAT CTTATCGGGA CTTACCGGTG CGCTTGGCAG AATTTGGTAC TGTGTATCGA TATGAGCAGC ATGGTGAGTT ACATGGGTTA GTACGTACAA GGGGCTTTAC ACAAGATGAT GCGCACATAT TTTGTAGAAA TGATCAAGTA AGAGAAGAGT TTGCTAAAGT TATAGACTTG GTAACTTATG TGTTTAGTGC ATTAGGTTTT AGCGACTATA CAGCTCGGTT GTCATTTAGA GACCCTGAAC AGCTACATAA GTATATAGGG GATCAGGAAG ATTGGGATAA GGCAGAAGAA GCTATAGAAG AAGTTGCTAA GTTAAGGAAG CTAAATACAA CGAAGGCATT AGGTGAAGCG GCTTTTTATG GACCTAAGTT AGATTTTATG GTGAAAGATG CGCTTGGTAG AAATTGGCAG CTTGGCACAG TACAGCTAGA TTATCAATTA CCTGTACGGT TCGATCTGTC TTATACTGGT GCTGATAACA AAAAGCACAG GCCTGTAATG ATTCATAGAG CACCCTTTGG TTCATTAGAG CGATTTATTG CTATACTATT GGAGCATACT GCTGGAAAAC TTCCATTATG GCTTGCACCA GAGCAAGTAG CTATATTACC TATTTCTGAA AAATTTGCTG CGTATGCAGA AGAAGTAAAT CAGAATCTGC GCAATAAAGA TATTAGATGC TTTATAGATC ATAGAGACGA GAAAATAGGT AAAAAGATAA GAGAAGCCGA ACTAAGCAAA ATTCCTTATA TGTTTATTAT AGGTGAAAAA GAACAACTTG AAAGGACAGT TTCTGTAAGA AAACAAGGTG CTGGAGATCA AGGGAGCTTC CCTATCGATA AATTAGTACA AGAAATTGTT GAAGATATTT CATAA
|
Protein sequence | MHNHTVNIAL PDGTIKSFAK GVTSLEIAQS ISERLSQQIL ASLVNGEVWD ITRPITEDAA VKLLTWQDED GKKAFWHSSA HLMAEALESL YPGIKLGIGP AIANGFYYDI DFGDYDFDAT HLPRIEEKML ELARQNNLYQ GIVVNKPAAI SFFQKKGDPY KVELLEGLQD GSITFYKHGN FTDLCRGPHI PHTGFIKAVK LLNISGAYWR GNEKNKQLTR IYGITFPQQK ELKAYLELLE EAQKRNHQKI GKELKLFTFS EKVGIGLPLW LPRGTVLREQ LEQFLRRAQV KAGYQPVVTP HIGHKELYMT SGHYDKYGED SFQPIRTPHE GEEFFLKPMN CPHHCEIYKH EPRSYRDLPV RLAEFGTVYR YEQHGELHGL VRTRGFTQDD AHIFCRNDQV REEFAKVIDL VTYVFSALGF SDYTARLSFR DPEQLHKYIG DQEDWDKAEE AIEEVAKLRK LNTTKALGEA AFYGPKLDFM VKDALGRNWQ LGTVQLDYQL PVRFDLSYTG ADNKKHRPVM IHRAPFGSLE RFIAILLEHT AGKLPLWLAP EQVAILPISE KFAAYAEEVN QNLRNKDIRC FIDHRDEKIG KKIREAELSK IPYMFIIGEK EQLERTVSVR KQGAGDQGSF PIDKLVQEIV EDIS
|
| |