Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1585 |
Symbol | thrS |
ID | 5104030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1532863 |
End bp | 1534488 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640507471 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_001191664 |
Protein GI | 146304348 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000347084 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0207238 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTCCT ATAGAGGATT GTGGCTAAAG GGAGCAATAG TAATGGCATT AAACATGTAT GAGGCTGGGC TCACTCCAGT GGAGATAGGC CTAGGAGAGA GAGATTTTTA CATTGATGTT CAGTCAGATT CGGCTCTTTC CCTTCAGGAG TCTGAGAAGT TTGCCCAGTG GAAAGATCAC AAGTACGAGA TCAAGGACGG AAAGGTAACC TATAACGGAA AACAAATACT TCTTCAAGGG GACGTGACAC CCTCTGGAGA ACCAAGATAT TTCAAGGTTC TGAACATCTC AGTACATCAT CCCTCAGCTA ACGTTCAGTT GGTGAGGATT AGGGGTATTG CCTTTGAGAC CAAGGAACAA ATGGACGATT ACCTTCAGTG GTTGGAGAAG GCCTCCGAGA CTGATCATCG TATTATAGGG GAGAGGATGG ATCTCTTCAG TTTCCACGAG GAATCTGGTC CAGGTCTAGT CCTGTTCCAT CCCAAGGGCC AGCTAATTAG AAATGAGATG ATAAACTACA TGAGGGAGAT TAACGCTTCC ATGGGATATC AAGAGGTCTA CACATCTCAC GTGTTTAGGA CTGTTCTTTG GAAGATAAGC GGTCATTACG ATACTTACAG GGACAAGATG TTGATCTTCC AAAAGGATGA TGACGAACTA GGAATAAAAC CCATGAATTG TCCCGCTCAC ATATTAATCT ACAAGTCAAG AGTTAGGAGT TACAGAGATC TTCCCATAAG GTTCTCCGAA TTCGGCAACG TTTATAGATG GGAGAAGAAG GGGGAGCTTT ACGGCTTACT TAGAACAAGG GGATTCACGC AGGATGACGG TCATATCTTT TTAAGGGAGG ATCAGCTGAA GGACGAGGTA AAGAATCTAG TTAGGAAGAC TCTTGACGTC CTCGGTAAGT TCGGGTTTAA GGGAGAGGAC GTTCGGATAA ATCTGAGCAC AAGACCAGAT GAAAGTATAG GAAGCGATGA ACAGTGGGAG AAGGCGACTA AGGCATTGCT AGATGTACTA AAGGAACTTA ACGTTCCCTA TGTCGTGAAG GAGAAGGAGG GGGCGTTTTA TGGACCCAAA ATAGATTTTG ACATAAGGGA CAGCTTAAAC AGATGGTGGC AATTGTCCAC TATTCAGGTA GATTTCAATC TGCCTGAAAG ATTCAAACTG GAGTACGTGG ATGAGGATGG AAGCAAGAAG AGGCCGGTCA TGGTTCACAG GGCCATATAC GGCTCGCTCG ACAGAATGAT AGCCATACTT CTTGAACATT TCCGTGGAAA GTTACCCACC TGGTTGTCTC CCGTTCAGGT AAGGGTTCTA CCCATAAGTG AGGACAACCT AGATTACGCT AAGAGGGTTA TGGACGTGCT AGTGCAGAGA GGTATCAGAA CGGAAATCGA TCCGAGCGGG GAAACGCTTT CCAAGAGGAT AAAGAGAGGT TATGATGACG GTGTTCCTTA CCTTGTCATT GTTGGTAGGA AAGAGGCCTC TGAGGAAAAG GTAACCATCA GGGCCAGAGG AAACGTGGAG ATAAAGGGGG TTCCTCTTTC CAGATTTGTG GATGAGCTCT CCCTAGAAAT CGGGAACAGG GACGCTGAAA ATACTCTGAT TAAGAGGATT GGATAA
|
Protein sequence | MESYRGLWLK GAIVMALNMY EAGLTPVEIG LGERDFYIDV QSDSALSLQE SEKFAQWKDH KYEIKDGKVT YNGKQILLQG DVTPSGEPRY FKVLNISVHH PSANVQLVRI RGIAFETKEQ MDDYLQWLEK ASETDHRIIG ERMDLFSFHE ESGPGLVLFH PKGQLIRNEM INYMREINAS MGYQEVYTSH VFRTVLWKIS GHYDTYRDKM LIFQKDDDEL GIKPMNCPAH ILIYKSRVRS YRDLPIRFSE FGNVYRWEKK GELYGLLRTR GFTQDDGHIF LREDQLKDEV KNLVRKTLDV LGKFGFKGED VRINLSTRPD ESIGSDEQWE KATKALLDVL KELNVPYVVK EKEGAFYGPK IDFDIRDSLN RWWQLSTIQV DFNLPERFKL EYVDEDGSKK RPVMVHRAIY GSLDRMIAIL LEHFRGKLPT WLSPVQVRVL PISEDNLDYA KRVMDVLVQR GIRTEIDPSG ETLSKRIKRG YDDGVPYLVI VGRKEASEEK VTIRARGNVE IKGVPLSRFV DELSLEIGNR DAENTLIKRI G
|
| |