Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0801 |
Symbol | |
ID | 7407988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 890949 |
End bp | 892892 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643715179 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_002572689 |
Protein GI | 222528807 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00121365 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAAAA TAGCTGTGAC ACTTCCAGAC GGAAAAGTGT TGGAAGCAGA AAAAGGAATA TCTGGTATGG AGTTTATAAA GACAATCTCC ATGAGACTTT ACAAAGAAGC AGTTGCGTGT AAGATAGATG GTGTTTTGAA GGATTTGTGG ACATCTCTTG AAAGAGATTG CAGCTTTGAG GTTGTGACAT TTTCAAGTGA TGAGGGTAAA AAGGTTTATT GGCACACAAC GTCTCATATT TTAGCTCAGG CTGTTAAAAG GCTCTTTGGC GATAAAGTAA AGCTTGGCAT AGGACCTGCA ATTGACAATG GTTTTTATTA CGACTTTGAT ATTGAAGAGT CAATTACGAG GGAGATTTTA GAGAAGATTG AAGAAGAGAT GCAAAAAATA ATAAAAGAAG ATTTAAAGAT TGAGAGATTT GAACTTTCAA GAGAAGAAGC AATAAAACTT ATGCAACAAA GAGGAGAAAA CTACAAGGTT GAGCTTATAA ATGATATTCC AGAGGGAGAG ATTATATCTT TTTATAGGCA AGGTGAATTT GTTGACCTGT GCACAGGACC ACATCTTCCT TCAACAGGGA GAGTAAAAGC ATTCAAACTA CTTTCTGTAG CAGGTGCGTA TTGGCGCGGA AATTCTAAAA ATAAGATGCT TCAAAGAATT TATGGAATTT CTTTTGAAAA GAAATCTGAA CTTGATGAGT ATCTCAAAAA GCTTGAAGAG GCAAAAAAAC GTGACCACAA CAGAATAGGA AGAGAGCTTG AGATTTTTAC AACATCCGAG GTTGTAGGGC AAGGTCTTCC TCTTTTAATG CCAAAAGGCG CAAAAATCCT CCAGATTCTT CAAAGGTTTG TTGAGGATGA AGAGGAAAAG AGAGGTTATC TTCTGACAAA GACTCCTTAC ATGGCAAAAA GTGATTTGTA TAAAATCTCT GGGCACTGGG ACCATTACAG AGATGGCATG TTTGTAATAG AGGAAGATGA AAATGAAGTT CTGGCTTTAA GACCAATGAC ATGTCCATTT CAGTTTTTGA TATATAACTC AAAGCAGAGA AGCTACAGGG ATTTGCCAAT AAGATACAGT GAGACATCAA CACTTTTCAG AAATGAAAGT TCAGGTGAGA TGCACGGACT TATAAGGGTT AGGCAGTTTA CTCTTTCAGA TGCACACATA ATCTGCAGAC CCGACCAGGT TGAGGAAGAG TTCAAAGGAG TTTTAGATTT GATTCAGTAT ATAATGAAAA TACTTGGTAT CGAGAATGAT ATATGGTACA GATTTTCGCG CTGGGACCCG AACAACAAGG AAAAATATAT TGACAATCCT GAGGCTTGGG AAAAGACAGA AAATGATATG AAAAATATCT TAGATAAGCT TGGAATAAAT TATAAAGAGG CAAAAGGTGA GGCTGCTTTT TATGGACCAA AACTTGACAT TCAGTTCAAA AACGTCTATG GTAAAGAGGA TACAATAATA ACAATACAGA TAGATTTTGC TCTGGCAGAA CGGTTTGATA TGACGTATGT AGACAGAGAT GGGCAGAAGA AAAGACCGAT TATTATCCAC CGCTCATCGA TAGGTTGTTA TGAAAGAACG CTTGCGATGC TTATTGAAAA GTACAACGGC GCTTTTCCGC TGTGGCTGGC GCCTGTGCAG ATTAGAGTGA TTCCTGTTTC TGATAATTTC AATGAGTATG CTAAAAATGT CGCAAGGATA TTAAAAGAAA ATGGGTTTAG GGTAGAAGAA GATTACAGGT CAGAAACAGT AGGGTACAAG ATAAGAGATG CTCAGCTTCA AAAGATACCG TATATGGTGA TTGTAGGTGA AAAGGAGCAA AAAGAGAATA CTGTGGCAGT AAGAGATAGG AAAAAAGGAG ATTTGGGTTC GTTTACCATT GAAGATTTTA TTGCAATGGT AAAAGAAAAA GTAGACAAAA AAGTGATAGA GTAA
|
Protein sequence | MDKIAVTLPD GKVLEAEKGI SGMEFIKTIS MRLYKEAVAC KIDGVLKDLW TSLERDCSFE VVTFSSDEGK KVYWHTTSHI LAQAVKRLFG DKVKLGIGPA IDNGFYYDFD IEESITREIL EKIEEEMQKI IKEDLKIERF ELSREEAIKL MQQRGENYKV ELINDIPEGE IISFYRQGEF VDLCTGPHLP STGRVKAFKL LSVAGAYWRG NSKNKMLQRI YGISFEKKSE LDEYLKKLEE AKKRDHNRIG RELEIFTTSE VVGQGLPLLM PKGAKILQIL QRFVEDEEEK RGYLLTKTPY MAKSDLYKIS GHWDHYRDGM FVIEEDENEV LALRPMTCPF QFLIYNSKQR SYRDLPIRYS ETSTLFRNES SGEMHGLIRV RQFTLSDAHI ICRPDQVEEE FKGVLDLIQY IMKILGIEND IWYRFSRWDP NNKEKYIDNP EAWEKTENDM KNILDKLGIN YKEAKGEAAF YGPKLDIQFK NVYGKEDTII TIQIDFALAE RFDMTYVDRD GQKKRPIIIH RSSIGCYERT LAMLIEKYNG AFPLWLAPVQ IRVIPVSDNF NEYAKNVARI LKENGFRVEE DYRSETVGYK IRDAQLQKIP YMVIVGEKEQ KENTVAVRDR KKGDLGSFTI EDFIAMVKEK VDKKVIE
|
| |