Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1759 |
Symbol | thrS |
ID | 3831049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1812365 |
End bp | 1814266 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829683 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_430603 |
Protein GI | 83590594 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000991077 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATAA CTCTACCTGA CGGTACAGTA AAGGAGTATG CCCCCGGCAC CACAGCCCTG CAGGTTGCCA GGGATATTTC CCCCAGGTTG GCCCGGGAGG CCCTGGCGGC GCGGGTCAAT GGAGAGGTCT GGGATCTGAC CCGTCCCCTG CCGGAGGAGT GCCAGCTGGA ACTCCTGACC TTTGCCGACG AAGGCGGGCG TCTCGCCTAC CGCCACACGG CGGCCCACGT TCTGGCCCAG GCTGTCAAGC ACCTTTTCCC GGGTACTGAA CTGGGCATCG GGCCGGCTAT AACCGACGGG TTTTATTACG ACTTTGACAG TGAGCATAAA TTCACCCCTG AAGACCTGAC GGCCCTTGAG GCTGAGATGC AGAGGATTAT CAAGGCCGAC CTGCCCCTGG AACGGCGGGA GGTGAGCCGG GAGGAAGCCC TGGAGCTATT CCGGCGGCTG GGCGAGCCTT ATAAGGTTGA GCTTATTAAC GACCTGCCGG AGGGGGTGCC CATCAGCACC TACCGCCAGG GTGACTTCAT CGACCTTTGC GCCGGCCCCC ACCTCCCCAG CACCGGTTAC CTGAAGGCTG TAAAGCTTAC CAGCCTGGCC GGGGCCTACT GGCGTGGCAG CGAAAGAAAT CCCATGCTCC AACGCATTTA CGGTACCGCC TTCCCTAAAG CAAAAGACCT GGAGGAGTAC CTGCACCGTC TGGAAGAGGC CCGCAAGAGG GACCACCGCC GCCTGGGAGC CCAGCTGGGG ATCTTCAGCC TCCATGAGGA AGGTCCAGGC TTTCCTTTCT TCCATCACAA GGGTATGATT ATCCGTAACG AACTGGAACA GTTCTGGCGG GAGGAGCACC GCCGTGCCGG CTACCTGGAG ATTCGCACCC CGGTTATTTT AAGCCGTACC CTCTGGGAGC AGTCGGGTCA CTGGGACCAC TACCGGGAGA ATATGTACTT CACCAAAATT GACGGGGCTG ACTATGCCAT CAAGCCCATG AACTGTCCCG GCGCCATCCT GGTTTATAAA ACCGAGCAGC ACAGCTACCG CGACCTGCCC CTGCGCCTGG CAGAGCTGGG GCTGGTCCAT CGCCACGAGA AATCGGGTGT CCTTCACGGC CTGATGCGGG TACGGGCCTT CACCCAGGAC GATTCCCACA TCTTTATGCT GCCCTCCCAG ATTGCCAGCG AGATCCAGGG AGTTATCGAC CTGGTGGACC GTTTCTACAA TCTTTTCGGC TTCAAGTACC ACGTAGAGCT TTCCACCCGG CCGGACAATG CCATGGGCTC GGAGGAAATA TGGGAAACGG CTACCAGCGC CCTGCGCCAG GCCCTGGAGG CCAAGGGCAT GCCTTACGCC GTCAATGAAG GCGATGGCGC CTTCTACGGC CCCAAGATTG ATTTCCATCT GGAGGATTCC CTGGGCCGTA CCTGGCAGTG CGGCACCATC CAGCTGGACT TCCTCATGCC CGAAAAGTTC GATCTGACCT ACATCGGTGA AGACGGCCAA AAGCATCGGC CGGTTATGAT CCACCGGGTG GTCTTCGGTA GCATTGAGCG CTTTATCGGC ATCCTCATTG AACACTACGG CGGTTCCTTC CCGGTCTGGC TGGCGCCGGT ACAGGTGCGG GTGCTACCCA TTACCGACCG CCACAACGAT TACGCCTTTA AAGTCAGGGC GGAACTGATC CGGGCCGGCA TCCGGGCGGA GGTGAACGAC CGCAACGACA AAATCGGCTA CAAGATCCGG GCCGCCCAGA TGGAGCATAT ACCCTATATG CTGGTAGTGG GGGATAAGGA AGCGGCCGAA GGCACCGTGG CCGTGCGGGA ACGGCAGGCC GGGGATACCG GCAGGGTACC CCTGGCAGAG TTTATTGCCA GGGTCACCAG GGAGATAAGC AGGCGGGAAT AA
|
Protein sequence | MRITLPDGTV KEYAPGTTAL QVARDISPRL AREALAARVN GEVWDLTRPL PEECQLELLT FADEGGRLAY RHTAAHVLAQ AVKHLFPGTE LGIGPAITDG FYYDFDSEHK FTPEDLTALE AEMQRIIKAD LPLERREVSR EEALELFRRL GEPYKVELIN DLPEGVPIST YRQGDFIDLC AGPHLPSTGY LKAVKLTSLA GAYWRGSERN PMLQRIYGTA FPKAKDLEEY LHRLEEARKR DHRRLGAQLG IFSLHEEGPG FPFFHHKGMI IRNELEQFWR EEHRRAGYLE IRTPVILSRT LWEQSGHWDH YRENMYFTKI DGADYAIKPM NCPGAILVYK TEQHSYRDLP LRLAELGLVH RHEKSGVLHG LMRVRAFTQD DSHIFMLPSQ IASEIQGVID LVDRFYNLFG FKYHVELSTR PDNAMGSEEI WETATSALRQ ALEAKGMPYA VNEGDGAFYG PKIDFHLEDS LGRTWQCGTI QLDFLMPEKF DLTYIGEDGQ KHRPVMIHRV VFGSIERFIG ILIEHYGGSF PVWLAPVQVR VLPITDRHND YAFKVRAELI RAGIRAEVND RNDKIGYKIR AAQMEHIPYM LVVGDKEAAE GTVAVRERQA GDTGRVPLAE FIARVTREIS RRE
|
| |