Gene Moth_1759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1759 
SymbolthrS 
ID3831049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1812365 
End bp1814266 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content60% 
IMG OID637829683 
Productthreonyl-tRNA synthetase 
Protein accessionYP_430603 
Protein GI83590594 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000991077 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATAA CTCTACCTGA CGGTACAGTA AAGGAGTATG CCCCCGGCAC CACAGCCCTG 
CAGGTTGCCA GGGATATTTC CCCCAGGTTG GCCCGGGAGG CCCTGGCGGC GCGGGTCAAT
GGAGAGGTCT GGGATCTGAC CCGTCCCCTG CCGGAGGAGT GCCAGCTGGA ACTCCTGACC
TTTGCCGACG AAGGCGGGCG TCTCGCCTAC CGCCACACGG CGGCCCACGT TCTGGCCCAG
GCTGTCAAGC ACCTTTTCCC GGGTACTGAA CTGGGCATCG GGCCGGCTAT AACCGACGGG
TTTTATTACG ACTTTGACAG TGAGCATAAA TTCACCCCTG AAGACCTGAC GGCCCTTGAG
GCTGAGATGC AGAGGATTAT CAAGGCCGAC CTGCCCCTGG AACGGCGGGA GGTGAGCCGG
GAGGAAGCCC TGGAGCTATT CCGGCGGCTG GGCGAGCCTT ATAAGGTTGA GCTTATTAAC
GACCTGCCGG AGGGGGTGCC CATCAGCACC TACCGCCAGG GTGACTTCAT CGACCTTTGC
GCCGGCCCCC ACCTCCCCAG CACCGGTTAC CTGAAGGCTG TAAAGCTTAC CAGCCTGGCC
GGGGCCTACT GGCGTGGCAG CGAAAGAAAT CCCATGCTCC AACGCATTTA CGGTACCGCC
TTCCCTAAAG CAAAAGACCT GGAGGAGTAC CTGCACCGTC TGGAAGAGGC CCGCAAGAGG
GACCACCGCC GCCTGGGAGC CCAGCTGGGG ATCTTCAGCC TCCATGAGGA AGGTCCAGGC
TTTCCTTTCT TCCATCACAA GGGTATGATT ATCCGTAACG AACTGGAACA GTTCTGGCGG
GAGGAGCACC GCCGTGCCGG CTACCTGGAG ATTCGCACCC CGGTTATTTT AAGCCGTACC
CTCTGGGAGC AGTCGGGTCA CTGGGACCAC TACCGGGAGA ATATGTACTT CACCAAAATT
GACGGGGCTG ACTATGCCAT CAAGCCCATG AACTGTCCCG GCGCCATCCT GGTTTATAAA
ACCGAGCAGC ACAGCTACCG CGACCTGCCC CTGCGCCTGG CAGAGCTGGG GCTGGTCCAT
CGCCACGAGA AATCGGGTGT CCTTCACGGC CTGATGCGGG TACGGGCCTT CACCCAGGAC
GATTCCCACA TCTTTATGCT GCCCTCCCAG ATTGCCAGCG AGATCCAGGG AGTTATCGAC
CTGGTGGACC GTTTCTACAA TCTTTTCGGC TTCAAGTACC ACGTAGAGCT TTCCACCCGG
CCGGACAATG CCATGGGCTC GGAGGAAATA TGGGAAACGG CTACCAGCGC CCTGCGCCAG
GCCCTGGAGG CCAAGGGCAT GCCTTACGCC GTCAATGAAG GCGATGGCGC CTTCTACGGC
CCCAAGATTG ATTTCCATCT GGAGGATTCC CTGGGCCGTA CCTGGCAGTG CGGCACCATC
CAGCTGGACT TCCTCATGCC CGAAAAGTTC GATCTGACCT ACATCGGTGA AGACGGCCAA
AAGCATCGGC CGGTTATGAT CCACCGGGTG GTCTTCGGTA GCATTGAGCG CTTTATCGGC
ATCCTCATTG AACACTACGG CGGTTCCTTC CCGGTCTGGC TGGCGCCGGT ACAGGTGCGG
GTGCTACCCA TTACCGACCG CCACAACGAT TACGCCTTTA AAGTCAGGGC GGAACTGATC
CGGGCCGGCA TCCGGGCGGA GGTGAACGAC CGCAACGACA AAATCGGCTA CAAGATCCGG
GCCGCCCAGA TGGAGCATAT ACCCTATATG CTGGTAGTGG GGGATAAGGA AGCGGCCGAA
GGCACCGTGG CCGTGCGGGA ACGGCAGGCC GGGGATACCG GCAGGGTACC CCTGGCAGAG
TTTATTGCCA GGGTCACCAG GGAGATAAGC AGGCGGGAAT AA
 
Protein sequence
MRITLPDGTV KEYAPGTTAL QVARDISPRL AREALAARVN GEVWDLTRPL PEECQLELLT 
FADEGGRLAY RHTAAHVLAQ AVKHLFPGTE LGIGPAITDG FYYDFDSEHK FTPEDLTALE
AEMQRIIKAD LPLERREVSR EEALELFRRL GEPYKVELIN DLPEGVPIST YRQGDFIDLC
AGPHLPSTGY LKAVKLTSLA GAYWRGSERN PMLQRIYGTA FPKAKDLEEY LHRLEEARKR
DHRRLGAQLG IFSLHEEGPG FPFFHHKGMI IRNELEQFWR EEHRRAGYLE IRTPVILSRT
LWEQSGHWDH YRENMYFTKI DGADYAIKPM NCPGAILVYK TEQHSYRDLP LRLAELGLVH
RHEKSGVLHG LMRVRAFTQD DSHIFMLPSQ IASEIQGVID LVDRFYNLFG FKYHVELSTR
PDNAMGSEEI WETATSALRQ ALEAKGMPYA VNEGDGAFYG PKIDFHLEDS LGRTWQCGTI
QLDFLMPEKF DLTYIGEDGQ KHRPVMIHRV VFGSIERFIG ILIEHYGGSF PVWLAPVQVR
VLPITDRHND YAFKVRAELI RAGIRAEVND RNDKIGYKIR AAQMEHIPYM LVVGDKEAAE
GTVAVRERQA GDTGRVPLAE FIARVTREIS RRE