Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1228 |
Symbol | thrS |
ID | 4809920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1468610 |
End bp | 1470517 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106651 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_001037653 |
Protein GI | 125973743 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000338672 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAAAA TAACACTTAA GGATGGAAGT GTTAGAGAAT ACAATGAAGG GATTACAATC AAAGAAGTGG CGGAAAGTAT AAGTGCGGGG CTTGCGAGGG TTGCCCTTGC AGGCGAGGTG AACGGTGAGG TAAAAGATTT GAGTTACCCG CTTGAAAATG ATTGCACTCT TAACTTGTTG ACTTTTGATG ATGAAGGCGG AAGAGACGCC TACAGGCATA CCACATCCCA TATACTGGCA CAGGCCGTCA AGAGGCTGTA TCCCGATGCA AAGCTGGCAA TAGGGCCTGC AATAGAGAAC GGTTTTTATT ATGACTTTGA TGTGGAAAAG CCTTTTTCGG TGGAAGATCT GGAAAAGATC GAAGAGGAAA TGAAGAAAAT CATAAAAGAG GACTACAAGC TTGAAAGATT TACTCTTCCA AGGGAAGAAG CCATAAAATT TATGGAAGAA AGAAACGAAC CGTACAAAGT GGAATTGATT CGGGATTTGC CGGAAGGTGA AACAATATCT TTTTACAAAC AGGGTGATTT TGTTGATTTG TGTGCCGGAC CACACATAGA ATCCACAGGA AAGGTAAAGG CTTTTAAGCT GATGTCGGTT GCCGGAGCAT ACTGGAGAGG CAACGAAAAG AATAAGATGC TCCAGAGAAT TTACGGTACT TCCTTTACCA AGAAGAGTGA CCTGGATGCG TATATTACAA GAATTGAGGA AGCAAAGAAG AGAGATCACA GAAAGCTTGG AAGGGAGCTG GATCTCTTTG ATATTTATGA AGAAGGTCCC GGTTTTCCGT TCTTTATGCC AAAGGGAATG GTTTTAAGAA ACGTTTTGGA GGAGTATTGG AGAGAAGAAC ACAGAAAAGC CGGATATCAG GAAATAAAGA CTCCGATAAT TTTGAACGAA GAACTGTGGC ACCGCTCCGG GCACTGGGAT CATTACAAGG AGAATATGTA TTTTACGAAA ATAGATGAGG CCGACTTTGC CATCAAGCCG ATGAACTGTC CCGGAGGCAT GCTGGTATAC AAGAGAAAGC TTCATTCCTA CAGGGATTTG CCTCAGAGGC TGGCGGAGCT TGGTCTGGTG CACAGGCACG AGCTTTCCGG AGTTTTGCAC GGCTTAATGA GGGTTAGATG CTTTACCCAG GACGATGCCC ACATATTTAT GACTCCGGAT CAAATTGAAA GCGAAATTCT TGGTGTTATA AGTCTGATTG ATGACTTCTA TAAAGTTTTC GGTTTCAAAT ATCATGTGGA ACTGTCCACG AGACCGGAAA ATTCCATGGG TTCGGATGAA GACTGGGAAA GGGCTACAAA TGCACTGAAG AACGCTCTTG AAAAGAAAGG AATAGATTAT AAGATAAATG AAGGAGACGG AGCATTCTAC GGACCTAAGA TAGATTTCCA CCTTGAAGAC TCCATTGGAC GTACCTGGCA GTGCGGAACA ATACAGCTCG ATTTCCAGAT GCCGGAGAGA TTTGACTTGA CGTATATAGG TCCTGACGGA GAAAAGCACA GACCTGTTAT GATTCATAGG GTTGTGTTCG GCAGCATAGA AAGGTTCATC GCCATTTTGA CCGAGCATTA TGCGGGAGCT TTCCCTGTAT GGCTTTCACC GGTGCAGGTA AAGATACTGC CCATACTTGA AAAACAGCAT GACTATGTTG CTGAAGTTAA AAAAGCGCTG GAAGAGAAAG GCGTCAGAGT GGAAGCTGAT TTGAGGAATG AAAAGATTGG CTACAAAATC AGGGAGGCGC AACTTGAAAA AGTGCCTTAC ATGCTTGTAA TTGGTGACAA AGAGATGGAG AACAGAACTG TTGCGGTAAG ATCAAGAAAA GACGGAGATT TGGGGCCCAT GCGTCTTGAA GATTTTGTAA ACAGAATTGT TGAAGCAATC AAGAATAAAG AAAATTAA
|
Protein sequence | MIKITLKDGS VREYNEGITI KEVAESISAG LARVALAGEV NGEVKDLSYP LENDCTLNLL TFDDEGGRDA YRHTTSHILA QAVKRLYPDA KLAIGPAIEN GFYYDFDVEK PFSVEDLEKI EEEMKKIIKE DYKLERFTLP REEAIKFMEE RNEPYKVELI RDLPEGETIS FYKQGDFVDL CAGPHIESTG KVKAFKLMSV AGAYWRGNEK NKMLQRIYGT SFTKKSDLDA YITRIEEAKK RDHRKLGREL DLFDIYEEGP GFPFFMPKGM VLRNVLEEYW REEHRKAGYQ EIKTPIILNE ELWHRSGHWD HYKENMYFTK IDEADFAIKP MNCPGGMLVY KRKLHSYRDL PQRLAELGLV HRHELSGVLH GLMRVRCFTQ DDAHIFMTPD QIESEILGVI SLIDDFYKVF GFKYHVELST RPENSMGSDE DWERATNALK NALEKKGIDY KINEGDGAFY GPKIDFHLED SIGRTWQCGT IQLDFQMPER FDLTYIGPDG EKHRPVMIHR VVFGSIERFI AILTEHYAGA FPVWLSPVQV KILPILEKQH DYVAEVKKAL EEKGVRVEAD LRNEKIGYKI REAQLEKVPY MLVIGDKEME NRTVAVRSRK DGDLGPMRLE DFVNRIVEAI KNKEN
|
| |