Gene Cthe_1228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1228 
SymbolthrS 
ID4809920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1468610 
End bp1470517 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content43% 
IMG OID640106651 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001037653 
Protein GI125973743 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000338672 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAA TAACACTTAA GGATGGAAGT GTTAGAGAAT ACAATGAAGG GATTACAATC 
AAAGAAGTGG CGGAAAGTAT AAGTGCGGGG CTTGCGAGGG TTGCCCTTGC AGGCGAGGTG
AACGGTGAGG TAAAAGATTT GAGTTACCCG CTTGAAAATG ATTGCACTCT TAACTTGTTG
ACTTTTGATG ATGAAGGCGG AAGAGACGCC TACAGGCATA CCACATCCCA TATACTGGCA
CAGGCCGTCA AGAGGCTGTA TCCCGATGCA AAGCTGGCAA TAGGGCCTGC AATAGAGAAC
GGTTTTTATT ATGACTTTGA TGTGGAAAAG CCTTTTTCGG TGGAAGATCT GGAAAAGATC
GAAGAGGAAA TGAAGAAAAT CATAAAAGAG GACTACAAGC TTGAAAGATT TACTCTTCCA
AGGGAAGAAG CCATAAAATT TATGGAAGAA AGAAACGAAC CGTACAAAGT GGAATTGATT
CGGGATTTGC CGGAAGGTGA AACAATATCT TTTTACAAAC AGGGTGATTT TGTTGATTTG
TGTGCCGGAC CACACATAGA ATCCACAGGA AAGGTAAAGG CTTTTAAGCT GATGTCGGTT
GCCGGAGCAT ACTGGAGAGG CAACGAAAAG AATAAGATGC TCCAGAGAAT TTACGGTACT
TCCTTTACCA AGAAGAGTGA CCTGGATGCG TATATTACAA GAATTGAGGA AGCAAAGAAG
AGAGATCACA GAAAGCTTGG AAGGGAGCTG GATCTCTTTG ATATTTATGA AGAAGGTCCC
GGTTTTCCGT TCTTTATGCC AAAGGGAATG GTTTTAAGAA ACGTTTTGGA GGAGTATTGG
AGAGAAGAAC ACAGAAAAGC CGGATATCAG GAAATAAAGA CTCCGATAAT TTTGAACGAA
GAACTGTGGC ACCGCTCCGG GCACTGGGAT CATTACAAGG AGAATATGTA TTTTACGAAA
ATAGATGAGG CCGACTTTGC CATCAAGCCG ATGAACTGTC CCGGAGGCAT GCTGGTATAC
AAGAGAAAGC TTCATTCCTA CAGGGATTTG CCTCAGAGGC TGGCGGAGCT TGGTCTGGTG
CACAGGCACG AGCTTTCCGG AGTTTTGCAC GGCTTAATGA GGGTTAGATG CTTTACCCAG
GACGATGCCC ACATATTTAT GACTCCGGAT CAAATTGAAA GCGAAATTCT TGGTGTTATA
AGTCTGATTG ATGACTTCTA TAAAGTTTTC GGTTTCAAAT ATCATGTGGA ACTGTCCACG
AGACCGGAAA ATTCCATGGG TTCGGATGAA GACTGGGAAA GGGCTACAAA TGCACTGAAG
AACGCTCTTG AAAAGAAAGG AATAGATTAT AAGATAAATG AAGGAGACGG AGCATTCTAC
GGACCTAAGA TAGATTTCCA CCTTGAAGAC TCCATTGGAC GTACCTGGCA GTGCGGAACA
ATACAGCTCG ATTTCCAGAT GCCGGAGAGA TTTGACTTGA CGTATATAGG TCCTGACGGA
GAAAAGCACA GACCTGTTAT GATTCATAGG GTTGTGTTCG GCAGCATAGA AAGGTTCATC
GCCATTTTGA CCGAGCATTA TGCGGGAGCT TTCCCTGTAT GGCTTTCACC GGTGCAGGTA
AAGATACTGC CCATACTTGA AAAACAGCAT GACTATGTTG CTGAAGTTAA AAAAGCGCTG
GAAGAGAAAG GCGTCAGAGT GGAAGCTGAT TTGAGGAATG AAAAGATTGG CTACAAAATC
AGGGAGGCGC AACTTGAAAA AGTGCCTTAC ATGCTTGTAA TTGGTGACAA AGAGATGGAG
AACAGAACTG TTGCGGTAAG ATCAAGAAAA GACGGAGATT TGGGGCCCAT GCGTCTTGAA
GATTTTGTAA ACAGAATTGT TGAAGCAATC AAGAATAAAG AAAATTAA
 
Protein sequence
MIKITLKDGS VREYNEGITI KEVAESISAG LARVALAGEV NGEVKDLSYP LENDCTLNLL 
TFDDEGGRDA YRHTTSHILA QAVKRLYPDA KLAIGPAIEN GFYYDFDVEK PFSVEDLEKI
EEEMKKIIKE DYKLERFTLP REEAIKFMEE RNEPYKVELI RDLPEGETIS FYKQGDFVDL
CAGPHIESTG KVKAFKLMSV AGAYWRGNEK NKMLQRIYGT SFTKKSDLDA YITRIEEAKK
RDHRKLGREL DLFDIYEEGP GFPFFMPKGM VLRNVLEEYW REEHRKAGYQ EIKTPIILNE
ELWHRSGHWD HYKENMYFTK IDEADFAIKP MNCPGGMLVY KRKLHSYRDL PQRLAELGLV
HRHELSGVLH GLMRVRCFTQ DDAHIFMTPD QIESEILGVI SLIDDFYKVF GFKYHVELST
RPENSMGSDE DWERATNALK NALEKKGIDY KINEGDGAFY GPKIDFHLED SIGRTWQCGT
IQLDFQMPER FDLTYIGPDG EKHRPVMIHR VVFGSIERFI AILTEHYAGA FPVWLSPVQV
KILPILEKQH DYVAEVKKAL EEKGVRVEAD LRNEKIGYKI REAQLEKVPY MLVIGDKEME
NRTVAVRSRK DGDLGPMRLE DFVNRIVEAI KNKEN