Gene Cthe_0787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0787 
SymbolileS 
ID4810405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp950810 
End bp953605 
Gene Length2796 bp 
Protein Length931 aa 
Translation table11 
GC content43% 
IMG OID640106204 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_001037215 
Protein GI125973305 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0590417 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAGG ATTACGGAAA AACTTTAAAT CTACCCAATA CTGATTTTCC TATGAGGGCC 
AATCTTCCTC AAAGAGAGCC GGAGTTTTTG AAAAAGTGGG AAGAGATGGA TATTTACCAC
AAGCAACTTA AGAAAAATGC CGGAAAGCCT AAGTTTGTAC TCCATGACGG TCCTCCTTAC
GCAAACGGCG GAATTCATCT GGGAACTGCT TTAAATAAAA TATTAAAGGA TATTATTATA
AAGTATTACA GTATGAAGGG CTATGAAACA CCTTATGTTC CTGGATGGGA TACCCATGGA
CTTCCGATTG AGCAGCGGGC TATAAAAGAG TTTGGGCTTA AAAGACATGA GGTAAGTCCT
GTGGAATTCA GGCAGGCATG CAAGAAATTT GCCTTAAAGT ACCTGGATGT ACAAAGGGAA
GCCTTTAAAA GACTTGGTGT AAGAGCGGAC TGGGATAATC CTTACATTAC ATTGAACAAG
GAGTTTGAAG CAAAGCAGAT AGAAGTATTC GGTGAAATGG CCAAAAAAGG TTATATATAC
AAGGGATTAA GGCCTGTGTA CTGGTGTCCC GAATGTGAGA CTGCCCTTGC GGAAGCTGAA
ATTGAGTATG CGGAAGACAC CACACTTTCT ATATACGTTA AGTTTGAGGT AAAAGACGAC
AAGGGCAAGT TTGGCGGTCT GGTGGATAGT TTAAAAAATA CATATTTTGT AATATGGACC
ACCACCACCT GGACACTTCC GGGCAACCTT GCAATCTGCT TAAACGGAGA ATTTGAGTAT
GGATTGGTGA AGGCAAACGG CGAGTACTAT ATAATTGCGC TGGAGCTTCT TGAAAATGTT
ATGAAGGTTG CAGGGATAGA AGAATACGAG GTTAAGGCGA AATTTACGGG AGCGGAACTG
GAAGGCATGC TTTGCAGGCA TCCGTTCCTT GACAGGGATT CTGTCATTAT ATTGGGAGAC
CATGTTACTG CAGAAGCAGG TACGGGATGT GTTCATACCG CTCCGGGACA TGGTGCGGAA
GACTTTGAAG TATGTAAAAA GTATGACATA CCTGTGATAG TGCCGGTGGA TGACAAAGGT
TATCTTACGT CCGAAGCAGG ACAGTTTGCA GGTTTGTATT ATGAGAAGTC AAATGCTGCA
ATTATTGAGC AGCTGAAATC CACCAACAAT CTTTTGGCAT CGGAAAAGAT TGTTCACCAG
TATCCTCACT GTTGGAGATG CAAGGATCCG ATAATATTCA GGGCTACAGA GCAGTGGTTT
GCGTCGATAG AAGGCTTCAG AAAGGAAGCA ATTGATGCCA TTTCAACTGT AAAGTGGATT
CCTGAATGGG GTCAGGAAAG AATTACAAAC ATGGTAAGAG AACGCGGTGA CTGGTGTATC
TCAAGGCAAA GGATATGGGG AGTGCCTATT CCGATATTCT ACTGCAAGGA TTGCAAGAAA
GAACTTATAA ATGATGACAC GATTAAAGCT GTGGCCAAGC TGTTTGCGGA AAAGGGCTCC
GATGCATGGT ATGAGTATGA TGCAAAGGAT ATCCTGCCCG AAGGTACGAA ATGTGAATGC
GGATGCAAAG AATTTACCAA GGAAAAGGAC ATCATGGACG TATGGTTCGA TTCCGGTTCC
AGCCATGCTG CCGTTCTTGA AACAACGGAA GGCTTGACAT GGCCTGCGGA CATGTATCTG
GAAGGAAGCG ACCAGCACAG GGGATGGTTC CAGTCATCGC TCCTTACGGC GGTTGCCACA
AAAGGCCAGG CTCCTTACAA GACAGTGCTG ACCCACGGCT ATGTGGTTGA CGGTGAAGGA
AGAAAAATGT CCAAATCCCT GGGCAACGGA ATAGACCCTG AAGATGTGAT AAAAGAATAC
GGAGCGGATA TATTAAGATT ATGGGTTGCA TCTTCGGACT ACAGGACGGA CATCAGGATA
TCCAAGGATA TATTAAAGCA GCTTTCTGAA GTATACAGAA AGATTAGAAA TACTTGCAGA
TATATTTTGG GCAATATATA TGATTTTGAC CCGAATAAAG ATATGGTAAG CTATGATGAG
ATGAACGAGC TTGATAAATG GGCACTCATG AAGCTCAACG GGCTTATTAA GAAAGTTAAC
GATGCCTATG AAAAATATGA GTTCCATTTG ATGTTCCATG CAATACACAA TTTCTGTGTT
GTTGATATGA GCAACTTCTA TCTTGATATT ATAAAAGACA GACTTTACAC CAGCAGGGCT
GACTCGAAAG AAAGAAGATC GGCGCAGACT GCAATGTATG AAATATTGGA GGCTCTTGTG
AAAATGCTGG CTCCGGTACT GGCGTTTACC AGCGAGGAAG TGTGGCAGTT TATGCCTCAC
AGAAGCACCA ACGACCCTGA AAGTGTGCAG CTCAACTATT GGCCTGAACC GAATGAGAAG
TATGAAAATG CGGCGCTTAA GGAAAAGTGG GACAGGATAA TTGAAATCCG TGACGTGGTA
TCTAAGGCTC TTGAAATTGC AAGAACGGAA AAAATAATAG GGCATTCTTT GAATGCAAAG
GTTACAATCT TTGCGGACAA GGAAAACTAT GACTTTATAG AGCCTATTAA GAAAGACCTT
GTTACAGTTT TCATAGTTTC AGACTTTGAG CTTAAGGGTT ATGATGAAGC TTCGGACGGT
AAGTACTATG AAGATCCGGA TGTTAAGGGA ATAAGAGTTA ATATTTCCAT GGCTTCGGGA
TCTAAATGCG AAAGATGCTG GATGATTAGC GAATCAGTTG GAAAGAATGA AAAACATCCC
ACTCTTTGCG ACAGATGTGC AGAAGTAGTT GGCTAA
 
Protein sequence
MAEDYGKTLN LPNTDFPMRA NLPQREPEFL KKWEEMDIYH KQLKKNAGKP KFVLHDGPPY 
ANGGIHLGTA LNKILKDIII KYYSMKGYET PYVPGWDTHG LPIEQRAIKE FGLKRHEVSP
VEFRQACKKF ALKYLDVQRE AFKRLGVRAD WDNPYITLNK EFEAKQIEVF GEMAKKGYIY
KGLRPVYWCP ECETALAEAE IEYAEDTTLS IYVKFEVKDD KGKFGGLVDS LKNTYFVIWT
TTTWTLPGNL AICLNGEFEY GLVKANGEYY IIALELLENV MKVAGIEEYE VKAKFTGAEL
EGMLCRHPFL DRDSVIILGD HVTAEAGTGC VHTAPGHGAE DFEVCKKYDI PVIVPVDDKG
YLTSEAGQFA GLYYEKSNAA IIEQLKSTNN LLASEKIVHQ YPHCWRCKDP IIFRATEQWF
ASIEGFRKEA IDAISTVKWI PEWGQERITN MVRERGDWCI SRQRIWGVPI PIFYCKDCKK
ELINDDTIKA VAKLFAEKGS DAWYEYDAKD ILPEGTKCEC GCKEFTKEKD IMDVWFDSGS
SHAAVLETTE GLTWPADMYL EGSDQHRGWF QSSLLTAVAT KGQAPYKTVL THGYVVDGEG
RKMSKSLGNG IDPEDVIKEY GADILRLWVA SSDYRTDIRI SKDILKQLSE VYRKIRNTCR
YILGNIYDFD PNKDMVSYDE MNELDKWALM KLNGLIKKVN DAYEKYEFHL MFHAIHNFCV
VDMSNFYLDI IKDRLYTSRA DSKERRSAQT AMYEILEALV KMLAPVLAFT SEEVWQFMPH
RSTNDPESVQ LNYWPEPNEK YENAALKEKW DRIIEIRDVV SKALEIARTE KIIGHSLNAK
VTIFADKENY DFIEPIKKDL VTVFIVSDFE LKGYDEASDG KYYEDPDVKG IRVNISMASG
SKCERCWMIS ESVGKNEKHP TLCDRCAEVV G