Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0787 |
Symbol | ileS |
ID | 4810405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 950810 |
End bp | 953605 |
Gene Length | 2796 bp |
Protein Length | 931 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106204 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_001037215 |
Protein GI | 125973305 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0590417 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGAGG ATTACGGAAA AACTTTAAAT CTACCCAATA CTGATTTTCC TATGAGGGCC AATCTTCCTC AAAGAGAGCC GGAGTTTTTG AAAAAGTGGG AAGAGATGGA TATTTACCAC AAGCAACTTA AGAAAAATGC CGGAAAGCCT AAGTTTGTAC TCCATGACGG TCCTCCTTAC GCAAACGGCG GAATTCATCT GGGAACTGCT TTAAATAAAA TATTAAAGGA TATTATTATA AAGTATTACA GTATGAAGGG CTATGAAACA CCTTATGTTC CTGGATGGGA TACCCATGGA CTTCCGATTG AGCAGCGGGC TATAAAAGAG TTTGGGCTTA AAAGACATGA GGTAAGTCCT GTGGAATTCA GGCAGGCATG CAAGAAATTT GCCTTAAAGT ACCTGGATGT ACAAAGGGAA GCCTTTAAAA GACTTGGTGT AAGAGCGGAC TGGGATAATC CTTACATTAC ATTGAACAAG GAGTTTGAAG CAAAGCAGAT AGAAGTATTC GGTGAAATGG CCAAAAAAGG TTATATATAC AAGGGATTAA GGCCTGTGTA CTGGTGTCCC GAATGTGAGA CTGCCCTTGC GGAAGCTGAA ATTGAGTATG CGGAAGACAC CACACTTTCT ATATACGTTA AGTTTGAGGT AAAAGACGAC AAGGGCAAGT TTGGCGGTCT GGTGGATAGT TTAAAAAATA CATATTTTGT AATATGGACC ACCACCACCT GGACACTTCC GGGCAACCTT GCAATCTGCT TAAACGGAGA ATTTGAGTAT GGATTGGTGA AGGCAAACGG CGAGTACTAT ATAATTGCGC TGGAGCTTCT TGAAAATGTT ATGAAGGTTG CAGGGATAGA AGAATACGAG GTTAAGGCGA AATTTACGGG AGCGGAACTG GAAGGCATGC TTTGCAGGCA TCCGTTCCTT GACAGGGATT CTGTCATTAT ATTGGGAGAC CATGTTACTG CAGAAGCAGG TACGGGATGT GTTCATACCG CTCCGGGACA TGGTGCGGAA GACTTTGAAG TATGTAAAAA GTATGACATA CCTGTGATAG TGCCGGTGGA TGACAAAGGT TATCTTACGT CCGAAGCAGG ACAGTTTGCA GGTTTGTATT ATGAGAAGTC AAATGCTGCA ATTATTGAGC AGCTGAAATC CACCAACAAT CTTTTGGCAT CGGAAAAGAT TGTTCACCAG TATCCTCACT GTTGGAGATG CAAGGATCCG ATAATATTCA GGGCTACAGA GCAGTGGTTT GCGTCGATAG AAGGCTTCAG AAAGGAAGCA ATTGATGCCA TTTCAACTGT AAAGTGGATT CCTGAATGGG GTCAGGAAAG AATTACAAAC ATGGTAAGAG AACGCGGTGA CTGGTGTATC TCAAGGCAAA GGATATGGGG AGTGCCTATT CCGATATTCT ACTGCAAGGA TTGCAAGAAA GAACTTATAA ATGATGACAC GATTAAAGCT GTGGCCAAGC TGTTTGCGGA AAAGGGCTCC GATGCATGGT ATGAGTATGA TGCAAAGGAT ATCCTGCCCG AAGGTACGAA ATGTGAATGC GGATGCAAAG AATTTACCAA GGAAAAGGAC ATCATGGACG TATGGTTCGA TTCCGGTTCC AGCCATGCTG CCGTTCTTGA AACAACGGAA GGCTTGACAT GGCCTGCGGA CATGTATCTG GAAGGAAGCG ACCAGCACAG GGGATGGTTC CAGTCATCGC TCCTTACGGC GGTTGCCACA AAAGGCCAGG CTCCTTACAA GACAGTGCTG ACCCACGGCT ATGTGGTTGA CGGTGAAGGA AGAAAAATGT CCAAATCCCT GGGCAACGGA ATAGACCCTG AAGATGTGAT AAAAGAATAC GGAGCGGATA TATTAAGATT ATGGGTTGCA TCTTCGGACT ACAGGACGGA CATCAGGATA TCCAAGGATA TATTAAAGCA GCTTTCTGAA GTATACAGAA AGATTAGAAA TACTTGCAGA TATATTTTGG GCAATATATA TGATTTTGAC CCGAATAAAG ATATGGTAAG CTATGATGAG ATGAACGAGC TTGATAAATG GGCACTCATG AAGCTCAACG GGCTTATTAA GAAAGTTAAC GATGCCTATG AAAAATATGA GTTCCATTTG ATGTTCCATG CAATACACAA TTTCTGTGTT GTTGATATGA GCAACTTCTA TCTTGATATT ATAAAAGACA GACTTTACAC CAGCAGGGCT GACTCGAAAG AAAGAAGATC GGCGCAGACT GCAATGTATG AAATATTGGA GGCTCTTGTG AAAATGCTGG CTCCGGTACT GGCGTTTACC AGCGAGGAAG TGTGGCAGTT TATGCCTCAC AGAAGCACCA ACGACCCTGA AAGTGTGCAG CTCAACTATT GGCCTGAACC GAATGAGAAG TATGAAAATG CGGCGCTTAA GGAAAAGTGG GACAGGATAA TTGAAATCCG TGACGTGGTA TCTAAGGCTC TTGAAATTGC AAGAACGGAA AAAATAATAG GGCATTCTTT GAATGCAAAG GTTACAATCT TTGCGGACAA GGAAAACTAT GACTTTATAG AGCCTATTAA GAAAGACCTT GTTACAGTTT TCATAGTTTC AGACTTTGAG CTTAAGGGTT ATGATGAAGC TTCGGACGGT AAGTACTATG AAGATCCGGA TGTTAAGGGA ATAAGAGTTA ATATTTCCAT GGCTTCGGGA TCTAAATGCG AAAGATGCTG GATGATTAGC GAATCAGTTG GAAAGAATGA AAAACATCCC ACTCTTTGCG ACAGATGTGC AGAAGTAGTT GGCTAA
|
Protein sequence | MAEDYGKTLN LPNTDFPMRA NLPQREPEFL KKWEEMDIYH KQLKKNAGKP KFVLHDGPPY ANGGIHLGTA LNKILKDIII KYYSMKGYET PYVPGWDTHG LPIEQRAIKE FGLKRHEVSP VEFRQACKKF ALKYLDVQRE AFKRLGVRAD WDNPYITLNK EFEAKQIEVF GEMAKKGYIY KGLRPVYWCP ECETALAEAE IEYAEDTTLS IYVKFEVKDD KGKFGGLVDS LKNTYFVIWT TTTWTLPGNL AICLNGEFEY GLVKANGEYY IIALELLENV MKVAGIEEYE VKAKFTGAEL EGMLCRHPFL DRDSVIILGD HVTAEAGTGC VHTAPGHGAE DFEVCKKYDI PVIVPVDDKG YLTSEAGQFA GLYYEKSNAA IIEQLKSTNN LLASEKIVHQ YPHCWRCKDP IIFRATEQWF ASIEGFRKEA IDAISTVKWI PEWGQERITN MVRERGDWCI SRQRIWGVPI PIFYCKDCKK ELINDDTIKA VAKLFAEKGS DAWYEYDAKD ILPEGTKCEC GCKEFTKEKD IMDVWFDSGS SHAAVLETTE GLTWPADMYL EGSDQHRGWF QSSLLTAVAT KGQAPYKTVL THGYVVDGEG RKMSKSLGNG IDPEDVIKEY GADILRLWVA SSDYRTDIRI SKDILKQLSE VYRKIRNTCR YILGNIYDFD PNKDMVSYDE MNELDKWALM KLNGLIKKVN DAYEKYEFHL MFHAIHNFCV VDMSNFYLDI IKDRLYTSRA DSKERRSAQT AMYEILEALV KMLAPVLAFT SEEVWQFMPH RSTNDPESVQ LNYWPEPNEK YENAALKEKW DRIIEIRDVV SKALEIARTE KIIGHSLNAK VTIFADKENY DFIEPIKKDL VTVFIVSDFE LKGYDEASDG KYYEDPDVKG IRVNISMASG SKCERCWMIS ESVGKNEKHP TLCDRCAEVV G
|
| |