Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0324 |
Symbol | valS |
ID | 4808470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 407909 |
End bp | 410563 |
Gene Length | 2655 bp |
Protein Length | 884 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105735 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001036755 |
Protein GI | 125972845 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0384006 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAA AAAATATAGC GAAAACATAT GACCCCAAAC AGGTTGAAGA CAGATTATAT GCGGAGTGGA TGGAGAAAGG CTATTTTCAT GCTGAAATAG ACAAAGAAAA AACGCCTTTC ACCATTGTTA TTCCACCGCC AAACATTACA GGTCAGCTTC ATATGGGACA TGCTTTGGAT GAAACTCTTC AGGATATCCT TATAAGATGG AAGAGGATGC AGGGGTATTG CACTCTTTGG CTGCCCGGTA CTGACCATGC CAGCATAGCC ACAGAGGCAA AAATTGTTGA AGCCATGGCC AAAGAAGGTA TTACCAAAGA GGATATCGGA AGAGAAAAGT TTTTAGAGAG AGCCTGGGAA TGGAAGAGGC ATTACGGCGG AAGAATCGTC GAGCAGCTTA AAAAGCTTGG ATGCTCCTGT GACTGGCAGA GGGAAAGATT CACCATGGAT GAAGGACTTT CCAGGGCTGT AATTGAGGTT TTCGTAAGGC TTTACAAAAA GGGCCTTATA CACAGGGGCG AAAGAATTAT AAACTGGTGT CCGAAGTGCA ACACTTCCAT ATCCGATGCG GAAGTTGAAT ACGAAGAAAA AGCAGGTCAT TTCTGGCATA TAAAATATCC TGTCAAAGAC AGCGATGAAT TTGTGGTTGT TGCTACCACA AGACCTGAAA CCATGCTGGG TGATACTGCC GTTGCCGTAC ATCCTGAGGA TGAAAGATAT AAGCATCTTA TCGGCAAAAC AGTTGTTCTT CCTCTTATGA ACAGAGAGAT ACCGGTTATA GCCGACGAGT ATGTTGAAAA GGATTTCGGA ACGGGAGTTG TTAAAATAAC TCCTGCTCAT GACCCCAATG ACTTTGAATT GGGGCTTAGG CACAATCTTC CGCAGATAAG GGTTATGAAT GATGATGCAA CAATGAACGA ACTTGCAGGC AAGTATCAAG GTATGGACAG ATATGAAGCC AGAAAGCAGA TTGTAAAGGA TTTGGAAGAA CTGGGCCTGT TGCTTAAGGT TGAAGACCAT ACCCATAATG TGGGTACTTG CTACAGGTGT GCCACAGTAA TTGAGCCGCT GATTTCAAAA CAGTGGTTTG TTAAAATGAA GCCTCTGGCA GAGCCTGCTA TAGAAGTGGT TAAAAACGGA ACAATCAAAT TTGTACCGGA AAGATTCTCA AAGATATACT TTAACTGGAT GGAAAACATT CAGGATTGGT GTATTTCAAG GCAGCTTTGG TGGGGACACA GAATACCGGC TTATTACTGT CAGGAATGCG GCTATATGAT GGTTGAAAAT GAAATGCCCG ATGTTTGTCC AAAATGCGGA AGCTCCAGAA TAGAGCAGGA TCCGGATACT CTTGACACAT GGTTCAGTTC GGCACTGTGG CCTTTCTCAA CTCTTGGTTG GCCGGATGAA ACAGAGGATT TGAAATATTT CTACCCAACG GATGTTCTTG TTACAGGGTA TGATATTATA TTCTTCTGGG TTGCAAGAAT GATATTTTCA GCCTTGGAGC ATACGGGAAA AGAACCCTTC AAATATGTGT TCATACATGG TATTGTAAGG GATGCACTGG GAAGAAAGAT GAGTAAATCT TTGGGCAACG GTATTGACCC TTTGGAGATA ATTGACAAAT ACGGTACCGA TGCTTTAAGA TTTGCCCTTA CGATAGGAAC TTCACCGGGA AATGATTTGA GGTTCTCCGA GGAAAAAGCT GAATCCAGCA GGAACTTTGC AAACAAGATA TGGAATGCAT CAAGATTTGT TCTTATGAAT TTTGATGACA ATCTTGATTT TTCAAAGGTT GATCCAAATA AATTTACTAC TTCCGACAAG TGGATATTAA GCCGGGTAAA CAATCTGACC AGGGAAGTTA CCGAAAACAT GGAAAAGTTT GAGTTGGGTA TAGCTCTCCA GAAAATATAT GAATTTATCT GGGAAGAGTT CTGCGACTGG TATATTGAGC TTGTAAAGCC GAGACTTTAT GACAAGGATG ATGAAACAAG ACTGGAAGCC CAGTATGTTT TAAATTATGT TTTAGGTACT GCCATGAAAC TTCTGCACCC GTATATGCCG TTTATTACCG AGGAGATTTA CCGTCACCTG GTAGTGGATG ATGAAAGCAT CATGATTTCA AAATGGCCGG TTTACAGGGA AGACTATAAT TTCCCTGAGG AAGAAAAGAA GATGAGCCTT ATCATGGATG CTATAAAGAG CATTAGAAAT ATACGGGCGG AGATGAATGT TCCTCATTCC AGAAAGGCAA AAGCCATATT TGTCGCGCCC GGCGGCAGCG AACAGGATAT ATTGAAAGAA GGAACGGTAT TCTTTGAAAG ACTTGCTTCA TGTTCGGAAG TTGTTATCCA GCCTGACAAG TCAGGAATAC CTTCCAATGC CGTAGCTGCG ATATTGGCAG GGGTTGAAAT ATTCCTTCCT TTGGAAGACC TTATTGATAT TGAGAAGGAA ATTGAAAGAC TCGAGAAGGA ACTTTCCAAT CTTCAGAAAG AATTGGACAG GGTAAACAGC AAACTGGCAA ACGAAGGGTT TGTTTCAAAA GCCCCGCAAA AAGTGGTTGA AGAAGAGAAG AAAAAGAAGG AAAAATATCA GGAAATGTAT GATAAGGTAG TGGAAAGACT TAATGGATTA AAAAATAAGA ATTAA
|
Protein sequence | MSEKNIAKTY DPKQVEDRLY AEWMEKGYFH AEIDKEKTPF TIVIPPPNIT GQLHMGHALD ETLQDILIRW KRMQGYCTLW LPGTDHASIA TEAKIVEAMA KEGITKEDIG REKFLERAWE WKRHYGGRIV EQLKKLGCSC DWQRERFTMD EGLSRAVIEV FVRLYKKGLI HRGERIINWC PKCNTSISDA EVEYEEKAGH FWHIKYPVKD SDEFVVVATT RPETMLGDTA VAVHPEDERY KHLIGKTVVL PLMNREIPVI ADEYVEKDFG TGVVKITPAH DPNDFELGLR HNLPQIRVMN DDATMNELAG KYQGMDRYEA RKQIVKDLEE LGLLLKVEDH THNVGTCYRC ATVIEPLISK QWFVKMKPLA EPAIEVVKNG TIKFVPERFS KIYFNWMENI QDWCISRQLW WGHRIPAYYC QECGYMMVEN EMPDVCPKCG SSRIEQDPDT LDTWFSSALW PFSTLGWPDE TEDLKYFYPT DVLVTGYDII FFWVARMIFS ALEHTGKEPF KYVFIHGIVR DALGRKMSKS LGNGIDPLEI IDKYGTDALR FALTIGTSPG NDLRFSEEKA ESSRNFANKI WNASRFVLMN FDDNLDFSKV DPNKFTTSDK WILSRVNNLT REVTENMEKF ELGIALQKIY EFIWEEFCDW YIELVKPRLY DKDDETRLEA QYVLNYVLGT AMKLLHPYMP FITEEIYRHL VVDDESIMIS KWPVYREDYN FPEEEKKMSL IMDAIKSIRN IRAEMNVPHS RKAKAIFVAP GGSEQDILKE GTVFFERLAS CSEVVIQPDK SGIPSNAVAA ILAGVEIFLP LEDLIDIEKE IERLEKELSN LQKELDRVNS KLANEGFVSK APQKVVEEEK KKKEKYQEMY DKVVERLNGL KNKN
|
| |