Gene Cthe_0324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0324 
SymbolvalS 
ID4808470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp407909 
End bp410563 
Gene Length2655 bp 
Protein Length884 aa 
Translation table11 
GC content41% 
IMG OID640105735 
Productvalyl-tRNA synthetase 
Protein accessionYP_001036755 
Protein GI125972845 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0384006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAA AAAATATAGC GAAAACATAT GACCCCAAAC AGGTTGAAGA CAGATTATAT 
GCGGAGTGGA TGGAGAAAGG CTATTTTCAT GCTGAAATAG ACAAAGAAAA AACGCCTTTC
ACCATTGTTA TTCCACCGCC AAACATTACA GGTCAGCTTC ATATGGGACA TGCTTTGGAT
GAAACTCTTC AGGATATCCT TATAAGATGG AAGAGGATGC AGGGGTATTG CACTCTTTGG
CTGCCCGGTA CTGACCATGC CAGCATAGCC ACAGAGGCAA AAATTGTTGA AGCCATGGCC
AAAGAAGGTA TTACCAAAGA GGATATCGGA AGAGAAAAGT TTTTAGAGAG AGCCTGGGAA
TGGAAGAGGC ATTACGGCGG AAGAATCGTC GAGCAGCTTA AAAAGCTTGG ATGCTCCTGT
GACTGGCAGA GGGAAAGATT CACCATGGAT GAAGGACTTT CCAGGGCTGT AATTGAGGTT
TTCGTAAGGC TTTACAAAAA GGGCCTTATA CACAGGGGCG AAAGAATTAT AAACTGGTGT
CCGAAGTGCA ACACTTCCAT ATCCGATGCG GAAGTTGAAT ACGAAGAAAA AGCAGGTCAT
TTCTGGCATA TAAAATATCC TGTCAAAGAC AGCGATGAAT TTGTGGTTGT TGCTACCACA
AGACCTGAAA CCATGCTGGG TGATACTGCC GTTGCCGTAC ATCCTGAGGA TGAAAGATAT
AAGCATCTTA TCGGCAAAAC AGTTGTTCTT CCTCTTATGA ACAGAGAGAT ACCGGTTATA
GCCGACGAGT ATGTTGAAAA GGATTTCGGA ACGGGAGTTG TTAAAATAAC TCCTGCTCAT
GACCCCAATG ACTTTGAATT GGGGCTTAGG CACAATCTTC CGCAGATAAG GGTTATGAAT
GATGATGCAA CAATGAACGA ACTTGCAGGC AAGTATCAAG GTATGGACAG ATATGAAGCC
AGAAAGCAGA TTGTAAAGGA TTTGGAAGAA CTGGGCCTGT TGCTTAAGGT TGAAGACCAT
ACCCATAATG TGGGTACTTG CTACAGGTGT GCCACAGTAA TTGAGCCGCT GATTTCAAAA
CAGTGGTTTG TTAAAATGAA GCCTCTGGCA GAGCCTGCTA TAGAAGTGGT TAAAAACGGA
ACAATCAAAT TTGTACCGGA AAGATTCTCA AAGATATACT TTAACTGGAT GGAAAACATT
CAGGATTGGT GTATTTCAAG GCAGCTTTGG TGGGGACACA GAATACCGGC TTATTACTGT
CAGGAATGCG GCTATATGAT GGTTGAAAAT GAAATGCCCG ATGTTTGTCC AAAATGCGGA
AGCTCCAGAA TAGAGCAGGA TCCGGATACT CTTGACACAT GGTTCAGTTC GGCACTGTGG
CCTTTCTCAA CTCTTGGTTG GCCGGATGAA ACAGAGGATT TGAAATATTT CTACCCAACG
GATGTTCTTG TTACAGGGTA TGATATTATA TTCTTCTGGG TTGCAAGAAT GATATTTTCA
GCCTTGGAGC ATACGGGAAA AGAACCCTTC AAATATGTGT TCATACATGG TATTGTAAGG
GATGCACTGG GAAGAAAGAT GAGTAAATCT TTGGGCAACG GTATTGACCC TTTGGAGATA
ATTGACAAAT ACGGTACCGA TGCTTTAAGA TTTGCCCTTA CGATAGGAAC TTCACCGGGA
AATGATTTGA GGTTCTCCGA GGAAAAAGCT GAATCCAGCA GGAACTTTGC AAACAAGATA
TGGAATGCAT CAAGATTTGT TCTTATGAAT TTTGATGACA ATCTTGATTT TTCAAAGGTT
GATCCAAATA AATTTACTAC TTCCGACAAG TGGATATTAA GCCGGGTAAA CAATCTGACC
AGGGAAGTTA CCGAAAACAT GGAAAAGTTT GAGTTGGGTA TAGCTCTCCA GAAAATATAT
GAATTTATCT GGGAAGAGTT CTGCGACTGG TATATTGAGC TTGTAAAGCC GAGACTTTAT
GACAAGGATG ATGAAACAAG ACTGGAAGCC CAGTATGTTT TAAATTATGT TTTAGGTACT
GCCATGAAAC TTCTGCACCC GTATATGCCG TTTATTACCG AGGAGATTTA CCGTCACCTG
GTAGTGGATG ATGAAAGCAT CATGATTTCA AAATGGCCGG TTTACAGGGA AGACTATAAT
TTCCCTGAGG AAGAAAAGAA GATGAGCCTT ATCATGGATG CTATAAAGAG CATTAGAAAT
ATACGGGCGG AGATGAATGT TCCTCATTCC AGAAAGGCAA AAGCCATATT TGTCGCGCCC
GGCGGCAGCG AACAGGATAT ATTGAAAGAA GGAACGGTAT TCTTTGAAAG ACTTGCTTCA
TGTTCGGAAG TTGTTATCCA GCCTGACAAG TCAGGAATAC CTTCCAATGC CGTAGCTGCG
ATATTGGCAG GGGTTGAAAT ATTCCTTCCT TTGGAAGACC TTATTGATAT TGAGAAGGAA
ATTGAAAGAC TCGAGAAGGA ACTTTCCAAT CTTCAGAAAG AATTGGACAG GGTAAACAGC
AAACTGGCAA ACGAAGGGTT TGTTTCAAAA GCCCCGCAAA AAGTGGTTGA AGAAGAGAAG
AAAAAGAAGG AAAAATATCA GGAAATGTAT GATAAGGTAG TGGAAAGACT TAATGGATTA
AAAAATAAGA ATTAA
 
Protein sequence
MSEKNIAKTY DPKQVEDRLY AEWMEKGYFH AEIDKEKTPF TIVIPPPNIT GQLHMGHALD 
ETLQDILIRW KRMQGYCTLW LPGTDHASIA TEAKIVEAMA KEGITKEDIG REKFLERAWE
WKRHYGGRIV EQLKKLGCSC DWQRERFTMD EGLSRAVIEV FVRLYKKGLI HRGERIINWC
PKCNTSISDA EVEYEEKAGH FWHIKYPVKD SDEFVVVATT RPETMLGDTA VAVHPEDERY
KHLIGKTVVL PLMNREIPVI ADEYVEKDFG TGVVKITPAH DPNDFELGLR HNLPQIRVMN
DDATMNELAG KYQGMDRYEA RKQIVKDLEE LGLLLKVEDH THNVGTCYRC ATVIEPLISK
QWFVKMKPLA EPAIEVVKNG TIKFVPERFS KIYFNWMENI QDWCISRQLW WGHRIPAYYC
QECGYMMVEN EMPDVCPKCG SSRIEQDPDT LDTWFSSALW PFSTLGWPDE TEDLKYFYPT
DVLVTGYDII FFWVARMIFS ALEHTGKEPF KYVFIHGIVR DALGRKMSKS LGNGIDPLEI
IDKYGTDALR FALTIGTSPG NDLRFSEEKA ESSRNFANKI WNASRFVLMN FDDNLDFSKV
DPNKFTTSDK WILSRVNNLT REVTENMEKF ELGIALQKIY EFIWEEFCDW YIELVKPRLY
DKDDETRLEA QYVLNYVLGT AMKLLHPYMP FITEEIYRHL VVDDESIMIS KWPVYREDYN
FPEEEKKMSL IMDAIKSIRN IRAEMNVPHS RKAKAIFVAP GGSEQDILKE GTVFFERLAS
CSEVVIQPDK SGIPSNAVAA ILAGVEIFLP LEDLIDIEKE IERLEKELSN LQKELDRVNS
KLANEGFVSK APQKVVEEEK KKKEKYQEMY DKVVERLNGL KNKN