Gene Cthe_0686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0686 
Symbol 
ID4810304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp845641 
End bp846630 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content42% 
IMG OID640106103 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_001037114 
Protein GI125973204 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAAAG GTACTATTTT GAGCGGCATG AGACCTACCG GAGCTTTGCA TCTTGGCAAT 
TATTTCGGAG CTCTGGAAAA CTGGGTAAAA CTTCAGGATG AATACGAGTG TTATTTTTTT
GTGGCTGATT GGCATGCCCT TACAACAGGA TATGAAGATA CATCTCAAAT CAAAAATAAT
ATAAATGACC TTGTTATAGA TTGGCTAAGT GCAGGACTTG ACCCTGAAAA ATGCGTCATA
TTTTTGCAGT CAAGTATAAA AGAACATGCA GAGCTTCATC TGTTGTTTTC CATGACAACG
CCTCTTTCCT GGCTGCTTCG CTGTCCGACA TACAAGGATC AGATTAATCA ATTGAAGGAC
AAGAATATTA CGACCTACGG ATTTTTAGGA TATCCGTGTC TTCAGGCAGC CGACATATTA
ATTTACAAAG CCGGTTTTGT ACCTGTGGGA GAAGACCAGC TTCCGCACCT TGAGTTGACG
AGGGAAATTG CAAGAAGATT TAATTATTTG TTTGGCGAGG TATTCCCTGA GCCGCAGGCA
ATTTTGACCA AGGCAAAAGT ATTGCCCGGA ACCGACGGCA GAAAGATGAG CAAAAGCTAT
GGCAATACCA TAGCTCTGTC CGACAGTCCC GATACAATCA GAAAGAAAGT CAGCTCAATG
ATAACCGACC CTGCAAGAAT CAGAAAGGAC GATCCCGGTC ATCCCGAGGT GTGTACGGTA
TTTTCCTTCC ACAAAGTATT TAATGAAAAT GAAGTGCCTG AAATTGAGCA GCACTGCAGA
GGCGGAAAAA TTGGGTGTGT GCAATGTAAA AAGAACCTTG CTGACAAAAT GGTGGAGCAT
TTGGAGCCCA TATATGAAAA AAGGCAAAAG ATAGTTGAAA ATCCGTCCAT AGTCAAAGAA
ATTCTCGCAG ACGGAAATGA AAAAGCCAGA AAGGTTGCGC AAAAGACTCT TGAAGAAGTA
CGAAAAGCCA TGAAAATAGA TTTTATTTAG
 
Protein sequence
MKKGTILSGM RPTGALHLGN YFGALENWVK LQDEYECYFF VADWHALTTG YEDTSQIKNN 
INDLVIDWLS AGLDPEKCVI FLQSSIKEHA ELHLLFSMTT PLSWLLRCPT YKDQINQLKD
KNITTYGFLG YPCLQAADIL IYKAGFVPVG EDQLPHLELT REIARRFNYL FGEVFPEPQA
ILTKAKVLPG TDGRKMSKSY GNTIALSDSP DTIRKKVSSM ITDPARIRKD DPGHPEVCTV
FSFHKVFNEN EVPEIEQHCR GGKIGCVQCK KNLADKMVEH LEPIYEKRQK IVENPSIVKE
ILADGNEKAR KVAQKTLEEV RKAMKIDFI