Gene Cthe_1211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1211 
Symbol 
ID4809903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1445603 
End bp1446967 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content45% 
IMG OID640106634 
Producttryptophan synthase subunit beta 
Protein accessionYP_001037636 
Protein GI125973726 
COG category[R] General function prediction only 
COG ID[COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) 
TIGRFAM ID[TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000141811 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AAGATGTAAC CAAGGTAGTT TTAGATGAGT CGGATATTCC AAAGCAATGG 
TACAATATTT TGGCAGACAT GCCCAACAAG CCCGCACCTT ACTTCAGCTC AAAAACCGGC
AAACCGGTTA CGTTGGACGA ACTTCAGGCA ATTTTCCCGA TGGAACTGAT TCAACAGGAA
AATTCCCAGG AGAGATGGAT TGACATTCCG GAAGAAGTAA GAGAAATGTA CCGTCAATGG
AGACCGAGTC CGTTGTACAG GGCTAGGGCT CTGGAAAAGC ATTTGGGGAC TCCTGCCAGA
ATCTATTACA AATATGAAGG AACGAACGCA ACCGGAAGCC ACAAGCTTAA CACTTCATTG
CCGCAGGCTT ATTACAACAA GATTGCCGGC ATAAAAAGAC TTTCAACGGA GACCGGCGCA
GGACAGTGGG GAAGTGCACT GAGCCTTGCA TGCAATCATT TCGGACTTGA GTGTACGGTT
TACATGGTTA AGGTAAGTTA TGAGCAAAAG CCCTACAGAC GTTCTTTCAT GAAAACTTTC
GGAGCCCAGG TGTATGCAAG TCCTACCAAT CTTACAAGCA GCGGCAGGGC GATTTTGGAA
AAAGATCCTG ATTGTACCGG AAGTCTCGGT ATTGCGATAA GTGAAGCTGT TGAAGATGCG
GCTACGCACG ATGATACCAA TTATGCCTTG GGAAGTGTTT TAAATCACGT ATGTTTGCAT
CAGACCATTA TCGGTCTTGA GGCCAAGAAG CAGTTGGAAT ATCTGGATGA ATACCCTGAT
GTGGTCTTTG CCTGCTGCGG CGGAGGATCA AACTTTGCCG GAATAGCTTT TCCGTTCCTG
ATGGACAAGT TTAAGGGAAC AAAAGTGAGA GCAGTGGCTG TTGAACCGAC TGCATGCCCC
ACTCTCACAA AAGGTGTGTA TGCTTATGAT TATTCCGACA CGGGAAAGAT CGGTCCGTTG
GCAAAGATGT ATACGGTTGG TCATGACTTT GTACCTGCCG GTATCCATGC AGGCGGGTTG
AGATATCACG GAGTTTCACC AATAGTCAGC CAGCTTTATG AGGATAAGTT GATTGAAGCA
AAAGCTTACG GACAGAGTTC GGTTTTTGAA GCGGCTGTTA TTTTTGCAAG AACGGAAGGA
ATTGTTCCCG CTCCTGAGTC TTCCCATGCA ATAAGGGCTG CTATTGACGA AGCCCTGTTG
TGCAAAGAAT CGGGAGAGGC GAAAGTTATT CTGTTTAATT TGAGTGGACA CGGATATTTT
GACATGGCCG CTTATGACAA CTACTTTAGC GGAAAACTTA GTGACGTGGA TTATTCGGAA
GAGGAAATTG CAAGAAGTAT GAAAAATTTG CCAAAGGTTG ACTAA
 
Protein sequence
MSKKDVTKVV LDESDIPKQW YNILADMPNK PAPYFSSKTG KPVTLDELQA IFPMELIQQE 
NSQERWIDIP EEVREMYRQW RPSPLYRARA LEKHLGTPAR IYYKYEGTNA TGSHKLNTSL
PQAYYNKIAG IKRLSTETGA GQWGSALSLA CNHFGLECTV YMVKVSYEQK PYRRSFMKTF
GAQVYASPTN LTSSGRAILE KDPDCTGSLG IAISEAVEDA ATHDDTNYAL GSVLNHVCLH
QTIIGLEAKK QLEYLDEYPD VVFACCGGGS NFAGIAFPFL MDKFKGTKVR AVAVEPTACP
TLTKGVYAYD YSDTGKIGPL AKMYTVGHDF VPAGIHAGGL RYHGVSPIVS QLYEDKLIEA
KAYGQSSVFE AAVIFARTEG IVPAPESSHA IRAAIDEALL CKESGEAKVI LFNLSGHGYF
DMAAYDNYFS GKLSDVDYSE EEIARSMKNL PKVD