Gene Cthe_1412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1412 
Symbol 
ID4809073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1730642 
End bp1731826 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content47% 
IMG OID640106835 
Producttryptophan synthase subunit beta 
Protein accessionYP_001037836 
Protein GI125973926 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00140909 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAAAG GAAGATTTGG AATCCACGGC GGACAATACA TTCCGGAAAC ATTGATGAAC 
GCCGTGATTG AGCTGGAAGA GGCATATAAC CACTTTAAAA ACGACCCCGA CTTTTTGGCG
GAACTTGACG ACCTGTTGAA AAATTATGCA GGACGTCCGT CTTTGCTTTA TTATGCGGAA
AAGATGACAA AAGACCTGAA CGGAGCAAAA ATTTATCTGA AACGTGAGGA TCTCAACCAT
ACAGGTTCGC ATAAAATAAA CAATGTTTTG GGCCAGGTGC TTCTTGCAAA ACGTATGGGT
AAAAAACGCG TAATTGCCGA GACCGGCGCC GGACAGCATG GTGTTGCGAC GGCAACCGCT
GCGGCGCTTA TGGGATTGGA ATGTGAAATT TTCATGGGCA AGGAGGATAC CGAGCGTCAG
GCGTTGAATG TATTTAGGAT GGAGCTTTTG GGCGCGAAAG TTCATGCGGT GACAAGCGGA
ACACAGACAC TTAAAGATGC GGTAAATGAG ACTTTGAGAG AGTGGTCGAG AAGGGTTCAT
GACACTCATT ATGTGCTGGG TTCTGTCATG GGACCACACC CGTTTCCTAC CATTGTAAGG
GATTTTCAGA GCGTAATCGG CCGGGAAATT AAAAAGCAGA TTATGGAGAA GGAAGGAAAA
CTTCCTGACG TTGTTATGGC CTGTGTGGGC GGAGGCAGCA ATGCAATCGG AGCGTTTTAT
GAGTTTATTG GTGATTCAAG TGTTCGGCTT ATAGGATGTG AGGCTGCCGG AAAGGGATTG
GACACTGACA AGCATGCGGC AACAATGTCA AAAGGTACAT TGGGAATTTT CCACGGCATG
AAGTCTTATT TCTGCCAGGA TGAGTATGGA CAGATTGCAC CGGTTTATTC TATTTCCGCA
GGTCTGGACT ACCCAGGAGT GGGCCCGGAA CATGCGTATC TGAAGGATAT TGGAAGGGCT
CAATACGTAG CCGTTACCGA TGATGAGGCC GTTGAGGCGT TTGAATACCT TTCCCGAACA
GAAGGTATAA TTCCGGCAAT TGAGAGTTCC CATGCGGTTG CATATGCAAT GAAACTTGCT
CCGACGATGA GCAAAGATCA GATAATTGTA ATTTGCCTTT CGGGAAGAGG CGATAAAGAT
GTTGCGGCGA TAGCGCGTTA CAGGGGGGTT CAAATCTATG AATAA
 
Protein sequence
MSKGRFGIHG GQYIPETLMN AVIELEEAYN HFKNDPDFLA ELDDLLKNYA GRPSLLYYAE 
KMTKDLNGAK IYLKREDLNH TGSHKINNVL GQVLLAKRMG KKRVIAETGA GQHGVATATA
AALMGLECEI FMGKEDTERQ ALNVFRMELL GAKVHAVTSG TQTLKDAVNE TLREWSRRVH
DTHYVLGSVM GPHPFPTIVR DFQSVIGREI KKQIMEKEGK LPDVVMACVG GGSNAIGAFY
EFIGDSSVRL IGCEAAGKGL DTDKHAATMS KGTLGIFHGM KSYFCQDEYG QIAPVYSISA
GLDYPGVGPE HAYLKDIGRA QYVAVTDDEA VEAFEYLSRT EGIIPAIESS HAVAYAMKLA
PTMSKDQIIV ICLSGRGDKD VAAIARYRGV QIYE