Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1412 |
Symbol | |
ID | 4809073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1730642 |
End bp | 1731826 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640106835 |
Product | tryptophan synthase subunit beta |
Protein accession | YP_001037836 |
Protein GI | 125973926 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0133] Tryptophan synthase beta chain |
TIGRFAM ID | [TIGR00263] tryptophan synthase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00140909 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAAAG GAAGATTTGG AATCCACGGC GGACAATACA TTCCGGAAAC ATTGATGAAC GCCGTGATTG AGCTGGAAGA GGCATATAAC CACTTTAAAA ACGACCCCGA CTTTTTGGCG GAACTTGACG ACCTGTTGAA AAATTATGCA GGACGTCCGT CTTTGCTTTA TTATGCGGAA AAGATGACAA AAGACCTGAA CGGAGCAAAA ATTTATCTGA AACGTGAGGA TCTCAACCAT ACAGGTTCGC ATAAAATAAA CAATGTTTTG GGCCAGGTGC TTCTTGCAAA ACGTATGGGT AAAAAACGCG TAATTGCCGA GACCGGCGCC GGACAGCATG GTGTTGCGAC GGCAACCGCT GCGGCGCTTA TGGGATTGGA ATGTGAAATT TTCATGGGCA AGGAGGATAC CGAGCGTCAG GCGTTGAATG TATTTAGGAT GGAGCTTTTG GGCGCGAAAG TTCATGCGGT GACAAGCGGA ACACAGACAC TTAAAGATGC GGTAAATGAG ACTTTGAGAG AGTGGTCGAG AAGGGTTCAT GACACTCATT ATGTGCTGGG TTCTGTCATG GGACCACACC CGTTTCCTAC CATTGTAAGG GATTTTCAGA GCGTAATCGG CCGGGAAATT AAAAAGCAGA TTATGGAGAA GGAAGGAAAA CTTCCTGACG TTGTTATGGC CTGTGTGGGC GGAGGCAGCA ATGCAATCGG AGCGTTTTAT GAGTTTATTG GTGATTCAAG TGTTCGGCTT ATAGGATGTG AGGCTGCCGG AAAGGGATTG GACACTGACA AGCATGCGGC AACAATGTCA AAAGGTACAT TGGGAATTTT CCACGGCATG AAGTCTTATT TCTGCCAGGA TGAGTATGGA CAGATTGCAC CGGTTTATTC TATTTCCGCA GGTCTGGACT ACCCAGGAGT GGGCCCGGAA CATGCGTATC TGAAGGATAT TGGAAGGGCT CAATACGTAG CCGTTACCGA TGATGAGGCC GTTGAGGCGT TTGAATACCT TTCCCGAACA GAAGGTATAA TTCCGGCAAT TGAGAGTTCC CATGCGGTTG CATATGCAAT GAAACTTGCT CCGACGATGA GCAAAGATCA GATAATTGTA ATTTGCCTTT CGGGAAGAGG CGATAAAGAT GTTGCGGCGA TAGCGCGTTA CAGGGGGGTT CAAATCTATG AATAA
|
Protein sequence | MSKGRFGIHG GQYIPETLMN AVIELEEAYN HFKNDPDFLA ELDDLLKNYA GRPSLLYYAE KMTKDLNGAK IYLKREDLNH TGSHKINNVL GQVLLAKRMG KKRVIAETGA GQHGVATATA AALMGLECEI FMGKEDTERQ ALNVFRMELL GAKVHAVTSG TQTLKDAVNE TLREWSRRVH DTHYVLGSVM GPHPFPTIVR DFQSVIGREI KKQIMEKEGK LPDVVMACVG GGSNAIGAFY EFIGDSSVRL IGCEAAGKGL DTDKHAATMS KGTLGIFHGM KSYFCQDEYG QIAPVYSISA GLDYPGVGPE HAYLKDIGRA QYVAVTDDEA VEAFEYLSRT EGIIPAIESS HAVAYAMKLA PTMSKDQIIV ICLSGRGDKD VAAIARYRGV QIYE
|
| |