Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1211 |
Symbol | |
ID | 4809903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1445603 |
End bp | 1446967 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640106634 |
Product | tryptophan synthase subunit beta |
Protein accession | YP_001037636 |
Protein GI | 125973726 |
COG category | [R] General function prediction only |
COG ID | [COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) |
TIGRFAM ID | [TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000141811 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA AAGATGTAAC CAAGGTAGTT TTAGATGAGT CGGATATTCC AAAGCAATGG TACAATATTT TGGCAGACAT GCCCAACAAG CCCGCACCTT ACTTCAGCTC AAAAACCGGC AAACCGGTTA CGTTGGACGA ACTTCAGGCA ATTTTCCCGA TGGAACTGAT TCAACAGGAA AATTCCCAGG AGAGATGGAT TGACATTCCG GAAGAAGTAA GAGAAATGTA CCGTCAATGG AGACCGAGTC CGTTGTACAG GGCTAGGGCT CTGGAAAAGC ATTTGGGGAC TCCTGCCAGA ATCTATTACA AATATGAAGG AACGAACGCA ACCGGAAGCC ACAAGCTTAA CACTTCATTG CCGCAGGCTT ATTACAACAA GATTGCCGGC ATAAAAAGAC TTTCAACGGA GACCGGCGCA GGACAGTGGG GAAGTGCACT GAGCCTTGCA TGCAATCATT TCGGACTTGA GTGTACGGTT TACATGGTTA AGGTAAGTTA TGAGCAAAAG CCCTACAGAC GTTCTTTCAT GAAAACTTTC GGAGCCCAGG TGTATGCAAG TCCTACCAAT CTTACAAGCA GCGGCAGGGC GATTTTGGAA AAAGATCCTG ATTGTACCGG AAGTCTCGGT ATTGCGATAA GTGAAGCTGT TGAAGATGCG GCTACGCACG ATGATACCAA TTATGCCTTG GGAAGTGTTT TAAATCACGT ATGTTTGCAT CAGACCATTA TCGGTCTTGA GGCCAAGAAG CAGTTGGAAT ATCTGGATGA ATACCCTGAT GTGGTCTTTG CCTGCTGCGG CGGAGGATCA AACTTTGCCG GAATAGCTTT TCCGTTCCTG ATGGACAAGT TTAAGGGAAC AAAAGTGAGA GCAGTGGCTG TTGAACCGAC TGCATGCCCC ACTCTCACAA AAGGTGTGTA TGCTTATGAT TATTCCGACA CGGGAAAGAT CGGTCCGTTG GCAAAGATGT ATACGGTTGG TCATGACTTT GTACCTGCCG GTATCCATGC AGGCGGGTTG AGATATCACG GAGTTTCACC AATAGTCAGC CAGCTTTATG AGGATAAGTT GATTGAAGCA AAAGCTTACG GACAGAGTTC GGTTTTTGAA GCGGCTGTTA TTTTTGCAAG AACGGAAGGA ATTGTTCCCG CTCCTGAGTC TTCCCATGCA ATAAGGGCTG CTATTGACGA AGCCCTGTTG TGCAAAGAAT CGGGAGAGGC GAAAGTTATT CTGTTTAATT TGAGTGGACA CGGATATTTT GACATGGCCG CTTATGACAA CTACTTTAGC GGAAAACTTA GTGACGTGGA TTATTCGGAA GAGGAAATTG CAAGAAGTAT GAAAAATTTG CCAAAGGTTG ACTAA
|
Protein sequence | MSKKDVTKVV LDESDIPKQW YNILADMPNK PAPYFSSKTG KPVTLDELQA IFPMELIQQE NSQERWIDIP EEVREMYRQW RPSPLYRARA LEKHLGTPAR IYYKYEGTNA TGSHKLNTSL PQAYYNKIAG IKRLSTETGA GQWGSALSLA CNHFGLECTV YMVKVSYEQK PYRRSFMKTF GAQVYASPTN LTSSGRAILE KDPDCTGSLG IAISEAVEDA ATHDDTNYAL GSVLNHVCLH QTIIGLEAKK QLEYLDEYPD VVFACCGGGS NFAGIAFPFL MDKFKGTKVR AVAVEPTACP TLTKGVYAYD YSDTGKIGPL AKMYTVGHDF VPAGIHAGGL RYHGVSPIVS QLYEDKLIEA KAYGQSSVFE AAVIFARTEG IVPAPESSHA IRAAIDEALL CKESGEAKVI LFNLSGHGYF DMAAYDNYFS GKLSDVDYSE EEIARSMKNL PKVD
|
| |