Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3097 |
Symbol | |
ID | 4809723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3654193 |
End bp | 3655158 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640108525 |
Product | glycosyl transferase family protein |
Protein accession | YP_001039485 |
Protein GI | 125975575 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000063803 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTACAT ATTTGATTTC AGTTGCTATC CCGGTATACA ATGAAGCAAA ACAAATTTAT GAAAATATAA ATATAATTCA CAAAATTTTG ACAGAAAACA ACATTAATCA TGAATTTATC CTTGTGGACG ACGGGTCCCG GGACAACACG TGGGAAGAGC TTAAAAGGCT TTCTGTGGAT CTTCCGAATG TCCTGGCCAT CAGACTTAGC AGGAATTTCG GCAAGGAAGC CGCTCTTTGT GCGGCGTTGG AAGCCGCCCA CGGAGATGCT TGTGTGGTTA TGGATTCCGA CCTTCAGCAT CCGCCGGAAA TCATTCCCCA AATGGTGCGA ATGTGGAGAG AAGAAGGTTA TGACGTCGTG GAAGGTGTAA AGTCTTCAAG GGGTAAGGAA AGTATTACAA ACAAGGTGGG TGCCCATTTA TTCTACTCAA TATTGAAAAA TTTGTCGGGC TTTAACTTAG ACGGCGCATC CGATTTCAAA CTTCTTGATT CGAAAGTGGT GGCTTCATGG CGCATCATGC CGGAGCGCAA CACCTTTTTC AGAGGAATGT CCGCCTGGCT TGGATACAAA AGAGCCTCCA TTCCCTTCGA AGTGGTGGAA AGAAGGGAAG GAAAATCAAA ATGGTCCACT TTAAAACTTT TCAAACTGGC AATAACCGCA ATAACATCTT TTTCGTCGCT TCCGCTTCAT TTGGTTACAT TAATGGGCAT GTTGTTTCTC TTCTGCTCCA TTATAATGGG CATTTACACC CTGTATATGA AGTTCAGAGG TTTGGCCGTA AGCGGTTTCA CCACTGTCAT TCTTCTGCTT TTGATAATCG GGAGCACATT AATGATAAGT CTCGGCATTA TAGGAACCTA CATTGCAAAA ATATTTGACG AGGTCAAGTT CCGTCCGCGG TACATAATAA GCGAAAAAGC GACAAGTAAA AAAACAGAAC ATGAAAATAT CGGCACAAAC AGTTAA
|
Protein sequence | MGTYLISVAI PVYNEAKQIY ENINIIHKIL TENNINHEFI LVDDGSRDNT WEELKRLSVD LPNVLAIRLS RNFGKEAALC AALEAAHGDA CVVMDSDLQH PPEIIPQMVR MWREEGYDVV EGVKSSRGKE SITNKVGAHL FYSILKNLSG FNLDGASDFK LLDSKVVASW RIMPERNTFF RGMSAWLGYK RASIPFEVVE RREGKSKWST LKLFKLAITA ITSFSSLPLH LVTLMGMLFL FCSIIMGIYT LYMKFRGLAV SGFTTVILLL LIIGSTLMIS LGIIGTYIAK IFDEVKFRPR YIISEKATSK KTEHENIGTN S
|
| |