Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1788 |
Symbol | |
ID | 4810033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2110040 |
End bp | 2111296 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107202 |
Product | glycosyl transferase family protein |
Protein accession | YP_001038202 |
Protein GI | 125974292 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTATCAC AATTAAACGG CATTATATAC GGCACAACCC AAATTATGCA GGTGGTAATA TTCATAGCGG GATGTTATTT TTTCGGAATA TCAATCTTTG GCTGGATAAA GCGCAGGGAA ACCAGCCCTA AAGAATATGT TCCCCAAAAA AAGTTTGCTT TGATCGTTGC CGCCCATAAT GAAGAAGCGG TAATCGGCCA TATAGTTGAC AGCCTCTTTA GGCAAAATTA CCCCCGCAAT CTTTTTGACG TATATGTAGT TGCCGACAAC TGTACGGACA GAACCGCCGA AATTGCTGAA GAACACGGCG CAATTGTATA CAAAAGGTAC AACAACTCCG CCCGGGGAAA GGGATATGCC CTGGAGTGGA TGTTTGAAAA AATATATAAT ATGGAAGAAA AATATGATGC AATAAGTGTA TTCGATGCCG ACAACCTCGT TTCAGCCAAT TACCTTTTGG AAATGAACAA GCAGCTTTGC AAGGGCCACA AGGTTGTTCA GGGATATGTT GACAGCAAAA ATCCTTTTGA TTCATGGATT ACATTGTCAT ACTCCATAGC TTTCTGGCTT TCAAACAGGA TATTCCAGCT TCCCAGGTAC TATTTGGGCT TGAGCTGCGG TCTTTGCGGC ACGGGCTTTT GCATTTCCGT GGATGTTTTA AAAGAAATAG GCTGGGGAGC AACATGCCTT ACAGAAGATT TGGAATTTAC AATGAAGCTG GCCCTGAACA ACTACAAAGT CGCATGGGCT CACAACGCTG TGGTTTATGA CGAAAAACCC ATTACATTAA AGCAGTCATG GAACCAGCGT AAAAGGTGGA TGCAGGGTCA TGCCGACTGT GCAAGCCGAT ATTTGGGCCC GTTGTTTAAA AAAGCCTTCA GGGAAGGAGA TTTAATAGCT TTTGACTGCG CGGTTTACTT GTTTCAGCCC ATAAGGCTGG TTTTTATCGG GCTGATAACA ATAATGATGT GGATTCAAAC CGTTTTCCCT GAATCCCCTT TTTATAATCT TAAGTATGTT TTTCCCACAG AAGTATGGTC CGTGTTCGTA ACGCTGCAGT TTCTCTATGG TCCTTTGGTG GTGCTTTCGG AGAAAAAATT CAATCTCAAG GTGCTTTACG GCTTTTTGAT TTATCCTTTT TACTGCATTA CGTGGATACC AATCACCATA CAGGGATTCA TGAGTAAAAA CAATAAAGAC TGGAGCCACA CTCAGCATTC AAGGAAAATA AGCATATCCG ATCTTGAAAA GGCATAA
|
Protein sequence | MLSQLNGIIY GTTQIMQVVI FIAGCYFFGI SIFGWIKRRE TSPKEYVPQK KFALIVAAHN EEAVIGHIVD SLFRQNYPRN LFDVYVVADN CTDRTAEIAE EHGAIVYKRY NNSARGKGYA LEWMFEKIYN MEEKYDAISV FDADNLVSAN YLLEMNKQLC KGHKVVQGYV DSKNPFDSWI TLSYSIAFWL SNRIFQLPRY YLGLSCGLCG TGFCISVDVL KEIGWGATCL TEDLEFTMKL ALNNYKVAWA HNAVVYDEKP ITLKQSWNQR KRWMQGHADC ASRYLGPLFK KAFREGDLIA FDCAVYLFQP IRLVFIGLIT IMMWIQTVFP ESPFYNLKYV FPTEVWSVFV TLQFLYGPLV VLSEKKFNLK VLYGFLIYPF YCITWIPITI QGFMSKNNKD WSHTQHSRKI SISDLEKA
|
| |