Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3094 |
Symbol | |
ID | 4809720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3649007 |
End bp | 3649981 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640108522 |
Product | glycosyl transferase family protein |
Protein accession | YP_001039482 |
Protein GI | 125975572 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000830791 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGAGA AAATTGTATG TTCAGTTGTT GTACCTCTTT ACAACGAAGA AGAGGTAATT CTGGAAACCT ACAAAAGGCT TAAAAATGTA ATGGAATCAC TGAATGAGCC TTATGAAATA ATATTTGTCA ACGATGGAAG CAAAGACAGA ACAGGAATTA TCGCCAATGA AATATGTAAC AAGGATAAAA CCGTAAAGCT GGTGGACTTT GCAAGAAATT TCGGCCATCA AACCGCAATA ACCGCCGGAA TGGATTATTC TGAAGGTGAA GCCATAGTTG TCATTGATGC AGACCTTCAG GACCCTCCGG AGCTTATTCC CAAAATGATT GAAAAATGGC GCGAAGGCTA TGATGTTGTA TACGGAAAAA GAAAAGAAAG AAAAGGCGAA ACTTTCTTTA AAAAGTTTAC GGCAAAGGTA TTCTACCGCT TCTTAAGAAG AATGACCGAT GTAGATATTC CTGTTGACAC AGGCGACTTC AGACTCATAG ACCGGAAGGT CTGTGAAGCT CTCAAGCTGG TTAATGAGCG CAACAGATAT ATACGCGGCA TAATAAGCTG GCTTGGCTTT AAACAAACAG GAATTGAGTT TGTAAGGGAA AAACGCTTTG CCGGCGAAAC AAAGTATCCC TTAAAGAAAA TGCTGAAGTT TGCCGCCGAT GCCATTACAT CATTCTCCTA TAAACCTTTG AAGCTGGCGT CATACTTTGG TATGCTGCTC TCATTTTGCA GTTTCGTATA TCTGCTTGTG GTTATCTGGA TGAAGCTTTT TACGGACCAT GTACAACAAG GTTGGGCGTC AACCGTCGCA ATCAACCTCT TTTTCCACGG CATTACTCTT ATCATTTTAG GTATCATGGG AGAGTATATA GGAAGAATTT ATGACGAAGC CAAAGGAAGA CCTTTATATA TCGTAAAACA GACCAGAAAC TTCTCTGAAG ACAAAACCGA CAAGATAACC ATAAGAAAAA AATAA
|
Protein sequence | MSEKIVCSVV VPLYNEEEVI LETYKRLKNV MESLNEPYEI IFVNDGSKDR TGIIANEICN KDKTVKLVDF ARNFGHQTAI TAGMDYSEGE AIVVIDADLQ DPPELIPKMI EKWREGYDVV YGKRKERKGE TFFKKFTAKV FYRFLRRMTD VDIPVDTGDF RLIDRKVCEA LKLVNERNRY IRGIISWLGF KQTGIEFVRE KRFAGETKYP LKKMLKFAAD AITSFSYKPL KLASYFGMLL SFCSFVYLLV VIWMKLFTDH VQQGWASTVA INLFFHGITL IILGIMGEYI GRIYDEAKGR PLYIVKQTRN FSEDKTDKIT IRKK
|
| |