Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2554 |
Symbol | |
ID | 4809161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3023268 |
End bp | 3024521 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640107969 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001038948 |
Protein GI | 125975038 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAGG TTGTGCACAT AACTGCACAT TTAGGCGGAG GAATTGGAAG AATACTTTCG AGTATAGCTA TATATTCGCA AATTGAAAAA GAAGTTGAAC ATGTTATTGT TACGCTGGAA AAAACCGAGA ATTCTCATTT TGAACAGTTA CTGAAAGAAC ACAGCATAAA AGTTTTCCTG CAAAACCAAT GCTGTCTAAA ACAAATATTA CAAGAGGCTG ACATTGTCGA AGTGGATTGG TGGCATCATC CTCTAACCTC TGCGTTTATG CATAATTATT TTAATGATAT AGAATGCAGA TTGCTGATTT GGAGTCATGT TTCAGGCTGT ACCTATCCGT ATATAAAATA TGAACTTATT AAATGTGCAG ATAAATTTGT GTTTTCCACT CCCTTTTCAT TTGAAAATGA GTATTGGTCA AATGAAGAAA AAGAAGAAGT CATGAAAAGA GTAGAAATAA TAGTAAGCTC TGGAATAGAT TTTGATGCTC CTGTAAAGAA AAAGCCACAT CACGGTTATA ATGTCGGCTA TATAGGTTTT TTAAGCTATT CCAAAACACA CCCTGATTTT GTAAGGTTTC TGGAAGCTGC TGCCGACATC CCTGACATAT GTTTTAAAGT AGTAGGTGAT ACAGCATACG GGAAAGAATT GATTAAAGAT GTTCAAAATT CCAAGCTTGT ACGCAACAAG GTTATATTTG AAGGTTACGC CCTGGATGTG AAAGAAAAGT TTGCTGAATT TGATGTATTT GGGTATCCCC TAAATCCAAT GCACTACGGA ACTGCTGAAA ATGCGTTGCT TGAAGCCATG GCGGCAGGAG TTGTTCCGGT TGTTCTGAAT CAATGTACCG AAAAATACAT GGTAAGGCAT ATGGAAACAG GAATAATAGT AAACAGTATC GAAGAATACG GAACTGCATT AAGGTGGCTA AAAGATAATG CGGACAAGAG AATCCACATG GGCAATAACG CTTCTGAATT TGTCATTAAA AAGCTCCATA TTCGCGAAAC TGTCAACAGA TTAAATGCCT GTTATTCTGA TATGATGAGC CAGAATAAAA GGCTGCATGA TATATACTCT GCAATAGGAA CTAATCCATA TGAATGGTTT GTTAGTGCTT ATTGGGGTGA TGTCAATTGT TTGGAAGGCA ATTCGTTTGC CGAAACAAAG GGCTCTGCCA AGCACTATTT AAGGTACTTT CCGGAAGACA AAATATTAAG AAAGGTTGTG GAAACTAATG AAAGCCGAAT TTAA
|
Protein sequence | MIKVVHITAH LGGGIGRILS SIAIYSQIEK EVEHVIVTLE KTENSHFEQL LKEHSIKVFL QNQCCLKQIL QEADIVEVDW WHHPLTSAFM HNYFNDIECR LLIWSHVSGC TYPYIKYELI KCADKFVFST PFSFENEYWS NEEKEEVMKR VEIIVSSGID FDAPVKKKPH HGYNVGYIGF LSYSKTHPDF VRFLEAAADI PDICFKVVGD TAYGKELIKD VQNSKLVRNK VIFEGYALDV KEKFAEFDVF GYPLNPMHYG TAENALLEAM AAGVVPVVLN QCTEKYMVRH METGIIVNSI EEYGTALRWL KDNADKRIHM GNNASEFVIK KLHIRETVNR LNACYSDMMS QNKRLHDIYS AIGTNPYEWF VSAYWGDVNC LEGNSFAETK GSAKHYLRYF PEDKILRKVV ETNESRI
|
| |